一键在 Manus 中运行任何 Skill

init

星标19

分支0

更新时间2026年5月14日 09:19

Initialize a new agentic-usability benchmark pipeline project. Use when setting up a new SDK benchmark, creating a config.json, or starting a new evaluation project.

安装

用 Codex 或 Claude 帮你安装复制这段 Prompt，粘贴到 Codex、Claude 或其他助手里，让它检查 Skill 页面并帮你完成安装。

在 Manus 中运行

来源

PSPDFKit-labs

PSPDFKit-labs/agentic-usability

打开 GitHub 仓库查看创作者相关仓库

下载

在 Manus 中运行

相关职业SOC

基于 SOC 职业分类

软件开发工程师计算机与数学类职业·SOC 15-1252

SKILL.md

readonly

同仓库更多 Skills

同仓库

sandbox

PSPDFKit-labs/agentic-usability

Launch an interactive shell inside a microsandbox for debugging. Supports bare mode, executor setup, or judge setup with optional test case scaffolding.

2026-05-1419

eval

PSPDFKit-labs/agentic-usability

Run the full evaluation pipeline (execute, judge, report) for an SDK usability benchmark. Use when running a complete benchmark end-to-end, resuming an interrupted pipeline, or checking pipeline status.

2026-04-2719

execute

PSPDFKit-labs/agentic-usability

Execute benchmark test cases in sandboxed environments with AI agents. Spins up microsandbox containers for each test case and extracts solutions.

2026-04-2719

export

PSPDFKit-labs/agentic-usability

Export a benchmark pipeline as a zip file for sharing or archiving. Excludes cache and large snapshots.

2026-04-2719

generate

PSPDFKit-labs/agentic-usability

Generate SDK usability test cases by exploring source code. Use when creating benchmark test suites, generating test cases for an SDK, or when the user wants to create evaluation scenarios.

2026-04-2719

insights

PSPDFKit-labs/agentic-usability

Analyze benchmark results and identify SDK improvement areas. Use when reviewing evaluation results, finding failure patterns, identifying documentation gaps, or understanding API design issues.

2026-04-2719

name	init
description	Initialize a new agentic-usability benchmark pipeline project. Use when setting up a new SDK benchmark, creating a config.json, or starting a new evaluation project.
argument-hint	[project-directory]
disable-model-invocation	true
allowed-tools	Bash(agentic-usability *) Write Read Glob

Initialize Pipeline Project

Set up a new agentic-usability benchmark pipeline in the given project directory.

echo "Project directory: $ARGUMENTS"

You have two approaches:

Option 1: Interactive Wizard

Run agentic-usability init -p $ARGUMENTS for a step-by-step interactive setup.

Option 2: Direct Config Creation

If the user has described their SDK, create config.json directly. This is faster and allows you to tailor the config to their exact setup.

Project Directory Structure

After init, the project should have:

<project>/
  config.json       # Configuration (you create this)
  suite.json        # Test suite (created by generate)
  results/          # Run results (created by eval/execute)
  cache/repos/      # Git repo cache (created automatically)

config.json Schema

{
  "privateInfo": [],
  "publicInfo": [],
  "agents": {},
  "targets": [],
  "sandbox": {}
}

`privateInfo` (required, non-empty array)

SDK source code and internal docs. Visible to generator and judge, never to executor. Each entry is a SourceConfig with a type discriminator:

Local source — filesystem path:

{ "type": "local", "path": "./src", "subpath": "packages/core", "additionalContext": "Focus on the Builder API" }

Fields: path (required), subpath (optional), additionalContext (optional)

Git source — clone a repository:

{ "type": "git", "url": "https://github.com/org/sdk.git", "branch": "main", "subpath": "src", "sparse": ["src/api"], "additionalContext": "..." }

Fields: url (required), branch, subpath, sparse (sparse checkout paths), additionalContext (all optional)

URL source — fetch documentation:

{ "type": "url", "url": "https://internal-docs.example.com/api-ref", "additionalContext": "..." }

Fields: url (required), additionalContext (optional)

Package source — metadata about the SDK package:

{ "type": "package", "name": "@example/sdk", "installCommand": "npm install @example/sdk", "language": "typescript", "additionalContext": "..." }

Fields: name (required), installCommand, language, additionalContext (all optional)

`publicInfo` (optional array)

Public docs and package info visible to both executor and judge. Same SourceConfig types as above. Typically includes:

A package source so executors know what to install
A url source for public documentation

`agents` (optional object)

Role	Type	Runs in sandbox?	Secret required?
`generator`	AgentConfig	No (host)	No
`executor`	SandboxAgentConfig	Yes	Yes
`judge`	SandboxAgentConfig	Yes	Yes
`insights`	AgentConfig	No (host)	No

AgentConfig fields (generator, insights):

command (required): "claude", "codex", "gemini", or custom CLI name
systemPrompt (optional): supports {{packageName}} and {{docsUrl}} placeholders

SandboxAgentConfig — extends AgentConfig with required secret:

{
  "command": "claude",
  "secret": { "value": "$ANTHROPIC_API_KEY" }
}

AgentSecretConfig fields:

value (required): raw API key or "$ENV_VAR" reference
envVar: env var name for key inside sandbox — auto-detected for known agents
baseUrl: API base URL — auto-detected for known agents
baseUrlEnvVar: env var for base URL override — auto-detected for known agents

Known agent defaults (auto-filled, user only needs value):

command	envVar	baseUrl	baseUrlEnvVar
`claude`	`ANTHROPIC_API_KEY`	`https://api.anthropic.com`	`ANTHROPIC_BASE_URL`
`codex`	`CODEX_API_KEY`	`https://api.openai.com/v1`	`OPENAI_BASE_URL`
`gemini`	`GEMINI_API_KEY`	`https://generativelanguage.googleapis.com`	`GEMINI_API_BASE_URL`

Custom agents must explicitly set envVar and baseUrl in the secret.

`targets` (required, non-empty array)

Docker images for sandboxed execution:

{ "name": "node-20", "image": "node:20-slim", "timeout": 1200, "additionalContext": "Node.js 20 with npm" }

Fields: name (required), image (required), timeout (optional, seconds), additionalContext (optional, included in generator prompt)

`sandbox` (required object, can be `{}`)

{
  "concurrency": 3,
  "defaultTimeout": 600,
  "memoryMib": 2048,
  "cpus": 2,
  "secrets": {
    "EXTRA_API_KEY": {
      "value": "$EXTRA_KEY",
      "allowHosts": ["api.extra-service.com"],
      "allowHostPatterns": ["*.extra-service.com"]
    }
  },
  "env": {
    "LICENSE_KEY": "$MY_LICENSE_KEY"
  }
}

secrets: TLS-injected secrets that never enter the VM. Each needs value and non-empty allowHosts.
env: Plain env vars passed directly into sandbox. Values can use $VAR to reference host env.

`workspace` (optional)

{ "template": "./workspace-template", "setupScript": "./setup.sh" }

For the full schema with all validation rules, see config-schema.md.

Complete Example

{
  "privateInfo": [
    { "type": "local", "path": "./sdk-source", "additionalContext": "Main SDK source code" }
  ],
  "publicInfo": [
    { "type": "package", "name": "my-sdk", "installCommand": "npm install my-sdk", "language": "typescript" },
    { "type": "url", "url": "https://docs.my-sdk.io/getting-started" }
  ],
  "agents": {
    "generator": { "command": "claude" },
    "executor": { "command": "claude", "secret": { "value": "$ANTHROPIC_API_KEY" } },
    "judge": { "command": "claude", "secret": { "value": "$ANTHROPIC_API_KEY" } }
  },
  "targets": [
    { "name": "node-20", "image": "node:20-slim", "timeout": 1200 }
  ],
  "sandbox": {
    "concurrency": 3,
    "defaultTimeout": 600
  }
}

After creating config.json, run agentic-usability generate -p <project> to create the test suite.

init

同仓库更多 Skills

同仓库更多 Skills

Initialize Pipeline Project

Option 1: Interactive Wizard

Option 2: Direct Config Creation

Project Directory Structure

config.json Schema

privateInfo (required, non-empty array)

publicInfo (optional array)

agents (optional object)

targets (required, non-empty array)

sandbox (required object, can be {})

workspace (optional)

Complete Example

Initialize Pipeline Project

Option 1: Interactive Wizard

Option 2: Direct Config Creation

Project Directory Structure

config.json Schema

privateInfo (required, non-empty array)

publicInfo (optional array)

agents (optional object)

targets (required, non-empty array)

sandbox (required object, can be {})

workspace (optional)

Complete Example

`privateInfo` (required, non-empty array)

`publicInfo` (optional array)

`agents` (optional object)

`targets` (required, non-empty array)

`sandbox` (required object, can be `{}`)

`workspace` (optional)

`privateInfo` (required, non-empty array)

`publicInfo` (optional array)

`agents` (optional object)

`targets` (required, non-empty array)

`sandbox` (required object, can be `{}`)

`workspace` (optional)