一键在 Manus 中运行任何 Skill

inspect

星标19

分支0

更新时间2026年4月27日 14:26

Open the web UI to visually inspect, edit, and run the benchmark pipeline. Use when the user wants a visual interface for their pipeline.

安装

用 Codex 或 Claude 帮你安装复制这段 Prompt，粘贴到 Codex、Claude 或其他助手里，让它检查 Skill 页面并帮你完成安装。

在 Manus 中运行

来源

PSPDFKit-labs

PSPDFKit-labs/agentic-usability

打开 GitHub 仓库查看创作者相关仓库

下载

在 Manus 中运行

Open Web UI

Launch the web-based inspector for the benchmark pipeline.

echo "Arguments: $ARGUMENTS"

Options

--port <number>: Port for the local server (default: 7373)

Pipeline Folder Structure

The web UI serves data from the project directory:

<project>/
  config.json                     # Pipeline configuration
  suite.json                      # Test suite (array of test cases)
  results/
    <runId>/                      # e.g. run-2026-04-25T10-30-00-000Z
      run.json                    # Run manifest (id, targets, testCount, label)
      pipeline-state.json         # Pipeline progress tracker
      report.json                 # Aggregate scorecard (if pipeline completed)
      <target>/<testId>/          # Per-test results
        generated-solution.json   # Agent's solution
        judge.json                # Judge scores
        agent-notes.md            # Agent's working notes
        agent-output.log          # Raw output
        agent-session.jsonl       # Agent conversation log
        judge-session.jsonl       # Judge conversation log

Locating Runs

All subdirectories in results/ with a run.json are runs
Latest run is used by default
Check pipeline-state.json to see if a run is complete (stage: "report") or paused

Run agentic-usability inspect -p $ARGUMENTS to start the server. It opens the browser automatically. Press Ctrl+C to stop.

For the full file inventory, see pipeline-guide.md.

同仓库更多 Skills

同仓库

init

PSPDFKit-labs/agentic-usability

Initialize a new agentic-usability benchmark pipeline project. Use when setting up a new SDK benchmark, creating a config.json, or starting a new evaluation project.

2026-05-1419

sandbox

PSPDFKit-labs/agentic-usability

Launch an interactive shell inside a microsandbox for debugging. Supports bare mode, executor setup, or judge setup with optional test case scaffolding.

2026-05-1419

eval

PSPDFKit-labs/agentic-usability

Run the full evaluation pipeline (execute, judge, report) for an SDK usability benchmark. Use when running a complete benchmark end-to-end, resuming an interrupted pipeline, or checking pipeline status.

2026-04-2719

execute

PSPDFKit-labs/agentic-usability

Execute benchmark test cases in sandboxed environments with AI agents. Spins up microsandbox containers for each test case and extracts solutions.

2026-04-2719

export

PSPDFKit-labs/agentic-usability

Export a benchmark pipeline as a zip file for sharing or archiving. Excludes cache and large snapshots.

2026-04-2719

generate

PSPDFKit-labs/agentic-usability

Generate SDK usability test cases by exploring source code. Use when creating benchmark test suites, generating test cases for an SDK, or when the user wants to create evaluation scenarios.

2026-04-2719

name	inspect
description	Open the web UI to visually inspect, edit, and run the benchmark pipeline. Use when the user wants a visual interface for their pipeline.
argument-hint	[project-directory] [--port 7373]
disable-model-invocation	true
allowed-tools	Bash(agentic-usability *) Read Glob

Open Web UI

Launch the web-based inspector for the benchmark pipeline.

echo "Arguments: $ARGUMENTS"

Options

--port <number>: Port for the local server (default: 7373)

Pipeline Folder Structure

The web UI serves data from the project directory:

<project>/
  config.json                     # Pipeline configuration
  suite.json                      # Test suite (array of test cases)
  results/
    <runId>/                      # e.g. run-2026-04-25T10-30-00-000Z
      run.json                    # Run manifest (id, targets, testCount, label)
      pipeline-state.json         # Pipeline progress tracker
      report.json                 # Aggregate scorecard (if pipeline completed)
      <target>/<testId>/          # Per-test results
        generated-solution.json   # Agent's solution
        judge.json                # Judge scores
        agent-notes.md            # Agent's working notes
        agent-output.log          # Raw output
        agent-session.jsonl       # Agent conversation log
        judge-session.jsonl       # Judge conversation log

Locating Runs

All subdirectories in results/ with a run.json are runs
Latest run is used by default
Check pipeline-state.json to see if a run is complete (stage: "report") or paused

Run agentic-usability inspect -p $ARGUMENTS to start the server. It opens the browser automatically. Press Ctrl+C to stop.

For the full file inventory, see pipeline-guide.md.