تشغيل أي مهارة في Manus بنقرة واحدة

inspect

النجوم١٩

التفرعات٠

آخر تحديث٢٧ أبريل ٢٠٢٦ في ١٤:٢٦

Open the web UI to visually inspect, edit, and run the benchmark pipeline. Use when the user wants a visual interface for their pipeline.

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

PSPDFKit-labs

PSPDFKit-labs/agentic-usability

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

مطوّرو البرمجياتمهن الحاسوب والرياضيات·SOC 15-1252

SKILL.md

readonly

name	inspect
description	Open the web UI to visually inspect, edit, and run the benchmark pipeline. Use when the user wants a visual interface for their pipeline.
argument-hint	[project-directory] [--port 7373]
disable-model-invocation	true
allowed-tools	Bash(agentic-usability *) Read Glob

Open Web UI

Launch the web-based inspector for the benchmark pipeline.

echo "Arguments: $ARGUMENTS"

Options

--port <number>: Port for the local server (default: 7373)

Pipeline Folder Structure

The web UI serves data from the project directory:

<project>/
  config.json                     # Pipeline configuration
  suite.json                      # Test suite (array of test cases)
  results/
    <runId>/                      # e.g. run-2026-04-25T10-30-00-000Z
      run.json                    # Run manifest (id, targets, testCount, label)
      pipeline-state.json         # Pipeline progress tracker
      report.json                 # Aggregate scorecard (if pipeline completed)
      <target>/<testId>/          # Per-test results
        generated-solution.json   # Agent's solution
        judge.json                # Judge scores
        agent-notes.md            # Agent's working notes
        agent-output.log          # Raw output
        agent-session.jsonl       # Agent conversation log
        judge-session.jsonl       # Judge conversation log

Locating Runs

All subdirectories in results/ with a run.json are runs
Latest run is used by default
Check pipeline-state.json to see if a run is complete (stage: "report") or paused

Run agentic-usability inspect -p $ARGUMENTS to start the server. It opens the browser automatically. Press Ctrl+C to stop.

For the full file inventory, see pipeline-guide.md.

المزيد من هذا المستودع

نفس المستودع

init

PSPDFKit-labs/agentic-usability

Initialize a new agentic-usability benchmark pipeline project. Use when setting up a new SDK benchmark, creating a config.json, or starting a new evaluation project.

2026-05-1419

sandbox

PSPDFKit-labs/agentic-usability

Launch an interactive shell inside a microsandbox for debugging. Supports bare mode, executor setup, or judge setup with optional test case scaffolding.

2026-05-1419

eval

PSPDFKit-labs/agentic-usability

Run the full evaluation pipeline (execute, judge, report) for an SDK usability benchmark. Use when running a complete benchmark end-to-end, resuming an interrupted pipeline, or checking pipeline status.

2026-04-2719

execute

PSPDFKit-labs/agentic-usability

Execute benchmark test cases in sandboxed environments with AI agents. Spins up microsandbox containers for each test case and extracts solutions.

2026-04-2719

export

PSPDFKit-labs/agentic-usability

Export a benchmark pipeline as a zip file for sharing or archiving. Excludes cache and large snapshots.

2026-04-2719

generate

PSPDFKit-labs/agentic-usability

Generate SDK usability test cases by exploring source code. Use when creating benchmark test suites, generating test cases for an SDK, or when the user wants to create evaluation scenarios.

2026-04-2719

name	inspect
description	Open the web UI to visually inspect, edit, and run the benchmark pipeline. Use when the user wants a visual interface for their pipeline.
argument-hint	[project-directory] [--port 7373]
disable-model-invocation	true
allowed-tools	Bash(agentic-usability *) Read Glob

Open Web UI

Launch the web-based inspector for the benchmark pipeline.

echo "Arguments: $ARGUMENTS"

Options

--port <number>: Port for the local server (default: 7373)

Pipeline Folder Structure

The web UI serves data from the project directory:

<project>/
  config.json                     # Pipeline configuration
  suite.json                      # Test suite (array of test cases)
  results/
    <runId>/                      # e.g. run-2026-04-25T10-30-00-000Z
      run.json                    # Run manifest (id, targets, testCount, label)
      pipeline-state.json         # Pipeline progress tracker
      report.json                 # Aggregate scorecard (if pipeline completed)
      <target>/<testId>/          # Per-test results
        generated-solution.json   # Agent's solution
        judge.json                # Judge scores
        agent-notes.md            # Agent's working notes
        agent-output.log          # Raw output
        agent-session.jsonl       # Agent conversation log
        judge-session.jsonl       # Judge conversation log

Locating Runs

All subdirectories in results/ with a run.json are runs
Latest run is used by default
Check pipeline-state.json to see if a run is complete (stage: "report") or paused

Run agentic-usability inspect -p $ARGUMENTS to start the server. It opens the browser automatically. Press Ctrl+C to stop.

For the full file inventory, see pipeline-guide.md.