تشغيل أي مهارة في Manus بنقرة واحدة

workspaces-performance-system

النجوم١

التفرعات٠

آخر تحديث٢٦ مايو ٢٠٢٦ في ٠٥:١٨

Run and maintain the canonical Workspaces performance contract across debug, installed, release, and CI environments. Use when collecting comparable before/after metrics, asserting scenario budgets, verifying installed-build parity, refreshing baselines, or classifying self-hosted perf lane failures as infrastructure versus product regressions. Prefer this skill over workspaces-optimization when the task is to operate the standardized performance system rather than investigate an unknown slowdown.

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

fairchild

fairchild/workspaces

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

محللو ضمان جودة البرمجيات والمختبرونمهن الحاسوب والرياضيات·SOC 15-1253

مستكشف الملفات

2 ملفات

SKILL.md

readonly

name

workspaces-performance-system

description

Workspaces Performance System

Use this skill when the task is to run the performance system correctly and produce comparable evidence.

Use workspaces-optimization instead when the task is root-cause investigation on a slow machine or an unknown latency path.

Quick Start

Run one canonical debug scenario and assert budgets:

./scripts/perf-runner.sh --scenario debug_no_activate --assert-budget

Compare before and after summaries:

./scripts/perf-compare.py before-summary.json after-summary.json

Verify release-bundle performance signoff on the packaged app:

./scripts/verify-installed-perf.sh build/WorkSpaces.app

Core Workflow

1. Start from the contract

Read these first:

config/performance/contract.json
docs/performance/metrics-reference.md
docs/performance-testing.md

The contract is the source of truth for:

scenario ids
metric coverage
budget formulas
PR versus scheduled-main gate policy

Do not invent new scenario names or hand-tuned thresholds in ad hoc scripts.

2. Pick the smallest canonical scenario

Use these scenario ids exactly:

debug_no_activate
debug_activate
installed_clean_shell
installed_login_shell
installed_input_short_capture

Guidance:

Use debug_no_activate for low-variance local branch comparisons and trend history, not release signoff.
Use debug_activate for interactive local validation.
Use installed_clean_shell to isolate app and terminal surface cost; this is the packaged-app release signoff scenario.
Use installed_login_shell to include shell-init overhead.
Use installed_input_short_capture only for short interactive focused typing captures.

3. Run through the wrapper first

Prefer the shared wrapper over calling collectors manually:

./scripts/perf-runner.sh --scenario installed_clean_shell --assert-budget

The wrapper exists to keep artifact shape consistent across debug and installed runs.

Use the older standalone scripts only when you need a narrower benchmark:

./scripts/perf-baseline.sh
./scripts/new-workspace-perf.sh
./scripts/launch-installed-diagnostics.sh
./.agents/skills/workspaces-optimization/scripts/summarize_perf_log.py
./.agents/skills/workspaces-optimization/scripts/summarize_diagnostic_report.py

4. Compare canonical summaries, not raw impressions

For every optimization or regression check:

capture before
capture after
compare with ./scripts/perf-compare.py
report metric deltas and budget status

Prefer median as the gate signal. Treat max and p95 as diagnostic signals.

5. Treat installed-build parity as mandatory

Before release or packaged-app signoff:

build the packaged app
run ./scripts/verify-release-bundle.sh
run ./scripts/verify-installed-perf.sh

Installed validation must confirm:

bundled ghostty resources exist
bundled terminfo exists
installed clean-shell capture emits terminal_first_output
installed clean-shell capture emits first_prompt_ready
logs do not contain missing-resource warnings

If the verifier fails because the machine is headless or lacks a real display session, classify that as an environment limitation, not a product perf result. Release automation may use ./scripts/verify-installed-perf.sh --allow-skip-noninteractive for that one case only.

Debug scenario failures are useful branch-trend evidence. They do not block a release by themselves when the packaged app verifier passes and the debug environment difference is explicitly classified.

6. Separate infra failures from perf regressions

For CI and scheduled runs, read:

.github/workflows/perf-validation.yml
docs/development/self-hosted-runner-incidents.md

Rules:

runner-lane-health answers whether the lane is available.
perf-validation answers whether the product met the scenario.
Do not report tart-ui offline or runner disconnects as app regressions.
On PRs, local evidence plus build/test is merge-critical; self-hosted perf stays advisory until lane stability is proven.

Outputs To Produce

When this skill is used successfully, produce:

canonical summary.json
short human-readable summary
before/after delta report when comparing changes
explicit classification of any infra-only failure

Guardrails

Do not bypass contract.json with custom thresholds.
Do not compare different scenarios as if they were equivalent.
Do not claim a regression from one noisy run when a lower-variance scenario is available.
Do not treat missing installed-build metrics as product failure before checking Ghostty resources and display/session constraints.
Do not mix root-cause debugging with contract validation unless the standardized run has already classified the regression.

Canonical Files

config/performance/contract.json
scripts/perf-runner.sh
scripts/perf-compare.py
scripts/verify-installed-perf.sh
scripts/perf-baseline.sh
scripts/new-workspace-perf.sh
.github/workflows/perf-validation.yml
.github/workflows/release.yml
docs/performance-testing.md
docs/performance/metrics-reference.md
docs/development/self-hosted-runner-incidents.md

المزيد من هذا المستودع

نفس المستودع

release-screenshot

fairchild/workspaces

Capture deterministic screenshots of the WorkSpaces macOS app in a known UI state. Use when the user asks to "capture a release screenshot", "screenshot the app", "show the Phase 1 state", or wants evidence for a PR / release notes / design brief. Wraps the fixture-mode env vars (WORKSPACES_UI_FIXTURE, WORKSPACES_UI_FIXTURE_AGENT_STATES, and WORKSPACES_UI_FIXTURE_COMMAND_STATUSES) and the existing scripts/sidebar-capture.sh capture pipeline. Triggers on "release screenshot", "screenshot the app", "capture the app", "fixture screenshot", "ui evidence", "/release-screenshot".

2026-05-311

cofounder-contributor

fairchild/workspaces

Run the cofounder contributor workflow for this repo's standing personas. Use when you want April Clearwater or Plat Ironwood to sweep open PRs, discussions, and issues, then produce one structured action: proposal, discussion comment, PR review, or issue execution that pushes a branch and opens or updates a PR.

2026-05-261

peter-planner

fairchild/workspaces

Convert an approved GitHub Discussion idea into actionable GitHub Issues and, when needed, a milestone. Use when planning endorsed ideas, turning a discussion into issue-sized work, reconciling existing planner markers, or running the Peter Planner workflow manually or from automation.

2026-05-101

become-persona

fairchild/workspaces

Assume one of this repo's named agent personas in the current conversation, including its repo-owned persona prompt and available memory context. Use this whenever the user types /become <persona>, says "become april", "assume Plat", "act as Peter", or asks for April Clearwater, Plat Ironwood, or Peter Planner with memory access.

2026-05-041

qa-web

fairchild/workspaces

Run, extend, and heal web/ app tests for this repo's Next.js dashboard. Use when the user asks to QA the web app, explore for bugs, add or heal Playwright tests, check accessibility, or measure coverage against behaviors. Triggers on "/qa", "qa", "test the web app", "explore the dashboard", "heal this test", "what's covered", "a11y check", "check the PR changes", "find regressions". Four phases — Scope (change-aware targeting), Explore (black-box heuristic testing), Author (spec-first test generation), Heal (selector-drift vs regression triage).

2026-04-191

drive

fairchild/workspaces

Execute GitHub milestones from this repo from planning through delivery. Use when the user wants to drive a milestone by number or relative target, for example `/drive m4`, `/drive to milestone 4`, `/drive next milestone`, or "deliver milestone 4". Always refresh the latest milestone, issue, PR, and workflow state from GitHub before planning, then work issues one at a time with issue updates, validation, PR management, review follow-up, and milestone retro notes.

2026-04-181

name

workspaces-performance-system

description

Workspaces Performance System

Use this skill when the task is to run the performance system correctly and produce comparable evidence.

Use workspaces-optimization instead when the task is root-cause investigation on a slow machine or an unknown latency path.

Quick Start

Run one canonical debug scenario and assert budgets:

./scripts/perf-runner.sh --scenario debug_no_activate --assert-budget

Compare before and after summaries:

./scripts/perf-compare.py before-summary.json after-summary.json

Verify release-bundle performance signoff on the packaged app:

./scripts/verify-installed-perf.sh build/WorkSpaces.app

Core Workflow

1. Start from the contract

Read these first:

config/performance/contract.json
docs/performance/metrics-reference.md
docs/performance-testing.md

The contract is the source of truth for:

scenario ids
metric coverage
budget formulas
PR versus scheduled-main gate policy

Do not invent new scenario names or hand-tuned thresholds in ad hoc scripts.

2. Pick the smallest canonical scenario

Use these scenario ids exactly:

debug_no_activate
debug_activate
installed_clean_shell
installed_login_shell
installed_input_short_capture

Guidance:

Use debug_no_activate for low-variance local branch comparisons and trend history, not release signoff.
Use debug_activate for interactive local validation.
Use installed_clean_shell to isolate app and terminal surface cost; this is the packaged-app release signoff scenario.
Use installed_login_shell to include shell-init overhead.
Use installed_input_short_capture only for short interactive focused typing captures.

3. Run through the wrapper first

Prefer the shared wrapper over calling collectors manually:

./scripts/perf-runner.sh --scenario installed_clean_shell --assert-budget

The wrapper exists to keep artifact shape consistent across debug and installed runs.

Use the older standalone scripts only when you need a narrower benchmark:

./scripts/perf-baseline.sh
./scripts/new-workspace-perf.sh
./scripts/launch-installed-diagnostics.sh
./.agents/skills/workspaces-optimization/scripts/summarize_perf_log.py
./.agents/skills/workspaces-optimization/scripts/summarize_diagnostic_report.py

4. Compare canonical summaries, not raw impressions

For every optimization or regression check:

capture before
capture after
compare with ./scripts/perf-compare.py
report metric deltas and budget status

Prefer median as the gate signal. Treat max and p95 as diagnostic signals.

5. Treat installed-build parity as mandatory

Before release or packaged-app signoff:

build the packaged app
run ./scripts/verify-release-bundle.sh
run ./scripts/verify-installed-perf.sh

Installed validation must confirm:

bundled ghostty resources exist
bundled terminfo exists
installed clean-shell capture emits terminal_first_output
installed clean-shell capture emits first_prompt_ready
logs do not contain missing-resource warnings

6. Separate infra failures from perf regressions

For CI and scheduled runs, read:

.github/workflows/perf-validation.yml
docs/development/self-hosted-runner-incidents.md

Rules:

runner-lane-health answers whether the lane is available.
perf-validation answers whether the product met the scenario.
Do not report tart-ui offline or runner disconnects as app regressions.
On PRs, local evidence plus build/test is merge-critical; self-hosted perf stays advisory until lane stability is proven.

Outputs To Produce

When this skill is used successfully, produce:

canonical summary.json
short human-readable summary
before/after delta report when comparing changes
explicit classification of any infra-only failure

Guardrails

Do not bypass contract.json with custom thresholds.
Do not compare different scenarios as if they were equivalent.
Do not claim a regression from one noisy run when a lower-variance scenario is available.
Do not treat missing installed-build metrics as product failure before checking Ghostty resources and display/session constraints.
Do not mix root-cause debugging with contract validation unless the standardized run has already classified the regression.

Canonical Files

config/performance/contract.json
scripts/perf-runner.sh
scripts/perf-compare.py
scripts/verify-installed-perf.sh
scripts/perf-baseline.sh
scripts/new-workspace-perf.sh
.github/workflows/perf-validation.yml
.github/workflows/release.yml
docs/performance-testing.md
docs/performance/metrics-reference.md
docs/development/self-hosted-runner-incidents.md