تشغيل أي مهارة في Manus بنقرة واحدة

qa-web

النجوم١

التفرعات٠

آخر تحديث١٩ أبريل ٢٠٢٦ في ٠٣:٠٦

Run, extend, and heal web/ app tests for this repo's Next.js dashboard. Use when the user asks to QA the web app, explore for bugs, add or heal Playwright tests, check accessibility, or measure coverage against behaviors. Triggers on "/qa", "qa", "test the web app", "explore the dashboard", "heal this test", "what's covered", "a11y check", "check the PR changes", "find regressions". Four phases — Scope (change-aware targeting), Explore (black-box heuristic testing), Author (spec-first test generation), Heal (selector-drift vs regression triage).

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

fairchild

fairchild/workspaces

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

محللو ضمان جودة البرمجيات والمختبرونمهن الحاسوب والرياضيات·SOC 15-1253

مستكشف الملفات

15 ملفات

SKILL.md

readonly

المزيد من هذا المستودع

نفس المستودع

release-screenshot

fairchild/workspaces

Capture deterministic screenshots of the WorkSpaces macOS app in a known UI state. Use when the user asks to "capture a release screenshot", "screenshot the app", "show the Phase 1 state", or wants evidence for a PR / release notes / design brief. Wraps the fixture-mode env vars (WORKSPACES_UI_FIXTURE, WORKSPACES_UI_FIXTURE_AGENT_STATES, and WORKSPACES_UI_FIXTURE_COMMAND_STATUSES) and the existing scripts/sidebar-capture.sh capture pipeline. Triggers on "release screenshot", "screenshot the app", "capture the app", "fixture screenshot", "ui evidence", "/release-screenshot".

2026-05-311

cofounder-contributor

fairchild/workspaces

Run the cofounder contributor workflow for this repo's standing personas. Use when you want April Clearwater or Plat Ironwood to sweep open PRs, discussions, and issues, then produce one structured action: proposal, discussion comment, PR review, or issue execution that pushes a branch and opens or updates a PR.

2026-05-261

workspaces-performance-system

fairchild/workspaces

Run and maintain the canonical Workspaces performance contract across debug, installed, release, and CI environments. Use when collecting comparable before/after metrics, asserting scenario budgets, verifying installed-build parity, refreshing baselines, or classifying self-hosted perf lane failures as infrastructure versus product regressions. Prefer this skill over workspaces-optimization when the task is to operate the standardized performance system rather than investigate an unknown slowdown.

2026-05-261

peter-planner

fairchild/workspaces

Convert an approved GitHub Discussion idea into actionable GitHub Issues and, when needed, a milestone. Use when planning endorsed ideas, turning a discussion into issue-sized work, reconciling existing planner markers, or running the Peter Planner workflow manually or from automation.

2026-05-101

become-persona

fairchild/workspaces

Assume one of this repo's named agent personas in the current conversation, including its repo-owned persona prompt and available memory context. Use this whenever the user types /become <persona>, says "become april", "assume Plat", "act as Peter", or asks for April Clearwater, Plat Ironwood, or Peter Planner with memory access.

2026-05-041

drive

fairchild/workspaces

Execute GitHub milestones from this repo from planning through delivery. Use when the user wants to drive a milestone by number or relative target, for example `/drive m4`, `/drive to milestone 4`, `/drive next milestone`, or "deliver milestone 4". Always refresh the latest milestone, issue, PR, and workflow state from GitHub before planning, then work issues one at a time with issue updates, validation, PR management, review follow-up, and milestone retro notes.

2026-04-181

name

qa-web

description

qa-web

Project-scoped skill for QA of web/ — a Next.js app with Vitest unit tests and Playwright E2E. Invoked by the qa-web-agent subagent (for sandboxed black-box exploration) or directly by the main session (for inline QA work).

Overview

You are testing a Next.js dashboard. The app has an existing test suite (Vitest + Playwright projects fast/full/demo/qa-explore), a behavior ledger at web/tests/LEDGER.md, and an evidence convention at output/qa-agent/<date>/<slug>/.

Your four phases:

Scope — inspect what changed on this branch; produce a ranked list of user-visible surfaces to probe. Skip if on main with no caller summary.
Explore — black-box heuristic testing (SFDIPOT, FEW HICCUPPS, a11y) scoped to the P0/P1 surfaces from Phase 0. Every finding gets a screenshot + axe snapshot + repro steps.
Author — convert findings and uncovered behaviors into spec-first tests. Human approval gate on the spec, then delegate code generation to the playwright-test-generator subagent (installed by mise run web:qa:init-agents).
Heal — triage failing Playwright tests via the playwright-test-healer subagent; patch selector drift, escalate behavior changes.

Normal flow

Follow this order every time the skill is invoked:

Doctor first. Run scripts/doctor.sh from the repo root. If it exits non-zero, the environment is not ready — read references/setup.md and apply fixes. Do not proceed to phases until doctor passes.
Parse the invocation. The caller tells you which phase(s) to run:
- Bare invocation, or explore with no area → Phase 0 → Phase 1 (scoped to P0/P1). If Phase 0 yields no P0/P1 surfaces (e.g. infra-only branch, or no diff), run Phase 1 as a baseline smoke against the existing web/e2e/explore/ targets — you're verifying the environment, catching any accidental breakage, and refreshing the ledger's last_verified dates for those behaviors.
- explore <area> → Phase 1 only, scoped to <area>. Skip Phase 0.
- Free-form change summary (e.g. "we just rewrote the auth middleware") → Phase 0 with summary as authoritative intent → Phase 1.
- author <slug-or-spec-path> → Phase 2. See references/author.md.
- heal [test-path] → Phase 3. See references/heal.md.
- run → just mise run web:check && mise run web:e2e, structured summary, candidates for /qa heal.
- ledger → read web/tests/LEDGER.md, summarize gaps ([gap] rows, last_verified > 30 days old).
Execute the phase. Follow the referenced procedure file.
Render the report. Run .claude/skills/qa-web/scripts/render-report.py --latest (or pass the run directory explicitly). It aggregates every finding.md + axe artifact in the run into:
- REPORT.md — consolidated markdown, grouped by severity, with a Decisions section.
- report.html — single-page browser view with inline screenshots.
- A short terminal summary (findings counts + evidence paths + open-in-browser hint).
Present and decide. Emit the terminal summary back to the caller, point them at the HTML, and explicitly ask for a decision per finding: promote (Phase 2 author), log-as-gap (LEDGER row), fix (bug issue), dismiss (document why), or investigate (tighter /qa explore). Do not advance until the caller chooses.

Invariants

Hard rules. Violating any of these is a bug in the run.

Black-box during Explore. In Phase 1 do NOT Read or Grep web/src/app/** or web/src/lib/**. Allowed during Explore: web/tests/LEDGER.md, web/e2e/**, web/docs/**, README.md, AGENTS.md, CLAUDE.md. Phase 0 may use git/gh metadata (file names, commit subjects, PR title/body) — that's not a violation. Reading the actual diff content of web/src/** is forbidden unless the caller explicitly authorizes it.
Spec-first during Author. Write the behavior spec under web/specs/<slug>.md before any test code. Pause for human approval. If the only source of truth for the behavior is the implementation, flag it and ask — do not write a tautological mirror test.
Locator priority is non-negotiable. getByRole → getByText → getByLabel → getByTestId → CSS last. Never raw class selectors or nth-child without justification. See references/locator-priority.md.
No snapshot-only assertions. toMatchSnapshot() requires at least one behavioral assertion alongside.
No live-LLM calls inside Vitest. Unit tests use deterministic mocks. For agent-runtime tests, extend web/src/lib/agent-runtime/mock-provider.ts.
Every finding gets evidence. Screenshot + axe snapshot + reproducible steps under output/qa-agent/<ISO-date>/<slug>/. Upload via scripts/evidence.sh when a PR is involved.
Viewport matrix for Explore. 1440×900 desktop and 375×667 mobile, minimum.
Trophy shape over pyramid. Before writing an E2E, justify why integration can't cover it. Decline E2E sprawl.
Heal vs regression. A selector-drift heal (same oracle, different selector) is safe. A behavior-change heal (new oracle, different flow) is not — stop and escalate.

Phase details (load as needed)

Phase	Reference	When to read
0 — Scope	`references/scope.md`	Bare `/qa`, or caller provided a change summary
1 — Explore	`references/explore.md`	Every invocation that probes the app
2 — Author	`references/author.md`	`author <slug>` or promoting an Explore finding
3 — Heal	`references/heal.md`	`heal [test-path]` or CI went red

Supporting references (load on-demand):

references/setup.md — what the project must have configured for this skill to work; source of truth for doctor fixes.
references/doctor.md — the verification checklist scripts/doctor.sh enforces.
references/locator-priority.md — Playwright locator rule, why it matters.
references/spec-template.md — the behavior-spec Markdown template for Phase 2.
references/oracles.md — SFDIPOT + FEW HICCUPPS heuristics for exploration.
references/evidence.md — evidence layout and upload conventions.

Scripts

scripts/doctor.sh — verify setup; exit 0 if ready, non-zero if not (prints remediation hints).
scripts/scope-report.sh — git + gh reconnaissance; prints the Phase 0 Scope Report.
scripts/render-report.py — aggregate a run's findings into REPORT.md + report.html with screenshots inline. Pass --latest to pick the most recent date dir under output/qa-agent/. Pass --open to open the HTML in the default browser.
web/scripts/qa-probe.js (lives in the project, not the skill) — axe-core + screenshots on a list of URLs. Invoke with QA_SLUG=<slug> QA_BASE_URL=<url> node web/scripts/qa-probe.js.

Relevant mise tasks (in web/.mise.toml):

web:e2e:explore — run the qa-explore Playwright project.
web:qa:init-agents — one-shot: scaffold Playwright's agent-loop instruction files (optional, complementary).
web:qa:codegen [url] — record interactions into a starter .spec.ts.

Output format

End every run with a BLUF-first block — TL;DR and recommended actions come first, details go behind a collapsed <details>. This is what the caller sees in chat; it mirrors what render-report.py produces for the HTML/markdown reports.

## TL;DR

<one- or two-sentence natural-language summary: what was found, what's notable, whether there are blockers>

- <Recommended action 1 in plain English — e.g. "Fix in product — copy the prompt below into an agent session">
- <Recommended action 2 — e.g. "Log as gap in web/tests/LEDGER.md">
- <Recommended action 3 — e.g. "Defer / dismiss N cosmetic items">

**Counts:** <severity pill row>  |  <total> total
**Branch / PR:** <branch> / #<pr or "none">
**Scope:** <caller summary, or "bare /qa — derived from diff">
**Ledger updates:** <n> rows added / updated
**Evidence root:** `output/qa-agent/<ISO-date>/`

<details>
<summary>Findings (click to expand)</summary>

- 🚨 **[P0]** <issue> — <path to evidence>
- ⚠️ **[P1]** <issue> — <path to evidence>
- 🔭 **[gap]** <uncovered behavior> — proposed spec: <path>

</details>

📄 Full report: `output/qa-agent/<ISO-date>/REPORT.md`
🌐 Open in browser: `open 'output/qa-agent/<ISO-date>/report.html'`

The details section, report links, and hand-off prompt are for the caller to expand when they want to act. The TL;DR + actions are what they read first.

Do NOT open PRs yourself. Summarize the diff and evidence paths so the caller can review and open the PR.

Rationalization table

If you catch yourself reasoning this way, stop and reread the rule.

Temptation	Counter
"I'll just peek at `chat-panel.tsx` to understand what to test" during Explore	That's implementation-mirroring. Explore tests user-visible behavior. Close the file, drive the app instead.
"Snapshot is easier than writing assertions"	Snapshots without semantic assertions rot silently. Pair at least one `expect(role).toBeVisible()`-equivalent with any snapshot.
"This test is flaky, I'll add `waitForTimeout(5000)`"	Flakiness is a signal about the selector or the oracle, not about timing. Replace with `expect(locator).toBeVisible({ timeout })` or `page.waitForRequest(...)`.
"The caller gave a summary but the diff says something else — I'll trust the diff"	The caller knows intent; the diff doesn't. Treat the summary as authoritative; note the discrepancy in the Scope Report.
"Healer proposed changing the assertion — close enough, ship it"	That's a behavior change, not a heal. Stop. Write a regression note. Escalate.
"I can skip doctor, it passed last week"	Run doctor. The environment drifts. One wasted minute beats thirty minutes debugging a missing dep.