Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

acceptance-exploration

Verify a finished feature works end-to-end before declaring done. Use after implementation and unit tests pass, when you need a pass/fail verdict on whether the feature actually works from a user's or caller's perspective — not just that tests are green. Works across browser apps, CLI tools, HTTP APIs, and agent/skill bundles at stage-appropriate depth (prototype / MVP / beta / GA). Produces a verdict with evidence (screenshots, transcripts, request logs). Do not use for writing tests (use test-driven-development), code review (use code-review-and-quality), or security audits (use security-and-hardening).

In Manus ausführen

Sterne1

Forks0

Aktualisiert24. April 2026 um 00:08

Quelle

jasonjgarcia24

jasonjgarcia24/claude-dev-team

GitHub-Repository öffnen Creator-Repositorys ansehen

Installationsbefehl

Download

In Manus ausführen

Nützlich fürSOC

Softwarequalitätssicherungsanalysten und -testerInformatik- und Mathematikberufe15-1253L4

Datei-Explorer

5 Dateien

SKILL.md

readonly

name

acceptance-exploration

description

Acceptance Exploration

Run the finished feature end-to-end and return a stage-appropriate pass/fail verdict with evidence. Tests verify code; acceptance exploration verifies the feature.

Invoked primarily by the negev agent (~/.claude/agents/acceptance-explorer.md). Follow directly when no persona is needed.

Two required inputs

Every exploration needs both:

Stage — how deep to probe: prototype / MVP / beta / GA
Surface — how to drive and observe: browser / cli / http-api / agent-skill / other

Stop and ask if either is missing. Stage is semantic (what to check). Surface is mechanical (how to check).

Also confirm: the spec (what the feature should do) and launch instructions (how to start it).

Stages

Deeper stages include everything from lighter stages. Checklists are semantic — apply them across all surfaces.

Prototype — "the core idea demonstrates"

Happy path runs start-to-finish without crashing
Core value proposition is observable in the output
No uncaught errors on the primary flow

MVP — "the feature is usable"

All prototype checks, plus:

Top 3 consumer flows complete successfully
Error states produce useful output (no blank screens, silent exits, or raw stack traces leaked to end users)
Transport layer clean (no 5xx, no non-zero exit on valid input)

Beta — "it handles real use"

All MVP checks, plus:

Edge cases: empty input, very long input, invalid input, duplicate/rapid invocations
Error recovery: retries, degraded-dependency handling, timeouts
Input-variant coverage across realistic consumer patterns

GA — "production-ready"

All beta checks, plus:

Performance meets any stated budget
Observability: failures leave useful traces; non-generic error messages
Graceful degradation (no silent failures, no infinite hangs)
Surface-specific hardening (see surface reference)

Surfaces

Pick the matching reference file and load only that one:

Browser app → references/surface-browser.md
CLI tool → references/surface-cli.md
HTTP API → references/surface-http-api.md
Agent or skill → references/surface-agent-skill.md
Other (daemon, extension, MCP server, library without a public endpoint) — adapt method; document the surface description in the report

Each reference covers: probing tool, how to drive the surface, evidence to capture, and surface-specific hardening items per stage.

If a feature spans multiple surfaces (e.g., CLI + API), run acceptance against each separately and consolidate in the verdict.

Method

Load the spec. Read the task, PRD, issue, or spec file.
Confirm inputs. Stage + surface + launch instructions. Stop and ask if any are missing.
Load the surface reference. Read only the one that matches.
Launch. Start the system. Verify baseline reachability before probing.
Drive the flows. Walk the stage checklist using the surface's probing tool. One flow at a time.
Capture evidence as you go. Save to ./acceptance-evidence/<YYYY-MM-DD-HHMM>/<flow-name>/ if running in a project directory.
Score. For each checklist item: PASS, FAIL, or BLOCKED. BLOCKED ≠ FAIL (BLOCKED means verification was prevented).
Report. Overall verdict + flow-by-flow results + evidence paths.

Output format

## Acceptance Report — <feature>, stage: <stage>, surface: <surface>

**Verdict:** PASS | FAIL | PARTIAL

**Overview:** <1-2 sentences>

### Flows verified
- [PASS] <flow name> — <what was verified>
- [FAIL] <flow name> — <what broke + evidence path + reproduction steps>
- [BLOCKED] <flow name> — <what prevented verification>

### Evidence
- <surface-appropriate paths>

### Observations beyond scope
- <things noticed but outside the stage checklist — optional>

### Blockers
- <what prevented full coverage, and what's needed to unblock — include only if PARTIAL>

Rules

Report PASS only if every checklist item completes successfully.
Capture evidence as you go — no evidence, no verdict.
Report DEFERRED if the system won't launch; do not guess at behavior.
Don't upgrade stage or swap surfaces without the caller asking.
Don't modify the system under test. Observe and report.
Surface breaks with reproduction steps. Do not fix them.
For multi-surface features, run each surface separately and consolidate.

Mehr aus diesem Repository

gleiches Repository

test-driven-development

jasonjgarcia24/claude-dev-team

Drives development with tests. Write a failing test before writing code that makes it pass; for bugs, reproduce with a test before attempting a fix (the Prove-It Pattern). Use when implementing any new logic or behavior, fixing any bug, modifying existing functionality, adding edge case handling, or making any change that could break existing behavior. Do NOT use for pure configuration changes, documentation updates, or static content changes with no behavioral impact. Covers the RED/GREEN/REFACTOR cycle, the test pyramid and test-size model, writing-good-tests patterns (state over interactions, DAMP over DRY, real over mocks, Arrange-Act-Assert), and common anti-patterns. For browser runtime verification, combine with the `browser-testing-with-devtools` skill.

2026-04-241

browser-testing-with-devtools

jasonjgarcia24/claude-dev-team

Verifies browser-rendered changes against live runtime via Chrome DevTools MCP. Use when building or debugging anything that renders in a browser, inspecting the DOM, capturing console errors, analyzing network requests, profiling Core Web Vitals, or verifying visual output with real runtime data. Do NOT use for backend-only changes, CLI tools, non-UI code, or when Pepper + test-driven-development already covers the scenario with automated tests.

2026-04-241

code-review-and-quality

jasonjgarcia24/claude-dev-team

Multi-axis code review before merge across correctness, readability, architecture, security, and performance. Use before merging any PR, after completing a feature, when evaluating code produced by another agent or model, during refactors, or after a bug fix (review both the fix and the regression test). Produces categorized findings (Critical / Important / Suggestion / Nit) and an APPROVE or REQUEST CHANGES verdict. Do not use for deep security audits (use security-and-hardening + Barb), for verifying a running feature end-to-end (use acceptance-exploration + Negev), for writing tests (use test-driven-development), or for committing curated changes (use git-workflow-and-versioning + Hubert).

2026-04-241

git-workflow-and-versioning

jasonjgarcia24/claude-dev-team

Structure git workflow practices — atomic commits, trunk-based branching, descriptive messages, and change summaries. Use when making any code change that gets committed, when splitting a messy working tree, when naming a branch, when writing a commit message, or when cleaning up history. Invoked most often by the `hubert` agent, which layers persona-specific execution (secrets scanning, 70-char subject cap, result-line contract). Do not use for force-pushing, rebasing published history, or PR creation — those are out of scope.

2026-04-241

security-and-hardening

jasonjgarcia24/claude-dev-team

Harden web application code against vulnerabilities during development. Use while writing any feature that accepts untrusted data, handles authentication or sessions, stores or transmits sensitive information, integrates with third-party APIs, accepts file uploads, or exposes webhooks and callbacks. Covers OWASP Top 10 prevention patterns, input validation at system boundaries, parameterized queries, output encoding, secrets management, rate limiting, session hardening, and the three-tier "always / ask first / never" boundary system. Do not use for post-implementation security audits, threat modeling of finished systems, or vulnerability reports — use the `barb` / `security-auditor` agent for that. This skill is for building secure code; Barb is for auditing built code.

2026-04-241

name

acceptance-exploration

description

Acceptance Exploration

Run the finished feature end-to-end and return a stage-appropriate pass/fail verdict with evidence. Tests verify code; acceptance exploration verifies the feature.

Invoked primarily by the negev agent (~/.claude/agents/acceptance-explorer.md). Follow directly when no persona is needed.

Two required inputs

Every exploration needs both:

Stage — how deep to probe: prototype / MVP / beta / GA
Surface — how to drive and observe: browser / cli / http-api / agent-skill / other

Stop and ask if either is missing. Stage is semantic (what to check). Surface is mechanical (how to check).

Also confirm: the spec (what the feature should do) and launch instructions (how to start it).

Stages

Deeper stages include everything from lighter stages. Checklists are semantic — apply them across all surfaces.

Prototype — "the core idea demonstrates"

Happy path runs start-to-finish without crashing
Core value proposition is observable in the output
No uncaught errors on the primary flow

MVP — "the feature is usable"

All prototype checks, plus:

Top 3 consumer flows complete successfully
Error states produce useful output (no blank screens, silent exits, or raw stack traces leaked to end users)
Transport layer clean (no 5xx, no non-zero exit on valid input)

Beta — "it handles real use"

All MVP checks, plus:

Edge cases: empty input, very long input, invalid input, duplicate/rapid invocations
Error recovery: retries, degraded-dependency handling, timeouts
Input-variant coverage across realistic consumer patterns

GA — "production-ready"

All beta checks, plus:

Performance meets any stated budget
Observability: failures leave useful traces; non-generic error messages
Graceful degradation (no silent failures, no infinite hangs)
Surface-specific hardening (see surface reference)

Surfaces

Pick the matching reference file and load only that one:

Browser app → references/surface-browser.md
CLI tool → references/surface-cli.md
HTTP API → references/surface-http-api.md
Agent or skill → references/surface-agent-skill.md
Other (daemon, extension, MCP server, library without a public endpoint) — adapt method; document the surface description in the report

Each reference covers: probing tool, how to drive the surface, evidence to capture, and surface-specific hardening items per stage.

If a feature spans multiple surfaces (e.g., CLI + API), run acceptance against each separately and consolidate in the verdict.

Method

Load the spec. Read the task, PRD, issue, or spec file.
Confirm inputs. Stage + surface + launch instructions. Stop and ask if any are missing.
Load the surface reference. Read only the one that matches.
Launch. Start the system. Verify baseline reachability before probing.
Drive the flows. Walk the stage checklist using the surface's probing tool. One flow at a time.
Capture evidence as you go. Save to ./acceptance-evidence/<YYYY-MM-DD-HHMM>/<flow-name>/ if running in a project directory.
Score. For each checklist item: PASS, FAIL, or BLOCKED. BLOCKED ≠ FAIL (BLOCKED means verification was prevented).
Report. Overall verdict + flow-by-flow results + evidence paths.

Output format

## Acceptance Report — <feature>, stage: <stage>, surface: <surface>

**Verdict:** PASS | FAIL | PARTIAL

**Overview:** <1-2 sentences>

### Flows verified
- [PASS] <flow name> — <what was verified>
- [FAIL] <flow name> — <what broke + evidence path + reproduction steps>
- [BLOCKED] <flow name> — <what prevented verification>

### Evidence
- <surface-appropriate paths>

### Observations beyond scope
- <things noticed but outside the stage checklist — optional>

### Blockers
- <what prevented full coverage, and what's needed to unblock — include only if PARTIAL>

Rules

Report PASS only if every checklist item completes successfully.
Capture evidence as you go — no evidence, no verdict.
Report DEFERRED if the system won't launch; do not guess at behavior.
Don't upgrade stage or swap surfaces without the caller asking.
Don't modify the system under test. Observe and report.
Surface breaks with reproduction steps. Do not fix them.
For multi-surface features, run each surface separately and consolidate.