ワンクリックでManusで任意のスキルを実行

$pwd:

recipe-fix-ticket

Name: Recipe Fix Ticket
Author: MetaMask

// Fix a MetaMask bug from a Jira/GitHub ticket using recipe-backed validation. Use when an agent needs to reproduce or understand an existing failure, implement a minimal fix, prove the acceptance criteria with a recipe, and prepare reviewer-ready evidence.

Manusで実行

$ git log --oneline --stat

stars:15

forks:5

updated:2026年5月28日 08:01

ファイルエクスプローラー

6 ファイル

SKILL.md

readonly

related-skills.json

同じリポジトリ

recipe-dev.md

from "MetaMask/skills"

Build a MetaMask feature, investigation, or product change from a clear task/ticket with acceptance criteria and recipe-backed validation. Use when an agent should implement desired behavior without first reproducing an existing bug, prove the happy path in a live Mobile or Extension runtime when applicable, package evidence, and stop for human review.

2026-05-2815

recipe-evidence.md

from "MetaMask/skills"

Format recipe run outputs into concise PR-ready validation evidence for MetaMask reviewers. Use when an agent has recipe artifacts, screenshots, logs, or trace output and needs a clear PR comment or description section.

2026-05-2815

recipe-harness.md

from "MetaMask/skills"

Install, verify, and clean up MetaMask recipe runtimes for Mobile and Extension checkouts. Use before recipe-cook, recipe-wallet-control, recipe-evidence, or recipe-quality when runtime evidence needs CDP/browser/mobile recipe execution, especially on historical commits or fresh checkouts.

2026-05-2815

recipe-wallet-control.md

from "MetaMask/skills"

Control MetaMask debug wallets through harness-backed wallet-aware setup/unlock, account selection, route navigation, screenshots, UI interaction, CDP/state introspection, fixture handling, recovery, and recipe handoff. Use when an agent needs to validate Mobile or Extension wallet behavior end-to-end or collect PR evidence on a live debug runtime.

2026-05-2815

recipe-cook.md

from "MetaMask/skills"

Author, run, and refine executable per-PR validation recipes for MetaMask work. Use when an agent needs to turn acceptance criteria, changed behavior, or reviewer requests into a portable recipe graph with concrete proof targets, project-native actions, and reviewable artifacts. Recipes may use recipe-wallet-control when available, but must not depend on it.

2026-05-2715

recipe-quality.md

from "MetaMask/skills"

Critique per-PR validation recipes and their evidence. Use when an agent or reviewer needs a structured verdict on acceptance-criteria coverage, recipe graph quality, adapter independence, evidence fit, flake risk, and the highest-value fixes before trusting a recipe.

2026-05-2715

package.json

"author": "MetaMask"

"repository": "MetaMask/skills"

GitHub リポジトリを開く Creator のリポジトリを見る

$ install --global

$ download --local

Manusで実行

$ useful --forSOC

ソフトウェア開発者コンピュータ・数学職15-1252L4

ワンクリックで任意のスキルを実行

name	recipe-fix-ticket
description	Fix a MetaMask bug from a Jira/GitHub ticket using recipe-backed validation. Use when an agent needs to reproduce or understand an existing failure, implement a minimal fix, prove the acceptance criteria with a recipe, and prepare reviewer-ready evidence.
maturity	experimental

Recipe Fix Ticket

recipe-fix-ticket is for bug-fix work that must end with proof, not just a patch. Unlike /mms-recipe-dev, it starts by reproducing or understanding an existing failure before making the smallest fix.

If the user invoked /mms-recipe-fix-ticket, stay in the fix-ticket protocol even when the ticket is a POC/debug change or looks feature-like. Do not silently downgrade baseline/repro gates to N/A because the change is "new". For visual/stateful tickets, still create a before/no-state recipe or record the specific failed /mms-recipe-cook attempt that made baseline proof impossible.

Runner Invocation Compatibility

Different agent runners expose installed skills differently:

Claude/Cursor: use the slash-command form, for example /mms-recipe-fix-ticket <ticket>.
Codex/OpenAI agents: use the skill trigger form, for example $mms-recipe-fix-ticket <ticket>.

If a human pasted the wrong runner-specific command shape and the runner rejects it, immediately continue by translating to the correct equivalent command. Do not stop or ask the human to re-run the command. Record the runner-specific invocation correction in the evidence package and continue through the full checklist.

Recommended Codex/OpenAI-agent invocation shape:

$mms-recipe-fix-ticket <ticket-or-task-url>

For /mms-recipe-dev, use $mms-recipe-dev <ticket-or-task-url-or-task-prompt>.

Live Checklist File Protocol

Before product edits, before implementation planning, and before telling the human to go get coffee, create a live checklist file from the installed platform reference:

# Pick mobile or extension after identifying the target repo from cwd/ticket.
.agents/skills/mms-recipe-fix-ticket/scripts/init-checklist.sh --platform <mobile|extension> --slug <ticket-or-task-slug>

If the skill is installed somewhere else, run the same script from the installed skill directory, or manually copy the matching reference checklist to:

temp/tasks/<skill>/<timestamp>-<slug>/CHECKLIST.md

The copied CHECKLIST.md is the source of truth for progress. It must contain [ ] checkboxes. After every gate:

edit CHECKLIST.md from [ ] to [x] for the completed gate;
add the artifact path, command, result, or blocker under that gate;
immediately continue to the next unchecked gate.

Do not rely on private scratch notes as the progress record. Do not final-answer with unchecked required gates unless the remaining gates are explicitly marked BLOCKED: <concrete reason> or N/A: <reason> in CHECKLIST.md. No SIGNAL.json is required for interactive skill runs.

Karpathy-Style Execution Discipline

Apply this discipline throughout the workflow:

Think before coding: if task source, target surface, ACs, fixture state, or evidence requirement is ambiguous, record the ambiguity in CHECKLIST.md and ask once before product edits.
Simplicity first: implement the smallest reversible change that satisfies the stated ACs. Do not add abstractions, generic actions, or speculative configuration unless the recipe proof requires it.
Surgical changes: every changed product line must trace to an AC. Do not refactor adjacent code, move existing logic, or clean unrelated files.
Goal-driven execution: each checklist gate must have a concrete verifier/path/result. Do not mark [x] from intent, code inspection, or tests that do not cover the gate.

Clean Per-Run Branch Protocol

Every new validation loop must run on a clean, model-specific branch so the human can compare Claude/Codex/Cursor diffs afterwards. Before product edits:

ensure the worktree has no unstaged product changes from a previous loop; if it does, stash them with a descriptive adr58-validation-... message and record the stash in CHECKLIST.md;
record the base branch and base SHA in CHECKLIST.md;
create or switch to a fresh branch named with the runner/model, skill, ticket or task slug, and run id. If the source is a Jira ticket, the branch name must start with the lowercased Jira key followed by a hyphen on both Mobile and Extension targets so regular MetaMask/Farmslot tooling can associate it, for example tat-3216-adr58-codex-mms-recipe-fix-ticket-fresh2. For non-Jira prompts, use a stable sanitized task slug such as adr58-codex-mms-recipe-fix-ticket-demo-fresh1;
keep all product edits for that loop on that branch only;
include branch name, base SHA, and git diff --stat <base>...HEAD in the final evidence package.

If the branch cannot be made clean, mark the branch gate BLOCKED before implementation. Do not mix multiple model attempts on the same product branch.

Clean Generated Harness State Protocol

A clean product worktree is not enough. Generated, ignored harness/runtime outputs can make a fresh run reuse stale recipe code or stale CDP metadata. Before the proof plan or harness install gate, record and clean these generated paths for the current target repo unless the caller explicitly asks to preserve runtime state for debugging:

rm -rf temp/agentic/recipes .agent/recipe-harness/extension .agent/recipe-harness/mobile
rm -rf temp/tasks/<this-run>/harness

Then reinstall the lower-level harness from the currently installed skill. Do not use --force; deleting known generated outputs first is the idempotent refresh. Do not edit .agents/skills/..., .claude/skills/..., or harness source files during product validation.

For Extension, prefer task-local harness output when writing new recipes so each run is isolated from shared temp/agentic/recipes state:

.agents/skills/mms-recipe-harness/scripts/recipe-harness.sh extension install \
  --target . \
  --out temp/tasks/<this-run>/harness/recipes

Use the same task-local validate-recipe.sh path for dry-run and live recipe runs. If an existing shared harness install must be reused, verify its manifest and content hash in CHECKLIST.md; do not silently reuse stale ignored files.

First Response to the Human

After creating CHECKLIST.md, immediately acknowledge the handoff with a short, friendly message that includes the checklist path the human can monitor. Use this exact spirit, adapted only if the user gave a stricter tone:

Ok, relax and go get a coffee ☕. I’ll take this from ticket → fix → recipe → evidence package. You can monitor live progress in <CHECKLIST.md path>, and I’ll report back when it is done or concretely blocked.

After that message, continue autonomously. Do not wait for the user after the acknowledgement. If Jira/MCP cannot fetch the ticket, ask for the missing ticket text once; after the user pastes it, resume at checklist step 1 and continue to the recipe/evidence gates.

Runtime Startup Approval Gate

Before any command that can start or restart a live runtime, write the exact command and approval state in CHECKLIST.md. This includes Mobile Metro, simulator/app launch, bundle prewarm/cache-warming helpers, recipe-harness live, Extension webpack/watch, Chrome/CDP launch, and any wrapper that would prepare/build runtime artifacts.

Respect the caller/orchestrator policy for the lane. If the current goal, checklist, or human says runtime startup needs explicit approval, do not work around that by creating manual-prewarm, nohup, background tmux, sleep/detached shell, ad-hoc cache-warming helpers, repo aliases such as yarn a:ios / yarn a:android, or direct preflight/start scripts such as scripts/perps/agentic/start-metro.sh --launch. Instead record BLOCKED: pending runtime-start approval with the exact command you would run and continue only after approval is provided.

When approval exists, prefer the installed harness delegate and cache/watch-first commands. Do not run Mobile auto, default, clean, rebuild-native, manual bundle prewarm/cache warming, Extension --start-test-watch, or Extension prepare/build unless that heavier mode was explicitly approved.

Portable Runtime Discovery Gate

Before choosing any runtime command or port, perform read-only discovery in this order and record the result in CHECKLIST.md:

caller-provided runtime context: RECIPE_RUNTIME_CONTEXT JSON path, RECIPE_SLOT_ID, RECIPE_CDP_PORT, CDP_PORT, RECIPE_METRO_PORT, METRO_PORT, RECIPE_WATCHER_PORT, WATCHER_PORT, IOS_SIMULATOR, SIMULATOR, ANDROID_SERIAL, ADB_SERIAL, and comparable env vars;
repo-local generic runtime context: temp/runtime/agentic-runtime.json (and temp/runtime/agentic-runtime.env if you want to source it). If this file has strict: true, use only the recorded slot/port/device values and do not probe or fall back to other local runtimes. If it has runtimeStart.approved: true plus runtimeStart.command, pass recovery through /mms-recipe-harness launch/live/verify and let the harness run that approved command; outside Farmslot, any developer/tool may provide the same context or RECIPE_RUNTIME_START_APPROVED=1 with RECIPE_RUNTIME_START_CMD;
installed recipe-harness/delegate summaries or manifests in the current checkout that identify an already-owned runtime;
currently listening local CDP/device endpoints only as fallbacks, never as a reason to ignore caller-provided context.

Do not assume default ports such as 9222 when no context was supplied. Do not turn missing runtime context into a raw product build (yarn build:test, start:test, native rebuild, or direct Chrome/simulator launch). Use static verify/no-start checks where useful, then record the missing runtime context or needed harness command as the blocker.

Harness Boundary Gate

This high-level skill owns product code, recipes, and evidence. It does not own the lower-level harness implementation during a product/ticket run. Do not edit installed delegate files such as .agents/skills/mms-recipe-harness, .claude/skills/mms-recipe-harness, .cursor/rules/mms-recipe-harness, .agent/recipe-harness, or copied harness adapter scripts while fixing a product ticket or validating this high-level workflow.

If the harness fails, inspect only enough logs/summaries to classify the failure and capture artifact paths. Rerun only with documented harness flags and the discovered runtime context. If success would require changing harness code, stop the runtime lane as BLOCKED: harness defect (or PASS-WITH-GAPS for product code already checked) and report the exact summary/log path. Do not patch the harness unless the explicit task is a harness-maintenance task.

Delegate Recovery Decision Tree

When a live proof is blocked, escalate through the lower-level delegates before using ad-hoc commands or packaging partial evidence:

Runtime/CDP unavailable — use /mms-recipe-harness (or the installed delegate path) to launch, live-verify, or recover the runtime with the caller-approved context/prepare command. The high-level skill should not run raw build/watch/Chrome/simulator commands when a harness path exists. If no runtime context or start approval exists, stop with BLOCKED: missing runtime context or BLOCKED: pending runtime-start approval.
Wallet locked, wrong account, onboarding, route, or app-ready blocker — use /mms-recipe-wallet-control (or the installed delegate path) for unlock, account, route, and wallet readiness primitives. Do not invent private aliases or mutate controller/store state to prove user-visible ACs.
Stateful product setup unavailable — use recipe/harness/wallet-control supported flows or documented pre-start fixtures. If no real flow/fixture exists, record a fixture/state setup blocker; do not manufacture the target state.
Delegate cannot recover — stop at the concrete blocker with command/log paths. Fix the delegate/runtime/root cause and restart from clean generated harness state rather than continuing to a partial evidence package, unless the human explicitly asks for partial packaging.

CDP Bootstrap Failure Stop Gate

For Extension visual or mixed ACs, a live recipe that fails before CDP session bootstrap (for example ECONNREFUSED 127.0.0.1:<port>, missing /json/version, no extension target, or no summary.json/trace.json emitted) is not a quality/evidence packaging condition. Stop the product validation lane, record BLOCKED: CDP bootstrap failed, preserve the exact command/log path, and fix the runtime/preflight root cause before restarting from a clean generated harness state.

Only continue to recipe-quality/evidence packaging after either:

the live recipe emitted normal runtime artifacts (summary.json, trace.json, artifact manifest, screenshots/video where applicable); or
the human explicitly asks for a partial package despite the bootstrap blocker.

Do not convert pre-bootstrap CDP failure into pass-with-gaps. Do not keep retrying inside the same dirty product run. Fix the root cause, clean generated outputs, and restart.

Ticket Source-of-Truth Gate

The ticket text or pasted task details are the source of truth. If Jira, MCP, WebFetch, or browser access returns a login wall, timeout, permission error, empty issue, or ambiguous page, ask the human for the ticket summary, description, requirements, and acceptance criteria before coding. Do this even if branch names, prior artifacts, local task folders, web search results, or repo history look suggestive.

Do not infer or rewrite acceptance criteria from branch names, stale ADR58 artifacts, previous validation runs, or web search. Do not change the target surface (for example Perps home vs market detail), gating condition, exact copy, or required state unless the ticket text says so. If ticket details remain unavailable, stop before product edits and report BLOCKED: missing ticket source of truth; do not implement a guessed patch.

Before editing product code, print the extracted AC matrix with the exact target surface, state precondition, copy, styling, and proof requirement. If any field is inferred rather than stated, label it UNKNOWN and ask for the missing ticket text instead of proceeding.

Load only what applies:

Runtime setup: /mms-recipe-harness
Recipe authoring: /mms-recipe-cook
Recipe critique: /mms-recipe-quality
PR evidence formatting: /mms-recipe-evidence
Target-repo fix notes are appended below when installed.
Target checklist reference: load only one of references/metamask-mobile-checklist.md or references/metamask-extension-checklist.md.

Lower-Level Skill Invocation Contract

The lower-level recipe skills are required gates, not optional background reading. For every delegate gate, do one of these explicitly:

invoke the named skill using the runner-specific command form (slash for Claude/Cursor, $ for Codex) if the runner supports nested skill calls; or
if nested slash calls are unavailable, open the installed skill file and follow it as the delegate protocol:
- .claude/skills/mms-recipe-harness/SKILL.md
- .claude/skills/mms-recipe-cook/SKILL.md
- .claude/skills/mms-recipe-quality/SKILL.md
- .claude/skills/mms-recipe-evidence/SKILL.md
- equivalent .agents/skills/.../SKILL.md or .cursor/rules/.../RULE.md when running under Codex/OpenAI agents or Cursor.

For each delegate, write Invoking mms-recipe-... in CHECKLIST.md and record the delegate output path or blocker. Direct ad-hoc app scripts, controller evals, DOM/fiber checks, screenshots, or unit tests do not satisfy these gates unless they are wrapped into the recipe protocol with an executable recipe path, exact command, summary.json, trace.json, artifact manifest, recipe-quality critique, and evidence package.

Do not claim recipe tooling is absent just because a repo-root validate-recipe.js or scripts/recipe-* file is missing. Before declaring a recipe/harness blocker, inspect the installed delegate locations for the current runner:

.claude/skills/mms-recipe-harness/SKILL.md and scripts/ / adapters/;
.agents/skills/mms-recipe-harness/SKILL.md and scripts/ / adapters/;
.cursor/rules/mms-recipe-harness/RULE.md and copied references/scripts.

If the installed delegate exists, follow it. If no executable recipe run exists (no recipe.json plus summary.json and trace.json from the harness), final status for runtime/visual work is FAIL or BLOCKED: no recipe protocol, not PASS-WITH-GAPS. Ad-hoc CDP probes, handwritten evidence packages, black screenshots, or “human should visually confirm” instructions are supporting notes only; they do not satisfy harness/cook/quality/evidence gates.

No Manufactured Runtime State

Do not prove a user-visible AC by mutating app/UI/runtime state directly. Forbidden proof setup includes window.stateHooks, stateHooks.submitRequestToBackground, Redux/store writes, React/fiber mutation, DOM injection, controller/provider state mutation, or any ad-hoc helper that directly creates, closes, clears, seeds, or inserts the target value/position/banner into the running app. These may be useful for diagnosis, but they are not valid AC proof.

Valid proof must use one of:

the real user flow encoded as a recipe;
a documented fixture/profile loaded before app start by the harness; or
an honest BLOCKED/PASS-WITH-GAPS verdict that names the missing fixture or runtime capability.

If a recipe uses state injection or controller/background calls to create or clear the exact condition being asserted (for example injecting or closing a BTC Perps position, mutating a controller value, inserting a banner/form value, or changing a DOM node), the corresponding stateful AC is not clean proof. It may be included as diagnostic/supporting evidence only; keep the affected AC at PASS-WITH-GAPS/BLOCKED until a real UI flow or documented harness-owned pre-start fixture proves the state. Do not call it code-proven, visually proven, or all ACs met; classify it as a fixture/recipe gap and feed that back to /mms-recipe-cook.

Non-Negotiable Proof Contract

For user-visible, stateful, or acceptance-criteria-driven tickets, do not stop at a code diff, unit tests, or type checks. A complete fix package must include:

the ticket URL or copied ticket prompt;
a product diff summary excluding harness/generated files;
/mms-recipe-harness install/verify status and artifact path;
an executable recipe path;
the exact recipe run command;
summary.json;
trace.json;
artifact-manifest.json or evidence manifest;
screenshots/video for reviewer-visible UI claims;
/mms-recipe-quality critique;
one improvement/rerun cycle, or an explicit note that the first pass already meets the evidence bar;
a PR-ready evidence summary;
explicit gaps for any unexercised proof target.

If the runtime cannot create the required state (for example a BTC open-position fixture is unavailable), mark that proof target as blocked/gap. Do not claim the acceptance criteria are met from code inspection or unit tests alone. If any acceptance criterion remains unrun, blocked, or covered only by weaker fallback proof, the final verdict is PASS-WITH-GAPS or PARTIAL, not complete, all ACs met, or ready. DOM-rendered fallback screenshots created because native screenshot capture timed out/blank count as weaker fallback proof for visual ACs: package them, read them, and keep the gap in the final verdict unless a native screenshot/video (or explicitly accepted alternate artifact) also exists. Packaging the gap is required, but it does not turn the gap into a pass.

For visual or mixed ACs, "code-proven" is not a valid proof status. Code review can prove minimality or placement intent, but visible copy/color/layout/ordering remain unproved until a live runtime recipe produces screenshot/video evidence that the runner reads visually. If CDP/browser is unavailable, mark those ACs BLOCKED: no runtime visual evidence.

Runtime Failure Recovery Gate

A failed live recipe node is not automatically a final blocker. Before final packaging, inspect the failed node in summary.json/trace.json, read any failure screenshot or last-screen artifact, and decide whether the failure is a recipe/action sequencing issue that can be fixed locally. Typical fixable cases include wrong route, below-the-fold or obscured target, unstable click/press, missing wait for hydration, stale selector, wrong browser context, or a broad shared flow landing on the wrong screen.

If the failure is plausibly recipe/action quality, patch the recipe or shared flow with the smallest navigation/wait/scroll/stable-click correction and rerun the smallest meaningful recipe segment or the full recipe. Do this before returning a final summary. Only mark the proof target BLOCKED after concrete retry attempts still fail, and then report the exact failed node, command, artifact paths, observed screen, and the recipe/harness improvement needed.

Do not loop indefinitely on native rebuilds, Metro/CDP reconnects, or simulator launch churn. After one harness/preflight recovery plus one targeted rerun, cap the lane as BLOCKED_RUNTIME if CDP/Metro remains unstable, and package the transcript plus runtime logs instead of spending unbounded time rebuilding.

For stateful setup flows, branch on the observed state. If the recipe wants to open a BTC position but the account already has BTC, do not wait for an open-long button that is correctly absent; either route directly to the open-position visual AC, or first close/clear via the documented real UI/harness flow and then rerun the no-state lane. Existing-state branching must be recorded in the recipe notes and verdict.

Unit tests, DOM/fiber/controller assertions, or a manual screenshot suggestion do not replace the missing live recipe proof. They may support the code diff, but visual/mixed ACs with a failed runtime node remain PASS-WITH-GAPS, PARTIAL, or BLOCKED by AC number.

Runtime Precondition Recovery Gate

A recipe that fails before the first workflow node because preconditions are not ready is not a final package yet. If summary.json/failure.json reports wallet.unlocked, Login, perps.ready_to_trade, perps.sufficient_balance, CLIENT_NOT_INITIALIZED, no CDP target, or a stopped simulator/browser, check runtime-start approval before attempting recovery:

If runtime-start approval has not been granted, do not run prepare/watch/launch/simulator-boot or any command that starts or restarts a runtime process. Run static/no-start harness checks if useful, record BLOCKED: pending runtime-start approval with the exact command that would be needed, and wait for explicit approval.
If runtime-start approval exists, invoke/follow /mms-recipe-harness verify/preflight for the platform, using the provided RECIPE_RUNTIME_CONTEXT, RECIPE_SLOT_ID, CDP_PORT, RECIPE_CDP_PORT, IOS_SIMULATOR, SIMULATOR, ADB_SERIAL, ANDROID_SERIAL, WATCHER_PORT, METRO_PORT, or equivalent caller-provided env vars first;
for wallet/login readiness, invoke/follow /mms-recipe-wallet-control or the repo harness wallet setup/unlock command rather than asking the human to run a private alias;
record the exact recovery command, output artifact, app-state/status result, and whether it reached a route/account/perps-ready state;
rerun the same recipe command after recovery, or explain the concrete external reason recovery could not run.

Static-only harness verification plus a failed live precondition attempt is a useful artifact, but it is not enough to close a stateful/visual proof lane when the target runtime env was available. If recovery still leaves the app on the login screen or CLIENT_NOT_INITIALIZED, package the lane as PASS-WITH-GAPS/BLOCKED_PRECONDITIONS and name the missing runtime setup explicitly.

Stateful AC Setup Gate

For acceptance criteria that depend on a required app state, the recipe must explicitly create or verify that state before asserting the UI. Do not rely on whatever state happens to be present in the active browser, simulator, wallet, or prior validation run. For example, a ticket with both no BTC position and open BTC position ACs needs separate no-state and with-state setup paths: close/clear through a real UI/harness flow, open/create through a real UI/harness flow, or load a documented harness-owned pre-start fixture.

A read-only observation recipe is acceptable only for investigation or for an AC whose required state is already the subject being observed and cannot be changed safely; in that case the affected AC must be labelled PASS-WITH-GAPS/BLOCKED with the missing setup flow named. If existing shared flows or platform fixtures can attempt the setup, try them before final packaging. Unit tests may support the code path but do not substitute for runtime setup of stateful visual ACs.

Fallback Screenshot Verdict Gate

Treat screenshot artifact metadata as part of the proof, not just the visible bitmap. If a PNG says DOM-rendered fallback evidence, native browser screenshot timed out/blank, or trace.json records a screenshot fallbackReason, then native visual capture failed for that AC. Read and package the fallback PNG, but a visual/mixed AC proven only by that fallback is not a clean visual pass. The final verdict must remain PASS-WITH-GAPS (or PARTIAL/BLOCKED if the fallback does not show the claim), even when summary.json says status: pass and DOM/viewport assertions passed.

Before writing the final verdict or recipe-quality, scan trace.json, screenshot captions, and the PNG header/body for fallback labels. The scan must be explicit in the notes, for example Fallback audit: trace fallbackReason=<n>, PNG DOM fallback labels=<n>. If any visual AC depends on fallback evidence, list the affected AC numbers and the native screenshot/video gap. Do not let a later successful rerun or unit test overwrite this proof-strength classification unless it produced native screenshot/video or another explicitly accepted non-fallback artifact. If recipe-quality or the final summary says native screenshot, no fallback, or clean visual PASS while trace.json or a PNG shows fallback metadata, the quality gate is incomplete; correct the verdict to PASS-WITH-GAPS and rewrite the evidence package before final response.

Mandatory Continuation Gate

After any product code change, the next step is not the final response. The agent must continue into the recipe/evidence pipeline even in an interactive session, even when Jira/Atlassian had to be pasted manually, and even when typecheck/Jest pass.

A final response is forbidden until one of these is true:

A recipe was authored or updated, run against the live target runtime, and the evidence package was produced; or
the runtime proof path was attempted through /mms-recipe-harness, failed for a concrete external reason, and the response clearly labels the recipe proof as BLOCKED rather than claiming the acceptance criteria are proven.

Do not say “all acceptance criteria met”, “all green”, or “ready” from implementation tests alone. Say “code/tests pass; recipe proof still pending” until the recipe artifacts exist.

Do not ask the human whether to proceed with recipe/harness validation. The answer is already yes. If the app, simulator, Metro, browser, or CDP is not currently running, check the Runtime Startup Approval Gate above. If runtime-start approval exists, invoke /mms-recipe-harness and let its verify/preflight path start or recover the runtime. If approval is required and absent, run static/no-start harness checks, record BLOCKED: pending runtime-start approval with the exact command, and wait. Only declare BLOCKED for a concrete external failure after the harness or recipe command was actually attempted with approval.

Honor runtime environment variables first. If RECIPE_RUNTIME_CONTEXT, RECIPE_SLOT_ID, CDP_PORT, RECIPE_CDP_PORT, ADB_SERIAL, simulator/device, or equivalent caller-provided runtime variables are present, use those values in harness verify and recipe commands before probing default ports. Do not claim "no CDP/browser/device" until the env-provided target was attempted and its failure artifact was recorded. If the env-provided runtime fails but a fallback port works, record both; if only fallback probing was done, the runtime gate is incomplete.

Do not ask to commit, create a PR, or package the work as done while any required recipe gate is incomplete. If the product diff and implementation checks are done but the recipe package is missing, the only valid next action is to invoke the next lower-level recipe skill or mark that specific gate BLOCKED with the attempted command and failure artifact.

Minimum post-code sequence for visible/stateful changes:

Delegate harness setup/verification to /mms-recipe-harness.
Delegate recipe authoring to /mms-recipe-cook.
Run the exact recipe command and capture artifacts.
Delegate recipe/evidence critique to /mms-recipe-quality; if a subagent tool is available, spawn it as an independent reviewer with the recipe path, AC matrix, and artifact manifest.
Read the screenshot PNGs yourself as the final worker gate; verify claimed UI is visible and rerun if weak.
Delegate final PR-ready packaging to /mms-recipe-evidence or produce the same evidence package shape.

Final Package Barrier

A recipe PASS is not the end of this skill. After the runtime recipe run, continue through these gates before showing an idle prompt or final response:

open summary.json, trace.json, run log, issue review, and the artifact manifest/evidence manifest;
read every recipe-produced PNG/video, not only ad-hoc screenshots captured before the recipe;
invoke or follow /mms-recipe-quality and record its verdict;
apply one recipe/evidence improvement and rerun, or record the quality verdict that no rerun is needed;
invoke/follow /mms-recipe-evidence and write a PR-ready evidence block/file (for example PR-READY-EVIDENCE.md) with task, diff, commands, artifact paths, screenshot notes, quality verdict, fallback audit, and remaining gaps;
run .agents/skills/mms-recipe-evidence/scripts/package-pr-evidence.js --task <task-dir> (or the runner-equivalent installed path) so the task contains pr-package/pr-desc.md, pr-package/images/ with easy-to-copy filenames, pr-package/package-manifest.json, and pr-package/final-report.md. The pr-desc.md draft must follow the target repo's .github/pull-request-template.md / .github/pull_request_template.md when present. A JSON manifest alone is not a PR-ready evidence package.

If you find yourself at the model prompt with steps 13-17 unchecked, that is a workflow failure. Continue immediately from the earliest unchecked gate; do not wait for the human to say "continue".

If /mms-recipe-quality returns pass-with-gaps, continue to evidence packaging, but keep the final verdict as PASS-WITH-GAPS and list each missing AC by number. Do not write "all checklist gates complete" unless every AC proof target is either passed or explicitly outside the ticket scope.

Ordered Checklist Protocol

Maintain the copied CHECKLIST.md file and execute it in order. Also load the target-specific checklist reference installed with this skill (for example .agents/skills/mms-recipe-fix-ticket/references/metamask-mobile-checklist.md or .agents/skills/mms-recipe-fix-ticket/references/metamask-extension-checklist.md; Claude/Cursor installs may expose the same files under their runner-specific skill directories) and use it as the concrete platform checklist. Do not search only for a repo-root references/ folder and declare the checklist absent. After each step, mark it [x], write the artifact/path/result, then immediately continue to the next unchecked step. Do not jump ahead. Do not final-answer with unchecked required steps.

CHECKLIST.md must exist before editing product behavior. Update it after every gate, not only at the end. If you realize you skipped an earlier gate, stop, backfill the missing gate honestly, and continue from the earliest unchecked required step. Do not silently replace this workflow with an implementation/test-only workflow.

Optimization rule: use the lower-level skills as focused delegates instead of manually re-deriving their protocols. If the runner supports subagents/tasks, use a subagent for independent recipe-quality review and evidence-package critique; do not delegate product code editing unless the user explicitly asked for a multi-agent implementation. The main agent remains responsible for ordering, truthfulness, and final evidence claims.

If a step fails, fix that layer and rerun the smallest relevant step; if it cannot be fixed locally, mark the step BLOCKED: <concrete reason> and continue only to package the blocker honestly.

## mms-recipe-fix-ticket checklist
- [ ] 0. Coffee handoff sent to human.
- [ ] 1. Ticket captured: URL or pasted text, summary, requirements, ACs.
- [ ] 2. AC matrix written: each AC numbered verbatim with proof mode (`state`, `visual`, or `mixed`) and primary evidence.
- [ ] 3. Target runtime selected: Mobile/Extension + platform/env + rationale.
- [ ] 4. Repro/baseline plan written before product behavior edits.
- [ ] 5. `/mms-recipe-harness` delegate completed install/verify; manifest path recorded.
- [ ] 6. `/mms-recipe-cook` delegate produced baseline/no-state recipe path and command, or baseline is explicitly blocked with reason.
- [ ] 7. Baseline/no-state recipe command run, or explicitly blocked with reason.
- [ ] 8. Minimal product fix implemented by the main agent.
- [ ] 9. Focused implementation checks run (typecheck/Jest/lint as relevant).
- [ ] 10. `/mms-recipe-cook` delegate updated after/with-state recipe path and command.
- [ ] 11. Runtime recipe command run; `summary.json`, `trace.json`, and manifest paths recorded.
- [ ] 12. Screenshot evidence read visually by the main agent; claimed UI is visible, not hidden/offscreen/wrong tab.
- [ ] 13. `/mms-recipe-quality` delegate/subagent critique completed against AC matrix + artifacts.
- [ ] 14. One improvement/rerun cycle completed, or quality critique says no rerun needed.
- [ ] 15. `/mms-recipe-evidence` delegate/package produced PR-ready evidence text.
- [ ] 16. Asked the human whether to clean up runtime resources now, or recorded that they should stay running for review.
- [ ] 17. Final response includes fix summary, tests, recipe evidence, quality loop, and remaining gaps only if truly blocked.

Hard ordering gates:

Steps 1–4 happen before behavior/source edits, except tiny locator-only testID additions needed to make baseline evidence executable.
Step 5 must explicitly invoke/follow /mms-recipe-harness; ad-hoc runtime status scripts are not a substitute.
Step 6 and step 10 must explicitly invoke/follow /mms-recipe-cook; ad-hoc screenshots are not a substitute for an executable recipe.
Step 9 is not a stopping point. Passing typecheck/Jest only unlocks steps 10–15.
Step 12 requires reading PNG/video evidence produced by the recipe artifact package, not only screenshots captured by miscellaneous helper scripts.
Visual or mixed ACs require screenshot/video evidence tied to that AC.
State-only assertions cannot prove visible copy/color/layout claims.
Schema warnings on screenshot nodes are quality failures for visual tickets. Add note and claims to screenshots, then rerun validation instead of treating warning-only schema output as clean.
The evidence package must contain artifact-manifest.json from the harness or an explicit evidence manifest you create that lists summary.json, trace.json, logs, screenshots/videos, quality verdict, and recipe path.
After final artifacts are captured, ask the human whether they want runtime resources cleaned up now. Name the concrete resources (for example Metro port, simulator/device, webpack/dev server, browser/CDP process, tmux pane) and only stop/release them after confirmation, unless the user already asked for cleanup.
A runtime blocker is acceptable only after step 5 or the relevant recipe run was actually attempted and the exact failure is recorded.

For every visual or mixed acceptance criterion, use the shared visual assertion protocol before screenshot evidence:

{
  "action": "wait_for",
  "test_id": "target-test-id",
  "visibility": "viewport",
  "scroll": { "strategy": "into_view", "settle_ms": 300 },
  "timeout_ms": 10000,
  "poll_ms": 500
}

Then the screenshot node must declare what the image is supposed to prove:

{
  "action": "screenshot",
  "filename": "after-ac1-target-visible.png",
  "note": "AC1: target component is visible with the expected text",
  "claims": {
    "must_show": [{ "test_id": "target-test-id", "visibility": "viewport" }],
    "must_not_show": [{ "text_contains": "Fund your wallet" }]
  }
}

Do not treat wait_for fiber-tree/DOM/native presence, eval_sync, controller state, or a passing recipe as proof that a user can see the element. Visual claims need viewport visibility plus screenshot claims, followed by human/quality review of the PNG/video.

Workflow

Create the live CHECKLIST.md from the embedded platform checklist, then send the coffee handoff message with its path.
Read the ticket, linked PRs/issues, logs, screenshots, and acceptance criteria. If the expected behavior or acceptance criteria are unclear, ask for clarification once; when the user supplies it, continue without stopping.
Write the AC matrix and proof modes before editing behavior code.
Reconstruct the expected behavior and failure mode.
Find the smallest relevant code path and existing tests.
Call /mms-recipe-harness for install/verify; record manifest and verification artifacts.
Call /mms-recipe-cook for baseline/no-state and after/with-state recipe planning for user-visible, stateful, cross-system, historically flaky, or acceptance-criteria-tied bugs.
Add or update focused tests where they directly prove the bug.
Patch the root cause with the smallest product diff.
Run focused implementation checks.
Continue into after/with-state recipe proof; a dry-run is schema-only and is not runtime proof.
Call /mms-recipe-quality or a recipe-quality subagent on the recipe plus evidence.
Classify any weakness into the correct layer: product, recipe, fixture/state setup, harness/runtime, skill instruction, evidence packaging, or runner steering.
Patch the smallest correct layer and rerun from the smallest meaningful point.
Call /mms-recipe-evidence or package equivalent PR-ready evidence.
Return the patch summary and evidence only after the Mandatory Continuation Gate and checklist are satisfied.

Output

Root Cause — concise explanation.
Fix — files changed and why.
Tests — commands run and result.
Recipe Evidence — recipe path, artifacts, and verdict.
Quality Loop — critique result, improvement made, and rerun status.
Remaining Risk — only if something is unproven.

name	recipe-fix-ticket
description	Fix a MetaMask bug from a Jira/GitHub ticket using recipe-backed validation. Use when an agent needs to reproduce or understand an existing failure, implement a minimal fix, prove the acceptance criteria with a recipe, and prepare reviewer-ready evidence.
maturity	experimental

Recipe Fix Ticket

Runner Invocation Compatibility

Different agent runners expose installed skills differently:

Claude/Cursor: use the slash-command form, for example /mms-recipe-fix-ticket <ticket>.
Codex/OpenAI agents: use the skill trigger form, for example $mms-recipe-fix-ticket <ticket>.

Recommended Codex/OpenAI-agent invocation shape:

$mms-recipe-fix-ticket <ticket-or-task-url>

For /mms-recipe-dev, use $mms-recipe-dev <ticket-or-task-url-or-task-prompt>.

Live Checklist File Protocol

Before product edits, before implementation planning, and before telling the human to go get coffee, create a live checklist file from the installed platform reference:

# Pick mobile or extension after identifying the target repo from cwd/ticket.
.agents/skills/mms-recipe-fix-ticket/scripts/init-checklist.sh --platform <mobile|extension> --slug <ticket-or-task-slug>

If the skill is installed somewhere else, run the same script from the installed skill directory, or manually copy the matching reference checklist to:

temp/tasks/<skill>/<timestamp>-<slug>/CHECKLIST.md

The copied CHECKLIST.md is the source of truth for progress. It must contain [ ] checkboxes. After every gate:

edit CHECKLIST.md from [ ] to [x] for the completed gate;
add the artifact path, command, result, or blocker under that gate;
immediately continue to the next unchecked gate.

Karpathy-Style Execution Discipline

Apply this discipline throughout the workflow:

Think before coding: if task source, target surface, ACs, fixture state, or evidence requirement is ambiguous, record the ambiguity in CHECKLIST.md and ask once before product edits.
Simplicity first: implement the smallest reversible change that satisfies the stated ACs. Do not add abstractions, generic actions, or speculative configuration unless the recipe proof requires it.
Surgical changes: every changed product line must trace to an AC. Do not refactor adjacent code, move existing logic, or clean unrelated files.
Goal-driven execution: each checklist gate must have a concrete verifier/path/result. Do not mark [x] from intent, code inspection, or tests that do not cover the gate.

Clean Per-Run Branch Protocol

Every new validation loop must run on a clean, model-specific branch so the human can compare Claude/Codex/Cursor diffs afterwards. Before product edits:

ensure the worktree has no unstaged product changes from a previous loop; if it does, stash them with a descriptive adr58-validation-... message and record the stash in CHECKLIST.md;
record the base branch and base SHA in CHECKLIST.md;
create or switch to a fresh branch named with the runner/model, skill, ticket or task slug, and run id. If the source is a Jira ticket, the branch name must start with the lowercased Jira key followed by a hyphen on both Mobile and Extension targets so regular MetaMask/Farmslot tooling can associate it, for example tat-3216-adr58-codex-mms-recipe-fix-ticket-fresh2. For non-Jira prompts, use a stable sanitized task slug such as adr58-codex-mms-recipe-fix-ticket-demo-fresh1;
keep all product edits for that loop on that branch only;
include branch name, base SHA, and git diff --stat <base>...HEAD in the final evidence package.

If the branch cannot be made clean, mark the branch gate BLOCKED before implementation. Do not mix multiple model attempts on the same product branch.

Clean Generated Harness State Protocol

rm -rf temp/agentic/recipes .agent/recipe-harness/extension .agent/recipe-harness/mobile
rm -rf temp/tasks/<this-run>/harness

For Extension, prefer task-local harness output when writing new recipes so each run is isolated from shared temp/agentic/recipes state:

.agents/skills/mms-recipe-harness/scripts/recipe-harness.sh extension install \
  --target . \
  --out temp/tasks/<this-run>/harness/recipes

First Response to the Human

Ok, relax and go get a coffee ☕. I’ll take this from ticket → fix → recipe → evidence package. You can monitor live progress in <CHECKLIST.md path>, and I’ll report back when it is done or concretely blocked.

Runtime Startup Approval Gate

Portable Runtime Discovery Gate

Before choosing any runtime command or port, perform read-only discovery in this order and record the result in CHECKLIST.md:

caller-provided runtime context: RECIPE_RUNTIME_CONTEXT JSON path, RECIPE_SLOT_ID, RECIPE_CDP_PORT, CDP_PORT, RECIPE_METRO_PORT, METRO_PORT, RECIPE_WATCHER_PORT, WATCHER_PORT, IOS_SIMULATOR, SIMULATOR, ANDROID_SERIAL, ADB_SERIAL, and comparable env vars;
repo-local generic runtime context: temp/runtime/agentic-runtime.json (and temp/runtime/agentic-runtime.env if you want to source it). If this file has strict: true, use only the recorded slot/port/device values and do not probe or fall back to other local runtimes. If it has runtimeStart.approved: true plus runtimeStart.command, pass recovery through /mms-recipe-harness launch/live/verify and let the harness run that approved command; outside Farmslot, any developer/tool may provide the same context or RECIPE_RUNTIME_START_APPROVED=1 with RECIPE_RUNTIME_START_CMD;
installed recipe-harness/delegate summaries or manifests in the current checkout that identify an already-owned runtime;
currently listening local CDP/device endpoints only as fallbacks, never as a reason to ignore caller-provided context.

Harness Boundary Gate

Delegate Recovery Decision Tree

When a live proof is blocked, escalate through the lower-level delegates before using ad-hoc commands or packaging partial evidence:

Runtime/CDP unavailable — use /mms-recipe-harness (or the installed delegate path) to launch, live-verify, or recover the runtime with the caller-approved context/prepare command. The high-level skill should not run raw build/watch/Chrome/simulator commands when a harness path exists. If no runtime context or start approval exists, stop with BLOCKED: missing runtime context or BLOCKED: pending runtime-start approval.
Wallet locked, wrong account, onboarding, route, or app-ready blocker — use /mms-recipe-wallet-control (or the installed delegate path) for unlock, account, route, and wallet readiness primitives. Do not invent private aliases or mutate controller/store state to prove user-visible ACs.
Stateful product setup unavailable — use recipe/harness/wallet-control supported flows or documented pre-start fixtures. If no real flow/fixture exists, record a fixture/state setup blocker; do not manufacture the target state.
Delegate cannot recover — stop at the concrete blocker with command/log paths. Fix the delegate/runtime/root cause and restart from clean generated harness state rather than continuing to a partial evidence package, unless the human explicitly asks for partial packaging.

CDP Bootstrap Failure Stop Gate

Only continue to recipe-quality/evidence packaging after either:

the live recipe emitted normal runtime artifacts (summary.json, trace.json, artifact manifest, screenshots/video where applicable); or
the human explicitly asks for a partial package despite the bootstrap blocker.

Do not convert pre-bootstrap CDP failure into pass-with-gaps. Do not keep retrying inside the same dirty product run. Fix the root cause, clean generated outputs, and restart.

Ticket Source-of-Truth Gate

Load only what applies:

Runtime setup: /mms-recipe-harness
Recipe authoring: /mms-recipe-cook
Recipe critique: /mms-recipe-quality
PR evidence formatting: /mms-recipe-evidence
Target-repo fix notes are appended below when installed.
Target checklist reference: load only one of references/metamask-mobile-checklist.md or references/metamask-extension-checklist.md.

Lower-Level Skill Invocation Contract

The lower-level recipe skills are required gates, not optional background reading. For every delegate gate, do one of these explicitly:

invoke the named skill using the runner-specific command form (slash for Claude/Cursor, $ for Codex) if the runner supports nested skill calls; or
if nested slash calls are unavailable, open the installed skill file and follow it as the delegate protocol:
- .claude/skills/mms-recipe-harness/SKILL.md
- .claude/skills/mms-recipe-cook/SKILL.md
- .claude/skills/mms-recipe-quality/SKILL.md
- .claude/skills/mms-recipe-evidence/SKILL.md
- equivalent .agents/skills/.../SKILL.md or .cursor/rules/.../RULE.md when running under Codex/OpenAI agents or Cursor.

.claude/skills/mms-recipe-harness/SKILL.md and scripts/ / adapters/;
.agents/skills/mms-recipe-harness/SKILL.md and scripts/ / adapters/;
.cursor/rules/mms-recipe-harness/RULE.md and copied references/scripts.

No Manufactured Runtime State

Valid proof must use one of:

the real user flow encoded as a recipe;
a documented fixture/profile loaded before app start by the harness; or
an honest BLOCKED/PASS-WITH-GAPS verdict that names the missing fixture or runtime capability.

Non-Negotiable Proof Contract

For user-visible, stateful, or acceptance-criteria-driven tickets, do not stop at a code diff, unit tests, or type checks. A complete fix package must include:

the ticket URL or copied ticket prompt;
a product diff summary excluding harness/generated files;
/mms-recipe-harness install/verify status and artifact path;
an executable recipe path;
the exact recipe run command;
summary.json;
trace.json;
artifact-manifest.json or evidence manifest;
screenshots/video for reviewer-visible UI claims;
/mms-recipe-quality critique;
one improvement/rerun cycle, or an explicit note that the first pass already meets the evidence bar;
a PR-ready evidence summary;
explicit gaps for any unexercised proof target.

Runtime Failure Recovery Gate

Runtime Precondition Recovery Gate

If runtime-start approval has not been granted, do not run prepare/watch/launch/simulator-boot or any command that starts or restarts a runtime process. Run static/no-start harness checks if useful, record BLOCKED: pending runtime-start approval with the exact command that would be needed, and wait for explicit approval.
If runtime-start approval exists, invoke/follow /mms-recipe-harness verify/preflight for the platform, using the provided RECIPE_RUNTIME_CONTEXT, RECIPE_SLOT_ID, CDP_PORT, RECIPE_CDP_PORT, IOS_SIMULATOR, SIMULATOR, ADB_SERIAL, ANDROID_SERIAL, WATCHER_PORT, METRO_PORT, or equivalent caller-provided env vars first;
for wallet/login readiness, invoke/follow /mms-recipe-wallet-control or the repo harness wallet setup/unlock command rather than asking the human to run a private alias;
record the exact recovery command, output artifact, app-state/status result, and whether it reached a route/account/perps-ready state;
rerun the same recipe command after recovery, or explain the concrete external reason recovery could not run.

Stateful AC Setup Gate

Fallback Screenshot Verdict Gate

Mandatory Continuation Gate

A final response is forbidden until one of these is true:

A recipe was authored or updated, run against the live target runtime, and the evidence package was produced; or
the runtime proof path was attempted through /mms-recipe-harness, failed for a concrete external reason, and the response clearly labels the recipe proof as BLOCKED rather than claiming the acceptance criteria are proven.

Do not say “all acceptance criteria met”, “all green”, or “ready” from implementation tests alone. Say “code/tests pass; recipe proof still pending” until the recipe artifacts exist.

Minimum post-code sequence for visible/stateful changes:

Delegate harness setup/verification to /mms-recipe-harness.
Delegate recipe authoring to /mms-recipe-cook.
Run the exact recipe command and capture artifacts.
Delegate recipe/evidence critique to /mms-recipe-quality; if a subagent tool is available, spawn it as an independent reviewer with the recipe path, AC matrix, and artifact manifest.
Read the screenshot PNGs yourself as the final worker gate; verify claimed UI is visible and rerun if weak.
Delegate final PR-ready packaging to /mms-recipe-evidence or produce the same evidence package shape.

Final Package Barrier

A recipe PASS is not the end of this skill. After the runtime recipe run, continue through these gates before showing an idle prompt or final response:

open summary.json, trace.json, run log, issue review, and the artifact manifest/evidence manifest;
read every recipe-produced PNG/video, not only ad-hoc screenshots captured before the recipe;
invoke or follow /mms-recipe-quality and record its verdict;
apply one recipe/evidence improvement and rerun, or record the quality verdict that no rerun is needed;
invoke/follow /mms-recipe-evidence and write a PR-ready evidence block/file (for example PR-READY-EVIDENCE.md) with task, diff, commands, artifact paths, screenshot notes, quality verdict, fallback audit, and remaining gaps;
run .agents/skills/mms-recipe-evidence/scripts/package-pr-evidence.js --task <task-dir> (or the runner-equivalent installed path) so the task contains pr-package/pr-desc.md, pr-package/images/ with easy-to-copy filenames, pr-package/package-manifest.json, and pr-package/final-report.md. The pr-desc.md draft must follow the target repo's .github/pull-request-template.md / .github/pull_request_template.md when present. A JSON manifest alone is not a PR-ready evidence package.

If you find yourself at the model prompt with steps 13-17 unchecked, that is a workflow failure. Continue immediately from the earliest unchecked gate; do not wait for the human to say "continue".

Ordered Checklist Protocol

If a step fails, fix that layer and rerun the smallest relevant step; if it cannot be fixed locally, mark the step BLOCKED: <concrete reason> and continue only to package the blocker honestly.

## mms-recipe-fix-ticket checklist
- [ ] 0. Coffee handoff sent to human.
- [ ] 1. Ticket captured: URL or pasted text, summary, requirements, ACs.
- [ ] 2. AC matrix written: each AC numbered verbatim with proof mode (`state`, `visual`, or `mixed`) and primary evidence.
- [ ] 3. Target runtime selected: Mobile/Extension + platform/env + rationale.
- [ ] 4. Repro/baseline plan written before product behavior edits.
- [ ] 5. `/mms-recipe-harness` delegate completed install/verify; manifest path recorded.
- [ ] 6. `/mms-recipe-cook` delegate produced baseline/no-state recipe path and command, or baseline is explicitly blocked with reason.
- [ ] 7. Baseline/no-state recipe command run, or explicitly blocked with reason.
- [ ] 8. Minimal product fix implemented by the main agent.
- [ ] 9. Focused implementation checks run (typecheck/Jest/lint as relevant).
- [ ] 10. `/mms-recipe-cook` delegate updated after/with-state recipe path and command.
- [ ] 11. Runtime recipe command run; `summary.json`, `trace.json`, and manifest paths recorded.
- [ ] 12. Screenshot evidence read visually by the main agent; claimed UI is visible, not hidden/offscreen/wrong tab.
- [ ] 13. `/mms-recipe-quality` delegate/subagent critique completed against AC matrix + artifacts.
- [ ] 14. One improvement/rerun cycle completed, or quality critique says no rerun needed.
- [ ] 15. `/mms-recipe-evidence` delegate/package produced PR-ready evidence text.
- [ ] 16. Asked the human whether to clean up runtime resources now, or recorded that they should stay running for review.
- [ ] 17. Final response includes fix summary, tests, recipe evidence, quality loop, and remaining gaps only if truly blocked.

Hard ordering gates:

Steps 1–4 happen before behavior/source edits, except tiny locator-only testID additions needed to make baseline evidence executable.
Step 5 must explicitly invoke/follow /mms-recipe-harness; ad-hoc runtime status scripts are not a substitute.
Step 6 and step 10 must explicitly invoke/follow /mms-recipe-cook; ad-hoc screenshots are not a substitute for an executable recipe.
Step 9 is not a stopping point. Passing typecheck/Jest only unlocks steps 10–15.
Step 12 requires reading PNG/video evidence produced by the recipe artifact package, not only screenshots captured by miscellaneous helper scripts.
Visual or mixed ACs require screenshot/video evidence tied to that AC.
State-only assertions cannot prove visible copy/color/layout claims.
Schema warnings on screenshot nodes are quality failures for visual tickets. Add note and claims to screenshots, then rerun validation instead of treating warning-only schema output as clean.
The evidence package must contain artifact-manifest.json from the harness or an explicit evidence manifest you create that lists summary.json, trace.json, logs, screenshots/videos, quality verdict, and recipe path.
After final artifacts are captured, ask the human whether they want runtime resources cleaned up now. Name the concrete resources (for example Metro port, simulator/device, webpack/dev server, browser/CDP process, tmux pane) and only stop/release them after confirmation, unless the user already asked for cleanup.
A runtime blocker is acceptable only after step 5 or the relevant recipe run was actually attempted and the exact failure is recorded.

For every visual or mixed acceptance criterion, use the shared visual assertion protocol before screenshot evidence:

{
  "action": "wait_for",
  "test_id": "target-test-id",
  "visibility": "viewport",
  "scroll": { "strategy": "into_view", "settle_ms": 300 },
  "timeout_ms": 10000,
  "poll_ms": 500
}

Then the screenshot node must declare what the image is supposed to prove:

{
  "action": "screenshot",
  "filename": "after-ac1-target-visible.png",
  "note": "AC1: target component is visible with the expected text",
  "claims": {
    "must_show": [{ "test_id": "target-test-id", "visibility": "viewport" }],
    "must_not_show": [{ "text_contains": "Fund your wallet" }]
  }
}

Workflow

Create the live CHECKLIST.md from the embedded platform checklist, then send the coffee handoff message with its path.
Read the ticket, linked PRs/issues, logs, screenshots, and acceptance criteria. If the expected behavior or acceptance criteria are unclear, ask for clarification once; when the user supplies it, continue without stopping.
Write the AC matrix and proof modes before editing behavior code.
Reconstruct the expected behavior and failure mode.
Find the smallest relevant code path and existing tests.
Call /mms-recipe-harness for install/verify; record manifest and verification artifacts.
Call /mms-recipe-cook for baseline/no-state and after/with-state recipe planning for user-visible, stateful, cross-system, historically flaky, or acceptance-criteria-tied bugs.
Add or update focused tests where they directly prove the bug.
Patch the root cause with the smallest product diff.
Run focused implementation checks.
Continue into after/with-state recipe proof; a dry-run is schema-only and is not runtime proof.
Call /mms-recipe-quality or a recipe-quality subagent on the recipe plus evidence.
Classify any weakness into the correct layer: product, recipe, fixture/state setup, harness/runtime, skill instruction, evidence packaging, or runner steering.
Patch the smallest correct layer and rerun from the smallest meaningful point.
Call /mms-recipe-evidence or package equivalent PR-ready evidence.
Return the patch summary and evidence only after the Mandatory Continuation Gate and checklist are satisfied.

Output

Root Cause — concise explanation.
Fix — files changed and why.
Tests — commands run and result.
Recipe Evidence — recipe path, artifacts, and verdict.
Quality Loop — critique result, improvement made, and rerun status.
Remaining Risk — only if something is unproven.

recipe-fix-ticket

このリポジトリの他の Skills

このリポジトリの他の Skills

Recipe Fix Ticket

Runner Invocation Compatibility

Live Checklist File Protocol

Karpathy-Style Execution Discipline

Clean Per-Run Branch Protocol

Clean Generated Harness State Protocol

First Response to the Human

Runtime Startup Approval Gate

Portable Runtime Discovery Gate

Harness Boundary Gate

Delegate Recovery Decision Tree

CDP Bootstrap Failure Stop Gate

Ticket Source-of-Truth Gate

Lower-Level Skill Invocation Contract

No Manufactured Runtime State

Non-Negotiable Proof Contract

Runtime Failure Recovery Gate

Runtime Precondition Recovery Gate

Stateful AC Setup Gate

Fallback Screenshot Verdict Gate

Mandatory Continuation Gate

Final Package Barrier

Ordered Checklist Protocol

Workflow

Output

Recipe Fix Ticket

Runner Invocation Compatibility

Live Checklist File Protocol

Karpathy-Style Execution Discipline

Clean Per-Run Branch Protocol

Clean Generated Harness State Protocol

First Response to the Human

Runtime Startup Approval Gate

Portable Runtime Discovery Gate

Harness Boundary Gate

Delegate Recovery Decision Tree

CDP Bootstrap Failure Stop Gate

Ticket Source-of-Truth Gate

Lower-Level Skill Invocation Contract

No Manufactured Runtime State

Non-Negotiable Proof Contract

Runtime Failure Recovery Gate

Runtime Precondition Recovery Gate

Stateful AC Setup Gate

Fallback Screenshot Verdict Gate

Mandatory Continuation Gate

Final Package Barrier

Ordered Checklist Protocol

Workflow

Output