Execute qualquer Skill no Manus
com um clique

Execute qualquer Skill no Manus com um clique

$pwd:

milestone-execute

Name: Milestone Execute
Author: frank-fs

// Use when implementing a milestone with multiple issues. Thesis-first approach: write failing E2E tests before issues, expert-review acceptance criteria before implementation, use dedicated terminal sessions (not subagents), and verify E2E yourself before merging. Includes review cycle that creates follow-up issues when expert review surfaces new gaps.

Executar no Manus

$ git log --oneline --stat

stars:176

forks:24

updated:26 de abril de 2026 às 03:56

SKILL.md

readonly

related-skills.json

mesmo repositório

decompose.md

from "frank-fs/frank"

Break an issue into tasks so small and precise that each has exactly one correct implementation. Refuses to launch execution until the plan contains actual code shapes, verified file paths, and scope locks. Use before any implementation session.

2026-04-26176

techdebt.md

from "frank-fs/frank"

Use when the user wants to find, catalog, or address technical debt. Also use when starting a cleanup sprint, looking for code quality improvements, or asking "what needs cleaning up?"

2026-04-26176

babysit.md

from "frank-fs/frank"

Babysit open PRs — auto-rebase, fix CI, address review comments. Use with /loop 5m /babysit for hands-free PR management.

2026-04-12176

discipline.md

from "frank-fs/frank"

Run Holzmann "Power of Ten" discipline review on changed code. Checks nesting depth, loop bounds, function size, preconditions, mutable state, side effect surfacing, and indirection depth. Use after writing code, before committing, or as part of PR review. Outputs a weighted letter grade (A-F) with stoplight color.

2026-04-11176

expert-review.md

from "frank-fs/frank"

Use when completing a feature, before creating a PR, or when the user asks "what do my experts think?" Dispatches the expert panel as parallel subagents for structured code review from multiple perspectives.

2026-04-04176

commit-push-pr.md

from "frank-fs/frank"

Commit staged changes, push branch, and create a PR with full pre-flight verification.

2026-04-03176

package.json

"author": "frank-fs"

"repository": "frank-fs/frank"

Abrir repositório GitHub Ver repositórios do creator

$ install --global

$ download --local

Executar no Manus

$ useful --forSOC

Desenvolvedores de softwareInformática e Matemática15-1252L4

name	milestone-execute
description	Use when implementing a milestone with multiple issues. Thesis-first approach: write failing E2E tests before issues, expert-review acceptance criteria before implementation, use dedicated terminal sessions (not subagents), and verify E2E yourself before merging. Includes review cycle that creates follow-up issues when expert review surfaces new gaps.

Milestone Execute

Thesis-first execution for milestones. The E2E test is the spec. The issue describes the thesis, not the implementation. Dedicated terminal sessions, not subagents.

Why This Process Exists

Four iteration cycles produced checkbox-complete implementations that failed expert review because the underlying thesis was unproven. Root cause: issues described symptoms ("add CE operation X") and agents fixed symptoms without proving the thesis. Subagents optimize for the prompt — if the prompt says "add X," they add X. They don't verify the thesis holds end-to-end.

Prerequisites

Milestone exists with open issues (or findings to convert into issues)
Each issue has thesis-first acceptance criteria (not implementation checkboxes)
E2E test script exists that fails before implementation

Process

Phase 0: Write the Failing E2E Test

Before creating any issues:

Write test-e2e.sh (or equivalent) with concrete HTTP request/response pairs that prove the thesis
Include negative tests — remove a crutch (flat transition, hardcoded value) and prove the real mechanism works
Include falsifiable tests — tests where the correct output cannot be produced without the underlying mechanism being correct
Run the test. It MUST fail. If it passes, the test isn't testing the thesis.

The E2E test IS the spec. Issues are "make these test lines green."

Next step: Present the failing E2E test to the user for review.

Phase 1: Create Thesis-First Issues

For each issue, follow the Issue Template below. Key rules:

Put thesis-level acceptance criteria in the issue body VERBATIM
Do NOT translate them into implementation instructions
The agent figures out the implementation by making the acceptance tests pass
Include dependencies on other issues so wave planning is explicit

Next step: Present each issue draft to the user for review before creating on GitHub. Then expert-review the acceptance criteria (not code) before implementation begins.

Phase 2: Expert Review Acceptance Criteria

Before any implementation:

Dispatch 2-4 relevant experts to review the ACCEPTANCE CRITERIA (not code)
Each expert checks: "If these tests pass, does the thesis hold?"
Experts identify tests that can be faked (shortcut to correct output without correct mechanism) and propose falsifiable alternatives
Revise acceptance criteria based on expert feedback

Next step: Present expert feedback and revised criteria to the user.

Phase 3: Wave Planning

Group issues into dependency waves. Query the Project board for current Status + Track in one call (preferred):

gh project item-list 1 --owner frank-fs --format json --limit 200 \
  | jq '.items[] | select(.content.milestone.title == "<name>") | {num: .content.number, title: .content.title, status: .status, track: .track}'

Fall back to gh issue list --milestone "<name>" --state open --json number,title,labels only if the project query is unavailable.

For umbrella-driven milestones, gh api graphql on the umbrella's subIssues { nodes { number, title, state } } gives the full child set with current state.

Wave N+1 depends on Wave N
Issues within a wave have zero shared files
Merge order: fewest shared-file changes first
Set Project Status to In Progress on the wave's items as you start each one (gh project item-edit); the auto-workflow flips them to Done on close.

Next step: Present wave plan for user approval.

Phase 3.5: Design Exploration (library design issues only)

Some issues require creative design work — new APIs, new abstractions, new middleware operations. These cannot be solved by constraints alone. An agent implements designs; it does not create them.

When to use this phase: The issue's thesis is about library/API design (not wiring, fixing, or extending an existing API). Ask: "Does the agent need to invent a new abstraction, or use an existing one?" If invent → do this phase.

Explore the current architecture in a research-only session. Map the relevant code paths, types, and assumptions. Do NOT implement anything.
Present findings to the user — structured summary with file paths, line numbers, and design assumptions that conflict with the thesis.
Iterate on the design with the user. Propose the new API surface, get feedback, revise. The design is done when the user approves it.
Write the design into the issue as context — not as implementation instructions, but as "the middleware API should support X, Y, Z" alongside the architectural constraints and anti-shortcuts.

The design becomes input to the issue. The agent implements the approved design and proves it works via the acceptance tests.

Why this phase exists: #250 failed 5 attempts because the issue had a library design thesis but no design — only desired output. Agents found shortcuts every time. Architectural constraints prevent wrong approaches (90%) but don't guarantee the right approach (55%). Working out the design first raises that to ~85%.

Next step: Present design exploration findings and proposed API to user.

Phase 4: Implementation via Dedicated Terminal Sessions

Boris's approach: use dedicated terminal sessions in worktrees, NOT subagents.

Preparation (orchestrating session does this)

For each issue in the current wave, the orchestrating session:

Creates the worktree:

git worktree add .claude/worktrees/{name} -b feature/{issue}-{short-name} master

Writes PROMPT.md in the worktree root with the full session prompt. IMPORTANT: Write PROMPT.md AFTER posting any addendum comments (architectural constraints, anti-shortcuts). Include both the issue body AND all comments.

gh issue view {number} --json body --jq '.body' > .claude/worktrees/{name}/PROMPT.md
gh api repos/{owner}/{repo}/issues/{number}/comments --jq '.[].body' >> .claude/worktrees/{name}/PROMPT.md

PROMPT.md contains:

The GitHub issue body (thesis + acceptance tests) — fetched via gh issue view
All issue comments (including architectural constraints addenda)
The E2E test instructions
TDD instruction: write failing tests for each acceptance criterion FIRST, then implement to make them pass
Verification instruction: run E2E test, do not claim done without evidence

Template for PROMPT.md content:

# Issue #{number}: {title}

{Full issue body from GitHub — paste verbatim}

---

## Instructions

Make the acceptance tests in the issue above pass.

1. Read the ENTIRE issue — thesis, architectural constraints, anti-shortcuts,
   implementation sequence, and acceptance tests are ALL part of the spec
2. Follow the implementation sequence if one is provided — do not skip phases
3. Respect architectural constraints — if the issue says the solution must be
   in the library, do not hand-code it in the application
4. Check anti-shortcuts before claiming done — if your implementation matches
   a listed anti-shortcut, it is wrong regardless of test results
5. Follow TDD (`superpowers:test-driven-development`): write a failing test
   for each acceptance criterion FIRST, then implement to make it pass
6. Run `DOTNET_SYSTEM_GLOBALIZATION_INVARIANT=1 dotnet build Frank.sln` and
   `DOTNET_SYSTEM_GLOBALIZATION_INVARIANT=1 dotnet test Frank.sln --filter "FullyQualifiedName!~Sample"`
   to verify nothing is broken
7. Run the E2E test if one exists
8. Do not claim done without build + test evidence in your output

Presents the start command for each session:
```
cd .claude/worktrees/{name} && claude --name "{issue-short-name}"
```
The user opens the session, then pastes the content of PROMPT.md as the first message (or reads it with /read PROMPT.md).

Why terminal sessions over subagents

Full permissions (can run servers, curl, E2E tests)
Persistent context (no context window pressure)
User can observe progress in real-time
User can intervene and redirect
No "agent said success" trust problem — you see the output

TDD integration (`superpowers:test-driven-development`)

Each acceptance criterion becomes a failing test BEFORE implementation
Red → Green → Refactor cycle for each criterion
E2E test is the outer loop; unit/integration tests are the inner loop
The red step is mandatory — if the test passes before implementation, the test isn't testing the right thing

Next step: Create worktrees, write PROMPT.md files, and present the start commands for each terminal session to the user.

Phase 5: Verification (User Runs E2E)

After each terminal session claims completion:

User runs the E2E test — not the agent, not a subagent, the USER
Every acceptance criterion must pass with observable evidence
If any test fails, the issue is not done — send the failure back to the terminal session
Only after E2E passes: run /simplify and /expert-review on the diff

CRITICAL: Do not trust agent self-reports. Do not trust "all tests pass" without seeing the output yourself. Run the test. Read the output.

Next step: Present E2E results to the user. If all pass, proceed to review. If any fail, go back to Phase 4.

Phase 6: Review and Follow-Up Issue Creation

After E2E passes, run /expert-review on the completed work.

Triage expert findings into three buckets:

Blocking — thesis is not proven despite E2E passing (test has a gap)
- Fix before merge. Update the E2E test to cover the gap, then return to Phase 4.
In-scope follow-up — real gap but separable from this issue
- Create a new issue using the Issue Template below
- Add to the current milestone
- Add as a native sub-issue of the relevant umbrella (addSubIssue mutation — see CLAUDE.md "Project board")
- Add a native dependency on the current issue with addIssueDependency (replaces "Depends on: #X" body convention; surfaces in the UI and is queryable)
- Include the expert finding as the "Current problem" section
Out-of-scope — valid concern but belongs to a future milestone
- Create issue with the future label
- Do NOT add to current milestone
- Note in the current PR body: "Deferred: #{new-issue} — {rationale}"

For each new follow-up issue:

Write it using the Issue Template (thesis → problem → definition → solution → acceptance tests → sources)
The acceptance test must be an HTTP exchange, not a code inspection
Add to the wave plan — does it fit in the current wave or the next?
Present to the user before creating on GitHub

CRITICAL: Never defer findings without user consent. Never close an issue when expert review surfaces unaddressed gaps. The review cycle is: E2E passes → expert review → create follow-up issues → user approves merge → merge.

Next step: Present expert findings with triage recommendations and any new issue drafts to the user.

Phase 7: PR and Merge

For each completed issue:

Create PR with per-requirement accounting (every acceptance test → PASS with evidence from the E2E output)
PR body must list any follow-up issues created during review
Wait for CI
Merge in dependency order (fewest shared files first)
git pull to update master before next wave

Next step: After all issues in a wave merge, proceed to next wave (Phase 4) or to completion (Phase 8).

Phase 8: Completion

After all waves merge:

Verify gh project item-list 1 --owner frank-fs --format json --limit 200 | jq '.items[] | select(.content.milestone.title == "<name>" and .status != "Done")' returns only follow-up issues (not original issues). The umbrella's sub-issue progress bar should also show all original children at 100%.
Run the FULL E2E test suite against merged master
Run /retrospective to capture session learnings
Update memory with milestone completion status
Close the umbrella issue once all sub-issues are closed — the "track shipped" event.

Next step: Present completion status and any open follow-up issues to the user.

Issue Template

## Thesis

{What must be true for Frank's thesis to hold in this area}

## Current problem

{What happens today — walk through the request lifecycle showing the gap}

## Definition: "{key term}"

{What the key term means, stated so an external observer can verify
from HTTP responses alone}

## Proposed solution

{High-level approach — NOT file:line instructions}

## Architectural constraints

{Where the solution must live and what boundaries it must respect.
These are NOT implementation instructions — they constrain the
approach without prescribing specific code. Required for library
design issues; optional for wiring/fix issues.}

- The solution MUST be in {library/middleware}, not {application/sample}
- Application code MUST only use {public API surface}
- Application code MUST NOT directly reference {internal types}

## Implementation sequence

{Ordered phases that prevent skipping to the end. Each phase has
a verifiable checkpoint. These describe WHAT to build in what
order, not HOW to build it.}

1. {Library/API change} — checkpoint: {how to verify this phase}
2. {Tests proving the API works} — checkpoint: {tests pass}
3. {Application uses the API} — checkpoint: {compiles using only public API}
4. {E2E acceptance tests} — checkpoint: {HTTP exchanges pass}

## Anti-shortcuts

{Known failure modes from previous attempts. Explicit "do NOT"
with explanation of why it produces correct output but wrong
design. Omit for first-attempt issues — populate after a failed
attempt so the next agent doesn't repeat the mistake.}

- Do NOT {shortcut} — produces correct output because {why} but
  wrong design because {why}

## Acceptance tests

Each test is verified by test-e2e.sh. The issue is not done until every
test produces the specified response.

### 1. {Test name}

{HTTP method} {URL} → {expected status} {Response body / header assertion}


{Why this test is falsifiable — what mechanism must be correct for it to pass}

### 2. {Test name}
...

## Dependencies

- Depends on: #{issue} — {why this must land first}
- Blocks: #{issue} — {why that issue needs this}

## Expert sources

- **{Expert}**: {finding summary}

Anti-Patterns

Anti-pattern	Why it fails	Correct approach
Implementation instructions in issues	Agent follows recipe without understanding thesis	Thesis + acceptance tests + architectural constraints + anti-shortcuts
HTTP-only acceptance for library design issues	Agent finds shortest path to green output, bypassing the library	Add architectural constraints on where solution lives; do Phase 3.5 design exploration first
Retrying same spec after failure	Same spec produces same shortcut every time	After first failure, fix the spec — the spec has a loophole, not the agent
Subagents for implementation	Can't run servers, limited permissions, trust problem	Dedicated terminal sessions
Agent self-reports as verification	"All tests pass" without evidence	User runs E2E, reads output
Expert review after implementation only	Finds same gaps again	Expert review acceptance criteria BEFORE implementation, then review code AFTER
Sample app as separate issue	Framework issues close without proving thesis	Sample's E2E test IS the acceptance criteria
Checkbox acceptance criteria	Green checkboxes, unproven thesis	HTTP request/response pairs
Closing issues when review finds gaps	Gaps ship as "done"	Create follow-up issues, get user approval before merge
Deferring findings without consent	Scope quietly shrinks	Present all findings, user decides what's blocking vs follow-up

Decision Log

Maintain during execution:

Decision	Rationale	Impact
{what}	{why}	{scope}

milestone-execute

Mais deste repositório

Milestone Execute

Why This Process Exists

Prerequisites

Process

Phase 0: Write the Failing E2E Test

Phase 1: Create Thesis-First Issues

Phase 2: Expert Review Acceptance Criteria

Phase 3: Wave Planning

Phase 3.5: Design Exploration (library design issues only)

Phase 4: Implementation via Dedicated Terminal Sessions

Preparation (orchestrating session does this)

Why terminal sessions over subagents

TDD integration (superpowers:test-driven-development)

Phase 5: Verification (User Runs E2E)

Phase 6: Review and Follow-Up Issue Creation

Triage expert findings into three buckets:

For each new follow-up issue:

Phase 7: PR and Merge

Phase 8: Completion

Issue Template

Anti-Patterns

Decision Log

Milestone Execute

Why This Process Exists

Prerequisites

Process

Phase 0: Write the Failing E2E Test

Phase 1: Create Thesis-First Issues

Phase 2: Expert Review Acceptance Criteria

Phase 3: Wave Planning

Phase 3.5: Design Exploration (library design issues only)

Phase 4: Implementation via Dedicated Terminal Sessions

Preparation (orchestrating session does this)

Why terminal sessions over subagents

TDD integration (superpowers:test-driven-development)

Phase 5: Verification (User Runs E2E)

Phase 6: Review and Follow-Up Issue Creation

Triage expert findings into three buckets:

For each new follow-up issue:

Phase 7: PR and Merge

Phase 8: Completion

Issue Template

Anti-Patterns

Decision Log

Mais deste repositório

TDD integration (`superpowers:test-driven-development`)

TDD integration (`superpowers:test-driven-development`)