Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

evaluate-spawn

Étoiles1

Forks0

Mis à jour24 mai 2026 à 16:31

This skill should be used when the user asks to evaluate a completed team spawn session, capture session learnings, or review how a spawn went. Use this skill when the user says "evaluate spawn", "evaluate-spawn", "how did the spawn go", "capture learnings", "review spawn session", "spawn retrospective", or at the soft prompt after a team completes its work. This is a voluntary post-spawn evaluation with profile-based questions tailored to all spawn types (build, think, create). Build profile asks up to 4 questions (3 explicit + 1 conditional gate bypass); Think and Create profiles ask up to 3 questions (2 explicit + 1 gate bypass). Hard cap includes auto-derived scenario coverage for Build.

Installation

Installer avec Codex ou Claude Copiez ce prompt, collez-le dans Codex, Claude ou un autre assistant, puis laissez-le vérifier la page du skill et l'installer pour vous.

Exécuter dans Manus

Source

nthplusio

nthplusio/functional-claude

Ouvrir le dépôt GitHub Voir les dépôts du créateur

Téléchargement

Exécuter dans Manus

Métiers associésSOC

Basé sur la classification professionnelle SOC

Spécialistes en gestion de projetsProfessions des affaires et des opérations financières·SOC 13-1082

Explorateur de fichiers

4 fichiers

SKILL.md

readonly

Plus depuis ce dépôt

même dépôt

pm-branches

nthplusio/functional-claude

Use this skill when creating a git branch for a Linear or GitHub issue, writing a PR description, opening a pull request, or linking a PR to its tracked issue. Trigger phrases "create a branch", "new branch for this issue", "branch for

2026-06-091

pm-conventions

nthplusio/functional-claude

Use this skill when integrating, reading, or updating a project's conventions.md file — the document that captures house style for issue titles, labels, branch naming, PR descriptions, handoff phrasing, and review etiquette. Trigger phrases "set conventions", "load conventions", "where are the conventions", "what are our conventions", "add a convention", "edit conventions.md", "/pm-conventions", "the conventions say", "our team uses <X> format", "project conventions", "project title format". Also use proactively when another pm-* skill (pm-issues, pm-review, pm-improve, pm-branches, pm-handoff, pm-agent, pm-projects, pm-project-review) is about to write or evaluate text and a conventions file exists at `<repo>/.claude/pm/conventions.md` — the conventions must be loaded and applied before the skill writes anything user-facing.

2026-06-091

pm-dedup

nthplusio/functional-claude

Use this skill when the user asks to "find duplicate issues", "clean up duplicates", "what dupes are in the backlog", "/pm-dedup", or after a /pm-status briefing flags likely-dup clusters. Runs the `duplicates` lens against the team cache, has the agent judge each cluster, then offers merge / relate / close per cluster. Powered by the shared dedup engine in `hooks/lib/dedup.js`.

2026-06-091

pm-feedback

nthplusio/functional-claude

Use this skill when the user wants to report a bug, an unexpected result, or an improvement idea about the project-manager plugin ITSELF (or one of its pm-* skills) to the functional-claude maintainers — or when the agent has just hit a project-manager failure (a pm-* CLI error, a wrong/empty briefing, a crash, a clearly-incorrect result) and should offer to report it. Trigger phrases "report a bug in pm-review / pm-notes / any pm-* skill", "this plugin is broken", "pm gave the wrong answer", "send feedback about project-manager", "tell the functional-claude team", "/pm-feedback", "file a plugin issue". The defining cue is that the target of the complaint is the plugin or a named pm-* skill, not the user's own product. Assembles a README-shaped report (plugin + version, repro, expected vs actual, safe environment), scrubs every credential via the tested `hooks/lib/feedback-report.js` guard, previews it, then delivers — email to the Linear intake address (via resend-cli), GitHub issue fallback, or copy-paste d

2026-06-091

pm-handoff

nthplusio/functional-claude

Use this skill when the user wants to hand off work to another person or to the next stage of the lifecycle. Trigger phrases "hand this off to", "give this to", "I'm OOO", "passing this to", "covering for", "ready for release", "ready for QA", "ready for review", "release notes", "deploy handoff", "write a handoff", "/pm-handoff". Produces recipient-facing briefs with four required blocks (summary, what's done + how to use it, what's NOT done + next steps, open questions). Routes through configured channels (Linear, GitHub, markdown, chat/email).

2026-06-091

pm-improve

nthplusio/functional-claude

Distinct from `pm-review` (which *scores* whether an issue has the right structural shape), this skill *adds* project-specific context to an existing Linear or GitHub issue — relevant files, recent commits, owning modules, best-practice references, prior-art / related issues, and clarifying questions the author should answer before work starts. Use it when you want to give an issue more material to work from, not when you want to grade it. Trigger phrases "improve this issue", "enrich issue <ID>", "add context to this issue", "what's missing from this issue", "any relevant code for this issue", "research this issue", "qna this issue", "/pm-improve". Also use proactively after `pm-review` flags structural gaps that hint at missing context. Produces a draft enrichment; with `--apply`, posts it as a comment on the issue. Safe to re-run as repo state drifts — the marker mechanic replaces rather than stacks.

2026-06-091

name	evaluate-spawn
description	This skill should be used when the user asks to evaluate a completed team spawn session, capture session learnings, or review how a spawn went. Use this skill when the user says "evaluate spawn", "evaluate-spawn", "how did the spawn go", "capture learnings", "review spawn session", "spawn retrospective", or at the soft prompt after a team completes its work. This is a voluntary post-spawn evaluation with profile-based questions tailored to all spawn types (build, think, create). Build profile asks up to 4 questions (3 explicit + 1 conditional gate bypass); Think and Create profiles ask up to 3 questions (2 explicit + 1 gate bypass). Hard cap includes auto-derived scenario coverage for Build.
version	0.26.0

Post-Spawn Evaluation

Voluntary evaluation skill that captures session learnings after a team completes its work. Uses profile-based evaluation — each spawn category gets questions answerable at the moment of completion. Produces structured retrospective data written to docs/retrospectives/[team-name].md.

Distinct from /after-action-review: this skill evaluates output quality (spec scoring accuracy, scenario coverage, gate-bypass calibration). AAR evaluates team process (effectiveness, coordination patterns). Both write to docs/retrospectives/ and can run in the same session.

When to Use

After any team spawn completes — feature, debug, research, planning, review, design, brainstorm, productivity
Prompted via soft suggestion in spawn completion messages — never blocks session end
Can also be invoked manually at any time: /evaluate-spawn

Profile Selection

Detect the spawn type from the team name pattern (e.g., feature-*, research-*, design-*) or from docs/teams/[team-name]/ artifacts. If the type cannot be determined, ask:

"What type of spawn was this?"

Option	Maps to profile
Feature or Debug	Build
Research, Planning, or Review	Think
Design, Brainstorm, or Productivity	Create

Route to the matching profile below.

Build Profile (feature, debug)

Spec-driven teams that produce code. Has auto-derived scenario coverage, three explicit questions (plus one conditional gate bypass question), and a deferred section.

Auto-Derived: Scenario Coverage

Parse the Tester's ### Scenario Notes table from the team's task output or docs/teams/[team-name]/ artifacts. The expected format (from shared/scenario-collection.md):

### Scenario Notes

| Scenario | Status | Notes |
|---|---|---|
| [name] | Validated / Invalidated / Partial | [details] |

Count statuses:

Validated: scenarios confirmed working
Invalidated: scenarios that failed
Partial: scenarios partially satisfied

Report as: N validated / M total (X%). If no ### Scenario Notes table is found, record scenario-coverage: "no table found".

Question 1: Setup Alignment

"Did the discovery setup capture what mattered for the implementation?"

Option	Description
Yes — setup covered the key areas	Discovery interview and spec captured the important requirements
Partially — missed some areas	Some important aspects weren't surfaced during setup
No — setup missed the point	The discovery interview or spec fundamentally misaligned with the real need

Question 2: Gap Identification (conditional)

Only ask if Question 1 answer is not "Yes":

"What did the setup miss, and where should it have been caught?"

Free text answer for what was missed, followed by:

Option	Description
Discovery interview should have asked	The core interview questions didn't cover this area
Spec refinement should have surfaced it	The refinement step should have caught this edge case
Scoring should have flagged it	A low score on a dimension should have prompted rework
Team should have caught it during work	The spec was fine — the team missed something it covered

Question 3: Score Accuracy

"Did the spec quality score reflect actual implementation difficulty? (A higher score should predict an easier build.)"

Option	Value written to frontmatter
Matched — score reflected actual difficulty	`score-accuracy: matched`
Score too optimistic — build was harder than the score suggested	`score-accuracy: too-optimistic`
Score too pessimistic — build was easier than the score suggested	`score-accuracy: too-pessimistic`
No spec score was used	`score-accuracy: not-scored`

Question 4: Gate Bypass (conditional)

Only ask if spec-score: frontmatter value is present AND the score was below the gate threshold (default: 4/6):

"The spec quality gate showed a score below threshold. Did you proceed anyway, and if so, was that the right call?"

Option	Value written to frontmatter
Did not bypass — I refined the spec first	`gate-bypassed: false`
Bypassed — override was correct (spec was sufficient despite score)	`gate-bypassed: true`, `bypass-verdict: correct`
Bypassed — should have refined spec first	`gate-bypassed: true`, `bypass-verdict: should-have-refined`

If score met or exceeded threshold (no bypass scenario): write gate-bypassed: false to frontmatter without asking.

Deferred Section

These items cannot be evaluated at spawn completion. They are included as checkboxes in the retrospective file for the user to fill in during periodic rubric reviews:

First fix — What was the first thing you had to fix after using the output?

Build Profile Output

Write to docs/retrospectives/[team-name].md using the Build Profile template at ${CLAUDE_PLUGIN_ROOT}/skills/evaluate-spawn/references/build-template.md. The template defines the exact frontmatter schema and body structure.

Think Profile (research, planning, review)

Analysis-driven teams that produce documents, decisions, recommendations.

Question 1: Coverage

"Did the team investigate the right questions and areas?"

Option	Description
Yes — covered the important ground	The analysis addressed the key questions and areas
Partially — some gaps	Important questions were addressed but some areas were missed
No — wrong focus	The team spent effort on the wrong areas

Question 1a: Expected Outcomes Validation (conditional)

Only ask if ### Expected Outcomes section is present in the team's Context block (check docs/teams/[team-name]/ artifacts):

"Did the output address the Expected Outcomes defined before spawning?"

Option	Description
Yes — all outcomes addressed	The output answered the decision question / met the plan criteria at the specified confidence level
Partially — some outcomes addressed	Some outcomes were met but others were not addressed
No — outcomes not addressed	The output did not address the pre-spawn definition of done

Write outcome verdict to retrospective as: outcomes-addressed: all|partial|none

If ### Expected Outcomes section is NOT present: skip Question 1a entirely.

Question 2: Blind Spots (conditional)

Only ask if Question 1 answer is not "Yes":

"What important angle or concern was missed, and where should it have been caught?"

Free text answer for what was missed, followed by:

Option	Description
Discovery interview should have scoped it	The initial framing didn't include this angle
Team lead should have directed it	The lead's task breakdown missed this area
Teammates should have identified it	A teammate with the right focus area should have raised this

Question 3: Gate Bypass

"Did you override the spec quality gate (or skip it) when setting up this spawn? If so, was that the right call?"

Option	Value written to frontmatter
No override — gate was not triggered or spec was not scored	`gate-bypassed: false`
Bypassed — override was correct	`gate-bypassed: true`, `bypass-verdict: correct`
Bypassed — should have refined spec first	`gate-bypassed: true`, `bypass-verdict: should-have-refined`

Think Profile Output

Write to docs/retrospectives/[team-name].md using the Think Profile template at ${CLAUDE_PLUGIN_ROOT}/skills/evaluate-spawn/references/think-template.md.

Create Profile (design, brainstorm, productivity)

Creative/output-driven teams that produce ideas, concepts, artifacts, or workflow improvements.

Question 1: Relevance

"Did the output address the right problem space?"

Option	Description
Yes — targeted the actual need	The deliverables address the real problem or opportunity
Partially — some drift	Parts of the output are relevant but some effort was misdirected
No — wrong problem	The team solved a different problem than what was needed

Question 2: Constraint Adherence (conditional)

Only ask if Question 1 answer is not "Yes":

"Were the stated constraints respected, and what was violated?"

Free text answer for what was violated, followed by:

Option	Description
Constraints weren't communicated clearly	The discovery interview didn't surface the real constraints
Constraints were stated but ignored	The team had the information but didn't follow it
Constraints conflicted — team chose wrong	The team made a reasonable tradeoff but picked the wrong side

Question 3: Gate Bypass

"Did you override the spec quality gate (or skip it) when setting up this spawn? If so, was that the right call?"

Option	Value written to frontmatter
No override — gate was not triggered or spec was not scored	`gate-bypassed: false`
Bypassed — override was correct	`gate-bypassed: true`, `bypass-verdict: correct`
Bypassed — should have refined spec first	`gate-bypassed: true`, `bypass-verdict: should-have-refined`

Create Profile Output

Write to docs/retrospectives/[team-name].md using the Create Profile template at ${CLAUDE_PLUGIN_ROOT}/skills/evaluate-spawn/references/create-template.md.

Rubric Update Process

Evaluations produce input; rubric updates are a manual human step.

The user reads docs/retrospectives/ periodically and updates:

Build profile inputs:

shared/spec-quality-scoring.md — Adjust dimensions or thresholds based on deferred score accuracy data
shared/discovery-interview.md § Feature-Characteristic Heuristics — Add heuristic rows based on recurring "spec should have surfaced it" patterns. Each new row needs: Feature Characteristic, Trigger Signal, Follow-Up Question.
shared/discovery-interview.md — Adjust core questions based on recurring "discovery interview should have asked" patterns

Think profile inputs:

shared/discovery-interview.md — Improve scoping questions based on recurring "discovery should have scoped it" patterns
Team lead task breakdown patterns — Adjust based on "lead should have directed it" data

Create profile inputs:

shared/discovery-interview.md — Improve constraint elicitation based on recurring "constraints weren't communicated" patterns
Team instructions — Adjust based on "constraints were stated but ignored" patterns

Deferred checkboxes (Build profile): After 3+ evaluations, review the deferred sections across retrospective files to identify common first-fix categories. Score accuracy is now captured immediately (Question 3) — aggregate via /calibrate-scoring after 10+ Build profile retrospectives exist.

Reference files

File	Contents
references/build-template.md	Build profile frontmatter + body schema
references/think-template.md	Think profile frontmatter + body schema
references/create-template.md	Create profile frontmatter + body schema