with one click
ln-310-multi-agent-validator
// Use when validating Stories, plans, or tasks through the evaluation platform with mandatory research, parallel evidence lanes, sequential merge, and bounded refinement. Modes: story | plan_review.
// Use when validating Stories, plans, or tasks through the evaluation platform with mandatory research, parallel evidence lanes, sequential merge, and bounded refinement. Modes: story | plan_review.
[HINT] Download the complete skill directory including SKILL.md and all related files
| name | ln-310-multi-agent-validator |
| description | Use when validating Stories, plans, or tasks through the evaluation platform with mandatory research, parallel evidence lanes, sequential merge, and bounded refinement. Modes: story | plan_review. |
| license | MIT |
Paths: File paths (
references/,../ln-*) are relative to this skill directory.
Type: L2 Coordinator Category: 3XX Planning
Evaluation-platform coordinator for:
mode=storymode=plan_reviewThis skill uses the evaluation platform for:
| Input | Required | Source | Description |
|---|---|---|---|
storyId | mode=story | args, git branch, kanban, user | Story to validate |
plan {file} | mode=plan_review | args or auto | Plan file to validate |
Mode detection:
plan or plan {file} -> mode=plan_reviewmode=storyMANDATORY READ: Load references/environment_state_contract.md, references/storage_mode_detection.md, references/input_resolution_pattern.md
MANDATORY READ: Load references/evaluation_coordinator_runtime_contract.md, references/evaluation_summary_contract.md, references/evaluation_parallelism_policy.md, references/evaluation_research_contract.md
MANDATORY READ: Load references/agent_delegation_pattern.md
MANDATORY READ: Load references/penalty_points.md
MANDATORY READ: Load references/researchgraph_mcp_usage.md when researchgraph files changed or the target claims hypothesis, goal, benchmark, or proposal readiness.
Conditional read: load references/phase2_research_audit.md only when the coordinator performs inline criteria mapping instead of consuming ln-312 findings summaries.
Agent review policy: run health check, record skipped reason when no advisor is available, verify every advisor claim before merge, and treat transport/auth/tool failures as operator evidence rather than domain findings. Load references/agent_review_workflow.md only when debugging lifecycle/liveness details outside the evaluation runtime.
The coordinator uses these evaluation workers:
ln-311-review-research-workerln-312-review-findings-workerln-313-review-docs-workerln-314-review-repair-workerln-315-review-merge-workerln-316-review-refinement-workerHost Skill Invocation: Skill(skill: "...", args: "...") is mandatory delegation.
SKILL.md, treat args as $ARGUMENTS, execute that skill workflow, then return here with its result/artifact.Use the Skill tool for delegated workers. Do not inline worker logic inside the coordinator.
TodoWrite format (mandatory):
Resolve target and build runtime manifestLoad target artifacts and metadataLaunch external agents and verify healthRun research and findings workers in parallelGenerate documentation updatesApply accepted low-risk repairsSync agents and merge all evidenceRun refinement (MANDATORY in ALL modes when advisor available — do NOT skip)Compute verdict and write review outputVerify runtime cleanup and self-checkRepresentative invocations:
Skill(skill: "ln-311-review-research-worker", args: "{identifier} research")
Skill(skill: "ln-312-review-findings-worker", args: "{identifier} findings")
Skill(skill: "ln-313-review-docs-worker", args: "{identifier} docs")
Skill(skill: "ln-314-review-repair-worker", args: "{identifier} repair")
Skill(skill: "ln-315-review-merge-worker", args: "{identifier} merge")
Skill(skill: "ln-316-review-refinement-worker", args: "{identifier} refinement")
MANDATORY READ: Load references/loop_health_contract.md
Runtime family:
evaluation-runtimeIdentifier:
story-{storyId} for story modeplan-{slug} for plan reviewPhase order:
PHASE_0_CONFIGPHASE_1_DISCOVERYPHASE_2_AGENT_LAUNCHPHASE_3_EVIDENCE_LANESPHASE_4_DOCSPHASE_5_REPAIRPHASE_6_MERGEPHASE_7_REFINEMENTPHASE_8_APPROVALPHASE_9_SELF_CHECKPhase policy:
delegate_phases = [PHASE_3_EVIDENCE_LANES, PHASE_4_DOCS, PHASE_5_REPAIR, PHASE_6_MERGE, PHASE_7_REFINEMENT]aggregate_phase = PHASE_6_MERGEreport_phase = PHASE_8_APPROVALcleanup_phase = PHASE_9_SELF_CHECKself_check_phase = PHASE_9_SELF_CHECKagent_resolve_before = [PHASE_6_MERGE]required_phases_when_advisor_available = [PHASE_7_REFINEMENT]Allowed overlap:
ln-311ln-312Sequential only:
ln-313ln-314ln-315ln-316mode, identifier, and storage mode.expected_agentsrequired_research=truephase_orderphase_policynode references/scripts/evaluation-runtime/cli.mjs start \
--skill ln-310 \
--identifier {identifier} \
--manifest-file .hex-skills/evaluation/{identifier}_manifest.json
mode=story, resolve Story and child tasks.mode=plan_review, resolve the plan file.H##, G##, run IDs, benchmark manifests, or readiness claims, run read-only researchgraph verification/audits and attach the result as validation evidence..hex-skills/environment_state.json.agents_skipped_reasonnode references/scripts/evaluation-runtime/cli.mjs register-agent \
--skill ln-310 \
--identifier {identifier} \
--agent {name} \
--prompt-file {promptPath} \
--result-file {resultPath} \
--metadata-file {metadataPath}
health_check_done, agents_available, agents_required, and optional agents_skipped_reason.rate_limited, tool_missing, auth_missing, permission_denial, and asked_question are transport/operator states.NO-GO without domain evidence from artifacts or findings.This phase is the mandatory parallel evidence barrier.
worker_plan with:
ln-311 lane research (mandatory)ln-312 lane findings (mandatory)node references/scripts/evaluation-runtime/cli.mjs record-worker-result \
--skill ln-310 \
--identifier {identifier} \
--payload-file {childSummaryArtifactPath}
Research is mandatory in every mode:
For mode=story, findings must still produce penalty-point evidence and coverage analysis.
mode=story, run ln-313-review-docs-worker when documentation changes are required.mode=plan_review, skip only when there is no documentation delta to create.ln-314-review-repair-worker.Preconditions:
Steps:
node references/scripts/evaluation-runtime/cli.mjs sync-agent --skill ln-310 --identifier {identifier}
ln-315-review-merge-worker.aggregation_summary.NEVER SKIP THIS PHASE. Phase 7 applies to ALL modes:
story,plan_review. The ONLY valid skip reason is no advisor available in health check. Mode is NOT a skip reason. Complexity is NOT a skip reason. Time is NOT a skip reason. If you are about to checkpoint Phase 7 without running ln-316 while an advisor is available — STOP. You are making an error.
| Mode | Phase 7 required? | Skip allowed? |
|---|---|---|
story | YES | NO (only if no advisor available) |
plan_review | YES | NO (only if no advisor available) |
Phase 7 is MANDATORY when an advisor is available. The coordinator MUST NOT checkpoint Phase 7 without a recorded review-refinement worker summary from ln-316. The runtime advance command will reject the transition if an advisor was available in health check but no refinement summary exists.
Run ln-316-review-refinement-worker. Refinement uses a 2-stage state machine:
Rules:
agent_runner.mjs (NOT host-native sub-agents)failure_class, progress_signals, and session_usable from agent_runner.mjs; classified transport failures pause/defer instead of becoming domain findingssync-agent; Claude hosts may use Monitor for observabilityStory mode:
| Metric | Before | After | Meaning |
|---|---|---|---|
| Penalty Points | from ln-312 | from ln-314 | 0 = all fixed |
| Readiness Score | clamp(1,10,10-floor(before/5)) | clamp(1,10,10-floor(after/5)) | Quality (1-10) |
| Anti-Hallucination | — | from ln-311 | VERIFIED/FLAGGED |
| AC Coverage | — | N/N | 100% = pass |
| Gate | — | GO/NO_GO | Final verdict |
GO = penalty_after=0 AND no FLAGGED items AND ac_coverage=100%NO_GO = otherwiseNO_GONO_GOGO: mutate Story status to Todo; update kanban_board.md to APPROVED.NO_GO.Plan mode:
Write coordinator summary:
node references/scripts/evaluation-runtime/cli.mjs record-summary \
--skill ln-310 \
--identifier {identifier} \
--payload '{...evaluation-coordinator summary...}'
Required checks:
Then:
node references/scripts/evaluation-runtime/cli.mjs complete --skill ln-310 --identifier {identifier}
Coordinator summary kind:
evaluation-coordinatorRecommended payload fields:
statusfinal_resultreport_pathworker_countagent_countissues_totalseverity_countswarningscleanup_verifiedresearch_completedpenalty_beforepenalty_afterreadiness_scoreac_coveragegate (GO/NO_GO)flagged_itemsevaluation-coordinator summary writtenOptional reference: load references/meta_analysis_protocol.md only when the user asks for post-run meta-analysis or protocol-formatted run reflection.
When requested after the coordinator run, analyze the session per protocol section 7 and include the protocol-formatted output with the final review result.
references/evaluation_coordinator_runtime_contract.md, references/evaluation_summary_contract.mdreferences/evaluation_research_contract.md, references/research_tool_fallback.md, references/plan_review_pipeline.mdreferences/evaluation_parallelism_policy.md../ln-311-review-research-worker/SKILL.md, ../ln-312-review-findings-worker/SKILL.md, ../ln-313-review-docs-worker/SKILL.md, ../ln-314-review-repair-worker/SKILL.md, ../ln-315-review-merge-worker/SKILL.md, ../ln-316-review-refinement-worker/SKILL.mdreferences/phase2_research_audit.md, references/penalty_points.mdreferences/cross_reference_validation.md, references/dependency_validation.md, references/domain_patterns.md, references/templates/mcp_ref_findings_template.md, references/premortem_validation.md, references/quality_validation.md, references/risk_validation.md, references/solution_validation.md, references/standards_validation.md, references/structural_validation.md, references/traceability_validation.md, references/workflow_validation.mdVersion: 8.0.0 Last Updated: 2026-03-22