一键在 Manus 中运行任何 Skill

oracle-review

星标0

分支0

更新时间2026年6月10日 12:02

Run iterative oracle + agent hardening loop on any artifact (designs, plans, beads, architecture) until findings converge to near-zero. Combines /swarm-oracle with /swarm-review in alternating rounds. Use for the full hardening cycle, not just a single oracle pass. For oracle-only, use /swarm-oracle. For bead-only hardening, use /swarm-beads-quality.

安装

用 Codex 或 Claude 帮你安装复制这段 Prompt，粘贴到 Codex、Claude 或其他助手里，让它检查 Skill 页面并帮你完成安装。

在 Manus 中运行

来源

oysteinkrog

oysteinkrog/dotfiles

打开 GitHub 仓库查看创作者相关仓库

下载

在 Manus 中运行

Oracle Review Skill

Oracle policy (2026-06): Fable (claude-fable-5) is the primary oracle. By default the FOR/AGAINST sessions below run as two fresh Fable subagents (Agent tool with model: "fable"; Fable is not reachable through PAL). The PAL 2x GPT-Pro setup is the escalation tier — extremely important or complex validations only, and always paired with a Fable consultation on the same prompt. See /consult-oracles and the Oracle Consultation Policy in ~/CLAUDE.md.

Two-part process: (1) oracle consensus validation with FOR/AGAINST stances, then (2) iterative hardening loop until findings converge to near-zero.

When to Use

After design completion (validate UX + architecture)
After implementation planning (validate feasibility + correctness)
After bead creation (validate readiness + completeness)
After architecture decisions (validate scope + approach)
Any high-stakes decision needing external challenge

When NOT to Use

Quick questions or second opinions (use /swarm-oracle-standalone)
Code review (use /swarm-agents type=review)
Simple validation that doesn't need adversarial challenge

Part 1: Oracle Consensus Validation

Setup

Run 2 concurrent oracle sessions:

Default tier: two Fable subagents (Agent, model: "fable"), one FOR stance, one AGAINST stance, spawned in a single message
Escalation tier (extremely important/complex only): PAL MCP consensus with 2x GPT-5.5-Pro (FOR + AGAINST), plus a parallel Fable consult on the same prompt

Oracle Prompt Template

Evaluate the following [ARTIFACT_TYPE] for [FEATURE]:

[ARTIFACT CONTENT or reference to document]

Score 1-10 on:
1. Correctness — Are the technical decisions sound?
2. Completeness — Are there gaps or missing considerations?
3. Consistency — Do parts contradict each other?
4. Feasibility — Can this be implemented as specified?
5. Quality — Does this meet production standards?

For each issue found, provide:
- Severity: CRITICAL / HIGH / MEDIUM / LOW
- Location: Which section/bead/decision
- Problem: What is wrong
- Recommendation: Specific fix

Do NOT say "looks good" without specific evidence.
Produce at least 3 actionable findings per category.

Interpreting Results

Score	Meaning	Action
9-10	Excellent	Proceed, apply minor findings
7-8	Good with issues	Fix all CRITICAL/HIGH, proceed
5-6	Significant problems	Fix all issues, re-validate
<5	Fundamental issues	Redesign, then re-validate

Both oracles typically converge on similar scores. If they diverge by >2 points, investigate the disagreement — it usually reveals a genuine ambiguity.

Apply Corrections

After each oracle round:

Compile all findings from both stances
Deduplicate (FOR and AGAINST often find same issues from different angles)
Prioritize: CRITICAL first, then HIGH
Apply fixes to the artifact
Document what changed and why

Part 2: Iterative Hardening Loop

After oracle validation, run hardening rounds until convergence.

Round Structure

Each round has 3 steps:

Step A: Review (10 Opus agents via /swarm-agents type=review)

Each agent reviews with a different lens
Finds issues, inconsistencies, gaps
Output: list of findings per agent

Step B: Fix

Compile all findings
Apply fixes (6-8 Opus agents for large artifacts)
Each fix validated against the original plan/design

Step C: Validate Fixes

Oracle round on the fixed artifact
OR agent review of just the changes
Confirm fixes don't introduce new issues

Convergence Signal

Track issues found per round:

Round	Issues Found	Action
1	15-20	Expected — many first-pass issues
2	8-12	Good — deeper issues surfacing
3	3-5	Converging — mostly edge cases
4	0-2	Done — ready to ship

Stop when a round finds <= 2 non-trivial issues. Typical: 3-4 rounds.

Hardening Agent Prompt Template

You are hardening [ARTIFACT] for [FEATURE].
Round {N} of iterative review.

Previous rounds found and fixed:
[SUMMARY OF PRIOR FINDINGS]

Your lens: [SPECIFIC_LENS]

Review the artifact and:
1. Check that prior fixes are correctly applied
2. Find NEW issues not caught in earlier rounds
3. Verify cross-cutting concerns are embedded (not just referenced)
4. Check acceptance criteria are Given/When/Then format
5. Verify file paths exist and are correct
6. Ensure no bead is >3 files (split if needed)

For beads specifically:
- Each bead must be self-contained (implementable without reading other beads)
- Dependencies must form a DAG (no cycles)
- Test beads must reference specific test methods/classes
- Acceptance criteria must be machine-verifiable

Target-Specific Guidance

Design Validation

Oracle focus: UX soundness, information architecture, interaction model consistency, accessibility, progressive disclosure balance, terminology.

Plan Validation

Oracle focus: Feasibility, file-level correctness, dependency ordering, risk coverage, test strategy completeness, migration safety.

Bead Validation

Oracle focus: Self-containment, AC specificity, dependency DAG, file path accuracy, test coverage, priority ordering, cross-cutting embedding.

Architecture Validation

Oracle focus: Rewrite scope (not too much/little), backward compatibility, migration path, performance impact, data integrity, rollback strategy.

Full Hardening Pipeline Example

Phase 1: Oracle (2x Fable FOR/AGAINST; escalate to 2x GPT-5.5-Pro + Fable if extremely important)
  -> Fix CRITICAL/HIGH findings
Phase 2: Agent Review (10 Opus, multi-lens)
  -> Fix all findings
Phase 3: Oracle (2x Fable FOR/AGAINST) on fixes
  -> Verify fixes, find remaining issues
Phase 4: Agent Hardening (8 Opus, fresh eyes)
  -> Embed cross-cutting, convert ACs, split oversized
Phase 5: Final Correctness (10 Opus)
  -> Verify everything, fix last issues
  -> If <= 2 issues: DONE
  -> If > 2 issues: repeat from Phase 3

Key Rules

Always run both FOR and AGAINST — single-stance misses adversarial findings
Fix before re-validating — never run a new round on unfixed artifacts
Track convergence — if issues aren't decreasing, the artifact needs redesign, not more rounds
Validate fixes against plan — hardening must not drift from the original design intent
Oracle before agents, agents before oracle — alternate perspectives for best coverage
Verify PAL MCP is running before launching GPT/Gemini oracle sessions (agents silently fall back to self-analysis without it); Fable subagents need no PAL

同仓库更多 Skills

同仓库

consult-oracles

oysteinkrog/dotfiles

Consult Fable (primary oracle) for expert second opinions; escalate to GPT-5.5-Pro only for extremely important or complex tasks (always paired with Fable). Use for complex decisions, architecture choices, debugging hard problems, or when user says "consult oracles", "ask the experts", or wants a second opinion.

2026-06-100

oracle-consensus

oysteinkrog/dotfiles

Run 2x oracle sessions (FOR + AGAINST stances) to validate design decisions, plans, or bead readiness. Default = two Fable subagents; escalate to PAL 2x GPT-Pro (always paired with Fable) for extremely important or complex validations. Use after design rounds, before implementation, or to challenge architecture decisions.

2026-06-100

sync-human

oysteinkrog/dotfiles

Act as a wise, effective teacher whose goal is to make the human deeply understand the work done in this session (a change, a bug fix, a feature, a design) — i.e. sync the human's mental model up to the agent's. Use when the user says "sync-human", "sync me up", "teach me this session", "make sure I understand", "walk me through what we did", "quiz me on this", or "I want to actually understand this PR/change", or otherwise wants Socratic, gated, incremental teaching with comprehension checks rather than a one-shot summary. Drives understanding at both high level (motivation, impact) and low level (business logic, edge cases) using a running checklist and quizzes.

2026-06-050

agent-mail

oysteinkrog/dotfiles

MCP Agent Mail for multi-agent coordination. Use when agents need file locks, messaging, inboxes, or conflict prevention. Handles macro_start_session, file_reservation_paths, send_message, threading, pre-commit guards.

2026-05-290

secret-lookup

oysteinkrog/dotfiles

Retrieve API tokens, keys, and credentials Oystein has stored locally. Use whenever code, scripts, or shell commands need a secret value: GitHub tokens, Cloudflare, HubSpot, Slack, Zendesk, Jira, Sentry, Anthropic, Apify, Browserbase, Google OAuth, Huma. Use BEFORE searching shell history, session logs, dotfiles, or the filesystem — the canonical store is documented here and the values are reachable via two fish helpers. Also use when adding, rotating, or removing a credential.

2026-05-110

changelog-md-workmanship

oysteinkrog/dotfiles

Rebuild CHANGELOG.md files and release histories from git, tags, releases, and issue trackers. Use when writing changelogs, version timelines, or agent-facing project history summaries.

2026-05-080

name	oracle-review
model	opus
description	Run iterative oracle + agent hardening loop on any artifact (designs, plans, beads, architecture) until findings converge to near-zero. Combines /swarm-oracle with /swarm-review in alternating rounds. Use for the full hardening cycle, not just a single oracle pass. For oracle-only, use /swarm-oracle. For bead-only hardening, use /swarm-beads-quality.
triggers	["oracle review","hardening loop","iterate until converged","full review cycle"]
argument-hint	<target: design\|plan\|beads\|architecture> [--rounds N]

Oracle Review Skill

Oracle policy (2026-06): Fable (claude-fable-5) is the primary oracle. By default the FOR/AGAINST sessions below run as two fresh Fable subagents (Agent tool with model: "fable"; Fable is not reachable through PAL). The PAL 2x GPT-Pro setup is the escalation tier — extremely important or complex validations only, and always paired with a Fable consultation on the same prompt. See /consult-oracles and the Oracle Consultation Policy in ~/CLAUDE.md.

Two-part process: (1) oracle consensus validation with FOR/AGAINST stances, then (2) iterative hardening loop until findings converge to near-zero.

When to Use

After design completion (validate UX + architecture)
After implementation planning (validate feasibility + correctness)
After bead creation (validate readiness + completeness)
After architecture decisions (validate scope + approach)
Any high-stakes decision needing external challenge

When NOT to Use

Quick questions or second opinions (use /swarm-oracle-standalone)
Code review (use /swarm-agents type=review)
Simple validation that doesn't need adversarial challenge

Part 1: Oracle Consensus Validation

Setup

Run 2 concurrent oracle sessions:

Default tier: two Fable subagents (Agent, model: "fable"), one FOR stance, one AGAINST stance, spawned in a single message
Escalation tier (extremely important/complex only): PAL MCP consensus with 2x GPT-5.5-Pro (FOR + AGAINST), plus a parallel Fable consult on the same prompt

Oracle Prompt Template

Evaluate the following [ARTIFACT_TYPE] for [FEATURE]:

[ARTIFACT CONTENT or reference to document]

Score 1-10 on:
1. Correctness — Are the technical decisions sound?
2. Completeness — Are there gaps or missing considerations?
3. Consistency — Do parts contradict each other?
4. Feasibility — Can this be implemented as specified?
5. Quality — Does this meet production standards?

For each issue found, provide:
- Severity: CRITICAL / HIGH / MEDIUM / LOW
- Location: Which section/bead/decision
- Problem: What is wrong
- Recommendation: Specific fix

Do NOT say "looks good" without specific evidence.
Produce at least 3 actionable findings per category.

Interpreting Results

Score	Meaning	Action
9-10	Excellent	Proceed, apply minor findings
7-8	Good with issues	Fix all CRITICAL/HIGH, proceed
5-6	Significant problems	Fix all issues, re-validate
<5	Fundamental issues	Redesign, then re-validate

Both oracles typically converge on similar scores. If they diverge by >2 points, investigate the disagreement — it usually reveals a genuine ambiguity.

Apply Corrections

After each oracle round:

Compile all findings from both stances
Deduplicate (FOR and AGAINST often find same issues from different angles)
Prioritize: CRITICAL first, then HIGH
Apply fixes to the artifact
Document what changed and why

Part 2: Iterative Hardening Loop

After oracle validation, run hardening rounds until convergence.

Round Structure

Each round has 3 steps:

Step A: Review (10 Opus agents via /swarm-agents type=review)

Each agent reviews with a different lens
Finds issues, inconsistencies, gaps
Output: list of findings per agent

Step B: Fix

Compile all findings
Apply fixes (6-8 Opus agents for large artifacts)
Each fix validated against the original plan/design

Step C: Validate Fixes

Oracle round on the fixed artifact
OR agent review of just the changes
Confirm fixes don't introduce new issues

Convergence Signal

Track issues found per round:

Round	Issues Found	Action
1	15-20	Expected — many first-pass issues
2	8-12	Good — deeper issues surfacing
3	3-5	Converging — mostly edge cases
4	0-2	Done — ready to ship

Stop when a round finds <= 2 non-trivial issues. Typical: 3-4 rounds.

Hardening Agent Prompt Template

You are hardening [ARTIFACT] for [FEATURE].
Round {N} of iterative review.

Previous rounds found and fixed:
[SUMMARY OF PRIOR FINDINGS]

Your lens: [SPECIFIC_LENS]

Review the artifact and:
1. Check that prior fixes are correctly applied
2. Find NEW issues not caught in earlier rounds
3. Verify cross-cutting concerns are embedded (not just referenced)
4. Check acceptance criteria are Given/When/Then format
5. Verify file paths exist and are correct
6. Ensure no bead is >3 files (split if needed)

For beads specifically:
- Each bead must be self-contained (implementable without reading other beads)
- Dependencies must form a DAG (no cycles)
- Test beads must reference specific test methods/classes
- Acceptance criteria must be machine-verifiable

Target-Specific Guidance

Design Validation

Oracle focus: UX soundness, information architecture, interaction model consistency, accessibility, progressive disclosure balance, terminology.

Plan Validation

Oracle focus: Feasibility, file-level correctness, dependency ordering, risk coverage, test strategy completeness, migration safety.

Bead Validation

Oracle focus: Self-containment, AC specificity, dependency DAG, file path accuracy, test coverage, priority ordering, cross-cutting embedding.

Architecture Validation

Oracle focus: Rewrite scope (not too much/little), backward compatibility, migration path, performance impact, data integrity, rollback strategy.

Full Hardening Pipeline Example

Phase 1: Oracle (2x Fable FOR/AGAINST; escalate to 2x GPT-5.5-Pro + Fable if extremely important)
  -> Fix CRITICAL/HIGH findings
Phase 2: Agent Review (10 Opus, multi-lens)
  -> Fix all findings
Phase 3: Oracle (2x Fable FOR/AGAINST) on fixes
  -> Verify fixes, find remaining issues
Phase 4: Agent Hardening (8 Opus, fresh eyes)
  -> Embed cross-cutting, convert ACs, split oversized
Phase 5: Final Correctness (10 Opus)
  -> Verify everything, fix last issues
  -> If <= 2 issues: DONE
  -> If > 2 issues: repeat from Phase 3

Key Rules

Always run both FOR and AGAINST — single-stance misses adversarial findings
Fix before re-validating — never run a new round on unfixed artifacts
Track convergence — if issues aren't decreasing, the artifact needs redesign, not more rounds
Validate fixes against plan — hardening must not drift from the original design intent
Oracle before agents, agents before oracle — alternate perspectives for best coverage
Verify PAL MCP is running before launching GPT/Gemini oracle sessions (agents silently fall back to self-analysis without it); Fable subagents need no PAL