Run any Skill in Manus with one click

adversarial-bug-hunt

Discover bugs through a 3-agent adversarial pipeline (finder → adversarial → referee) that exploits sycophancy for high-fidelity results. Use when reviewing code for bugs, especially when single-agent review isn't sufficient.

Run Skill in Manus

Overview

Install command

npx skills add https://github.com/AI-Lab-Yonder/ai-lab-agent-skills --skill adversarial-bug-hunt

Copy and paste this command into Claude Code to install the skill

Source

AI-Lab-Yonder/ai-lab-agent-skills

Stars18

Forks3

UpdatedMay 21, 2026 at 12:14

File Explorer

8 files

SKILL.md

readonly

name	adversarial-bug-hunt
description	Discover bugs through a 3-agent adversarial pipeline (finder → adversarial → referee) that exploits sycophancy for high-fidelity results. Use when reviewing code for bugs, especially when single-agent review isn't sufficient.
version	1.0.1
level	advanced
category	code-quality

Adversarial Bug Hunt

Three-agent bug discovery pipeline. A bug-finder over-reports, an adversarial agent disproves, a referee resolves. Each agent runs in a clean context to prevent cross-contamination.

Constraints

Read gotchas.md before starting
Each agent MUST be a separate Agent tool launch — never reuse context between agents
Scoring numbers are a prompt technique — they bias agent behavior via sycophancy, they are not tracked
The referee is told "ground truth exists" — this is intentional design that makes it more careful, not a mistake
Do NOT perform your own code review — your job is orchestration and presentation only
All agents must produce structured JSON output

Phase 0 — Determine Scope

If the user provided a scope (files, directory, "uncommitted changes"), use it. Otherwise ask. Default: uncommitted changes.

Phase 1 — Bug-Finder Agent

Read references/agent-prompts.md for the bug-finder prompt template.

Launch an Agent (general-purpose) with the bug-finder prompt, passing the scope. This agent is incentivized to over-report — it produces the superset of all possible bugs.

Receives: JSON array of findings using the schema in templates/finding-schema.json.

Phase 2 — Adversarial Agent

Read references/agent-prompts.md for the adversarial prompt template.

Launch an Agent (general-purpose) with the adversarial prompt, passing the bug-finder's full output. This agent is incentivized to disprove — it produces the subset of likely real bugs.

Receives: same findings array, each annotated with adversarial_verdict and adversarial_reasoning.

Phase 3 — Referee Agent

Read references/agent-prompts.md for the referee prompt template.

Launch an Agent (general-purpose) with both agents' outputs. The referee resolves each dispute.

Receives: same findings array, each annotated with referee_verdict and referee_reasoning.

Phase 4 — Present Results

Read references/report-format.md for display format. See examples/ for concrete samples.

Show only CONFIRMED + UNCERTAIN findings. If all findings are DISPROVED: say "No confirmed issues found in the reviewed code." and stop.

Otherwise, ask the user which finding IDs to fix:

"Which issues would you like me to fix? You can list IDs (e.g., BUG-001, BUG-003) or say 'all'."

CRITICAL — next turn action: When the user replies, your very first tool call MUST be EnterPlanMode. The plan must reference the specific findings, evidence, and fix recommendations from the report.

More from this repository

same repository

api-builder

AI-Lab-Yonder/ai-lab-agent-skills

Design and implement REST APIs with proper routing, validation, error handling, and documentation. Use when: building backend services, microservices, or adding API endpoints to existing applications.

2026-05-2118

auth-system

AI-Lab-Yonder/ai-lab-agent-skills

Implement authentication and authorization from scratch. Covers signup, login, sessions, JWT, role-based access, and protected routes. Use when: adding auth to a new or existing app.

2026-05-2118

autoresearch

AI-Lab-Yonder/ai-lab-agent-skills

Autonomously optimize any Claude Code skill by running it repeatedly, scoring outputs against binary evals, mutating the prompt, and keeping improvements. Based on Karpathy's autoresearch methodology. Use when: optimize this skill, improve this skill, run autoresearch on, make this skill better, self-improve skill, benchmark skill, eval my skill, run evals on. Outputs: an improved SKILL.md, a results log, and a changelog of every mutation tried.

2026-05-2118

bug-fixer

AI-Lab-Yonder/ai-lab-agent-skills

Systematic approach to finding and fixing bugs in any codebase. Use when: debugging errors, investigating unexpected behavior, fixing failing tests, or resolving production issues.

2026-05-2118

code-reviewer

AI-Lab-Yonder/ai-lab-agent-skills

Automated code review for security, quality, and performance. Catches bugs, vulnerabilities, and anti-patterns before they ship. Use when: reviewing PRs, auditing code before release, or checking your own work.

2026-05-2118

codex-review

AI-Lab-Yonder/ai-lab-agent-skills

Review uncommitted git changes for bugs/regressions via Codex MCP and present a structured report. Use when asked to review local changes or find bugs in current work. Requires Codex MCP to be configured.

2026-05-2118

Source

AI-Lab-Yonder

AI-Lab-Yonder/ai-lab-agent-skills

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Information Security AnalystsComputer and Mathematical Occupations15-1212L4

name	adversarial-bug-hunt
description	Discover bugs through a 3-agent adversarial pipeline (finder → adversarial → referee) that exploits sycophancy for high-fidelity results. Use when reviewing code for bugs, especially when single-agent review isn't sufficient.
version	1.0.1
level	advanced
category	code-quality

Adversarial Bug Hunt

Three-agent bug discovery pipeline. A bug-finder over-reports, an adversarial agent disproves, a referee resolves. Each agent runs in a clean context to prevent cross-contamination.

Constraints

Read gotchas.md before starting
Each agent MUST be a separate Agent tool launch — never reuse context between agents
Scoring numbers are a prompt technique — they bias agent behavior via sycophancy, they are not tracked
The referee is told "ground truth exists" — this is intentional design that makes it more careful, not a mistake
Do NOT perform your own code review — your job is orchestration and presentation only
All agents must produce structured JSON output

Phase 0 — Determine Scope

If the user provided a scope (files, directory, "uncommitted changes"), use it. Otherwise ask. Default: uncommitted changes.

Phase 1 — Bug-Finder Agent

Read references/agent-prompts.md for the bug-finder prompt template.

Launch an Agent (general-purpose) with the bug-finder prompt, passing the scope. This agent is incentivized to over-report — it produces the superset of all possible bugs.

Receives: JSON array of findings using the schema in templates/finding-schema.json.

Phase 2 — Adversarial Agent

Read references/agent-prompts.md for the adversarial prompt template.

Launch an Agent (general-purpose) with the adversarial prompt, passing the bug-finder's full output. This agent is incentivized to disprove — it produces the subset of likely real bugs.

Receives: same findings array, each annotated with adversarial_verdict and adversarial_reasoning.

Phase 3 — Referee Agent

Read references/agent-prompts.md for the referee prompt template.

Launch an Agent (general-purpose) with both agents' outputs. The referee resolves each dispute.

Receives: same findings array, each annotated with referee_verdict and referee_reasoning.

Phase 4 — Present Results

Read references/report-format.md for display format. See examples/ for concrete samples.

Show only CONFIRMED + UNCERTAIN findings. If all findings are DISPROVED: say "No confirmed issues found in the reviewed code." and stop.

Otherwise, ask the user which finding IDs to fix:

"Which issues would you like me to fix? You can list IDs (e.g., BUG-001, BUG-003) or say 'all'."