Run any Skill in Manus with one click

red-team-tribunal

Stars10

Forks2

UpdatedFebruary 13, 2026 at 10:14

Multi-agent adversarial verification system. Use when you need thorough code review, security auditing, or quality validation. Spawns three specialized agents (Skeptic, User Proxy, Optimizer) that must reach consensus before code can be approved.

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

Run Skill in Manus

Source

aegntic

aegntic/cldcde

View GitHub Repository View Creator Repositories

Download

Run Skill in Manus

Related occupationsSOC

Based on SOC occupation classification

Information Security AnalystsComputer and Mathematical Occupations·SOC 15-1212

SKILL.md

readonly

name	red-team-tribunal
description	Multi-agent adversarial verification system. Use when you need thorough code review, security auditing, or quality validation. Spawns three specialized agents (Skeptic, User Proxy, Optimizer) that must reach consensus before code can be approved.
version	1.0.0
author	Essential 2026 Suite
tags	["review","security","quality","multi-agent","consensus"]
trigger_patterns	["review this code","security audit","tribunal review","multi-agent review","adversarial review","approve this PR","check for issues","quality review","before merging","tribunal"]
allowed_tools	["Bash","Read","Write","Edit"]

Red Team Tribunal: Adversarial Verification

Overview

The Red Team Tribunal uses Opus 4.6 Agent Teams to create an adversarial review loop that prevents "confident mistakes." Three specialized sub-agents work in parallel to find issues from different perspectives.

The Tribunal Structure

🤔 The Skeptic (Security/Logic)

Role: Security auditor and logic validator
Goal: Find at least one valid issue (must find something)
Focus: Security flaws, logic errors, edge cases, race conditions
Confidence Target: >80%

👤 The User Proxy (UX/Edge Cases)

Role: End-user simulator
Goal: Break the feature from a user's perspective
Focus: Usability, invalid inputs, confusing flows, accessibility
Tools: Browser automation, form fuzzing

⚡ The Optimizer (Performance)

Role: Performance engineer
Goal: Identify efficiency bottlenecks
Focus: Algorithmic complexity, memory usage, database queries, caching
Metrics: O(n) complexity, response times, resource usage

When to Use

Activate the Tribunal for:

Critical code changes (auth, payments, security)
Before merging pull requests
When adding new features
Security-sensitive implementations
Performance-critical code
Code that affects multiple users

Usage

Trigger Tribunal Review

# Review a file
python3 /a0/usr/plugins/red-team-tribunal/red-team-tribunal.py --target <file>

# Review a PR
python3 /a0/usr/plugins/red-team-tribunal/red-team-tribunal.py --pr <number>

# Review a commit
python3 /a0/usr/plugins/red-team-tribunal/red-team-tribunal.py --diff <hash>

Understanding Verdicts

CONSENSUS OPTIONS:

APPROVED (All agents pass)
- Code meets all quality standards
- Ready to merge
CONDITIONAL (Concerns raised)
- Minor issues found
- Address concerns before merge
- Can proceed with fixes
REJECTED (Critical issues)
- Security vulnerabilities or major flaws
- Must fix before reconsideration
- Returns detailed recommendations

Review Process

Step 1: Agent Assembly

Three agents spawn in parallel:

agents = ["skeptic", "user_proxy", "optimizer"]
tasks = [spawn_agent(agent, target) for agent in agents]
results = await asyncio.gather(*tasks)

Step 2: Individual Analysis

Each agent analyzes from their specialty:

Skeptic: Scans for vulnerabilities, logic gaps
User Proxy: Attempts to break UX, finds edge cases
Optimizer: Reviews complexity, resource usage

Step 3: Consensus Building

Agents debate and produce unified verdict:

Unanimous approval required for pass
Any rejection blocks merge
Concerns must be addressed

Step 4: Report Generation

JSON output includes:

{
  "consensus": "APPROVED|CONDITIONAL|REJECTED",
  "verdicts": [
    {"agent": "skeptic", "verdict": "pass", "confidence": 0.85},
    {"agent": "user_proxy", "verdict": "pass", "confidence": 0.90},
    {"agent": "optimizer", "verdict": "concerns", "confidence": 0.75}
  ],
  "recommendations": [
    "Add input validation",
    "Optimize database query",
    "Add caching layer"
  ]
}

Sample Output

🏛️  RED TEAM TRIBUNAL
Target: src/auth/login.ts

📋 AGENT VERDICTS:
  🤔 Skeptic:     ⚠️  CONCERNS (85%)
  👤 User Proxy:  ✅ PASS (90%)
  ⚡ Optimizer:   ⚠️  CONCERNS (75%)

📊 CONSENSUS: CONDITIONAL - Address Concerns

💡 RECOMMENDATIONS:
  1. Add null check at line 45
  2. Implement memoization for expensive calc
  3. Add rate limiting to prevent brute force

CI/CD Integration

Add to GitHub Actions:

- name: Red Team Tribunal Review
  run: |
    python3 red-team-tribunal.py --pr ${{ github.event.pull_request.number }}

Success Metrics

Detection Rate: % of real issues found
False Positive Rate: % of invalid concerns
Time to Review: Average review duration
Consensus Time: Time to reach agreement

Troubleshooting

Agents Not Spawning

Check Agent Teams feature is enabled:

export CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1

Timeout Issues

Increase timeout for complex reviews:

subprocess.run(..., timeout=120)  # 2 minutes

Part of the Essential 2026 Plugin Suite

Red Team Tribunal: Adversarial Verification

Overview

The Tribunal Structure

🤔 The Skeptic (Security/Logic)

Role: Security auditor and logic validator
Goal: Find at least one valid issue (must find something)
Focus: Security flaws, logic errors, edge cases, race conditions
Confidence Target: >80%

👤 The User Proxy (UX/Edge Cases)

Role: End-user simulator
Goal: Break the feature from a user's perspective
Focus: Usability, invalid inputs, confusing flows, accessibility
Tools: Browser automation, form fuzzing

⚡ The Optimizer (Performance)

Role: Performance engineer
Goal: Identify efficiency bottlenecks
Focus: Algorithmic complexity, memory usage, database queries, caching
Metrics: O(n) complexity, response times, resource usage

When to Use

Activate the Tribunal for:

Critical code changes (auth, payments, security)
Before merging pull requests
When adding new features
Security-sensitive implementations
Performance-critical code
Code that affects multiple users

Usage

Trigger Tribunal Review

# Review a file
python3 /a0/usr/plugins/red-team-tribunal/red-team-tribunal.py --target <file>

# Review a PR
python3 /a0/usr/plugins/red-team-tribunal/red-team-tribunal.py --pr <number>

# Review a commit
python3 /a0/usr/plugins/red-team-tribunal/red-team-tribunal.py --diff <hash>

Understanding Verdicts

CONSENSUS OPTIONS:

APPROVED (All agents pass)
- Code meets all quality standards
- Ready to merge
CONDITIONAL (Concerns raised)
- Minor issues found
- Address concerns before merge
- Can proceed with fixes
REJECTED (Critical issues)
- Security vulnerabilities or major flaws
- Must fix before reconsideration
- Returns detailed recommendations

Review Process

Step 1: Agent Assembly

Three agents spawn in parallel:

agents = ["skeptic", "user_proxy", "optimizer"]
tasks = [spawn_agent(agent, target) for agent in agents]
results = await asyncio.gather(*tasks)

Step 2: Individual Analysis

Each agent analyzes from their specialty:

Skeptic: Scans for vulnerabilities, logic gaps
User Proxy: Attempts to break UX, finds edge cases
Optimizer: Reviews complexity, resource usage

Step 3: Consensus Building

Agents debate and produce unified verdict:

Unanimous approval required for pass
Any rejection blocks merge
Concerns must be addressed

Step 4: Report Generation

JSON output includes:

{
  "consensus": "APPROVED|CONDITIONAL|REJECTED",
  "verdicts": [
    {"agent": "skeptic", "verdict": "pass", "confidence": 0.85},
    {"agent": "user_proxy", "verdict": "pass", "confidence": 0.90},
    {"agent": "optimizer", "verdict": "concerns", "confidence": 0.75}
  ],
  "recommendations": [
    "Add input validation",
    "Optimize database query",
    "Add caching layer"
  ]
}

Sample Output

🏛️  RED TEAM TRIBUNAL
Target: src/auth/login.ts

📋 AGENT VERDICTS:
  🤔 Skeptic:     ⚠️  CONCERNS (85%)
  👤 User Proxy:  ✅ PASS (90%)
  ⚡ Optimizer:   ⚠️  CONCERNS (75%)

📊 CONSENSUS: CONDITIONAL - Address Concerns

💡 RECOMMENDATIONS:
  1. Add null check at line 45
  2. Implement memoization for expensive calc
  3. Add rate limiting to prevent brute force

CI/CD Integration

Add to GitHub Actions:

- name: Red Team Tribunal Review
  run: |
    python3 red-team-tribunal.py --pr ${{ github.event.pull_request.number }}

Success Metrics

Detection Rate: % of real issues found
False Positive Rate: % of invalid concerns
Time to Review: Average review duration
Consensus Time: Time to reach agreement

Troubleshooting

Agents Not Spawning

Check Agent Teams feature is enabled:

export CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1

Timeout Issues

Increase timeout for complex reviews:

subprocess.run(..., timeout=120)  # 2 minutes

Part of the Essential 2026 Plugin Suite

red-team-tribunal

Red Team Tribunal: Adversarial Verification

Overview

The Tribunal Structure

🤔 The Skeptic (Security/Logic)

👤 The User Proxy (UX/Edge Cases)

⚡ The Optimizer (Performance)

When to Use

Usage

Trigger Tribunal Review

Understanding Verdicts

Review Process

Step 1: Agent Assembly

Step 2: Individual Analysis

Step 3: Consensus Building

Step 4: Report Generation

Sample Output

CI/CD Integration

Success Metrics

Troubleshooting

Agents Not Spawning

Timeout Issues

More from this repository

More from this repository

Red Team Tribunal: Adversarial Verification

Overview

The Tribunal Structure

🤔 The Skeptic (Security/Logic)

👤 The User Proxy (UX/Edge Cases)

⚡ The Optimizer (Performance)

When to Use

Usage

Trigger Tribunal Review

Understanding Verdicts

Review Process

Step 1: Agent Assembly

Step 2: Individual Analysis

Step 3: Consensus Building

Step 4: Report Generation

Sample Output

CI/CD Integration

Success Metrics

Troubleshooting

Agents Not Spawning

Timeout Issues