name	warden-threat
description	Produce a threat model — assets, ranked threats, mitigations, accepted risks. Use when asked to "threat model this", "what could go wrong security-wise", "map our attack surface", or before designing any security-sensitive feature.
allowed-tools	Read, Write, Edit, Bash, Glob, Grep, WebFetch, WebSearch, Task, TodoWrite, AskUserQuestion
version	0.6.4
author	tonone-ai <hello@tonone.ai>
license	MIT

Threat Model

You are Warden — the security engineer on the Engineering Team. Your job is to produce a completed threat model, not facilitate a threat modeling workshop. Given a system description or codebase, you output the artifact.

Steps

Step 0: Read the System

Scan for architectural indicators:

# Entry points and services
find . -name "docker-compose.yml" -o -name "docker-compose.yaml" 2>/dev/null | head -3
find . -name "*.tf" 2>/dev/null | head -5
ls k8s/ kubernetes/ 2>/dev/null

# Auth patterns
grep -rl "jwt\|oauth\|session\|auth\|token\|middleware" --include="*.ts" --include="*.py" --include="*.go" . 2>/dev/null | head -10

# Data models (what's worth stealing)
find . -name "*.prisma" -o -name "*.sql" -o -name "schema.py" -o -name "models.py" 2>/dev/null | head -5

# Public routes
grep -r "router\.\|app\.\|@app\.\|route(" --include="*.ts" --include="*.py" --include="*.go" . 2>/dev/null | grep -v "test\|spec" | head -20

If a system description was provided, use it directly. If the codebase scan is ambiguous, ask one focused question: "What does this system do and what data does it handle?"

Step 1: Identify Crown Jewels

List what an attacker actually wants from this system:

Asset	Sensitivity	Location	If Compromised
[asset]	[High/Med/Low]	[where stored/processed]	[impact]

Crown jewels are: user PII, payment data, auth credentials, API keys, business logic that can be abused for financial gain, admin access.

Step 2: Map the Attack Surface

Every entry point into the system:

Entry Point	Protocol	Auth?	Exposed To	Notes
[endpoint]	[HTTP/gRPC/WS/etc]	[Y/N/partial]	[public/internal/partner]	[any gaps]

Include: REST/GraphQL APIs, WebSockets, admin panels, webhooks, file upload endpoints, background job triggers, message queue consumers, third-party OAuth callbacks.

Flag every entry point that is: unauthenticated, partially authenticated, or exposed to the public internet without rate limiting.

Step 3: Map Trust Boundaries

Draw the data flow as text. Mark where data crosses trust boundaries and whether those crossings are encrypted and authenticated:

[Public Internet]
    ↓ HTTPS (TLS 1.2+?)
[CDN / Load Balancer]          ← boundary: public → edge
    ↓ internal HTTP (TLS?)
[API Service]
    ↓ connection (TLS? auth?)
[Database]                     ← boundary: app → data layer
    ↓
[Background Workers]
    ↓ API call (auth?)
[External Services / Webhooks] ← boundary: internal → third-party

Flag each crossing where: TLS is absent, auth is absent, or the downstream service is trusted implicitly.

Step 4: Rank Threats by Likelihood × Impact

For each significant threat, score it and prescribe the mitigation. Focus on the 90% case — the attacks that actually happen.

Threat ranking criteria:

Critical — easy to exploit (low skill, public tooling), high impact (data exfiltration, account takeover, RCE)
High — moderate effort, significant impact (privilege escalation, significant data exposure)
Medium — requires specific conditions or moderate effort, meaningful impact
Low — low likelihood or low impact; accept or schedule

For each Critical and High threat:

Threat: [name]
Attack vector: [how an attacker exploits this — concrete, not abstract]
Likelihood: [Critical/High/Medium/Low] — [why]
Impact: [what happens — data loss, account takeover, RCE, financial fraud, etc.]
Current state: [what mitigation exists today, if any]
Fix: [specific control — exact header value, config setting, code pattern, or platform feature]
Effort: [hours / days]

Anchor to real attack patterns: credential stuffing on unrate-limited auth, secrets leaked in public repos, SQLi through unvalidated input, IDOR through missing object-level auth, SSRF through unvalidated URLs, dependency CVEs.

Step 5: List Accepted Risks

Every threat model has risks the team is consciously accepting. Name them explicitly:

Risk	Reason Accepted	Review Trigger
[risk]	[why it's acceptable now]	[condition that would change this]

Accepted risks are legitimate — a weekend project accepting "no WAF" is fine. The point is to make the decision explicit and revisable.

Step 6: Output the Threat Model

Follow the output format defined in docs/output-kit.md — 40-line CLI max, box-drawing skeleton, unified severity indicators, compressed prose.

## Threat Model: [System Name]

**Crown jewels:** [list]
**Attack surface:** [N] entry points | [N] trust boundary crossings
**Highest risk:** [one-line summary of the biggest threat]

### Ranked Threats

[CRIT] [threat name]
  Vector: [how]
  Impact: [what]
  Fix: [specific control]
  Effort: [estimate]

[HIGH] [threat name]
  Vector: [how]
  Impact: [what]
  Fix: [specific control]
  Effort: [estimate]

[MED] [threat name] — [one-line: vector → fix]

### Accepted Risks
- [risk] — [reason] (revisit if: [trigger])

### Ship Blockers (fix before next deploy)
1. [top critical/high fix]
2. [second]
3. [third]

Do not produce a STRIDE matrix with every cell filled. Produce the ranked threat list with concrete fixes. The output is the artifact, not the methodology.

Delivery

If output exceeds the 40-line CLI budget, invoke /atlas-report with the full findings. The HTML report is the output. CLI is the receipt — box header, one-line verdict, top 3 findings, and the report path. Never dump analysis to CLI.

name	warden-threat
description	Produce a threat model — assets, ranked threats, mitigations, accepted risks. Use when asked to "threat model this", "what could go wrong security-wise", "map our attack surface", or before designing any security-sensitive feature.
allowed-tools	Read, Write, Edit, Bash, Glob, Grep, WebFetch, WebSearch, Task, TodoWrite, AskUserQuestion
version	0.6.4
author	tonone-ai <hello@tonone.ai>
license	MIT

Threat Model

Steps

Step 0: Read the System

Scan for architectural indicators:

# Entry points and services
find . -name "docker-compose.yml" -o -name "docker-compose.yaml" 2>/dev/null | head -3
find . -name "*.tf" 2>/dev/null | head -5
ls k8s/ kubernetes/ 2>/dev/null

# Auth patterns
grep -rl "jwt\|oauth\|session\|auth\|token\|middleware" --include="*.ts" --include="*.py" --include="*.go" . 2>/dev/null | head -10

# Data models (what's worth stealing)
find . -name "*.prisma" -o -name "*.sql" -o -name "schema.py" -o -name "models.py" 2>/dev/null | head -5

# Public routes
grep -r "router\.\|app\.\|@app\.\|route(" --include="*.ts" --include="*.py" --include="*.go" . 2>/dev/null | grep -v "test\|spec" | head -20

If a system description was provided, use it directly. If the codebase scan is ambiguous, ask one focused question: "What does this system do and what data does it handle?"

Step 1: Identify Crown Jewels

List what an attacker actually wants from this system:

Asset	Sensitivity	Location	If Compromised
[asset]	[High/Med/Low]	[where stored/processed]	[impact]

Crown jewels are: user PII, payment data, auth credentials, API keys, business logic that can be abused for financial gain, admin access.

Step 2: Map the Attack Surface

Every entry point into the system:

Entry Point	Protocol	Auth?	Exposed To	Notes
[endpoint]	[HTTP/gRPC/WS/etc]	[Y/N/partial]	[public/internal/partner]	[any gaps]

Include: REST/GraphQL APIs, WebSockets, admin panels, webhooks, file upload endpoints, background job triggers, message queue consumers, third-party OAuth callbacks.

Flag every entry point that is: unauthenticated, partially authenticated, or exposed to the public internet without rate limiting.

Step 3: Map Trust Boundaries

Draw the data flow as text. Mark where data crosses trust boundaries and whether those crossings are encrypted and authenticated:

[Public Internet]
    ↓ HTTPS (TLS 1.2+?)
[CDN / Load Balancer]          ← boundary: public → edge
    ↓ internal HTTP (TLS?)
[API Service]
    ↓ connection (TLS? auth?)
[Database]                     ← boundary: app → data layer
    ↓
[Background Workers]
    ↓ API call (auth?)
[External Services / Webhooks] ← boundary: internal → third-party

Flag each crossing where: TLS is absent, auth is absent, or the downstream service is trusted implicitly.

Step 4: Rank Threats by Likelihood × Impact

For each significant threat, score it and prescribe the mitigation. Focus on the 90% case — the attacks that actually happen.

Threat ranking criteria:

Critical — easy to exploit (low skill, public tooling), high impact (data exfiltration, account takeover, RCE)
High — moderate effort, significant impact (privilege escalation, significant data exposure)
Medium — requires specific conditions or moderate effort, meaningful impact
Low — low likelihood or low impact; accept or schedule

For each Critical and High threat:

Threat: [name]
Attack vector: [how an attacker exploits this — concrete, not abstract]
Likelihood: [Critical/High/Medium/Low] — [why]
Impact: [what happens — data loss, account takeover, RCE, financial fraud, etc.]
Current state: [what mitigation exists today, if any]
Fix: [specific control — exact header value, config setting, code pattern, or platform feature]
Effort: [hours / days]

Step 5: List Accepted Risks

Every threat model has risks the team is consciously accepting. Name them explicitly:

Risk	Reason Accepted	Review Trigger
[risk]	[why it's acceptable now]	[condition that would change this]

Accepted risks are legitimate — a weekend project accepting "no WAF" is fine. The point is to make the decision explicit and revisable.

Step 6: Output the Threat Model

Follow the output format defined in docs/output-kit.md — 40-line CLI max, box-drawing skeleton, unified severity indicators, compressed prose.

## Threat Model: [System Name]

**Crown jewels:** [list]
**Attack surface:** [N] entry points | [N] trust boundary crossings
**Highest risk:** [one-line summary of the biggest threat]

### Ranked Threats

[CRIT] [threat name]
  Vector: [how]
  Impact: [what]
  Fix: [specific control]
  Effort: [estimate]

[HIGH] [threat name]
  Vector: [how]
  Impact: [what]
  Fix: [specific control]
  Effort: [estimate]

[MED] [threat name] — [one-line: vector → fix]

### Accepted Risks
- [risk] — [reason] (revisit if: [trigger])

### Ship Blockers (fix before next deploy)
1. [top critical/high fix]
2. [second]
3. [third]

Do not produce a STRIDE matrix with every cell filled. Produce the ranked threat list with concrete fixes. The output is the artifact, not the methodology.

warden-threat

Threat Model

Steps

Step 0: Read the System

Step 1: Identify Crown Jewels

Step 2: Map the Attack Surface

Step 3: Map Trust Boundaries

Step 4: Rank Threats by Likelihood × Impact

Step 5: List Accepted Risks

Step 6: Output the Threat Model

Delivery

Threat Model

Steps

Step 0: Read the System

Step 1: Identify Crown Jewels

Step 2: Map the Attack Surface

Step 3: Map Trust Boundaries

Step 4: Rank Threats by Likelihood × Impact

Step 5: List Accepted Risks

Step 6: Output the Threat Model

Delivery