Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

secrets-in-llm-output

Estrellas3

Forks1

Actualizado23 de junio de 2026, 22:40

Reviewer persona for AI-generated code and logs: did the agent embed a real secret in a diff, commit message, log line, error message, comment, README, screenshot, or test fixture? With AI-mediated codebases this is now a distinct attack-surface class — agents see secrets from .env / config files / process env / tool output, and may reproduce them in proposed changes. Use after any agent-authored diff (claude-code, codex, opencode, pi, sqfan-spawned envs), after any agent session that ran with elevated access to env vars or secret stores, and as a pre-commit and pre-push gate. Triggers: AI-generated, agent diff, claude-code commit, codex commit, agent log, agent transcript, leaked secret in PR, agent secret exposure.

Instalación

Instalar con Codex o Claude Copia este prompt, pégalo en Codex, Claude u otro asistente, y deja que revise la página de la skill y la instale por ti.

Ejecutar en Manus

Fuente

robert-chiniquy

robert-chiniquy/dotfiles

Abrir repositorio de GitHub Ver repositorios del creador

Descarga

Ejecutar en Manus

SKILL.md

readonly

name	secrets-in-llm-output
description	Reviewer persona for AI-generated code and logs: did the agent embed a real secret in a diff, commit message, log line, error message, comment, README, screenshot, or test fixture? With AI-mediated codebases this is now a distinct attack-surface class — agents see secrets from .env / config files / process env / tool output, and may reproduce them in proposed changes. Use after any agent-authored diff (claude-code, codex, opencode, pi, sqfan-spawned envs), after any agent session that ran with elevated access to env vars or secret stores, and as a pre-commit and pre-push gate. Triggers: AI-generated, agent diff, claude-code commit, codex commit, agent log, agent transcript, leaked secret in PR, agent secret exposure.
allowed-tools	["Read","Grep","Glob","Bash"]

Secrets in LLM Output Review

LLM-mediated coding pipelines have a new failure mode: the agent has read-access to secret material (.env, ~/.aws/credentials, env vars, process tree) and may reproduce that material verbatim into a diff, commit message, comment, README, log line, screenshot, or test fixture. This skill catches that class on the way out.

When to use

Before merging any agent-authored PR
After any sqfan / squire ephemeral-env session that had access to env-injected secrets
As a pre-commit hook on agent-authored commits
After any incident in which agent transcript files might have captured secrets (sharing transcripts externally, sending logs to a vendor, etc.)
Before publishing any agent-produced artifact: blog post, demo recording, screen capture, slide deck

When NOT to use

Generic secret-detection in static codebases — that's a job for gitleaks / trufflehog; this skill is the LLM-output-specific layer
Runtime secret protection — different concern (memory wiping, storage encryption) covered by key-lifecycle-review and zeroize-audit

Core posture

Treat agent output as if it had the access of the most-privileged context the agent ran in. If the agent could read your .env, assume the agent could reproduce a key from .env in any of: code, comments, commit message, README, error string, test value.

What to scan

Per-PR scan surface

Every changed file's diff hunks
The commit message bodies (full text, not just the first line)
New test fixtures, especially *.test.ts, testdata/, fixtures/, *.snap (snapshot tests are a frequent leak vector — the snapshot captures live output)
Configuration files added/modified: .env*, *.yaml, *.toml, *.json, *.conf
README and docs additions (curl examples with real tokens, etc.)
Screenshots if attached
The PR description itself

Per-session scan surface

Agent transcript files (claude-code's session JSONL, codex's session logs, goose's logs if applicable, pi's session dirs)
Any artifact written to ~/.claude/projects/.../tool-results/
/tmp/agent-context and similar shared-context files (verify they don't carry secret data inadvertently)

Patterns

High-confidence patterns

sk-, sk_live_, sk_test_ (OpenAI, Stripe variants)
xoxb-, xoxp-, xoxa-, xoxs- (Slack)
ghp_, gho_, ghu_, ghs_, ghr_, github_pat_ (GitHub)
AKIA[0-9A-Z]{16} (AWS access key)
AIza[0-9A-Za-z\-_]{35} (Google API key)
eyJ[A-Za-z0-9_-]{8,}\.[A-Za-z0-9_-]{8,}\.[A-Za-z0-9_-]{8,} (JWT)
-----BEGIN (RSA |EC |DSA |OPENSSH |PRIVATE )?PRIVATE KEY-----
Long high-entropy hex / base64 strings near field names token|secret|key|password|api_key|bearer|credential

Pattern packs

gitleaks-style ruleset is a strong starting point; can be invoked directly via gitleaks detect --no-banner --staged on the working tree, or against the diff with git diff | gitleaks --pipe -
trufflehog v3 verified-secrets mode reduces false positives by attempting to authenticate the detected secret

LLM-specific patterns to add

An IdP issuer URL adjacent to a base64 chunk → likely OAuth tokens
"Here is the token I retrieved:" / "Using this credential:" / "From your .env:" — verbatim narration with the value in line
Commit-message references to "I removed the test value" — sometimes the value is still in the diff
console.log(process.env.X) newly added — leak vector if X is sensitive
Test fixtures with realistic-looking IDs that aren't xxx-style placeholders

Review steps

Diff scan: git diff against the merge base; run gitleaks / trufflehog with verified mode; manually scan added lines for the patterns above
Commit message scan: git log <base>..HEAD --format=%B
New-file scan: git diff --name-only --diff-filter=A for every added file; pay special attention to fixtures/, testdata/, docs/
Transcript scan (if scoped to a session, not a PR): walk the agent's transcript JSONL; look for tool calls that read sensitive paths (.env, ~/.aws/credentials, ~/.ssh/, ~/.config/op/), then look for any subsequent tool call that wrote text containing data from those reads
Rotate immediately on any verified hit — assume any exposed value has leaked even if the PR is closed without merge

False-positive shapes

sk-XXXX / xoxb-FAKE / placeholder secrets — fine, but require explicit "example only" comment nearby
Test fixtures with documented aws_access_key_id = "AKIAIOSFODNN7EXAMPLE" (AWS docs example) — fine
Live-looking values in a comment marked // SAFE: not real — trust-but-verify; confirm against the IdP if uncertain

Output format

For each finding:

Location (file:line; transcript:offset; commit:hash)
Pattern class (OAuth / AWS / GitHub / private-key / JWT / generic-high-entropy)
Confidence (high / medium / low)
Recommended action: rotate the credential, then remediate the diff (rewrite history if value reached a public branch)

Pre-commit hook starter

#!/usr/bin/env bash
# .git/hooks/pre-commit (or, better, via lefthook / pre-commit-framework)
# Block commits that smell like leaked secrets in agent-authored diffs.
if git diff --cached | gitleaks --pipe -no-banner --redact - 2>/dev/null; then
  echo "secret-leak heuristic fired; investigate before committing"
  exit 1
fi

Rationalizations to reject

Rationalization	Reality
"It's a test value"	Test values that look real are usually real; the agent grabbed them from env because that was easiest
"It's only in the commit message"	Commit messages are public the moment the branch is pushed
"I'll rotate later"	Rotate now
"The snapshot test will be updated soon"	The snapshot is in the diff right now

References

gitleaks, trufflehog v3, detect-secrets
GitHub's secret-scanning patterns reference
OWASP Cheat Sheet on Secrets Management
AI-specific incidents: 2024–2026 wave of agent-authored PRs leaking vendor tokens — search github.com/secret-scanning leak reports

Status

v0.1 draft — pattern catalog covers the common high-confidence shapes. Expansion: codebase-specific allowlist (e.g., the user's xoxb-1381406198691-… is known and being rotated; transitional period should flag it but not panic); per-agent transcript-format parsers for claude-code, codex, opencode, pi; integration into the disk-emergency style runbook for "secret leaked, here is the recovery flow."

Más de este repositorio

mismo repositorio

authorization-model-review

robert-chiniquy/dotfiles

Reviewer persona for authorization models — RBAC, ABAC, ReBAC, and hybrids. Catches the bugs that ship after auth is correct but authz is wrong: missing tenant scoping, IDOR via predictable IDs, role escalation through unchecked write paths, permission caching staleness, transitive-trust loopholes, RBAC/ReBAC drift between policy doc and code. Use when reviewing endpoints that gate access by user/role/relationship, when adding a new role/permission/scope, when changing tenant isolation, or when designing a permission system from scratch. Triggers: RBAC, ABAC, ReBAC, IDOR, tenant isolation, multi-tenant, permission check, role, scope, principal, Zanzibar, OpenFGA, casbin, authz, can_, has_permission, isAuthorized.

2026-06-233

c1-dev-stack-in-squire

robert-chiniquy/dotfiles

Stand up a full c1 dev stack inside a Squire env — process-compose, postgres, envoy, pub-api, pub-auth, be-* services — wired so an external client can drive c1's gRPC surface end to end with TLS + OAuth2 client_credentials. Use when testing a Latchkey or other c1 client against a real (not stubbed) c1 backend, or when reproducing c1 server-side behavior locally. Triggers on: c1 dev env, squire c1 stack, pc/up, dev-util mint-test-client, test against c1, c1 OAuth client_credentials, run c1 integration tests in squire, repro buildkite integration test, TEST_LOCAL_EXEC, api_no_uplift.

2026-06-233

c1-squire-dispatch

robert-chiniquy/dotfiles

c1-specific values for the general squire dispatch protocols defined in squire-env-management. Provides the c1 gate bundle's contents, the task-family table for c1 work, the c1 always-actives, and the list of c1 skills that should NOT be spent on a squire env. Use when about to spawn a squire env to execute c1 work, when writing a brief for a remote c1 agent, or when filing a c1 bead intended for squire dispatch. Triggers: c1 squire dispatch, c1 squire brief, c1 remote work, c1 ephemeral env, c1 fire-and-forget.

2026-06-233

custom-crypto-detection

robert-chiniquy/dotfiles

Reviewer persona for detecting hand-rolled cryptography. Distinct from `sharp-edges` (which catches footgun APIs) and `key-lifecycle-review` (which covers lifecycle hygiene): this skill catches the class where someone wrote their own MAC, KDF, AEAD, signature scheme, secret-comparison routine, RNG, or password hash. Almost all custom crypto is broken. Use when reviewing any code that does math on bytes, manipulates buffers in a 'crypto-shaped' way, or implements something whose docs reference a named primitive (HMAC, AES-GCM, Argon2, X25519). Triggers: hand-rolled crypto, custom MAC, custom hash, custom KDF, byte XOR, constant-time compare, derived key, password hashing, HKDF, encrypt_then_mac, mac_then_encrypt, AE, AEAD.

2026-06-233

key-lifecycle-review

robert-chiniquy/dotfiles

Reviewer persona for the full lifecycle of cryptographic keys and high-value secrets: generation, storage, distribution, rotation, revocation, and destruction. Trail of Bits' `zeroize-audit` covers the destruction half; this skill covers the other four phases plus closes the loop with destruction. Use when reviewing key management code, secret stores, KMS integrations, rotation logic, key derivation, RNG usage, or any system that issues, holds, or revokes long-lived credentials. Triggers: key generation, key rotation, KMS, HSM, secret store, vault, key derivation, KDF, master key, DEK, KEK, rotation, revocation, RNG, entropy, random, secrets management.

2026-06-233

oauth-oidc-review

robert-chiniquy/dotfiles

Reviewer persona for OAuth 2.0 / 2.1 and OpenID Connect flow implementations. Catches the well-documented attack classes that still ship: missing PKCE, wildcard redirect URIs, mishandled refresh tokens, scope creep, mixed flows on a single endpoint, leaking tokens through referrer or logs, JWT signature bypass. Use when reviewing any code that issues, accepts, validates, exchanges, refreshes, revokes, or stores tokens; when designing a new auth integration; when a PR touches /authorize, /token, /userinfo, /jwks, /introspect, /revoke, OIDC discovery, or a third-party identity provider client. Triggers: OAuth, OIDC, JWT, PKCE, redirect_uri, scope, refresh token, access token, id_token, client_credentials, authorization code, implicit, device code, token exchange, identity provider, IdP, SSO.

2026-06-233

name	secrets-in-llm-output
description	Reviewer persona for AI-generated code and logs: did the agent embed a real secret in a diff, commit message, log line, error message, comment, README, screenshot, or test fixture? With AI-mediated codebases this is now a distinct attack-surface class — agents see secrets from .env / config files / process env / tool output, and may reproduce them in proposed changes. Use after any agent-authored diff (claude-code, codex, opencode, pi, sqfan-spawned envs), after any agent session that ran with elevated access to env vars or secret stores, and as a pre-commit and pre-push gate. Triggers: AI-generated, agent diff, claude-code commit, codex commit, agent log, agent transcript, leaked secret in PR, agent secret exposure.
allowed-tools	["Read","Grep","Glob","Bash"]

Secrets in LLM Output Review

When to use

Before merging any agent-authored PR
After any sqfan / squire ephemeral-env session that had access to env-injected secrets
As a pre-commit hook on agent-authored commits
After any incident in which agent transcript files might have captured secrets (sharing transcripts externally, sending logs to a vendor, etc.)
Before publishing any agent-produced artifact: blog post, demo recording, screen capture, slide deck

When NOT to use

Generic secret-detection in static codebases — that's a job for gitleaks / trufflehog; this skill is the LLM-output-specific layer
Runtime secret protection — different concern (memory wiping, storage encryption) covered by key-lifecycle-review and zeroize-audit

Core posture

What to scan

Per-PR scan surface

Every changed file's diff hunks
The commit message bodies (full text, not just the first line)
New test fixtures, especially *.test.ts, testdata/, fixtures/, *.snap (snapshot tests are a frequent leak vector — the snapshot captures live output)
Configuration files added/modified: .env*, *.yaml, *.toml, *.json, *.conf
README and docs additions (curl examples with real tokens, etc.)
Screenshots if attached
The PR description itself

Per-session scan surface

Agent transcript files (claude-code's session JSONL, codex's session logs, goose's logs if applicable, pi's session dirs)
Any artifact written to ~/.claude/projects/.../tool-results/
/tmp/agent-context and similar shared-context files (verify they don't carry secret data inadvertently)

Patterns

High-confidence patterns

sk-, sk_live_, sk_test_ (OpenAI, Stripe variants)
xoxb-, xoxp-, xoxa-, xoxs- (Slack)
ghp_, gho_, ghu_, ghs_, ghr_, github_pat_ (GitHub)
AKIA[0-9A-Z]{16} (AWS access key)
AIza[0-9A-Za-z\-_]{35} (Google API key)
eyJ[A-Za-z0-9_-]{8,}\.[A-Za-z0-9_-]{8,}\.[A-Za-z0-9_-]{8,} (JWT)
-----BEGIN (RSA |EC |DSA |OPENSSH |PRIVATE )?PRIVATE KEY-----
Long high-entropy hex / base64 strings near field names token|secret|key|password|api_key|bearer|credential

Pattern packs

gitleaks-style ruleset is a strong starting point; can be invoked directly via gitleaks detect --no-banner --staged on the working tree, or against the diff with git diff | gitleaks --pipe -
trufflehog v3 verified-secrets mode reduces false positives by attempting to authenticate the detected secret

LLM-specific patterns to add

An IdP issuer URL adjacent to a base64 chunk → likely OAuth tokens
"Here is the token I retrieved:" / "Using this credential:" / "From your .env:" — verbatim narration with the value in line
Commit-message references to "I removed the test value" — sometimes the value is still in the diff
console.log(process.env.X) newly added — leak vector if X is sensitive
Test fixtures with realistic-looking IDs that aren't xxx-style placeholders

Review steps

Diff scan: git diff against the merge base; run gitleaks / trufflehog with verified mode; manually scan added lines for the patterns above
Commit message scan: git log <base>..HEAD --format=%B
New-file scan: git diff --name-only --diff-filter=A for every added file; pay special attention to fixtures/, testdata/, docs/
Transcript scan (if scoped to a session, not a PR): walk the agent's transcript JSONL; look for tool calls that read sensitive paths (.env, ~/.aws/credentials, ~/.ssh/, ~/.config/op/), then look for any subsequent tool call that wrote text containing data from those reads
Rotate immediately on any verified hit — assume any exposed value has leaked even if the PR is closed without merge

False-positive shapes

sk-XXXX / xoxb-FAKE / placeholder secrets — fine, but require explicit "example only" comment nearby
Test fixtures with documented aws_access_key_id = "AKIAIOSFODNN7EXAMPLE" (AWS docs example) — fine
Live-looking values in a comment marked // SAFE: not real — trust-but-verify; confirm against the IdP if uncertain

Output format

For each finding:

Location (file:line; transcript:offset; commit:hash)
Pattern class (OAuth / AWS / GitHub / private-key / JWT / generic-high-entropy)
Confidence (high / medium / low)
Recommended action: rotate the credential, then remediate the diff (rewrite history if value reached a public branch)

Pre-commit hook starter

#!/usr/bin/env bash
# .git/hooks/pre-commit (or, better, via lefthook / pre-commit-framework)
# Block commits that smell like leaked secrets in agent-authored diffs.
if git diff --cached | gitleaks --pipe -no-banner --redact - 2>/dev/null; then
  echo "secret-leak heuristic fired; investigate before committing"
  exit 1
fi

Rationalizations to reject

Rationalization	Reality
"It's a test value"	Test values that look real are usually real; the agent grabbed them from env because that was easiest
"It's only in the commit message"	Commit messages are public the moment the branch is pushed
"I'll rotate later"	Rotate now
"The snapshot test will be updated soon"	The snapshot is in the diff right now

References

gitleaks, trufflehog v3, detect-secrets
GitHub's secret-scanning patterns reference
OWASP Cheat Sheet on Secrets Management
AI-specific incidents: 2024–2026 wave of agent-authored PRs leaking vendor tokens — search github.com/secret-scanning leak reports