Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

dark-code-audit

Étoiles0

Forks0

Mis à jour14 avril 2026 à 00:03

Audits a codebase for dark code risk: code that was generated, passed automated checks, and shipped without anyone understanding it. Produces a structured audit report with a hotspot map, comprehension debt scorecard (spec coverage %, context layer coverage %, review depth), ownership gap analysis, top failure scenarios, and a prioritized action plan. Use this skill before a security review, compliance review, or major refactor; when new engineers join and the codebase feels opaque; after a period of high AI-assisted development velocity; quarterly as a health check; or any time you hear "audit for dark code", "comprehension debt", "dark code risk", "what do we not understand about this codebase", "knowledge gap analysis", "who owns what", or "we've been shipping AI code really fast lately". This skill does not recommend "add more monitoring" — it identifies where human comprehension is missing and prescribes structural fixes.

Installation

Installer avec Codex ou Claude Copiez ce prompt, collez-le dans Codex, Claude ou un autre assistant, puis laissez-le vérifier la page du skill et l'installer pour vous.

Exécuter dans Manus

Source

az9713

az9713/dark-code-skills

Ouvrir le dépôt GitHub Voir les dépôts du créateur

Téléchargement

Exécuter dans Manus

Métiers associésSOC

Basé sur la classification professionnelle SOC

Analystes en assurance qualité des logiciels et testeursProfessions informatiques et mathématiques·SOC 15-1253

Explorateur de fichiers

3 fichiers

SKILL.md

readonly

Plus depuis ce dépôt

même dépôt

comprehension-gate

az9713/dark-code-skills

Runs a seven-dimension comprehension review on a code change before it ships: credential exposure, cross-service side effects, blast radius, state/persistence mismatch (the Kiro pattern — AI treating persistent infrastructure as ephemeral), token TTL management, implicit assumptions, and whether the change would be explainable by the person shipping it. Produces a COMPREHENSION_ARTIFACT.md with a findings table and a CLEAR / REVIEW REQUIRED / HOLD verdict. Use this skill before merging any AI-generated code, before any change that touches shared resources (Redis, shared databases, message queues), before changes to auth flows or token handling, when reviewing code for dark code risk, or any time you hear "check blast radius", "review for comprehension", "is this safe to ship", "comprehension gate", "pre-merge review", or "will this cause an incident". This skill catches system-level failures that linters, type checkers, and unit tests cannot detect.

2026-04-140

context-layer-generator

az9713/dark-code-skills

Generates three context layer artifacts for a code module: MODULE_MANIFEST.md (structural map — where things connect), BEHAVIORAL_CONTRACTS.md (semantic contracts — what each interface guarantees), and DECISION_LOG.md (philosophical record — why decisions were made, with explicit warnings about what breaks if reversed). Use this skill whenever working on a module that lacks documentation, when the original author has left, before an AI agent modifies an unfamiliar module, when documenting a module after onboarding, when a codebase audit flagged missing context layers, or any time you hear phrases like "document this module", "make this self-describing", "build context layers", "preserve knowledge before the author leaves", or "what does this module do". This skill is especially important for AI-generated code that was never explained by anyone.

2026-04-140

dark-code-suite-init

az9713/dark-code-skills

Sets up a project to use the full dark code prevention suite in one step: creates the .claude/comprehension/ directory for comprehension artifacts, adds a ## Dark Code Prevention section to CLAUDE.md (or creates CLAUDE.md if missing), creates docs/dark-code-audit/ for audit reports, and runs an initial dark-code-audit to baseline the project's current comprehension debt. Use this skill when starting to use the dark code suite on a new project, when onboarding a codebase to dark code prevention practices, or any time you hear "set up dark code prevention", "initialize the dark code suite", "add comprehension gate to this project", or "how do I start with dark code practices here".

2026-04-140

generate-data-lineage

az9713/dark-code-skills

Assembles a data flow narrative from MODULE_MANIFEST.md and BEHAVIORAL_CONTRACTS.md context files, answering the explainability question: "What does the system do with [data type] for [user journey]?" Use before a compliance or security review, when a dark-code-audit flags "Explainability: Partial", when onboarding a new engineer who needs to understand data flows, or when preparing for GDPR, EU AI Act, or SOC 2 review. Reads context layers across the codebase, interviews for gaps, and writes docs/data-lineage/YYYY-MM-DD-<name>.md with a confidence rating. Invoke as: /generate-data-lineage (all PII-touching flows in the codebase) /generate-data-lineage --journey user-signup (specific user journey) /generate-data-lineage --module path/to/mod (flows for a specific module) /generate-data-lineage --type payment (specific data type)

2026-04-140

generate-eu-ai-act-system-card

az9713/dark-code-skills

Generates a per-service EU AI Act system card documenting AI tool usage, risk classification, human oversight mechanisms, and limitations. Use for any service where AI tools contribute to code generation, decision support, or automated processing — especially before the August 2026 EU AI Act deadline. Use when dark-code-audit flags AI-heavy services, when preparing a compliance package for a regulator or enterprise customer, or when the organization needs to document its AI practices. Reads MODULE_MANIFEST.md and BEHAVIORAL_CONTRACTS.md, conducts a structured interview, and writes docs/compliance/eu-ai-act-system-card-<service>-YYYY-MM-DD.md. Invoke as /generate-eu-ai-act-system-card path/to/service or with --risk-level limited|general|high.

2026-04-140

generate-gdpr-ropa

az9713/dark-code-skills

Generates a draft GDPR Article 30 Record of Processing Activities (ROPA) from MODULE_MANIFEST.md and BEHAVIORAL_CONTRACTS.md context files. Use when preparing for a GDPR audit, when a dark-code-audit flags PII-handling services with incomplete documentation, or when building a compliance package. Reads context layers across the codebase, groups them into logical processing activities, auto-populates what it can, and interviews the user for fields that require human judgment (legal basis, purpose, international transfers). Writes docs/compliance/gdpr-ropa-YYYY-MM-DD.md. Invoke as /generate-gdpr-ropa or /generate-gdpr-ropa --module path/to/module for a single module entry.

2026-04-140

name

dark-code-audit

description

Dark Code Audit

You are conducting a dark code audit. Dark code is code that was never understood by anyone: AI-generated, passed automated checks, shipped — and the comprehension step never happened.

This is different from technical debt. Technical debt is code the author understood but cut corners on. Dark code is code that no one has ever fully understood. The distinction matters because the fix is different: you can pay down technical debt by refactoring; you address dark code by building comprehension artifacts, context layers, and comprehension gates.

Never recommend "add more monitoring" or "add a supervisory layer" as a primary fix. These are the wrong answers — monitoring tells you when something broke, not why, and a supervisory layer just adds another dark layer on top. The fixes must happen upstream of the code running.

Step 1: Automated Discovery

Before the interview, gather everything the codebase can tell you automatically. Run in parallel:

Context layer coverage:

Glob for MODULE_MANIFEST.md across the codebase — list all directories that have one vs. don't
Glob for BEHAVIORAL_CONTRACTS.md — same
Glob for DECISION_LOG.md — same
Glob for COMPREHENSION_ARTIFACT.md in .claude/comprehension/ — what PRs had gate reviews?

Velocity indicators:

If git available: git log --oneline -50 to see recent commit velocity and message patterns
git shortlog -sn --no-merges to see contributor distribution (high concentration in few names, or many one-time contributors, are both risk signals)
Look for commit messages that are obviously AI-generated (comprehensive, impersonal, covering every edge case in the message)

Structural risk patterns:

Grep for shared resource patterns: redis\|cache\|REDIS, database client imports, queue client imports
Glob for *.env.example or similar — what external services does this system touch?
List top-level directories and identify which represent distinct modules or services

Existing documentation:

Read root README and any CLAUDE.md
Check for existing ADRs (docs/adr/, decisions/, architecture/)
Check for spec documents (specs/, requirements/, *.spec.md)

Present a discovery summary before starting the interview:

Auto-discovered:
• Modules/directories: N
• With MODULE_MANIFEST.md: N/N (X%)
• With BEHAVIORAL_CONTRACTS.md: N/N (X%)  
• With DECISION_LOG.md: N/N (X%)
• Comprehension artifacts (prior gate reviews): N
• Shared infrastructure clients found in: [list of modules]
• Contributor concentration: [N engineers touched N% of commits]
• ADRs / spec docs: found / not found

Starting interview. Correct anything that looks wrong.

Step 2: Interview (Four Groups)

Conduct the interview in four groups. Ask each group's questions together — don't ask one at a time. Make it clear that short answers are fine; you're building a picture, not writing an essay.

Group 1: Architecture and system scope

What are the main services or modules in this system? (if not clear from directory structure)
Which services communicate with each other? Any cross-service data flows not obvious from imports?
Are there any services that were built primarily by AI tools in the last 6–12 months?
Is there any part of the system where "nobody really knows how it works"?

Group 2: AI tool usage

Are AI coding tools (Claude, Copilot, Cursor, etc.) actively used on this codebase?
Is there a requirement for spec/design approval before generating code, or does code go directly from prompt to PR?
Are AI-generated PRs reviewed differently than human-written PRs — more scrutiny, less, or the same?
Has the team ever shipped something from an AI that caused an incident? What happened?

Group 3: Team and ownership

Who owns each major module or service? Is that documented anywhere?
Have there been significant team changes in the last year — layoffs, departures, team restructuring?
Are there modules where the original author has left and knowledge transferred? Or didn't?
Is there any code that the team is afraid to touch? ("We don't change that file.")

Group 4: Development and deployment practices

Does the team write specs or requirements before implementing features, or is the ticket the spec?
Are there pre-merge reviews that specifically check for comprehension (blast radius, side effects) — not just "does it work"?
Is there a process for capturing architectural decisions before they're implemented?
Are there automated tests? What do they cover? What don't they cover?

Step 3: Analyze

With auto-discovery data and interview answers, analyze across three categories:

Velocity dark code (code that moved too fast to understand):

Services with high AI usage + no spec requirement + no comprehension gate review
Modules modified by many different engineers with no central owner
High commit velocity with low test coverage
"We just got it working" modules with no documented decisions

Structural dark code (emergent behavior nobody designed):

Cross-service data flows identified in discovery that have no documenting schema or contract
Shared resources (Redis, databases) accessed by multiple services with no ownership clarity
Non-engineer-accessible workflows (scheduled jobs, event triggers, webhooks) with no manifest
Services where the answer to "what does this do?" is "it depends on what called it"

Compounding factors:

Knowledge loss: original authors of key modules have left
Ownership dissolution: modules with no clear owner
The "observability trap": the team believes Datadog/logs/metrics = comprehension (it doesn't)
Regulatory exposure: services handling PII/payments/user data without behavioral contracts

Step 4: Write the Audit Report

Write the report to docs/dark-code-audit/YYYY-MM-DD-audit.md (create directory if needed). Use today's date. If a previous audit exists in that directory, note what has changed.

Use the template in references/audit-template.md and the scoring guide in references/scoring-rubric.md.

The report must include:

Executive summary — 3–5 bullet points: what the audit found, the risk level, and the single most important thing to do next
Hotspot map — a table of modules rated by dark code risk, with columns: Module, Risk level, Context layers, Owner known, AI-heavy, Notes
Top 3 failure scenarios — specific, plausible incidents that the current state makes possible. Not abstract risks. Write them as brief narratives: "Service X calls Service Y's /compute endpoint. Y has no BEHAVIORAL_CONTRACTS.md. If Y's team changes the retry behavior, X's caller will [specific consequence]."
Ownership gap analysis — modules with no identified owner, and the risk each poses
Comprehension debt scorecard — using references/scoring-rubric.md:
- Spec coverage % (what % of features/modules have a written spec)
- Context layer coverage % (what % of modules have MODULE_MANIFEST.md + BEHAVIORAL_CONTRACTS.md + DECISION_LOG.md)
- Review depth score (0–3: no review / existence check / functional review / comprehension gate)
- Explainability score (can the team answer: what does X do with customer data on day Y?)
- Overall comprehension debt level: LOW / MEDIUM / HIGH / CRITICAL
Prioritized action plan — 3–5 specific, executable actions. Each action should: name the specific module or practice to address, say exactly what to do (run /context-layer-generator on X, set up /comprehension-gate for PRs to Y), and explain what risk it reduces.

Guardrails

Do not recommend:

"Add more monitoring or logging" as a fix for dark code. Monitoring tells you when something broke. It does not give anyone comprehension of why.
"Add a supervisory AI layer." A layer that watches dark code is itself dark if it wasn't understood when built.
"Rewrite it." Rewriting dark code without first building comprehension just creates new dark code faster.

Do distinguish:

Confirmed findings (from auto-discovery or direct interview answer) vs. risks (inferred from patterns)
Velocity dark code (addressable now with context layers + gate) vs. structural dark code (requires runtime tooling beyond Claude Code's scope)

After Writing

Report:

Dark code audit complete.

Comprehension debt level: [LOW / MEDIUM / HIGH / CRITICAL]
Hotspots: [N modules at HIGH risk]
Top action: [single most important next step]

Full report: docs/dark-code-audit/YYYY-MM-DD-audit.md

Recommended next steps:
  • /context-layer-generator [highest-risk module] — build context layers
  • /comprehension-gate — run on any pending PRs touching hotspot modules