Run any Skill in Manus with one click

$pwd:

deep-audit

Name: Deep Audit
Author: av1155

// Exhaustive post-work audit for any codebase. Run after finishing a feature, sprint, bug fix, or refactoring pass to find overlooked edge cases, discarded response data, missing cross-field validations, misleading success signals, untested failure paths, and implicit assumptions. Invoke with /deep-audit or /deep-audit followed by context. Use this skill whenever the user says audit, review my changes, check my work, what did I miss, post-mortem, sanity check, edge cases, or asks to verify completeness of recent code changes.

Run Skill in Manus

$ git log --oneline --stat

stars:6

forks:0

updated:May 14, 2026 at 07:10

File Explorer

3 files

SKILL.md

readonly

related-skills.json

same repository

commenting.md

from "av1155/.dotfiles"

Universal commenting and documentation conventions across all languages. Load when writing or reviewing substantial comments, docstrings, file headers, or migration documentation. Project-specific commenting standards in `docs/commenting-standard.md` (when present) extend or override this baseline.

2026-05-156

agentic-coding-harnesses.md

from "av1155/.dotfiles"

Use when modifying the agentic coding environment, including dotfiles, skills/rules wiring, AGENTS.md/CLAUDE.md loading, Stow symlinks, Claude/Codex/OpenCode/Pi config, plugins, MCP, hooks, or harness compatibility. Loads the runbook to avoid breaking cross-harness setup. Defer to skill-creator-global for skill authoring craft, description quality, and progressive disclosure.

2026-05-126

find-docs.md

from "av1155/.dotfiles"

Retrieves up-to-date documentation, API references, and code examples for any developer technology via the ctx7 (Context7) CLI. Use this skill whenever the user mentions "ctx7" or "context7", or asks about a specific library, framework, SDK, CLI tool, or cloud service -- even for well-known ones like React, Next.js, Prisma, Express, Tailwind, Django, or Spring Boot. Your training data may not reflect recent API changes or version updates. Always use for: API syntax questions, configuration options, version migration issues, "how do I" questions mentioning a library name, debugging that involves library-specific behavior, setup instructions, and CLI tool usage. Use even when you think you know the answer -- do not rely on training data for API details, signatures, or configuration options as they are frequently outdated. Always verify against current docs. Prefer this over web search for library documentation and API details.

2026-05-126

frontend-design-global.md

from "av1155/.dotfiles"

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, artifacts, posters, or applications (examples include websites, landing pages, dashboards, React components, HTML/CSS layouts, or when styling/beautifying any web UI). Generates creative, polished code and UI design that avoids generic AI aesthetics.

2026-05-126

skill-creator-global.md

from "av1155/.dotfiles"

Create, revise, or review Agent Skills for authoring quality. Use when writing SKILL.md content, improving skill descriptions and trigger words, deciding what belongs in SKILL.md versus references/scripts, or diagnosing why a skill over-triggers or under-triggers. Defer to agentic-coding-harnesses for dotfiles, Stow, symlinks, harness visibility, AGENTS.md, plugins, MCP, or environment wiring.

2026-05-126

firecrawl-cli.md

from "av1155/.dotfiles"

Search, scrape, and interact with the web via the Firecrawl CLI. Use this skill whenever the user wants to search the web, find articles, research a topic, look something up online, scrape a webpage, grab content from a URL, get data from a website, crawl documentation, download a site, or interact with pages that need clicks or logins. Also use when they say "fetch this page", "pull the content from", "get the page at https://", or reference external websites. This provides real-time web search with full page content and interact capabilities — beyond what Claude can do natively with built-in tools. Do NOT trigger for local file operations, git commands, deployments, or code editing tasks.

2026-05-106

package.json

"author": "av1155"

"repository": "av1155/.dotfiles"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software Quality Assurance Analysts and TestersComputer and Mathematical Occupations15-1253L4

name	deep-audit
description	Exhaustive post-work audit for any codebase. Run after finishing a feature, sprint, bug fix, or refactoring pass to find overlooked edge cases, discarded response data, missing cross-field validations, misleading success signals, untested failure paths, and implicit assumptions. Invoke with /deep-audit or /deep-audit followed by context. Use this skill whenever the user says audit, review my changes, check my work, what did I miss, post-mortem, sanity check, edge cases, or asks to verify completeness of recent code changes.
argument-hint	[description or --range HEAD~N..HEAD or --files path1 path2]
disable-model-invocation	true
allowed-tools	Read Grep Glob Bash(git ) Bash(find ) Bash(wc ) Bash(head ) Bash(tail ) Bash(cat ) Bash(jq *) Agent

Deep Audit — Exhaustive Post-Work Review

You are performing an exhaustive, multi-dimensional audit of recent code changes. Your goal is to find every overlooked edge case, every piece of discarded data, every missing validation, every misleading success signal, and every implicit assumption in the changed code. You are not a linter. You are not a generic code reviewer. You are hunting for the kinds of bugs that only surface when a real user does something the developer never anticipated.

Why this exists

A real incident: a ping() function called GET /api/v3/system/status and received a JSON response containing an appName field identifying the remote application. The code checked whether the HTTP request returned 200 but discarded the response body entirely. A user configured a Sonarr URL under a "Radarr" type selector. The connection test passed (Sonarr responds 200 to the same endpoint). The system then sent Radarr-format commands to Sonarr, which returned HTTP 500. Debugging consumed hours across two people. The fix was a single field comparison using data already present in the response that was being thrown away. This skill ensures that class of oversight never recurs.

Invocation

/deep-audit — audit all changes vs. the default branch (auto-detects main/master/develop)
/deep-audit description of what I just built — same, but focuses the audit on the described area
/deep-audit --range HEAD~5..HEAD — audit a specific commit range
/deep-audit --files src/foo.py src/bar.py — audit specific files only

Execution Flow

Step 0: Parse arguments, determine scope, set up tracking

ARGS = "$ARGUMENTS"

If ARGS is empty, diff against the default branch. If ARGS starts with --range, extract the range. If ARGS starts with --files, use those paths. Otherwise, treat the entire argument string as context that focuses the audit without changing the scope.

Determine the default branch:

git symbolic-ref refs/remotes/origin/HEAD 2>/dev/null | sed 's@^refs/remotes/origin/@@' || echo "main"

Get the list of changed files:

git diff --name-only --diff-filter=ACMR <base>

If no files are changed, report that and stop.

Progress tracking: If the diff touches more than 15 files, use ToolSearch to load TaskCreate and TaskUpdate. Create a tracked task named "Deep Audit" so progress is visible during the run. Update it at each major phase transition (Reconnaissance → Build Verification → Plan → Execute → Report). For smaller diffs, skip task tracking and work inline.

Step 1: Reconnaissance (use subagents heavily)

Before building any audit plan, gather full context. Do NOT skip this phase.

1a. Classify the project

Read config files to auto-detect the project's stack. Check for (non-exhaustive — look for whatever exists):

Language/runtime: package.json, pyproject.toml, Cargo.toml, go.mod, build.gradle, pom.xml, Gemfile, mix.exs, composer.json
Framework: next.config., nuxt.config., vite.config., angular.json, svelte.config., manage.py, config/routes.rb
Package manager: detect from lockfile — pnpm-lock.yaml, yarn.lock, package-lock.json, uv.lock, Pipfile.lock, poetry.lock, Cargo.lock, go.sum
Test framework: vitest.config., jest.config., pytest.ini, .rspec, *_test.go files
Linter/formatter: .eslintrc.*, biome.json, .prettierrc, ruff.toml, .rubocop.yml, rustfmt.toml
CI: .github/workflows/, .gitlab-ci.yml, Jenkinsfile, .circleci/
Database: migrations/ directories, prisma/schema.prisma, supabase/ directory, alembic/, knex migrations
Project conventions: CLAUDE.md, README.md, CONTRIBUTING.md

Store this classification — it determines which audit dimensions are relevant and which tools to invoke.

1b. Discover project-specific resources

Scan for resources the audit can leverage. This skill is global and must adapt to whatever project it lands in.

Project slash commands and skills:

ls .claude/commands/ .claude/skills/ 2>/dev/null

Read any that look relevant to quality checking (names containing "review", "test", "lint", "check", "verify", "validate", "ci", "qa"). If the project has a quality-gate command (like a /review that runs lint+typecheck+test), note it — you will invoke it in Step 2.

Build and test scripts: Check the package manager config for available scripts:

package.json scripts field (look for test, lint, typecheck, build, check, validate)
Makefile targets
pyproject.toml [tool.pytest], [tool.ruff], scripts
Cargo.toml — cargo test, cargo clippy, cargo build are always available for Rust

Issue tracker and PR context: Determine what's available and use the best source of intent:

Linear: Check if the Linear MCP is connected AND the project references Linear (search CLAUDE.md, README, or .claude/ configs for "linear" mentions). If so, use Linear MCP to look up the issue linked to the current branch. Linear issues often carry the ground-truth acceptance criteria.
GitHub (PR, issues, reviews): If the remote is a GitHub repo, pull PR and issue context using whichever method is available, in this preference order:
- gh CLI (check which gh): fastest and most flexible. Use gh pr view --json title,body,comments,reviews,url, gh pr diff, gh issue view <number> --json title,body,comments, etc. The gh api subcommand can hit any GitHub REST or GraphQL endpoint if you need more.
- Raw git: git log --oneline <base>..HEAD for commit messages if gh is unavailable. Prefer gh CLI; fall back to raw git when needed.
Neither: Fall back to git log messages for intent context.

The issue/PR context tells you what the code is supposed to do, making it far easier to spot where implementation diverges from requirement.

1c. Explore changed files with subagents

For each changed file (or tightly-coupled group), spawn an Explore subagent to research:

What calls this file (callers/importers)
What this file calls (dependencies/imports)
What data flows through it (function signatures, return types, response handling)
Every external interaction: HTTP calls, DB queries, file I/O, message queues, IPC
For each external interaction: what data is sent, what data is received, what is done with the response, and critically — what is NOT done with the response
The corresponding test file(s) — do they exist? What do they test?

One subagent per major file or tightly-coupled group. Collect all reports before proceeding.

1d. Read the actual diffs

Don't just know which files changed — read what changed within them:

git diff <base> -- <file>

1e. Library and API verification (when available)

For external libraries, APIs, or frameworks used in the changed code, verify correct usage:

ctx7 (first choice): Use it to fetch current docs for any library or framework the changed code interacts with. Check whether the code uses the API correctly, handles all documented error cases, and isn't ignoring return values or response fields that the docs say are important.
WebSearch + WebFetch (general research): Use ToolSearch to load WebSearch and WebFetch. Search for known issues, CVEs, deprecation notices, or migration guides related to the specific dependency versions in use. Check whether any external API the code calls has documented constraints the code doesn't enforce.
firecrawl (JS-heavy fallback): If WebFetch fails to properly render a page (returns empty content, incomplete HTML, or garbled output from a JavaScript-heavy documentation site) and firecrawl is available, use it as a fallback. Reserve it for sites standard fetching cannot handle.

Skip this entire substep if no web tools are available. The audit continues without it.

Step 2: Build verification

Before spending time on semantic analysis, confirm the code compiles and passes existing quality gates.

2a. Build check

Using the detected package manager and build system from Step 1a, run the build command:

Node/pnpm/yarn/npm: the build script if it exists
Python: python -m py_compile on changed .py files, or the project's build command
Rust: cargo check
Go: go build ./...
Other: whatever the project's standard build command is

If the build fails, report the failure immediately. Do not proceed with the full audit — the developer needs to fix compilation first. List the errors and stop.

2b. Run existing quality gates

If Step 1b found project-specific quality commands (a /review slash command, or individual lint/typecheck/test scripts), invoke them now. Specifically:

Run the linter on changed files if a linter exists
Run the type checker if one exists
Run tests related to changed files if a test runner exists

Capture all output. Failures become automatic FAIL findings in the audit report — you don't need to rediscover what the toolchain already caught. Passing quality gates doesn't mean the code is correct; it means the audit can focus on the things machines don't catch.

If no quality gates are found, note it as a WARN ("No automated quality gates detected in this project") and continue.

2c. Security guidance check

If the security-guidance plugin is available (check via ToolSearch), invoke it against the changed files. Capture its output. Any findings become inputs to Dimension 7 (Security Surface) of the audit — you don't need to duplicate its work, but you should verify its findings and add any it missed.

If the plugin is unavailable, Dimension 7 runs entirely from the hand-written checks in the audit dimensions reference.

Step 3: Build the audit plan

Using reconnaissance data and build verification results, construct a detailed plan organized by the ten audit dimensions. Each item must be a concrete, specific check against actual code — never generic advice.

Enter plan mode if not already in it. Present the plan to the user for review before executing.

If the plan has more than 15 checklist items, lead with a scannable summary table before the full detail. This lets the user see the entire scope at a glance and decide whether to approve, adjust, or skip dimensions before reading every finding:

| ID     | Dimension              | File / Location         | Concern (one line)                        |
|--------|------------------------|-------------------------|-------------------------------------------|
| D1-01  | Discarded Data         | lib/client.ts:53        | getUser() response fields unused           |
| D2-01  | Cross-Field Validation | migration.sql:129       | max_shares not checked >= min_shares       |
| ...    | ...                    | ...                     | ...                                        |

Then present the full detail by dimension below the table. For audits with 15 or fewer items, skip the summary table and go straight to the full detail.

Read references/audit-dimensions.md for the detailed specification of all ten dimensions with examples and detection strategies.

Read references/anti-patterns.md for the catalog of common oversight patterns to hunt for.

The ten dimensions are:

Discarded Data — response bodies, return values, and fields that are fetched but unused
Cross-Field Validation — logical relationships between user inputs that go unchecked
Failure Mode Distance — how far errors travel from their root cause before surfacing
Happy Path Bias — code paths that only work when everything goes right
Boundary Validation — completeness of checks at every trust boundary
Error Handling — swallowed errors, unactionable messages, missing timeout handling
Security Surface — injection, auth bypass, SSRF, secrets exposure scoped to changes (incorporate security-guidance plugin output if available)
Concurrency and State — race conditions, shared mutable state, async safety
Idempotency and Retry Safety — double-execution risk, missing rollback paths
Observable Behavior Gaps — what the user sees (or doesn't) during partial failures

Additional cross-cutting checks:

Test Coverage Gaps — for each changed source file:
- Does a corresponding test file exist? If not, FAIL.
- Do the tests cover the new/changed code paths? Specifically, do they test non-happy paths, error conditions, and edge cases introduced by the changes?
- If new conditional branches were added, are there tests for both sides?
Schema and Contract Consistency — if any of the following changed, verify downstream consumers are updated:
- TypeScript interfaces/types that other files import
- Database schemas, migrations, or ORM models
- API request/response shapes (OpenAPI specs, GraphQL schemas, tRPC routers)
- Environment variable requirements
- Configuration file formats
- If a schema changed but its consumers didn't, FAIL.

For each dimension, generate specific checklist items derived from the actual code. Each item has:

An ID (e.g., D1-03 for Dimension 1, item 3; T-02 for Test Coverage, item 2; S-01 for Schema Consistency, item 1)
A concrete description of what to check
The file(s) and line(s) involved

Presentation order for the plan: If the total checklist exceeds 15 items, present a condensed summary table first so the user can scan the full scope before reading detail:

| ID | File(s) | Check | Likely Severity |
|----|---------|-------|-----------------|
| D1-01 | retriever.ts | similarity score usage | WARN |
| D2-01 | migration 1, L129 | min/max shares constraint | FAIL |
| ... | ... | ... | ... |

Then present the full detail per dimension below the table. This prevents a 28-item wall of text where the user cannot see the forest for the trees.

Step 4: Execute the audit

For each checklist item, determine:

PASS: The code handles this correctly. State the evidence.
FAIL: The code has a concrete deficiency. State exactly what is wrong and provide a fix.
WARN: The code might be fine but deserves human review. State the concern.

Do NOT be lazy. Do NOT skip items. Do NOT produce generic observations. Every finding must cite specific files, line numbers, and code. If a dimension has zero items (e.g., no concurrency in a pure synchronous script), state that explicitly and move on.

Observable behavior verification (when possible): If the changes affect a web UI and playwright-cli is available, consider spawning a subagent to actually load the affected page, trigger the new feature, and verify what the user sees, instead of only inferring behavior from source code. This is optional and should only be attempted if the project has a working dev server configuration. Do not block the audit on browser verification.

Step 5: Summary report

After all checks complete, produce:

## Deep Audit Report

### Scope
- Base: <branch or range>
- Files audited: <count>
- Dimensions evaluated: <count>/12
- Issue/PR context: <Linear issue, GitHub PR, or "none found">
- Build verification: <passed/failed/skipped>
- Quality gates run: <list what ran, or "none detected">

### Results
| Status | Count |
|--------|-------|
| PASS   | N     |
| FAIL   | N     |
| WARN   | N     |
| SKIP   | N     |

### Critical Findings (FAIL)
<each finding with full context, code reference, and recommended fix>

### Warnings (WARN)
<each warning with context and what to look for>

### Passed Checks
<abbreviated list — ID and one-line description only>

### Meta-Observations
<patterns across findings, if any — e.g., "3 of 4 FAILs involve discarded response data"
or "all changed files lack corresponding test files">

After reporting: If any FAIL was novel (doesn't match an existing anti-pattern in references/anti-patterns.md), tell the user: "This finding doesn't match any cataloged anti-pattern. Consider adding it to ~/.claude/skills/deep-audit/references/anti-patterns.md so future audits hunt for it specifically." Describe the pattern shape so the user can append it.

Step 6: Create tracking issues (requires approval)

If the report contains any FAIL or WARN findings, offer to create tracking issues in the project's issue tracker. Do not create issues without explicit user approval. Present the proposed issues first and wait for confirmation.

Detect which tracker to use (same discovery as Step 1b):

Linear MCP — if connected and the project uses Linear, create issues with:
- Title: finding ID + short description (e.g., "S7-01: Change CASCADE to RESTRICT on financial table FKs")
- Description: file paths, line numbers, problem explanation, and recommended fix from the audit report
- Priority: high for FAIL findings in the Priority 1 tier, medium for everything else
- Labels: match the project's existing label conventions (check what labels exist before creating)
- Set dependency relationships between issues where one fix must precede another (e.g., tests blocked by the code changes they test)
GitHub Issues via gh CLI — if gh is available and the remote is GitHub:
```
gh issue create --title "<ID>: <description>" --body "<details from report>"
```
Apply labels matching project conventions. If no conventions exist, use bug for FAILs and enhancement for WARNs.
None available — list the proposed issues in the chat so the user can create them manually.

Presentation before approval:

I found N FAIL and M WARN findings. Want me to create tracking issues?

Priority 1 (FAIL — high):
- S7-01: Change CASCADE to RESTRICT on financial table FKs
- D5-01: Add CHECK constraint on investors.withholding_rate [0, 1]

Priority 2 (FAIL/WARN — medium):
- D2-01: Add cross-field CHECKs on distribution_tiers
- T-01: Write tests for Supabase client factories

Tracker: Linear (detected) | Labels: type:bug for FAILs, Improvement for WARNs

Create these issues? [wait for user confirmation]

Only proceed after the user confirms. If the user modifies the list (removes items, changes priorities, adjusts grouping), apply their changes. Skip Priority 3 findings by default — only include them if the user asks.

Rules

Never abbreviate. Run every dimension against every changed file where it applies.
Never produce generic advice. "Consider adding error handling" is not a finding. "Line 47 of src/client.py catches httpx.HTTPError but swallows the exception with pass, losing the error message and status code that the caller needs for retry logic" is a finding.
Always cite code. Every PASS, FAIL, and WARN must reference a specific file and line range.
Use subagents for heavy research. Don't try to hold the entire codebase in context. Farm out exploration to Explore subagents and synthesize their reports.
Prioritize findings by user impact. A FAIL that causes silent data loss ranks higher than a WARN about a missing log message.
Don't audit generated files, lockfiles, or vendored dependencies unless explicitly requested.
If the diff is very large (100+ files), focus on non-test source files first, then audit test files for coverage gaps.
Adapt to the project. Use whatever quality gates, slash commands, test runners, linters, and issue trackers exist in the current project. Never hardcode tool names — discover them dynamically from config files and available resources.
Degrade gracefully. Every optional tool (ctx7, WebSearch, firecrawl, Linear, gh, playwright-cli, security-guidance) enhances the audit when present but must not block it when absent. If a tool is unavailable, note what was skipped and continue.
PASS means PASS. If code correctly handles something, mark it PASS and move on. Do not manufacture concerns about correct code, do not hedge PASSes with "but you could also...", and do not invent hypothetical scenarios that require ignoring the code's actual behavior. The audit is a quality gate, not a wishlist. A clean audit should produce mostly PASSes with zero FAILs — that is the goal, not a sign that the audit wasn't thorough enough.

deep-audit

More from this repository

More from this repository

Deep Audit — Exhaustive Post-Work Review

Why this exists

Invocation

Execution Flow

Step 0: Parse arguments, determine scope, set up tracking

Step 1: Reconnaissance (use subagents heavily)

1a. Classify the project

1b. Discover project-specific resources

1c. Explore changed files with subagents

1d. Read the actual diffs

1e. Library and API verification (when available)

Step 2: Build verification

2a. Build check

2b. Run existing quality gates

2c. Security guidance check

Step 3: Build the audit plan

Step 4: Execute the audit

Step 5: Summary report

Step 6: Create tracking issues (requires approval)

Rules

Deep Audit — Exhaustive Post-Work Review

Why this exists

Invocation

Execution Flow

Step 0: Parse arguments, determine scope, set up tracking

Step 1: Reconnaissance (use subagents heavily)

1a. Classify the project

1b. Discover project-specific resources

1c. Explore changed files with subagents

1d. Read the actual diffs

1e. Library and API verification (when available)

Step 2: Build verification

2a. Build check

2b. Run existing quality gates

2c. Security guidance check

Step 3: Build the audit plan

Step 4: Execute the audit

Step 5: Summary report

Step 6: Create tracking issues (requires approval)

Rules