with one click
understand
// Analyze a codebase to produce an interactive knowledge graph for understanding architecture, components, and relationships
// Analyze a codebase to produce an interactive knowledge graph for understanding architecture, components, and relationships
| name | understand |
| description | Analyze a codebase to produce an interactive knowledge graph for understanding architecture, components, and relationships |
| argument-hint | ["[path] [--full|--auto-update|--no-auto-update|--review|--language <lang>]"] |
Analyze the current codebase and produce a knowledge-graph.json file in .understand-anything/. This file powers the interactive dashboard for exploring the project's architecture.
$ARGUMENTS may contain:
--full — Force a full rebuild, ignoring any existing graph--auto-update — Enable automatic graph updates on commit (writes autoUpdate: true to .understand-anything/config.json)--no-auto-update — Disable automatic graph updates (writes autoUpdate: false to .understand-anything/config.json)--review — Run full LLM graph-reviewer instead of inline deterministic validation--language <lang> — Generate all textual content (summaries, descriptions, tags, titles, languageNotes, languageLesson) in the specified language. Accepts ISO 639-1 codes (zh, ja, ko, en, es, fr, de, etc.) or friendly names (chinese, japanese, korean, english, spanish, etc.). Locale variants supported: zh-TW, zh-HK, etc. Defaults to en (English). Stores preference in .understand-anything/config.json for consistency across incremental updates./path/to/repo or ../other-project) — Analyze the given directory instead of the current working directoryDetermine whether to run a full analysis or incremental update.
Resolve PROJECT_ROOT:
Parse $ARGUMENTS for a non-flag token (any argument that does not start with --). If found, treat it as the target directory path.
test -d <path>). If it does not exist or is not a directory, report an error to the user and STOP.PROJECT_ROOT to the resolved absolute path.If no directory path argument is found, set PROJECT_ROOT to the current working directory.
Worktree redirect. If PROJECT_ROOT is inside a git worktree (not the main checkout), redirect output to the main repository root. Worktrees managed by Claude Code are ephemeral — .understand-anything/ written there is destroyed when the session ends, taking the knowledge graph with it (issue #133). Detect a worktree by comparing git rev-parse --git-dir against git rev-parse --git-common-dir; in a normal checkout or submodule they resolve to the same path, in a worktree they differ and the parent of --git-common-dir is the main repo root.
COMMON_DIR=$(git -C "$PROJECT_ROOT" rev-parse --git-common-dir 2>/dev/null)
GIT_DIR=$(git -C "$PROJECT_ROOT" rev-parse --git-dir 2>/dev/null)
if [ -n "$COMMON_DIR" ] && [ -n "$GIT_DIR" ]; then
COMMON_ABS=$(cd "$PROJECT_ROOT" && cd "$COMMON_DIR" 2>/dev/null && pwd -P)
GIT_ABS=$(cd "$PROJECT_ROOT" && cd "$GIT_DIR" 2>/dev/null && pwd -P)
if [ -n "$COMMON_ABS" ] && [ "$COMMON_ABS" != "$GIT_ABS" ]; then
MAIN_ROOT=$(dirname "$COMMON_ABS")
if [ -d "$MAIN_ROOT" ] && [ "${UNDERSTAND_NO_WORKTREE_REDIRECT:-0}" != "1" ]; then
echo "[understand] Detected git worktree at $PROJECT_ROOT"
echo "[understand] Redirecting output to main repo root: $MAIN_ROOT"
echo "[understand] (Set UNDERSTAND_NO_WORKTREE_REDIRECT=1 to keep PROJECT_ROOT as the worktree.)"
PROJECT_ROOT="$MAIN_ROOT"
fi
fi
fi
Set UNDERSTAND_NO_WORKTREE_REDIRECT=1 if you intentionally want a per-worktree graph (rare — most users want the redirect).
1.5. Ensure the plugin is built. Later phases invoke Node scripts that import @understand-anything/core. On a fresh install packages/core/dist/ does not exist yet — build once.
Important: do not assume the plugin root is simply two directories above the skill path string. In many installations ~/.agents/skills/understand is a symlink into the real plugin checkout. Prefer runtime-provided plugin roots first (for Claude), then fall back to universal symlinks, skill symlink resolution, and common clone-based install paths.
Resolve the plugin root like this:
SKILL_REAL=$(realpath ~/.agents/skills/understand 2>/dev/null || readlink -f ~/.agents/skills/understand 2>/dev/null || echo "")
SELF_RELATIVE=$([ -n "$SKILL_REAL" ] && cd "$SKILL_REAL/../.." 2>/dev/null && pwd || echo "")
COPILOT_SKILL_REAL=$(realpath ~/.copilot/skills/understand 2>/dev/null || readlink -f ~/.copilot/skills/understand 2>/dev/null || echo "")
COPILOT_SELF_RELATIVE=$([ -n "$COPILOT_SKILL_REAL" ] && cd "$COPILOT_SKILL_REAL/../.." 2>/dev/null && pwd || echo "")
PLUGIN_ROOT=""
for candidate in \
"${CLAUDE_PLUGIN_ROOT}" \
"$HOME/.understand-anything-plugin" \
"$SELF_RELATIVE" \
"$COPILOT_SELF_RELATIVE" \
"$HOME/.codex/understand-anything/understand-anything-plugin" \
"$HOME/.opencode/understand-anything/understand-anything-plugin" \
"$HOME/.pi/understand-anything/understand-anything-plugin" \
"$HOME/understand-anything/understand-anything-plugin"; do
if [ -n "$candidate" ] && [ -f "$candidate/package.json" ] && [ -f "$candidate/pnpm-workspace.yaml" ]; then
PLUGIN_ROOT="$candidate"
break
fi
done
if [ -z "$PLUGIN_ROOT" ]; then
echo "Error: Cannot find the understand-anything plugin root."
echo "Checked:"
echo " - ${CLAUDE_PLUGIN_ROOT:-<unset CLAUDE_PLUGIN_ROOT>}"
echo " - $HOME/.understand-anything-plugin"
echo " - ${SELF_RELATIVE:-<unresolved path derived from ~/.agents/skills/understand>}"
echo " - ${COPILOT_SELF_RELATIVE:-<unresolved path derived from ~/.copilot/skills/understand>}"
echo " - $HOME/.codex/understand-anything/understand-anything-plugin"
echo " - $HOME/.opencode/understand-anything/understand-anything-plugin"
echo " - $HOME/.pi/understand-anything/understand-anything-plugin"
echo " - $HOME/understand-anything/understand-anything-plugin"
echo "Make sure the plugin is installed correctly."
exit 1
fi
if [ ! -f "$PLUGIN_ROOT/packages/core/dist/index.js" ]; then
cd "$PLUGIN_ROOT" && (pnpm install --frozen-lockfile 2>/dev/null || pnpm install) && pnpm --filter @understand-anything/core build
fi
If pnpm is missing, report to the user: "Install Node.js ≥ 22 and pnpm ≥ 10, then re-run /understand."
Get the current git commit hash:
git rev-parse HEAD
Create the intermediate and temp output directories:
mkdir -p $PROJECT_ROOT/.understand-anything/intermediate
mkdir -p $PROJECT_ROOT/.understand-anything/tmp
3.5. Auto-update configuration:
- If --auto-update is in $ARGUMENTS: write {"autoUpdate": true} to $PROJECT_ROOT/.understand-anything/config.json
- If --no-auto-update is in $ARGUMENTS: write {"autoUpdate": false} to $PROJECT_ROOT/.understand-anything/config.json
- These flags only set the config — analysis proceeds normally regardless.
3.6. Language configuration:
- Parse $ARGUMENTS for --language <lang> flag. If found, extract the language code.
- Language code normalization: Map friendly names to ISO codes:
- chinese → zh, japanese → ja, korean → ko, english → en, spanish → es, french → fr, german → de, portuguese → pt, russian → ru, arabic → ar, etc.
- Locale variants: zh-TW, zh-HK, zh-CN, pt-BR, etc. are preserved as-is.
- If --language is NOT specified:
- Check $PROJECT_ROOT/.understand-anything/config.json for an existing outputLanguage field. If present, use that.
- If no stored preference, default to en (English).
- If --language IS specified:
- Update $PROJECT_ROOT/.understand-anything/config.json with the new language: merge {"outputLanguage": "<lang>"} into existing config.
- Store as $OUTPUT_LANGUAGE for use throughout all phases.
- Language directive template: Store as $LANGUAGE_DIRECTIVE:
markdown > **Language directive**: Generate all textual content (summaries, descriptions, tags, titles, languageNotes, languageLesson) in **{language}**. Maintain technical accuracy while using natural, native-level phrasing in the target language. Keep technical terms in English when no standard translation exists (e.g., "middleware", "hook", "barrel").
*knowledge-graph*.json files in $PROJECT_ROOT/.understand-anything/ excluding knowledge-graph.json itself (e.g. frontend-knowledge-graph.json, backend-knowledge-graph.json). If any subdomain graphs exist, run the merge script bundled with this skill (located next to this SKILL.md file — use the skill directory path, not the project root):python <SKILL_DIR>/merge-subdomain-graphs.py $PROJECT_ROOT
The script discovers subdomain graphs, loads the existing knowledge-graph.json as a base (if present), and merges everything into knowledge-graph.json (deduplicating nodes and edges). Report the merge summary to the user, then continue with the merged graph.
Check if $PROJECT_ROOT/.understand-anything/knowledge-graph.json exists. If it does, read it.
Check if $PROJECT_ROOT/.understand-anything/meta.json exists. If it does, read it to get gitCommitHash.
Decision logic:
| Condition | Action |
|---|---|
--full flag in $ARGUMENTS | Full analysis (all phases) |
| No existing graph or meta | Full analysis (all phases) |
--review flag + existing graph + unchanged commit hash | Skip to Phase 6 (review-only — reuse existing assembled graph) |
| Existing graph + unchanged commit hash | Ask the user: "The graph is up to date at this commit. Would you like to: (a) run a full rebuild (--full), (b) run the LLM graph reviewer (--review), or (c) do nothing?" Then follow their choice. If they pick (c), STOP. |
| Existing graph + changed files | Incremental update (re-analyze changed files only) |
Review-only path: Copy the existing knowledge-graph.json to $PROJECT_ROOT/.understand-anything/intermediate/assembled-graph.json, then jump directly to Phase 6 step 3.
For incremental updates, get the changed file list:
git diff <lastCommitHash>..HEAD --name-only
If this returns no files, report "Graph is up to date" and STOP.
Collect project context for subagent injection:
README.md (or README.rst, readme.md) from $PROJECT_ROOT if it exists. Store as $README_CONTENT (first 3000 characters).package.json, pyproject.toml, Cargo.toml, go.mod, pom.xml) if it exists. Store as $MANIFEST_CONTENT.find $PROJECT_ROOT -maxdepth 2 -type f -not -path '*/node_modules/*' -not -path '*/.git/*' -not -path '*/dist/*' | head -100
Store as $DIR_TREE.src/index.ts, src/main.ts, src/App.tsx, index.js, main.py, manage.py, app.py, wsgi.py, asgi.py, run.py, __main__.py, main.go, cmd/*/main.go, src/main.rs, src/lib.rs, src/main/java/**/Application.java, Program.cs, config.ru, index.php. Store first match as $ENTRY_POINT.Set up and verify the .understandignore file before scanning.
$PROJECT_ROOT/.understand-anything/.understandignore exists.$PROJECT_ROOT (reads .gitignore and deduplicates against built-in defaults):
node -e "
const fs = require('fs');
const path = require('path');
const root = process.cwd();
const defaults = ['node_modules/','node_modules','.git/','vendor/','venv/','.venv/','__pycache__/','dist/','dist','build/','build','out/','coverage/','coverage','.next/','.cache/','.turbo/','target/','obj/','*.lock','package-lock.json','yarn.lock','pnpm-lock.yaml','*.png','*.jpg','*.jpeg','*.gif','*.svg','*.ico','*.woff','*.woff2','*.ttf','*.eot','*.mp3','*.mp4','*.pdf','*.zip','*.tar','*.gz','*.min.js','*.min.css','*.map','*.generated.*','.idea/','.vscode/','LICENSE','.gitignore','.editorconfig','.prettierrc','.eslintrc*','*.log'];
const norm = p => p.replace(/\/+$/, '');
const defaultSet = new Set(defaults.map(norm));
const header = '# .understandignore — patterns for files/dirs to exclude from analysis\n# Syntax: same as .gitignore (globs, # comments, ! negation, trailing / for dirs)\n# Lines below are suggestions — uncomment to activate.\n# Use ! prefix to force-include something excluded by defaults.\n#\n# Built-in defaults (always excluded unless negated):\n# node_modules/, .git/, dist/, build/, obj/, *.lock, *.min.js, etc.\n#\n';
let body = '';
const gitignorePath = path.join(root, '.gitignore');
if (fs.existsSync(gitignorePath)) {
const gi = fs.readFileSync(gitignorePath, 'utf-8').split('\n').map(l => l.trim()).filter(l => l && !l.startsWith('#')).filter(p => !defaultSet.has(norm(p)));
if (gi.length) { body += '# --- From .gitignore (uncomment to exclude) ---\n\n' + gi.map(p => '# ' + p).join('\n') + '\n\n'; }
}
const dirs = ['__tests__','test','tests','fixtures','testdata','docs','examples','scripts','migrations','.storybook'];
const found = dirs.filter(d => fs.existsSync(path.join(root, d)));
if (found.length) { body += '# --- Detected directories (uncomment to exclude) ---\n\n' + found.map(d => '# ' + d + '/').join('\n') + '\n\n'; }
body += '# --- Test file patterns (uncomment to exclude) ---\n\n# *.test.*\n# *.spec.*\n# *.snap\n';
const outDir = path.join(root, '.understand-anything');
if (!fs.existsSync(outDir)) fs.mkdirSync(outDir, { recursive: true });
fs.writeFileSync(path.join(outDir, '.understandignore'), header + body);
"
Generated
.understand-anything/.understandignorewith suggested exclusions based on your project structure. Please review it and uncomment any patterns you'd like to exclude from analysis. When ready, confirm to continue.
Found
.understand-anything/.understandignore. Review it if needed, then confirm to continue.
Dispatch a subagent using the project-scanner agent definition (at agents/project-scanner.md). Append the following additional context:
Additional context from main session:
Project README (first 3000 chars):
$README_CONTENTPackage manifest:
$MANIFEST_CONTENTUse this context to produce more accurate project name, description, and framework detection. The README and manifest are authoritative — prefer their information over heuristics.
$LANGUAGE_DIRECTIVE
Pass these parameters in the dispatch prompt:
Scan this project directory to discover all project files (including non-code files like configs, docs, infrastructure), detect languages and frameworks. Project root:
$PROJECT_ROOTWrite output to:$PROJECT_ROOT/.understand-anything/intermediate/scan-result.json
After the subagent completes, read $PROJECT_ROOT/.understand-anything/intermediate/scan-result.json to get:
fileCategory per file (code, config, docs, infra, data, script, markup)importMap): pre-resolved project-internal imports per file (non-code files have empty arrays)Store importMap in memory as $IMPORT_MAP for use in Phase 2 batch construction.
Store the file list as $FILE_LIST with fileCategory metadata for use in Phase 2 batch construction.
Gate check: If >100 files, inform the user and suggest scoping with a subdirectory argument. Proceed only if user confirms or add guidance that this may take a while.
If the scan result includes filteredByIgnore > 0, report:
Excluded {filteredByIgnore} files via
.understandignore.
Batch the file list from Phase 1 into groups of 20-30 files each (aim for ~25 files per batch for balanced sizes).
Batching strategy for non-code files:
depends_on Dockerfile)fileCategory from Phase 1 must be included in the batch file listFor each batch, dispatch a subagent using the file-analyzer agent definition (at agents/file-analyzer.md). Run up to 5 subagents concurrently using parallel dispatch. Append the following additional context:
Additional context from main session:
Project:
<projectName>—<projectDescription>Languages:<languages from Phase 1>$LANGUAGE_DIRECTIVE
Before dispatching each batch, construct batchImportData from $IMPORT_MAP:
batchImportData = {}
for each file in this batch:
batchImportData[file.path] = $IMPORT_MAP[file.path] ?? []
Fill in batch-specific parameters below and dispatch:
Analyze these files and produce GraphNode and GraphEdge objects. Project root:
$PROJECT_ROOTProject:<projectName>Languages:<languages>Batch index:<batchIndex>Skill directory (for bundled scripts):<SKILL_DIR>Write output to:$PROJECT_ROOT/.understand-anything/intermediate/batch-<batchIndex>.jsonPre-resolved import data for this batch (use this for all import edge creation — do NOT re-resolve imports from source):
<batchImportData JSON>Files to analyze in this batch (every entry MUST be passed through to
batchFileswith all four fields —path,language,sizeLines,fileCategory):
<path>( lines, language:<language>, fileCategory:<fileCategory>)<path>( lines, language:<language>, fileCategory:<fileCategory>) ...
After ALL batches complete, run the merge-and-normalize script bundled with this skill (located next to this SKILL.md file — use the skill directory path, not the project root):
python <SKILL_DIR>/merge-batch-graphs.py $PROJECT_ROOT
This script reads all batch-*.json files from $PROJECT_ROOT/.understand-anything/intermediate/, then in one pass:
low→simple, medium→moderate, high→complex, etc.)(source, target, type)The merge script also runs a tested_by linker that canonicalizes test-coverage edges in two passes. Pass 1 walks LLM-emitted tested_by edges and flips inverted ones in place (the LLM systematically emits test → production because it sees the import only when analyzing the test file); semantically broken edges (test↔test, prod↔prod, orphan endpoints) are dropped. Pass 2 supplements with path-convention pairings (X.ts ↔ X.test.ts, JS/TS __tests__/ and <dir>/test/ walk-out, Python in-package tests/, Go _test.go sibling, Maven/Gradle src/test/... ↔ src/main/..., .NET <svc>/tests/ ↔ <svc>/src/... and <App>.Tests/ ↔ <App>/). Production nodes that end up sourcing any tested_by edge get a "tested" tag. All resulting edges run production → test.
Output: $PROJECT_ROOT/.understand-anything/intermediate/assembled-graph.json
Include the script's warnings in $PHASE_WARNINGS for the reviewer.
Use the changed files list from Phase 0. Batch and dispatch file-analyzer subagents using the same process as above (20-30 files per batch, up to 5 concurrent, with batchImportData constructed from $IMPORT_MAP), but only for changed files.
After batches complete:
filePath matches any changed file from the existing graphsource or target references a removed nodebatch-existing.json in the intermediate directorybatch-existing.json with the fresh batch-*.json files:
python <SKILL_DIR>/merge-batch-graphs.py $PROJECT_ROOT
Dispatch a subagent using the assemble-reviewer agent definition (at agents/assemble-reviewer.md).
Pass these parameters in the dispatch prompt:
Review the assembled graph at
$PROJECT_ROOT/.understand-anything/intermediate/assembled-graph.json. Project root:$PROJECT_ROOTBatch files are at:$PROJECT_ROOT/.understand-anything/intermediate/batch-*.jsonWrite review output to:$PROJECT_ROOT/.understand-anything/intermediate/assemble-review.jsonMerge script report:
<paste the full stderr output from merge-batch-graphs.py>Import map for cross-batch edge verification:
$IMPORT_MAP
After the subagent completes, read $PROJECT_ROOT/.understand-anything/intermediate/assemble-review.json and add any notes to $PHASE_WARNINGS.
Build the combined prompt template:
architecture-analyzer agent definition (at agents/architecture-analyzer.md).python, markdown, dockerfile, yaml, sql, terraform, graphql, protobuf, shell, html, css), read the file at ./languages/<language-id>.md (e.g., ./languages/python.md, ./languages/dockerfile.md) and append its content after the base template under a ## Language Context header. If the file does not exist for a detected language, skip it silently and continue. These files are in the languages/ subdirectory next to this SKILL.md file. Include non-code language snippets — they provide edge patterns and summary styles for non-code files.Django), read the file at ./frameworks/<framework-id-lowercase>.md (e.g., ./frameworks/django.md) and append its full content after the language context. If the file does not exist for a detected framework, skip it silently and continue. These files are in the frameworks/ subdirectory next to this SKILL.md file.$OUTPUT_LANGUAGE is NOT en (English), read the locale guidance file at ./locales/<language-code>.md (e.g., ./locales/zh.md, ./locales/ja.md, ./locales/ko.md) and append its content after the framework addendums under a ## Output Language Guidelines header. This provides language-specific guidance for tag naming conventions, summary style, and layer name translations. If the locale file does not exist for the specified language, skip silently — the $LANGUAGE_DIRECTIVE still applies. These files are in the locales/ subdirectory next to this SKILL.md file.Append the language/framework context and the following additional context to the agent's prompt:
Additional context from main session:
Frameworks detected:
<frameworks from Phase 1>Directory tree (top 2 levels):
$DIR_TREEUse the directory tree, language context, and framework addendums (appended above) to inform layer assignments. Directory structure is strong evidence for layer boundaries. Non-code files (config, docs, infrastructure, data) should be assigned to appropriate layers — see the prompt template for guidance.
$LANGUAGE_DIRECTIVE
Pass these parameters in the dispatch prompt:
Analyze this codebase's structure to identify architectural layers. Project root:
$PROJECT_ROOTWrite output to:$PROJECT_ROOT/.understand-anything/intermediate/layers.jsonProject:<projectName>—<projectDescription>File nodes (all node types — includes code files, config, document, service, pipeline, table, schema, resource, endpoint):
[list of {id, type, name, filePath, summary, tags} for ALL file-level nodes — omit complexity, languageNotes]Import edges:
[list of edges with type "imports"]All edges (for cross-category analysis — includes configures, documents, deploys, triggers, etc.):
[list of ALL edges — include all edge types]
After the subagent completes, read $PROJECT_ROOT/.understand-anything/intermediate/layers.json and normalize it into a final layers array. Apply these steps in order:
{ "layers": [...] } instead of a plain array, extract the inner array. (The prompt requests a plain array, but LLMs may still produce an envelope.)nodes field instead of nodeIds, rename nodes → nodeIds. If nodes entries are objects with an id field rather than plain strings, extract just the id values into nodeIds.id, generate one as layer:<kebab-case-name>.nodeIds entries are raw file paths without a known prefix (file:, config:, document:, service:, pipeline:, table:, schema:, resource:, endpoint:), convert them to file:<relative-path>.nodeIds entries that do not exist in the merged node set.Each element of the final layers array MUST have this shape:
[
{
"id": "layer:<kebab-case-name>",
"name": "<layer name>",
"description": "<what belongs in this layer>",
"nodeIds": ["file:src/App.tsx", "config:tsconfig.json", "document:README.md"]
}
]
All four fields (id, name, description, nodeIds) are required.
For incremental updates: Always re-run architecture analysis on the full merged node set, since layer assignments may shift when files change.
Context for incremental updates: When re-running architecture analysis, also inject the previous layer definitions:
Previous layer definitions (for naming consistency):
[previous layers from existing graph]Maintain the same layer names and IDs where possible. Only add/remove layers if the file structure has materially changed.
Dispatch a subagent using the tour-builder agent definition (at agents/tour-builder.md). Append the following additional context:
Additional context from main session:
Project README (first 3000 chars):
$README_CONTENTProject entry point:
$ENTRY_POINTUse the README to align the tour narrative with the project's own documentation. Start the tour from the entry point if one was detected. The tour should tell the same story the README tells, but through the lens of actual code structure.
$LANGUAGE_DIRECTIVE
Pass these parameters in the dispatch prompt:
Create a guided learning tour for this codebase. Project root:
$PROJECT_ROOTWrite output to:$PROJECT_ROOT/.understand-anything/intermediate/tour.jsonProject:<projectName>—<projectDescription>Languages:<languages>Nodes (all file-level nodes — includes code files, config, document, service, pipeline, table, schema, resource, endpoint):
[list of {id, name, filePath, summary, type} for ALL file-level nodes — do NOT include function or class nodes]Layers:
[list of {id, name, description} for each layer — omit nodeIds]Edges (all types — includes imports, calls, configures, documents, deploys, triggers, etc.):
[list of ALL edges — include all edge types for complete graph topology analysis]
After the subagent completes, read $PROJECT_ROOT/.understand-anything/intermediate/tour.json and normalize it into a final tour array. Apply these steps in order:
{ "steps": [...] } instead of a plain array, extract the inner array. (The prompt requests a plain array, but LLMs may still produce an envelope.)nodesToInspect instead of nodeIds, rename it → nodeIds. If any step has whyItMatters instead of description, rename it → description.nodeIds entries are raw file paths without a known prefix (file:, config:, document:, service:, pipeline:, table:, schema:, resource:, endpoint:), convert them to file:<relative-path>.nodeIds entries that do not exist in the merged node set.order before saving.Each element of the final tour array MUST have this shape:
[
{
"order": 1,
"title": "Project Overview",
"description": "Start with the README to understand the project's purpose and architecture.",
"nodeIds": ["document:README.md"]
},
{
"order": 2,
"title": "Application Entry Point",
"description": "This step explains how the frontend boots and mounts.",
"nodeIds": ["file:src/main.tsx", "file:src/App.tsx"]
}
]
Required fields: order, title, description, nodeIds. Preserve optional languageLesson when present.
Assemble the full KnowledgeGraph JSON object:
{
"version": "1.0.0",
"project": {
"name": "<projectName>",
"languages": ["<languages>"],
"frameworks": ["<frameworks>"],
"description": "<projectDescription>",
"analyzedAt": "<ISO 8601 timestamp>",
"gitCommitHash": "<commit hash from Phase 0>"
},
"nodes": [<all nodes from assembled-graph.json after Phase 3 review>],
"edges": [<all edges from assembled-graph.json after Phase 3 review>],
"layers": [<layers from Phase 4>],
"tour": [<steps from Phase 5>]
}
Before writing the assembled graph, validate that:
layers is an array of objects with these required fields: id, name, description, nodeIdstour is an array of objects with these required fields: order, title, description, nodeIdstour[*].languageLesson is allowed as an optional string fieldlayers[*].nodeIds entry exists in the merged node settour[*].nodeIds entry exists in the merged node setIf validation fails, automatically normalize and rewrite the graph into this shape before saving. If the graph still fails final validation after the normalization pass, save it with warnings but mark dashboard auto-launch as skipped.
Write the assembled graph to $PROJECT_ROOT/.understand-anything/intermediate/assembled-graph.json.
Check $ARGUMENTS for --review flag. Then run the appropriate validation path:
--review): inline deterministic validationWrite the following Node.js script to $PROJECT_ROOT/.understand-anything/tmp/ua-inline-validate.cjs:
#!/usr/bin/env node
const fs = require('fs');
const graphPath = process.argv[2];
const outputPath = process.argv[3];
try {
const graph = JSON.parse(fs.readFileSync(graphPath, 'utf8'));
const issues = [], warnings = [];
if (!Array.isArray(graph.nodes)) { issues.push('graph.nodes is missing or not an array'); graph.nodes = []; }
if (!Array.isArray(graph.edges)) { issues.push('graph.edges is missing or not an array'); graph.edges = []; }
const nodeIds = new Set();
const seen = new Map();
graph.nodes.forEach((n, i) => {
if (!n.id) { issues.push(`Node[${i}] missing id`); return; }
if (!n.type) issues.push(`Node[${i}] '${n.id}' missing type`);
if (!n.name) issues.push(`Node[${i}] '${n.id}' missing name`);
if (!n.summary) issues.push(`Node[${i}] '${n.id}' missing summary`);
if (!n.tags || !n.tags.length) issues.push(`Node[${i}] '${n.id}' missing tags`);
if (seen.has(n.id)) issues.push(`Duplicate node ID '${n.id}' at indices ${seen.get(n.id)} and ${i}`);
else seen.set(n.id, i);
nodeIds.add(n.id);
});
graph.edges.forEach((e, i) => {
if (!nodeIds.has(e.source)) issues.push(`Edge[${i}] source '${e.source}' not found`);
if (!nodeIds.has(e.target)) issues.push(`Edge[${i}] target '${e.target}' not found`);
});
const fileLevelTypes = new Set(['file', 'config', 'document', 'service', 'pipeline', 'table', 'schema', 'resource', 'endpoint']);
const fileNodes = graph.nodes.filter(n => fileLevelTypes.has(n.type)).map(n => n.id);
const assigned = new Map();
if (!Array.isArray(graph.layers)) { if (graph.layers) warnings.push('graph.layers is not an array'); graph.layers = []; }
if (!Array.isArray(graph.tour)) { if (graph.tour) warnings.push('graph.tour is not an array'); graph.tour = []; }
graph.layers.forEach(layer => {
(layer.nodeIds || []).forEach(id => {
if (!nodeIds.has(id)) issues.push(`Layer '${layer.id}' refs missing node '${id}'`);
if (assigned.has(id)) issues.push(`Node '${id}' appears in multiple layers`);
assigned.set(id, layer.id);
});
});
fileNodes.forEach(id => {
if (!assigned.has(id)) issues.push(`File node '${id}' not in any layer`);
});
graph.tour.forEach((step, i) => {
(step.nodeIds || []).forEach(id => {
if (!nodeIds.has(id)) issues.push(`Tour step[${i}] refs missing node '${id}'`);
});
});
const withEdges = new Set([
...graph.edges.map(e => e.source),
...graph.edges.map(e => e.target)
]);
graph.nodes.forEach(n => {
if (!withEdges.has(n.id)) warnings.push(`Node '${n.id}' has no edges (orphan)`);
});
const stats = {
totalNodes: graph.nodes.length,
totalEdges: graph.edges.length,
totalLayers: graph.layers.length,
tourSteps: graph.tour.length,
nodeTypes: graph.nodes.reduce((a, n) => { a[n.type] = (a[n.type]||0)+1; return a; }, {}),
edgeTypes: graph.edges.reduce((a, e) => { a[e.type] = (a[e.type]||0)+1; return a; }, {})
};
fs.writeFileSync(outputPath, JSON.stringify({ issues, warnings, stats }, null, 2));
process.exit(0);
} catch (err) { process.stderr.write(err.message + '\n'); process.exit(1); }
Execute it:
node $PROJECT_ROOT/.understand-anything/tmp/ua-inline-validate.cjs \
"$PROJECT_ROOT/.understand-anything/intermediate/assembled-graph.json" \
"$PROJECT_ROOT/.understand-anything/intermediate/review.json"
If the script exits non-zero, read stderr, fix the script, and retry once.
--review path: full LLM reviewerIf --review IS in $ARGUMENTS, dispatch the LLM graph-reviewer subagent as follows:
Dispatch a subagent using the graph-reviewer agent definition (at agents/graph-reviewer.md). Append the following additional context:
Additional context from main session:
Phase 1 scan results (file inventory):
[list of {path, sizeLines} from scan-result.json]Phase warnings/errors accumulated during analysis:
- [list any batch failures, skipped files, or warnings from Phases 2-5]
Cross-validate: every file in the scan inventory should have a corresponding node in the graph (node types may vary:
file:,config:,document:,service:,pipeline:,table:,schema:,resource:,endpoint:). Flag any missing files. Also flag any graph nodes whosefilePathdoesn't appear in the scan inventory.
Pass these parameters in the dispatch prompt:
Validate the knowledge graph at
$PROJECT_ROOT/.understand-anything/intermediate/assembled-graph.json. Project root:$PROJECT_ROOTRead the file and validate it for completeness and correctness. Write output to:$PROJECT_ROOT/.understand-anything/intermediate/review.json
Read $PROJECT_ROOT/.understand-anything/intermediate/review.json.
If issues array is non-empty:
issues listtags -> ["untagged"], empty summary -> "No summary available")If issues array is empty: Proceed to Phase 7.
Write the final knowledge graph to $PROJECT_ROOT/.understand-anything/knowledge-graph.json.
Generate structural fingerprints baseline. This creates the basis for future automatic incremental updates and must succeed before meta.json is written — otherwise auto-update sees a fresh commit hash with no fingerprints to compare against, classifies every file as STRUCTURAL, and escalates to FULL_UPDATE on every subsequent commit (issue #152).
Write the input file:
cat > $PROJECT_ROOT/.understand-anything/intermediate/fingerprint-input.json <<EOF
{
"projectRoot": "$PROJECT_ROOT",
"sourceFilePaths": [<all source file paths from Phase 1, as JSON array>],
"gitCommitHash": "<current commit hash>"
}
EOF
Then invoke the bundled script (located next to this SKILL.md):
node <SKILL_DIR>/build-fingerprints.mjs \
$PROJECT_ROOT/.understand-anything/intermediate/fingerprint-input.json
The script uses TreeSitterPlugin + PluginRegistry exactly like extract-structure.mjs, so the baseline matches the comparison logic used during auto-updates.
If the script exits non-zero or stdout does not include Fingerprints baseline:, abort Phase 7 and report the error. Do NOT proceed to step 3 (writing meta.json).
Write metadata to $PROJECT_ROOT/.understand-anything/meta.json (only after step 2 succeeded):
{
"lastAnalyzedAt": "<ISO 8601 timestamp>",
"gitCommitHash": "<commit hash>",
"version": "1.0.0",
"analyzedFiles": <number of files analyzed>
}
Clean up intermediate files:
rm -rf $PROJECT_ROOT/.understand-anything/intermediate
rm -rf $PROJECT_ROOT/.understand-anything/tmp
Report a summary to the user containing:
$PROJECT_ROOT/.understand-anything/knowledge-graph.jsonOnly automatically launch the dashboard by invoking the /understand-dashboard skill if final graph validation passed after normalization/review fixes.
If final validation did not pass, report that the graph was saved with warnings and dashboard launch was skipped.
$PHASE_WARNINGS list. When using --review, pass this list to the graph-reviewer in Phase 6. On the default path, include accumulated warnings in the Phase 7 final report.| Type | Description | ID Convention |
|---|---|---|
file | Source code file | file:<relative-path> |
function | Function or method | function:<relative-path>:<name> |
class | Class, interface, or type | class:<relative-path>:<name> |
module | Logical module or package | module:<name> |
concept | Abstract concept or pattern | concept:<name> |
config | Configuration file (YAML, JSON, TOML, env) | config:<relative-path> |
document | Documentation file (Markdown, RST, TXT) | document:<relative-path> |
service | Deployable service definition (Dockerfile, K8s) | service:<relative-path> |
table | Database table or migration | table:<relative-path>:<table-name> |
endpoint | API endpoint or route definition | endpoint:<relative-path>:<endpoint-name> |
pipeline | CI/CD pipeline configuration | pipeline:<relative-path> |
schema | Schema definition (GraphQL, Protobuf, Prisma) | schema:<relative-path> |
resource | Infrastructure resource (Terraform, CloudFormation) | resource:<relative-path> |
| Category | Types |
|---|---|
| Structural | imports, exports, contains, inherits, implements |
| Behavioral | calls, subscribes, publishes, middleware |
| Data flow | reads_from, writes_to, transforms, validates |
| Dependencies | depends_on, tested_by, configures |
| Semantic | related, similar_to |
| Infrastructure | deploys, serves, provisions, triggers |
| Schema/Data | migrates, documents, routes, defines_schema |
| Edge Type | Weight |
|---|---|
contains | 1.0 |
inherits, implements | 0.9 |
calls, exports, defines_schema | 0.8 |
imports, deploys, migrates | 0.7 |
depends_on, configures, triggers | 0.6 |
tested_by, documents, provisions, serves, routes | 0.5 |
| All others | 0.5 (default) |
Use when you need to ask questions about a codebase or understand code using a knowledge graph
Use when you need to analyze git diffs or pull requests to understand what changed, affected components, and risks
Use when you need a deep-dive explanation of a specific file, function, or module in the codebase
Use when you need to generate an onboarding guide for new team members joining a project
Extract business domain knowledge from a codebase and generate an interactive domain flow graph. Works standalone (lightweight scan) or derives from an existing /understand knowledge graph.
Analyze a Karpathy-pattern LLM wiki knowledge base and generate an interactive knowledge graph with entity extraction, implicit relationships, and topic clustering.