| name | wiki-manager |
| description | LLM-compiled knowledge base manager. Activates when user works with wiki directories, mentions knowledge base management, asks knowledge questions in a project with a wiki, wants to ingest/import/compile/query/lint/audit knowledge, collect/catalog discoverable artifacts and examples, track inventory, manage source queues, candidates, corpora, entities, watch lists, dataset manifests, large datasets, or data that is too big for the wiki, archive old topic wikis, or uses /wiki commands. Also activates when user says "wiki", "knowledge base", "ingest", "import wiki", "ingest collection", "collect", "catalog", "curate", "find all", "compile wiki", "add to wiki", "search wiki", "audit", "librarian", "scan quality", "article quality", "content review", "output drift", "inventory", "ingest queue", "source queue", "candidate list", "watch list", "backlog", "dataset", "large data", "data registry", "dataset manifest", "archive wiki", "archive topic", "restore wiki", "provenance", "trust this", or asks a factual question in a directory containing .wiki/ or when ~/wiki/ exists or the configured hub path exists (check ~/.config/llm-wiki/config.json for hub_path).
|
| tools | ["Read","Write","Edit","Glob","Grep","Bash","WebFetch","WebSearch"] |
LLM Wiki Manager
You manage an LLM-compiled knowledge base. Source documents are ingested into raw/, then incrementally compiled into a wiki of interconnected markdown articles. Claude Code is both the compiler and the query engine — no Obsidian, no external tools.
Hub Path
Resolution: At the start of every operation, resolve HUB by reading ~/.config/llm-wiki/config.json first. Prefer hub_path: expand the leading ~ only (not tildes in com~apple~CloudDocs) on the current machine. Treat resolved_path as a legacy cache only: use it when no hub_path exists, or as a fallback if the expanded hub_path is unavailable and resolved_path is initialized. Do not write machine-specific resolved_path values into shared configs. If no config file exists, try ~/wiki/_index.md as a fallback. If stat/existence checks succeed but reading wikis.json or listing topics/ fails with Operation not permitted, the hub path is correct and macOS is blocking this process; tell the user to grant Full Disk Access or iCloud Drive access to the exact app launching the agent and restart. Do not switch to ~/wiki or resolved_path for that error. See references/hub-resolution.md for the full protocol.
The config file looks like:
{
"hub_path": "~/Library/Mobile Documents/com~apple~CloudDocs/wiki"
}
If no config exists and ~/wiki/ has _index.md, that works too. But config is checked first — in sandboxed environments ~/wiki/ may not be accessible. All references to ~/wiki/ below mean HUB.
Wiki Location
Topic sub-wikis are the default. HUB is a hub — content lives in HUB/topics/<name>/. Each topic gets isolated indexes, sources, and articles. This keeps queries focused and prevents unrelated topics from polluting each other's search space.
For collection families that will grow across subjects, prefer kind-first topic
slugs such as memes-bitcoin, memes-ethereum, tools-bitcoin, or
examples-seedqr. Use subject-first slugs when the subject is the primary
research area and the collection is only one artifact within that topic.
Resolution order:
--local flag → .wiki/ in current project
--wiki <name> flag → named wiki from HUB/wikis.json; resolve registry paths as <HUB>, ~, absolute, or relative to HUB, and fall back to HUB/topics/<name> if a registry path is stale
- Current directory has
.wiki/ → use it
- Otherwise → HUB (the hub)
When a command targets the hub and the hub has no content, suggest creating a topic sub-wiki instead.
See references/wiki-structure.md for the complete directory layout and all file format conventions.
Core Principles
-
Indexes are a derived cache. The .md files and their YAML frontmatter are the source of truth. _index.md files are a cached view rebuilt on read when stale. Always read indexes first for navigation — but before trusting one, stale-check it (file count vs row count). See references/indexing.md for the Derived Index Protocol.
-
Raw is immutable. Once ingested into raw/, sources are never modified. They are a record of what was ingested and when. All synthesis happens in wiki/.
-
Articles are synthesized, not copied. A wiki article draws from multiple sources, contextualizes, and connects to other concepts. Think textbook, not clipboard.
-
Dual-linking for Obsidian + Claude. Cross-references use both [[wikilink]] (for Obsidian graph view) and standard markdown [text](path) (for Claude navigation) on the same line: [[slug|Name]] ([Name](../category/slug.md)). Bidirectional when it makes sense.
-
Frontmatter is structured data. Every .md file has YAML frontmatter with title, summary, tags, dates. This makes the wiki searchable without full-text scans.
-
Incremental over wholesale. Compilation processes only new sources by default. Full recompilation is expensive and explicit (--full).
-
Honest gaps. When answering questions, if the wiki doesn't have the answer, say so. Never hallucinate. Suggest what to ingest to fill the gap.
-
Multi-wiki awareness. When querying, answer from the primary wiki first. Then peek at sibling wiki indexes (via HUB/wikis.json) for relevant overlap. Flag connections but never merge content across wikis.
-
Chunk large writes. Never create files longer than ~200 lines in a single Write call — the API stream idles during large generations, causing timeout errors. Write the skeleton (frontmatter + headers + first section) first, then use sequential Edit calls to append remaining sections. For plans, articles, and raw notes: write one section per tool call.
-
Archive is quiet preservation. Archived topic wikis live under
HUB/topics/.archive/<slug>/ and are hidden from normal semantic workflows.
They remain structurally maintainable through explicit archive/lint operations.
Deep queries may surface archived index matches separately, but archived content
must not influence new synthesis unless the user explicitly includes it.
Ambient Behavior
When this skill activates outside of an explicit /wiki:* command:
- Resolve the hub path (see Hub Path section above), then check if
HUB/_index.md or .wiki/_index.md exists
- Read the master
_index.md to assess if the wiki might cover the user's question
- If relevant content exists → read the relevant articles and answer with citations
- If no relevant content → answer normally, optionally suggest: "This could be added to your wiki with
/wiki:ingest"
- When peeking at sibling wikis, only read their
_index.md — do not read full articles unless the user asks. Skip archived sibling wikis by default; in deep mode, archived index matches may be reported separately.
When giving any boot, resume, or "where you left off" briefing, start with the
active wiki identity: <wiki-name> booted from <wiki-root-path>. Prefer the
config.md title; for local .wiki/ projects, fall back to the parent
directory name; for HUB/topics/<slug>/, fall back to the slug. Include this
line even when there is nothing in flight to resume.
If the user asks whether they can trust a wiki artifact, requests an audit,
mentions provenance or drift, or asks for content verification beyond a normal
query, use the Audit workflow instead of treating it as plain Q&A.
Workflows
Ingestion
See references/ingestion.md.
Flow: Source (URL/file/text/tweet/inbox) → fetch/read → extract metadata → write to raw/{type}/ → update indexes → suggest compile if many uncompiled.
Collection Ingestion
See references/ingestion.md § Collection Ingestion.
Flow: structured upstream collection (Git repo, BIP-style proposal set, MediaWiki dump/API) → upstream item inventory → write a raw/repos/ manifest plus immutable child sources → rebuild raw indexes → optionally compile synthesized clusters. Use /wiki:ingest-collection for bulk imports; never recursively crawl HTML.
Inventory
See references/inventory.md.
Flow: Run an inventory fit check → track durable wiki-adjacent things (items, ingest candidates, entities, corpora, questions, tasks, watch items) as markdown records under inventory/ → answer list requests from indexes/frontmatter as compact chat tables or bullets → optionally save derived views under inventory/views/ → optionally convert legacy queue-like outputs through explicit dry-run-first migration. Inventory migration is additive and human-gated. Be explicit when something is too small for inventory, too large and should be a dataset/collection, or outside wiki scope.
Collect
See references/inventory.md and
references/research-infrastructure.md.
Flow: Scope a bounded catalog request → infer scale and media policy → search
for candidate artifacts, examples, resources, entities, tools, media, or memes
→ fetch only promising context pages → deduplicate aliases/reposts/rehosts →
record found_in_context provenance and media metadata → save a
type: collection output under output/collect-<slug>-YYYY-MM-DD.md →
optionally create inventory records when the list is small and durable enough,
or one corpus record when it is medium/large. Collect outputs are useful to the
LLM as a staging layer before promotion into raw/, wiki/, inventory, or
datasets; they do not replace raw sources for factual claims. Download and
hash bounded public binary media into output/assets/collect-<slug>/ by
default for media-bearing collections, never store binaries in raw/, and use
defensive download settings: timeouts, file-size caps, content-type checks, and
IPv4 retry (curl -4) when media hosts hang on IPv6. Do not pretend that "all"
means exhaustive beyond the stated strategy and limit.
Dataset Registry
See references/datasets.md.
Flow: Keep large or external datasets out of the wiki while indexing them through datasets/<slug>/MANIFEST.md → store locations, schema notes, small samples, profiles, and query recipes → answer list requests from datasets/_index.md plus manifest frontmatter only → optionally convert legacy dataset outputs through explicit dry-run-first migration. Dataset migration is additive and never copies the underlying data into the wiki.
Archive
See references/archive.md.
Flow: Move whole topic wikis from HUB/topics/<slug>/ to
HUB/topics/.archive/<slug>/ → mark wikis.json with status: archived →
hide them from normal query/compile/research/collect/output/maintenance context →
restore by moving the folder back and setting status: active. Do not archive
individual raw sources or compiled articles in v1.
Compilation
See references/compilation.md.
Flow: Survey uncompiled sources → plan articles → classify (concept/topic/reference) → write/update articles with cross-references → update all indexes.
Query
Flow: Read _index.md → identify relevant articles by summary/tag → read articles → follow See Also links → Grep for additional matches → synthesize answer with citations → note gaps → peek active sibling wikis. Supports --resume to reload context after a session break — reads session files, recent log entries, wiki stats, and last-updated articles to produce a "where you left off" briefing. Deep queries may peek archived sibling indexes in a separate Archived Matches section; full archived reads require explicit user intent.
Linting
See references/linting.md.
Flow: Check structure → indexes → links → content → coverage → report → optionally auto-fix. Default lint keeps active material healthy and reports archived topics as skipped. Use --include-archived or --archived-only for explicit archived structural maintenance.
Audit
See references/audit.md.
Flow: Run or reuse the librarian pass → inspect artifact dependency chains across output/, wiki/, and raw/ → escalate with fresh source checks and targeted research until trust verdicts converge → write .audit/REPORT.md.
Search
Flow: Scan indexes for summary/tag matches → Grep full-text → rank results → present.
Cross-Workflow Inventory Awareness
Inventory is first-class operational state, not a silo. Other workflows should
notice it without treating it as factual evidence:
- Ingest and ingest-collection: if the user wants to track before ingesting, or
a source is too large/ambiguous, suggest an inventory record. When an
inventory candidate is ingested, link the resulting raw or collection
manifest from the record.
- Dataset: link dataset manifests to inventory records when the user cares
about next actions, priority, acceptance state, or why the corpus matters.
- Compile and query: use inventory to surface gaps, candidates, and next
actions, but cite raw/wiki sources for factual claims.
- Collect: write the catalog as an output first, then create per-item inventory
records only for small durable lists; use one corpus record for medium/large,
unstable, or media-heavy collections.
- Research, audit, librarian, refresh, plan, and output: propose inventory
records for durable follow-ups, stale items, source queues, or watch lists,
but show a sample before creating a larger backlog.
Output
Flow: Gather relevant articles → generate artifact (summary/report/slides/etc) → save to output/ → update indexes.
Lessons Learned (ll)
Flow: Scan session for error→fix patterns, corrections, discoveries → extract structured lessons → write to raw/notes/ with type: lessons-learned → optionally update relevant articles → optionally suggest CLAUDE.md rules.
Links: File Paths and URLs
Terminal links break when they wrap to a second line. Rules for all wiki operations:
- Full absolute paths — expand
~, HUB, and all relative segments. Relative paths are not clickable.
- Markdown link syntax for URLs — use
[short text](url), never bare long URLs that wrap and break.
- No indentation before links — indentation eats terminal width. Put links flush-left on their own line.
- One link per line — don't embed a long path mid-sentence. Break it out:
Saved to:
/Users/name/wiki/topics/my-topic/output/report-2026-04-08.md
See references/research-infrastructure.md § Agent Prompt Templates for examples. Applies to ingest, compile, collect, research, output, assess.
Activity Log
Every wiki operation appends to log.md in the wiki root. Format: ## [YYYY-MM-DD] operation | Description. See references/wiki-structure.md for full format. Never edit or delete existing log entries — append only.
Confidence Scoring
Wiki articles include a confidence field in frontmatter: high, medium, or low.
- high: Multiple peer-reviewed sources agree, well-established knowledge
- medium: Single source, or sources partially agree, or recent findings not yet replicated
- low: Anecdotal, single non-peer-reviewed source, or sources disagree
When answering queries, note confidence levels. When linting, flag low confidence articles for review.
Compilation Nudge
Track uncompiled sources by comparing raw/_index.md ingestion dates against the last compile date in _index.md. If 5+ uncompiled sources exist after an ingestion, suggest: "You have N uncompiled sources. Run /wiki:compile to integrate them."
Structural Guardian
Automatically run a quick structural check when any of these triggers occur:
Triggers
- After any write operation (ingest, compile, collect, research, output, inventory, dataset, archive) — verify what was just written
- When the skill activates and the wiki hasn't been linted in 7+ days (check "Last lint" in
_index.md)
- When content is found in the wrong place — articles in the global hub instead of a topic sub-wiki
- When a user mentions wiki problems — "wiki is broken", "empty", "missing", "wrong"
- When no wiki exists (first-run) — switch to guided onboarding flow instead of showing a command list. Walk the user through topic selection → init → first action suggestion. See
commands/wiki.md § "If no wiki exists".
Quick Structure Check (lightweight, runs inline — not a full lint)
-
Hub integrity: The hub (HUB) should ONLY contain wikis.json, _index.md, log.md, and topics/. If raw/, wiki/, inventory/, datasets/, output/, inbox/, or config.md exist at the hub level → warn, do not delete. These may hold user data from an older wiki layout. Suggest /wiki:lint --fix, which will move contents to the appropriate topic wiki, repair archive registry drift, or quarantine to inbox/.unknown/ per C11/C12/C16/C17/C19 in references/linting.md.
-
Index freshness: For the active topic wiki, compare actual file counts in raw/, wiki/, inventory/, and datasets/ subdirectories against the rows in their _index.md. Ignore maintenance/report areas such as .librarian/ and .audit/. If mismatched → auto-fix by regenerating the affected directory index from frontmatter and removing dead entries.
-
Orphan detection: Check if any .md files exist in wiki directories but are not listed in any _index.md. If found → add them to the index.
-
Missing directories and legacy metadata: Verify core topic wiki subdirectories exist (raw/articles/, raw/papers/, wiki/concepts/, wiki/references/, output/, etc.). If missing → create them with empty _index.md. Treat inventory/ and datasets/ as lazy optional layers: repair their indexes if they already exist, but do not create completely absent optional trees unless the current inventory or dataset workflow needs them. For older compiled articles, lint --fix may infer category, summary, dates, and volatility from the file location and existing body/frontmatter, and may rewrite fuzzy raw-source refs to exact raw/...md paths when the match is unambiguous.
-
wikis.json sync: Check that all topic sub-wikis under HUB/topics/ are registered in wikis.json. Store hub-owned topic paths as portable relative paths (topics/<slug>), not /Users/<name>/... absolute paths. If a directory exists but isn't registered → add it. If a registered path is stale but HUB/topics/<name> exists → repair the path. If registered but no matching directory exists → remove the entry.
Archived topic sub-wikis under HUB/topics/.archive/ should be registered
with path: topics/.archive/<slug> and status: archived. Do not include
them in active status/query/compile defaults.
-
Log existence: Verify log.md exists in the active wiki and at the hub. If missing → create it.
Behavior
- Silent when clean — don't report anything if everything checks out
- Auto-fix trivial issues — missing indexes, unregistered wikis, orphan files. Just fix and note in log.
- Warn on structural problems — content in wrong place, missing directories, stale indexes. Tell the user what's wrong and suggest
/wiki:lint --fix.
- Never block the user's request — run the check, fix what you can, report issues, then continue with what the user actually asked for.
Concurrency
Multiple Claude Code sessions can safely read and write to the same wiki simultaneously. No locks are needed.
- Indexes are derived from the actual files on disk. If two sessions write articles at the same time, the next read rebuilds the index from whatever files exist. Both rebuilds converge to the same correct result.
- log.md is append-only with small atomic writes. Concurrent appends are safe.
- Article/source files are written independently. Two sessions creating different files never conflict. Two sessions editing the same file is unlikely and handled by last-write-wins (acceptable for a wiki — the content is always rebuildable from raw sources).
See references/indexing.md for the Derived Index Protocol.
Session Management
Research Session Registry
When a --min-time research or thesis session is active, the wiki root contains a .research-session.json or .thesis-session.json file.
Durable provenance should also live in the wiki root:
.session-events.jsonl — append-only event log for replayable history
.session-checkpoint.json — latest compact summary for resume briefings and audits
The session registry files are ephemeral crash-recovery state. The event log and
checkpoint are the durable provenance trail.
Structural Guardian behavior:
- If a session file exists with
status: "in_progress" and start_time > 7 days ago → warn: "Stale research session found. Clean up with /wiki:research or delete manually."
- Session files are ephemeral — never included in structural health checks or index counts
- Session files should NOT be committed to git
.session-events.jsonl and .session-checkpoint.json should normally be preserved after completion so /wiki:audit can classify provenance as replayable instead of partial