Generate interactive validation reports with quality scoring, missing data analysis, and type checking. Combines Pandas validation, Plotly visualization, and YAML configuration for comprehensive data quality reporting.

2026-06-1511

hermes-local-configuration

vamseeachanta/workspace-hub

Class-level Hermes local configuration and setup workflows, including config audit gotchas and Windows installation.

2026-06-1511

model-selection

vamseeachanta/workspace-hub

Guide AI model selection based on task complexity, cost constraints, and latency requirements

2026-06-1511

modular-architecture-documentation-1-module-definition-framework

vamseeachanta/workspace-hub

Sub-skill of modular-architecture-documentation: 1. Module Definition Framework (+9).

2026-06-1511

modular-architecture-documentation-overview

vamseeachanta/workspace-hub

Sub-skill of modular-architecture-documentation: Overview (+6).

2026-06-1511

name	workspace-knowledge-doc-contracts
version	1.0.0
category	workspace-hub
description	Class-level workspace knowledge, LLM-wiki, repo mission contracts, stale doc references, semantic taxonomy, and knowledge-source reconnaissance.
tags	["knowledge","documentation","llm-wiki","contracts"]

Workspace Knowledge Doc Contracts

When to Use

Use when building/repairing LLM-wiki knowledge, aligning repo mission contracts, auditing stale documentation references, extracting learnings into issues, or turning domain gaps into issue roadmaps.

Class-Level Workflow

Inventory knowledge sources and distinguish canonical docs from generated/session artifacts.
Repair metadata/frontmatter before indexing or downstream issue creation.
Keep repo mission/contract documents aligned with live repository purpose.
Convert real gaps into deduplicated issues only after source-grounded evidence exists.
Maintain semantic taxonomy terms consistently across reports and docs.

Metadata-only LLM-wiki execution for large/sensitive corpora

Use this sub-pattern when an approved plan promotes a large or sensitive corpus (external-drive archives, standards libraries, training decks, client/project files) into LLM-wiki pages without copying raw data:

Gate first: confirm explicit plan approval (status:plan-approved or a local approval marker) before writing wiki pages.
RED test first: add focused tests that initially fail for expected page existence and boundary fields: extraction_policy, raw_copy_allowed: false, ocr_allowed: false where applicable, source-of-record absolute paths, and index/log updates.
Create pointer/shell pages, not extraction dumps:
- standards/spec libraries use extraction_policy: metadata-only; no clauses, copied standards text, or detailed licensed titles.
- training/client corpora use extraction_policy: metadata-first; curated summaries only; no full deck text, speaker notes, OCR output, screenshots, figures, or raw bytes.
- standards resolver stubs use public metadata only (code_id, publisher, revision or explicit unverified/deferred marker, revision_source, verified_on, public_url).
Update discoverability surfaces: update wiki/index.md, wiki/log.md, and parent source-catalog cross-links so new pages are not orphaned.
Verify broadly: run targeted pytest, scripts/knowledge/tests/test_llm_wiki.py, llm_wiki.py lint/status for affected domains, and a raw-data guard such as git show --name-only --format='' HEAD | grep -E 'knowledge/wikis/.*/raw/|\.(pdf|pptx|ppt|xlsx|xls|docx|doc|png|jpg|jpeg|dwf|db)$' || true.
Force-add intentionally ignored wiki files: workspace-hub may ignore knowledge/wikis/*; use git add -f only for intended wiki pages/tests/reports and check targeted git status to avoid unrelated provider/report churn.
Treat fetch/rev-list as push truth: if git push reports a transient remote lock/rejected ref but the remote already points at the new commit, verify with git fetch origin main and git rev-list --left-right --count HEAD...origin/main; 0 0 is the sync ground truth.
Close issues with evidence: include commit hash, test/lint results, raw-data guard result, and preserved boundaries in GitHub comments.

Consolidated Session Learnings

The references/ directory contains archived narrow skills absorbed during the 2026-04-29 umbrella consolidation pass. Use the subsections below as the class-level index, then open the named reference when a case-specific recipe is needed.

Absorbed Narrow Skills (2026-04-29)

`docs-stale-reference-guardrails`

Former skill demoted to references/docs-stale-reference-guardrails.md.
Preserved insight: Prevent deleted workflow/path references from creeping back into live docs by combining strict scans, legacy allowlists, and shared regex helpers.

`extract-learnings-to-issues`

Former skill demoted to references/extract-learnings-to-issues.md.
Preserved insight: Extract unstructured user reflections and learnings, distill core themes, route insights to existing GitHub issues as contextual comments rather than creating duplicates.

`knowledge-source-recon`

Former skill demoted to references/knowledge-source-recon.md.
Preserved insight: Reconnaissance pattern to inventory all knowledge sources across the workspace-hub ecosystem's existing intelligence infrastructure. Maps raw sources for LLM Wiki ingestion planning. Leverages pre-built registries and indexes rather than re-scanning directories.

`llm-wiki-ecosystem-gap-to-issues`

Former skill demoted to references/llm-wiki-ecosystem-gap-to-issues.md.
Preserved insight: Review the workspace-hub LLM-wiki/document-intelligence ecosystem, identify high-leverage gaps, and create grounded GitHub feature issues without duplicating existing work.

`llm-wiki-pattern`

Former skill demoted to references/llm-wiki-pattern.md.
Preserved insight: Build and maintain persistent, compounding knowledge bases using the LLM Wiki pattern (Karpathy). Unlike RAG (retrieve chunks at query time), the wiki is a persistent artifact that gets compiled and maintained by the LLM.

`llm-wiki-roadmap-integration`

Former skill demoted to references/llm-wiki-roadmap-integration.md.
Preserved insight: Integrate repo-ecosystem work into an existing llm-wiki / knowledge-roadmap issue without creating duplicate GitHub issues.

`parallel-llm-wiki-gap-to-issues`

Former skill demoted to references/parallel-llm-wiki-gap-to-issues.md.
Preserved insight: Use parallel subagents to mine remaining LLM-wiki/document-intelligence gaps, de-duplicate against existing GitHub issues, then create only the strongest bounded follow-on issues.

`repair-legacy-llm-wiki-frontmatter-dates`

Former skill demoted to references/repair-legacy-llm-wiki-frontmatter-dates.md.
Preserved insight: Diagnose and repair legacy llm-wiki source pages that have ingested timestamps but are missing required added/last_updated frontmatter dates.

`repo-mission-portfolio-audit`

Former skill demoted to references/repo-mission-portfolio-audit.md.
Preserved insight: Audit the workspace-hub repo portfolio to extract each repo's mission, identify documentation gaps, and prioritize a plan/approval sequence with explicit LLM-wiki weighting for future issue triage.

`semantic-taxonomy-reporting-consistency`

Former skill demoted to references/semantic-taxonomy-reporting-consistency.md.
Preserved insight: Keep semantic-diff taxonomy summaries consistent with evidence tables when adding richer categories to legacy comparison/reporting pipelines.

`workspace-hub-mission-contract-first-packet`

Former skill demoted to references/workspace-hub-mission-contract-first-packet.md.
Preserved insight: Plan the first repo-mission canonicalization packet for the workspace ecosystem by reusing the existing control-plane issue, locking Wave-1 scope, and making review criteria deterministic.

`tier1-indexing-scorecard-and-freshness-audit`

Former skill demoted to references/tier1-indexing-scorecard-and-freshness-audit.md.
Preserved insight: Audit tier-1 repos for code-placement/retrieval readiness, write a scorecard report, create follow-up GitHub issues, and add a daily freshness cron while avoiding legacy product-doc reference patterns.

`exclude-wiki-Codex-md-from-harness-line-limit-hook`

Former skill demoted to references/exclude-wiki-Codex-md-from-harness-line-limit-hook.md.
Preserved insight: Fix false-positive pre-commit failures where workspace-hub's AGENTS.md line-limit hook blocks edits to auto-generated wiki schema files under knowledge/wikis/.

`memory-bridge-commit-fallbacks`

Former skill demoted to references/memory-bridge-commit-fallbacks.md.
Preserved insight: Fallback procedures when the Hermes ↔ Codex memory bridge writes .Codex/memory outputs but the internal git commit/push path fails because of dirty, stale, or broken submodule state.