一键在 Manus 中运行任何 Skill

verify-invariants

星标4

分支2

更新时间2026年3月27日 12:55

Implement and verify design doc invariants by annotating tests and source code with [INV-*] / @spec tags, then driving tx spec coverage from BUILD toward HARDEN (100% FCI). Works with any design doc that has an invariants block.

安装

用 Codex 或 Claude 帮你安装复制这段 Prompt，粘贴到 Codex、Claude 或其他助手里，让它检查 Skill 页面并帮你完成安装。

在 Manus 中运行

来源

jamesaphoenix

jamesaphoenix/tx

打开 GitHub 仓库查看创作者相关仓库

下载

在 Manus 中运行

Verify Invariants

Drive invariants from uncovered (BUILD phase) to fully verified (HARDEN phase) by annotating integration tests and source code with invariant IDs.

Scope:

With argument (e.g. /verify-invariants team-tags-design): verify that doc only
Without argument (/verify-invariants): verify ALL design docs and PRDs with uncovered invariants

How tx spec Discovery Works

tx spec discover scans files in two passes:

Test files (matched by test_patterns in .tx/config.toml) — scanned for both [INV-*] bracket tags in test names AND // @spec INV-* comments
Source files (all programming languages) — scanned ONLY for // @spec INV-* comments (structural annotations)

This means @spec comments work in ANY file (test or source), but [INV-*] bracket tags are only picked up from test files.

Annotation Formats

Format	Where It Works	Example
`[INV-TAG-001]` in test name	Test files only	`it('creates tag [INV-TAG-001]', ...)`
`// @spec INV-TAG-001` comment	Any file (test or source)	`// @spec INV-TAG-001` above a schema definition
`-- @spec INV-TAG-001` SQL comment	`.sql` files	`-- @spec INV-TAG-001` in pgTAP test
`# @spec INV-TAG-001` hash comment	`.py`, `.rb`, etc.	`# @spec INV-TAG-001` above a test

Critical: IDs Are Case-Sensitive

INV-TAG-001 and inv-tag-001 are NOT the same. Always use UPPERCASE as shown in the design doc's invariants: YAML block.

Critical: Config Matters

The test_patterns in .tx/config.toml control which files are scanned as test files. The defaults include:

test_patterns = [
  "**/*.test.{ts,js,tsx,jsx}",
  "**/*.integration.test.{ts,js,tsx,jsx}",
  "**/*.spec.{ts,js,tsx,jsx}",
  "**/*.pgtap.sql",
  # ... and more for Go, Python, Rust, Java, Ruby, C/C++
]

If your test file isn't being found, check that its pattern matches one of these globs.

Workflow State Machine

START
  │
  ▼
┌─────────────────────────────────────────────────────┐
│ Step 1: LOAD GAPS                                    │
│                                                      │
│ If <name> given:                                     │
│   tx spec gaps --doc <name>                          │
│   tx spec fci --doc <name>                           │
│   tx doc show <name> --md                            │
│                                                      │
│ If NO argument:                                      │
│   tx spec gaps        (all docs)                     │
│   tx spec fci         (global)                       │
│   tx doc list         (enumerate all docs)           │
│   For each doc with gaps:                            │
│     tx doc show <doc> --md                           │
│                                                      │
│ ├─ FCI = 100%? → Already HARDEN. Done.              │
│ └─ FCI < 100%? → Continue to Step 2                 │
└─────────────────────────────────────────────────────┘
  │
  ▼
┌─────────────────────────────────────────────────────┐
│ Step 2: CLASSIFY INVARIANTS                          │
│                                                      │
│ For each uncovered invariant, decide:                │
│                                                      │
│ A) TESTABLE — can be verified by an integration      │
│    test or unit test exercising the behavior.        │
│    Examples: API returns 401, cascade deletes,       │
│    filter returns correct results.                   │
│    → Annotate in test file with [INV-*] tag          │
│                                                      │
│ B) STRUCTURAL — enforced by code structure, lint,    │
│    or DB constraint. Cannot be meaningfully tested   │
│    in isolation but IS enforced in source.           │
│    Examples: "tag names unique per org" enforced     │
│    by DB index, "no retention policy" enforced by    │
│    absence from retentionTableNames literal.         │
│    → Annotate in source code with // @spec INV-*     │
│                                                      │
│ C) PGTAP — enforced by database constraints.         │
│    → Annotate in pgTAP test with -- @spec INV-*      │
│                                                      │
│ Prefer A over B. Use B only when A is genuinely      │
│ not possible.                                        │
└─────────────────────────────────────────────────────┘
  │
  ▼
┌─────────────────────────────────────────────────────┐
│ Step 3: IMPLEMENT + ANNOTATE                         │
│                                                      │
│ For TESTABLE invariants:                             │
│   Write or extend integration test files.            │
│   Annotate each test with the invariant ID:          │
│                                                      │
│   it('creates tag [INV-TAG-001]', async () => {     │
│     // test body                                     │
│   })                                                 │
│                                                      │
│   OR use comment form:                               │
│   // @spec INV-TAG-001                               │
│   it('creates tag', async () => { ... })             │
│                                                      │
│ For STRUCTURAL invariants:                           │
│   Place // @spec comment in the source file where    │
│   the invariant is enforced (schema, literals, etc.) │
│                                                      │
│ For PGTAP invariants:                                │
│   Place -- @spec comment in the .pgtap.sql file      │
│                                                      │
│ RULE: Every invariant must get exactly one            │
│ annotation somewhere in the codebase.                │
└─────────────────────────────────────────────────────┘
  │
  ▼
┌─────────────────────────────────────────────────────┐
│ Step 4: DISCOVER                                     │
│                                                      │
│ tx spec discover --doc <name>                        │
│ tx spec discover          (if verifying all)         │
│                                                      │
│ Verify output shows:                                 │
│   Scanned N file(s)                                  │
│   Discovered links: M                                │
│   By source: tag=X, comment=Y                        │
│                                                      │
│ If 0 links: check annotation format, file path,      │
│ config.toml patterns, ID case sensitivity.           │
└─────────────────────────────────────────────────────┘
  │
  ▼
┌─────────────────────────────────────────────────────┐
│ Step 5: RUN TESTS + RECORD RESULTS                   │
│                                                      │
│ For vitest tests:                                    │
│   pnpm vitest run <file> --reporter=json             │
│     2>/dev/null | tx spec batch --from vitest        │
│                                                      │
│ For structural annotations:                          │
│   tx spec run "<file>::spec@line-N" --passed         │
│                                                      │
│ For pgTAP:                                           │
│   pnpm test:db:pgtap                                 │
│   tx spec run "<pgtap-file>::<test>" --passed        │
└─────────────────────────────────────────────────────┘
  │
  ▼
┌─────────────────────────────────────────────────────┐
│ Step 6: CHECK PROGRESS                               │
│                                                      │
│ tx spec fci --doc <name>                             │
│ tx spec gaps --doc <name>                            │
│                                                      │
│ ├─ FCI = 100% → HARDEN phase reached. Done.         │
│ ├─ Gaps remain → Loop back to Step 3 for remaining. │
│ └─ Failures → Fix failing tests, re-run Step 5.     │
└─────────────────────────────────────────────────────┘
  │
  ▼
┌─────────────────────────────────────────────────────┐
│ Step 7: REPORT                                       │
│                                                      │
│ tx spec matrix --doc <name>                          │
│ tx spec status --doc <name>                          │
│                                                      │
│ Print summary with invariant coverage breakdown.     │
└─────────────────────────────────────────────────────┘
  │
  ▼
DONE

Step 1 — Load Gaps

If a doc name was provided:

tx spec gaps --doc $ARGUMENTS
tx spec fci --doc $ARGUMENTS
tx spec matrix --doc $ARGUMENTS
tx doc show $ARGUMENTS --md

If no argument (verify all):

tx spec gaps
tx spec fci
tx doc list
# Then for each doc with gaps:
tx doc show <doc-name> --md

Read each design doc markdown to understand each invariant's:

id — the invariant ID (e.g. INV-TAG-001) — must match annotation exactly
statement — what must be true
severity — critical/high/medium/low (prioritize critical first)
verified_by — hint about which test file should cover it
traces_to — which PRD requirement it maps to

If FCI is already 100%, report success and stop.

Step 2 — Classify Each Uncovered Invariant

For each gap from tx spec gaps, decide the verification strategy:

Category	When to use	Annotation style	Where
TESTABLE	Behavior can be exercised via API call, function call, or DB query	`[INV-]` in test name or `// @spec INV-` near test	Test file (`.test.ts`, `.integration.test.ts`, `.pgtap.sql`)
STRUCTURAL	Enforced by schema, lint rule, type system, or code structure — no meaningful test possible	`// @spec INV-*` comment	Source file where enforcement lives (schema.ts, literals.ts, service file)
PGTAP	Database constraint, trigger, or index behavior	`-- @spec INV-*` SQL comment	`.pgtap.sql` test file

Prefer TESTABLE. Most invariants should be verifiable by a test. Only use STRUCTURAL when the invariant is genuinely about code structure (e.g., "table X is NOT in retentionTableNames" — verified by inspecting the literal array, not by running a test).

Step 3 — Implement and Annotate

Integration test annotation (preferred)

The [INV-*] bracket tag in the test name is the cleanest approach — tx spec discover extracts both the invariant ID and the test name from the same line:

it('creates tag with name and color [INV-TAG-001]', async () => {
  const res = await fetch(`${ctx.baseUrl}/v1/organizations/${orgId}/tags`, {
    method: 'POST',
    headers: { 'content-type': 'application/json', authorization: `Bearer ${adminToken}` },
    body: JSON.stringify({ name: 'Engineering', color: '#3B82F6' })
  })
  expect(res.status).toBe(201)
})

The comment form also works — place it on the line immediately before or up to 2 lines above the it():

// @spec INV-TAG-100
it('returns 401 without auth token', async () => {
  const res = await fetch(`${ctx.baseUrl}/v1/organizations/${orgId}/tags`)
  expect(res.status).toBe(401)
})

Source code annotation (structural)

tx spec discover scans ALL source files (any language) for @spec comments. Place the annotation directly above the code that enforces the invariant:

// @spec INV-TAG-200
export const teamTags = pgTable('team_tags', {
  // ...columns
}, (table) => ({
  orgNameCiUnique: uniqueIndex('team_tags_org_name_ci_unique')
    .on(table.organizationId, sql`lower(trim(${table.name}))`)
}))

Important: Structural annotations create a link with a test ID like packages/infra/db/src/schema.ts::spec@line-264. You must manually record these as passed since there's no test to run:

# Get the exact test ID
tx spec tests INV-TAG-200
# Output: packages/infra/db/src/schema.ts::spec@line-264 [comment]

# Record as passed
tx spec run "packages/infra/db/src/schema.ts::spec@line-264" --passed

pgTAP annotation

-- @spec INV-TAG-202
SELECT ok(
  EXISTS (
    SELECT 1 FROM pg_constraint
    WHERE conname = 'team_tag_assignments_tag_id_fkey'
    AND confdeltype = 'c'
  ),
  'tag assignments cascade-delete when tag is deleted'
);

Rules

Every invariant gets exactly one annotation somewhere in the codebase
One test/annotation can cover multiple invariants: [INV-TAG-001] [INV-TAG-002]
Critical severity invariants should be covered first
Integration tests are preferred over unit tests (hard requirement from design doc skill)

Step 4 — Discover

# For a specific doc
tx spec discover --doc $ARGUMENTS

# For all docs
tx spec discover

Expected output:

Scanned 123 file(s)
Discovered links: 14
Upserted links: 14
By source: tag=12, comment=2, manifest=0

Troubleshooting: 0 Links Discovered

Symptom	Cause	Fix
`Scanned 0 file(s)`	Test patterns in `.tx/config.toml` don't match your test files	Add patterns like `"*/.test.{ts,js}"` or `"*/.integration.test.{ts,js}"`
Scanned files but 0 links	Annotation format wrong	Must be `[INV-TAG-001]` (with brackets) or `// @spec INV-TAG-001` — no other formats work
Links found but wrong count	ID mismatch	IDs are case-sensitive. `INV-TAG-001` is not `inv-tag-001` or `INV-Tag-001`
Source `@spec` not found	Source file excluded	Check file isn't in `node_modules`, `dist`, `.git`, or other skip dirs

How Test IDs Are Built

tx spec discover builds test IDs as {relative-file-path}::{test-name}. For example:

apps/api/src/routes/team-tags.test.ts::creates tag with name and color [INV-TAG-001]
packages/infra/db/src/schema.ts::spec@line-264 (structural — no test name, uses line number)

Step 5 — Run Tests and Record Results

Vitest (primary path)

# Run specific test file and pipe JSON results to tx
pnpm vitest run apps/api/src/routes/team-tags.test.ts --reporter=json 2>/dev/null | tx spec batch --from vitest

tx spec batch --from vitest does the following:

Parses the vitest JSON report
Normalizes absolute file paths to relative (strips repo root)
Matches test results to discovered spec_tests by {file}::{title}
Records PASS/FAIL runs for each matched invariant

Important: vitest outputs absolute paths but tx spec discover stores relative paths. The batch parser handles this automatically by stripping common path prefixes (/apps/, /packages/, /src/, etc.).

Important: vitest fullName includes describe ancestry (e.g. suite name > test name) but discover stores just the it() title. The batch parser emits both variants and matches on the title (shorter form) first.

Structural annotations (manual recording)

Structural annotations (source code @spec comments) have no test to run. After confirming the code correctly enforces the invariant, record them manually:

# Get the exact test ID from tx spec tests
tx spec tests INV-TAG-200
# Output: packages/infra/db/src/schema.ts::spec@line-264 [comment]

# Record as passed
tx spec run "packages/infra/db/src/schema.ts::spec@line-264" --passed

pgTAP tests

# Run pgTAP suite
pnpm test:db:pgtap

# Record results manually (pgTAP doesn't output vitest-compatible JSON)
tx spec run "packages/infra/db/pgtap/003_team_tags.pgtap.sql::cascade delete" --passed

Step 6 — Check Progress and Iterate

tx spec fci --doc $ARGUMENTS
tx spec gaps --doc $ARGUMENTS

FCI Phases

FCI	Phase	Meaning
0%	BUILD	No invariants verified — just starting
1-99%	BUILD	Some invariants verified, gaps remain
100%	HARDEN	All invariants linked + passing — ready for sign-off
100% + sign-off	COMPLETE	Human confirmed feature is done

What "untested" Means

tx spec fci distinguishes between:

covered — invariant has a linked test/annotation (via discover)
passing — linked test has a recorded PASS run (via batch or manual)
untested — covered but no run recorded yet

A common state is covered=14, passing=12, untested=2 — meaning 2 invariants are linked (discovered) but you haven't yet run tx spec run --passed for them. This typically happens with structural annotations.

Iteration Loop

If gaps remain:

Check which invariants are uncovered: tx spec gaps --doc <name>
Add the missing annotation (test or source)
Re-run tx spec discover
Run tests + record results
Check FCI again

Step 7 — Report

tx spec matrix --doc $ARGUMENTS
tx spec status --doc $ARGUMENTS

The matrix shows every invariant with its linked test and latest result:

INV-TAG-001: Organizations can create, list, update, and delete tags via API
  - apps/api/src/routes/team-tags.test.ts::creates tag [INV-TAG-001] [tag] latest=PASS
INV-TAG-200: Tag names are unique per org via DB index
  - packages/infra/db/src/schema.ts::spec@line-264 [comment] latest=PASS

Print a summary:

Total invariants
Covered by test annotations vs source annotations
FCI percentage and phase
Any remaining gaps or failures

After Verification

FCI = 100% → phase is HARDEN. All invariants are mechanically verified.
Human sign-off: tx spec complete --doc <name> --by <name> → moves to COMPLETE.
Regression detection: Re-run tests periodically → tx spec batch → FCI drops if a test fails, phase reverts to BUILD.

Proven Example (team-tags-design)

This workflow was tested end-to-end with the team-tags-design doc:

14 invariants (design doc)
  ↓ tx spec discover
12 [INV-*] tags in test file + 2 @spec comments in schema.ts = 14 links
  ↓ vitest run | tx spec batch --from vitest
12 test results recorded (PASS)
  ↓ tx spec run --passed (2 structural)
2 structural results recorded
  ↓ tx spec fci
FCI: 100% (HARDEN)
  ↓ tx spec matrix
All 14 invariants → PASS

name	verify-invariants
description	Implement and verify design doc invariants by annotating tests and source code with [INV-*] / @spec tags, then driving tx spec coverage from BUILD toward HARDEN (100% FCI). Works with any design doc that has an invariants block.
argument-hint	[design-doc-name] (optional - omit to verify ALL docs)

verify-invariants

同仓库更多 Skills

同仓库更多 Skills

Verify Invariants

How tx spec Discovery Works

Annotation Formats

Critical: IDs Are Case-Sensitive

Critical: Config Matters

Workflow State Machine

Step 1 — Load Gaps

Step 2 — Classify Each Uncovered Invariant

Step 3 — Implement and Annotate

Integration test annotation (preferred)

Source code annotation (structural)

pgTAP annotation

Rules

Step 4 — Discover

Troubleshooting: 0 Links Discovered

How Test IDs Are Built

Step 5 — Run Tests and Record Results

Vitest (primary path)

Structural annotations (manual recording)

pgTAP tests

Step 6 — Check Progress and Iterate

FCI Phases

What "untested" Means

Iteration Loop

Step 7 — Report

After Verification

Proven Example (team-tags-design)

Verify Invariants

How tx spec Discovery Works

Annotation Formats

Critical: IDs Are Case-Sensitive

Critical: Config Matters

Workflow State Machine

Step 1 — Load Gaps

Step 2 — Classify Each Uncovered Invariant

Step 3 — Implement and Annotate

Integration test annotation (preferred)

Source code annotation (structural)

pgTAP annotation

Rules

Step 4 — Discover

Troubleshooting: 0 Links Discovered

How Test IDs Are Built

Step 5 — Run Tests and Record Results

Vitest (primary path)

Structural annotations (manual recording)

pgTAP tests

Step 6 — Check Progress and Iterate

FCI Phases

What "untested" Means

Iteration Loop

Step 7 — Report

After Verification

Proven Example (team-tags-design)