Run any Skill in Manus with one click

tdd

Stars0

Forks0

UpdatedJune 15, 2026 at 22:34

Use when changing the deterministic core (dmbot/rules/, engine.py, marker.py, any new pure function with verifiable behaviour) and you want the change driven by a failing test first. Enforces red-green-refactor against the repo's fixed-seed pytest setup: write one failing test, confirm red, implement minimal green, refactor. Not for prose/persona/prompt edits or RAG ingestion.

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

Run Skill in Manus

Source

Pr0degie

Pr0degie/dungeonmaster

View GitHub Repository View Creator Repositories

Download

Run Skill in Manus

SKILL.md

readonly

name	tdd
description	Use when changing the deterministic core (dmbot/rules/, engine.py, marker.py, any new pure function with verifiable behaviour) and you want the change driven by a failing test first. Enforces red-green-refactor against the repo's fixed-seed pytest setup: write one failing test, confirm red, implement minimal green, refactor. Not for prose/persona/prompt edits or RAG ingestion.

Test-driven a change to the deterministic core

Golden rule #2 (dice = code, deterministic) makes the engine the ideal TDD target: pure functions, fixed-seed, exact outcomes. This skill enforces the order Claude does not follow by default — it writes implementation first and rewrites tests to pass. Here the test leads.

Scope: dmbot/rules/ (engine, marker, profile loader) and any new pure helper with a checkable result. Not for LLM-shaped work (persona, prompts, recaps), RAG, or Discord glue — those aren't deterministically assertable. For a whole new subsystem use rules-subsystem (it already bakes in fixed-seed tests); reach for tdd when you're adding/changing behaviour in code that already exists.

The loop — one test at a time

RED — write one failing test. Add a single test to tests/test_<area>.py, modelled on tests/test_psyker.py / tests/test_augmetics.py (seeded RNG, assert the exact outcome). Describe the behaviour you want (public function in/out), not the implementation. One behaviour per cycle — never a batch of speculative tests.
Confirm red. Run uv run --with pytest python -m pytest tests/test_<area>.py -q and verify the new test fails for the right reason (assertion, not an import/typo error). A test that was never red proves nothing.
GREEN — minimal implementation. Write the smallest change to engine.py (or the target module) that makes the test pass. Read the profile block as data; keep the function pure.
Confirm green. Re-run the file, then the full suite (uv run --with pytest python -m pytest) so the change didn't break a sibling.
REFACTOR. Only once green: dedupe, rename, extract helpers. Tests stay untouched and must stay green throughout.
Repeat from 1 for the next behaviour.

Guardrails (the whole point)

Never edit a test to make it pass. If a test is wrong, say so explicitly and fix it as a deliberate, separate step — never silently to chase green. The pre-commit gate + Stop hook already run the suite; a test quietly relaxed is the failure mode they can't catch.
No green before red. If you can't show the test failing first, the test isn't pinning the behaviour — fix the test, not the implementation.
One behaviour per cycle. Bulk tests written upfront end up asserting mocks instead of real code paths.
Test public behaviour, not internals. Assert what resolve_* returns, not how it got there — so the test survives the refactor step.
Deterministic only. Seed the RNG; assert exact bands incl. crit/fumble/edge. If the result isn't deterministic, it doesn't belong to the engine (golden rule #2).

Close-out

Full suite green: uv run --with pytest python -m pytest.
Commit test + implementation together, imperative scoped message (rules(im): success-level edge band). Direct to main (no feature branch for this repo).
A real design trade-off → write an ADR; pure red-green of existing behaviour → no ADR needed.

More from this repository

same repository

improve-architecture

Pr0degie/dungeonmaster

Find deepening opportunities across the codebase — refactors that turn shallow pass-through modules into deep ones, for testability and AI-navigability. Informed by architecture.md and the ADRs in docs/decisions/; never re-litigates a decided ADR. Use when Tobi wants to improve architecture, find refactoring opportunities, consolidate tightly-coupled modules, or make a part of the bot more testable. Whole-codebase altitude — not /simplify or /code-review (those are diff-scoped).

2026-06-150

to-prd

Pr0degie/dungeonmaster

Turn the current conversation context — ideally after a /grill-me session — into a PRD written to the repo. Do NOT interview; synthesize what's already been decided. Use when Tobi wants to turn a grilled-out plan into a written spec, or says "to-prd" / "schreib den Plan".

2026-06-150

grill-me

Pr0degie/dungeonmaster

Interview Tobi relentlessly about a plan or design decision until reaching shared understanding, resolving each branch of the decision tree one fork at a time. Use when he wants to stress-test a plan before building, asks to "get grilled", or says "grill me". For design/architecture forks — not for mechanical edits.

2026-06-150

playtest-triage

Pr0degie/dungeonmaster

Use when the user pastes a live Discord session log, transcript, or annotated complaints from a play session and wants a fix round. Diagnoses the failure chain from the logs, extracts player improvement-wishes, applies deterministic fixes, runs the test suite, commits per round, and updates progress.md (and an ADR when a real trade-off was decided).

2026-06-140

session-ritual

Pr0degie/dungeonmaster

Use at the start of a working session for the handshake (read CLAUDE.md → progress.md → the latest ADR, then state where we are and what's next), and at the end of a session or on "wrap up" / "update progress" to update progress.md and scaffold the next-numbered ADR. Keeps continuity across context clears and model switches.

2026-06-140

character-build

Pr0degie/dungeonmaster

Use when a player's Imperium Maledictum character JSON comes in and needs validating + deploying, when building a new IM character from scratch, or when backfilling backstory onto an existing sheet. Validates budgets/formulas/psyker powers/augmetics against the active profile, writes data/party/<player>.json, generates the PDF sheet, and proposes (confirm first) the merge into the session characters.json + aliases.

2026-06-130

name	tdd
description	Use when changing the deterministic core (dmbot/rules/, engine.py, marker.py, any new pure function with verifiable behaviour) and you want the change driven by a failing test first. Enforces red-green-refactor against the repo's fixed-seed pytest setup: write one failing test, confirm red, implement minimal green, refactor. Not for prose/persona/prompt edits or RAG ingestion.

Test-driven a change to the deterministic core

The loop — one test at a time

RED — write one failing test. Add a single test to tests/test_<area>.py, modelled on tests/test_psyker.py / tests/test_augmetics.py (seeded RNG, assert the exact outcome). Describe the behaviour you want (public function in/out), not the implementation. One behaviour per cycle — never a batch of speculative tests.
Confirm red. Run uv run --with pytest python -m pytest tests/test_<area>.py -q and verify the new test fails for the right reason (assertion, not an import/typo error). A test that was never red proves nothing.
GREEN — minimal implementation. Write the smallest change to engine.py (or the target module) that makes the test pass. Read the profile block as data; keep the function pure.
Confirm green. Re-run the file, then the full suite (uv run --with pytest python -m pytest) so the change didn't break a sibling.
REFACTOR. Only once green: dedupe, rename, extract helpers. Tests stay untouched and must stay green throughout.
Repeat from 1 for the next behaviour.

Guardrails (the whole point)

Never edit a test to make it pass. If a test is wrong, say so explicitly and fix it as a deliberate, separate step — never silently to chase green. The pre-commit gate + Stop hook already run the suite; a test quietly relaxed is the failure mode they can't catch.
No green before red. If you can't show the test failing first, the test isn't pinning the behaviour — fix the test, not the implementation.
One behaviour per cycle. Bulk tests written upfront end up asserting mocks instead of real code paths.
Test public behaviour, not internals. Assert what resolve_* returns, not how it got there — so the test survives the refactor step.
Deterministic only. Seed the RNG; assert exact bands incl. crit/fumble/edge. If the result isn't deterministic, it doesn't belong to the engine (golden rule #2).

Close-out

Full suite green: uv run --with pytest python -m pytest.
Commit test + implementation together, imperative scoped message (rules(im): success-level edge band). Direct to main (no feature branch for this repo).
A real design trade-off → write an ADR; pure red-green of existing behaviour → no ADR needed.