Run any Skill in Manus with one click

$pwd:

characterize

Name: Characterize
Author: hashintel

// Create characterization tests (Golden Master) for existing code so you can refactor or replace safely. Use before refactoring, strangler-replacing, or modifying code with unclear behavior. Captures observable behavior with a minimal harness, producing tests, fixtures, and a coverage report.

Run Skill in Manus

$ git log --oneline --stat

stars:35

forks:14

updated:March 23, 2026 at 12:12

SKILL.md

readonly

name	characterize
description	Create characterization tests (Golden Master) for existing code so you can refactor or replace safely. Use before refactoring, strangler-replacing, or modifying code with unclear behavior. Captures observable behavior with a minimal harness, producing tests, fixtures, and a coverage report.
argument-hint	[subsystem / entrypoint / behavior surface to lock down]

Characterization Tests (Golden Master / Feathers)

Create characterization tests for an existing subsystem whose behavior is unclear, messy, or AI-generated. Pin down current observable behavior so that refactoring and incremental replacement are safe.

Input

Subsystem / entrypoint / behavior surface to lock down: $ARGUMENTS

Preconditions:

At least one runnable entry point (function, CLI command, server route, etc.)
A concept capsule is NOT required, but use its vocabulary if one exists

Procedure

1. Identify the behavior surface

List 1-3 surfaces to lock down: public functions, CLI commands, HTTP endpoints, file-in→file-out, serialized outputs. Prefer the most stable public surface available.

2. Minimal fixture set

One canonical happy path
1-3 meaningful edge cases
One "weird but real" case you suspect is fragile

Keep it small — a suite that's too big becomes unmaintainable.

3. Stabilize nondeterminism

Before capturing outputs, neutralize noise sources:

Timestamps: freeze time
Randomness: seed or stub
Ordering: sort keys, canonicalize arrays where order is not meaningful
OS-dependent paths/newlines: normalize
Network calls: stub/record once — do not hit live services
Concurrency: force single-threaded if needed for determinism

If behavior is inherently nondeterministic, define a tolerant comparator (ignore specific fields, assert shape/membership rather than equality).

4. Capture golden outputs

For each fixture: run the surface, record observable output (return value / stdout+stderr / exit code / HTTP status+body), store as snapshot/golden file.

Stable, readable format (text, JSON)
Keep goldens small
Make it obvious how to intentionally update them (a "regenerate" command)

5. Write characterization tests

Assert: given fixture inputs → output matches golden (or tolerant comparator).

Assert observable behavior only, not internal structure
Do NOT refactor production code while writing these tests
Few strong assertions over many weak ones

6. Coverage report

Surfaces covered
Fixture list (what cases are locked down)
Known nondeterminism and how it was stabilized
Behavior gaps (what remains unknown/unlocked)
Recommended next step: usually /pragma:refactor or strangler seams

7. Commit (tests only)

Characterization: lock down current behavior of <subsystem>

Constraints

No refactor during characterization. Don't touch production code — you're measuring the thing you're locking down.
Keep fixtures minimal. Tempted to add many cases → you need a capsule + real spec work.
Tests must be deterministic (or explicitly tolerant in controlled ways).
Golden updates must be intentional. Accidental snapshot churn destroys trust.

Output

The coverage report
The test/fixture locations
The exact command(s) to run the characterization suite
The recommended next step (/pragma:refactor or strangler approach)
Lifecycle: State: stabilizing, Next: /pragma:refactor, Loop: /pragma:consult (default unless user explicitly continues directly)

related-skills.json

same repository

petrinaut.md

from "hashintel/labs"

Read and write a Petri net (SDCPN) document by Automerge URL. Use when creating, editing, or querying Petri nets — adding or removing places, transitions, arcs, color types, differential equations, and parameters.

2026-04-3035

assumptions.md

from "hashintel/labs"

Create and maintain an Assumption Ledger — a persistent record of assumptions, their confidence, and validation status. Use when starting a new slice, resuming work in a new context window, or when implicit assumptions risk causing drift. Tracks requirements, architecture, and implementation assumptions.

2026-03-2335

capsule.md

from "hashintel/labs"

Create or update a concept capsule — the conceptual anchor for a project or feature area. Use before writing code on a new project or feature, or when terms and boundaries feel unclear. Defines glossary, invariants, happy-path scenario, and non-goals.

2026-03-2335

card.md

from "hashintel/labs"

Write a tracer-bullet card — a precise specification for one thin end-to-end slice of work. Use when scoping a new slice, defining what to build next, or breaking a feature into provable increments. Covers target behavior, boundary crossings, risks, and definition of done.

2026-03-2335

consult.md

from "hashintel/labs"

Methodology triage consultant for tracer-bullet development. Use when unsure which pragma skill to run next, when starting a new project, or when the current approach feels stuck. Interviews the user, assesses state, and recommends the next pragma skill.

2026-03-2335

contract.md

from "hashintel/labs"

Turn capsule invariants and boundary crossings into executable contracts. Use after creating a concept capsule, or when invariants need to be enforced in code. Covers preconditions, postconditions, constructor validation, domain types, and contract tests.

2026-03-2335

package.json

"author": "hashintel"

"repository": "hashintel/labs"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software Quality Assurance Analysts and TestersComputer and Mathematical Occupations15-1253L4

name	characterize
description	Create characterization tests (Golden Master) for existing code so you can refactor or replace safely. Use before refactoring, strangler-replacing, or modifying code with unclear behavior. Captures observable behavior with a minimal harness, producing tests, fixtures, and a coverage report.
argument-hint	[subsystem / entrypoint / behavior surface to lock down]

Characterization Tests (Golden Master / Feathers)

Input

Subsystem / entrypoint / behavior surface to lock down: $ARGUMENTS

Preconditions:

At least one runnable entry point (function, CLI command, server route, etc.)
A concept capsule is NOT required, but use its vocabulary if one exists

Procedure

1. Identify the behavior surface

List 1-3 surfaces to lock down: public functions, CLI commands, HTTP endpoints, file-in→file-out, serialized outputs. Prefer the most stable public surface available.

2. Minimal fixture set

One canonical happy path
1-3 meaningful edge cases
One "weird but real" case you suspect is fragile

Keep it small — a suite that's too big becomes unmaintainable.

3. Stabilize nondeterminism

Before capturing outputs, neutralize noise sources:

Timestamps: freeze time
Randomness: seed or stub
Ordering: sort keys, canonicalize arrays where order is not meaningful
OS-dependent paths/newlines: normalize
Network calls: stub/record once — do not hit live services
Concurrency: force single-threaded if needed for determinism

If behavior is inherently nondeterministic, define a tolerant comparator (ignore specific fields, assert shape/membership rather than equality).

4. Capture golden outputs

For each fixture: run the surface, record observable output (return value / stdout+stderr / exit code / HTTP status+body), store as snapshot/golden file.

Stable, readable format (text, JSON)
Keep goldens small
Make it obvious how to intentionally update them (a "regenerate" command)

5. Write characterization tests

Assert: given fixture inputs → output matches golden (or tolerant comparator).

Assert observable behavior only, not internal structure
Do NOT refactor production code while writing these tests
Few strong assertions over many weak ones

6. Coverage report

Surfaces covered
Fixture list (what cases are locked down)
Known nondeterminism and how it was stabilized
Behavior gaps (what remains unknown/unlocked)
Recommended next step: usually /pragma:refactor or strangler seams

7. Commit (tests only)

Characterization: lock down current behavior of <subsystem>

Constraints

No refactor during characterization. Don't touch production code — you're measuring the thing you're locking down.
Keep fixtures minimal. Tempted to add many cases → you need a capsule + real spec work.
Tests must be deterministic (or explicitly tolerant in controlled ways).
Golden updates must be intentional. Accidental snapshot churn destroys trust.

Output

The coverage report
The test/fixture locations
The exact command(s) to run the characterization suite
The recommended next step (/pragma:refactor or strangler approach)
Lifecycle: State: stabilizing, Next: /pragma:refactor, Loop: /pragma:consult (default unless user explicitly continues directly)

characterize

Characterization Tests (Golden Master / Feathers)

Input

Procedure

1. Identify the behavior surface

2. Minimal fixture set

3. Stabilize nondeterminism

4. Capture golden outputs

5. Write characterization tests

6. Coverage report

7. Commit (tests only)

Constraints

Output

More from this repository

More from this repository

Characterization Tests (Golden Master / Feathers)

Input

Procedure

1. Identify the behavior surface

2. Minimal fixture set

3. Stabilize nondeterminism

4. Capture golden outputs

5. Write characterization tests

6. Coverage report

7. Commit (tests only)

Constraints

Output