Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

align-op

Name: Align Op
Author: tile-ai

// Per-op orchestrator that brings a single op into alignment with its manifest entry. Classifies the op into one of three cases (green field / interface redesign / minor delta), dispatches to the right path (scaffold-op for new, archive+rescaffold+port for redesign, implement-op for minor), then runs the shared downstream (test → bench → validate → flip status → report). Complements the family-scoped `align-family`; per-op entry when you know the op you want to touch.

Ejecutar en Manus

$ git log --oneline --stat

stars:126

forks:37

updated:8 de mayo de 2026, 16:03

SKILL.md

readonly

related-skills.json

mismo repositorio

package.json

"author": "tile-ai"

"repository": "tile-ai/TileOPs"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Desarrolladores de softwareOcupaciones informáticas y matemáticas15-1252L4

Ejecuta cualquier Skill con un clic

name

align-op

description

Per-op orchestrator that brings a single op into alignment with its manifest entry. Classifies the op into one of three cases (green field / interface redesign / minor delta), dispatches to the right path (scaffold-op for new, archive+rescaffold+port for redesign, implement-op for minor), then runs the shared downstream (test → bench → validate → flip status → report). Complements the family-scoped `align-family`; per-op entry when you know the op you want to touch.

Arguments

op_name (positional) — manifest key, e.g. CumsumFwdOp.
--mode=green|redesign|minor (optional) — override the automatic classification. When omitted, CLASSIFY decides (auto if unambiguous, otherwise prompt).
--classify-only (optional) — stop after CLASSIFY; write mode.json and return without executing any case path. Use to ask "which case is this op in?" without side effects.

Contract

Input: op_name must be present in tileops/manifest/ with status: spec-only and a non-empty source.kernel_map (same preconditions as scaffold-op; see PRE_CHECK).
Path and data bindings used throughout this skill (resolved by the orchestrator once at PRE_CHECK when the manifest entry is first loaded, then passed into every sub-skill invocation):
- <source_op> — manifest source.op path (e.g., tileops/ops/reduction/cumsum.py).
- <source_test> — manifest source.test path (e.g., tests/ops/test_cumulative.py).
- <source_bench> — manifest source.bench path.
- <source_kernel> — manifest source.kernel path (the primary kernel implementation file).
- <manifest_signature> — the signature sub-tree from the op's manifest entry, passed verbatim to test-op / implement-op.
- <pytorch_equivalent> — manifest ref_api value (e.g., "torch.cumsum") or null if the op has no PyTorch reference. Required by test-op.
Output (SUCCESS path): op file at source.op aligned with the manifest; test file source.test aligned; __init__.py registrations consistent; status flipped spec-only → implemented (single commit). Side-artefacts in .foundry/plan/<op_name>/: mode.json (classification), plan.json (scaffold-op's §1/§2/§3 when that skill ran), kernel-check.json (redesign case only), pre-rewrite/source.py (redesign case, removed at CLEANUP on SUCCESS).
Termination (success): python scripts/validate_manifest.py --check-op <op_name> reports no errors + python -m pytest <source_test> -v passes + benchmark produces numbers + manifest status flipped.
Termination (blocked): any sub-skill (scaffold-op / test-op / implement-op / bench-op) returns blocked; or scaffold-op §1 drift; or REVALIDATE fails. Kernel-layer mismatches surfaced by KERNEL_CHECK are informational only and never cause BLOCKED by themselves — BLOCKED is reached only if a kernel drift propagates into a downstream sub-skill failure (e.g., bench-op runtime error, REVALIDATE regression). Archives are kept for post-mortem.
Constraints:
- Only align-op (and only at FLIP_STATUS) may modify tileops/manifest/. Sub-skills never touch the manifest.
- MUST NOT modify kernel code. Kernel-layer work, if needed, is surfaced via kernel-check.json as a separate follow-up.
- MUST NOT expand to multi-op scope; that is align-family's role.

Trust model

CLASSIFY, DISPATCH, FLIP_STATUS, CLEANUP, REPORT are orchestrator stages (align-op itself). Every other stage delegates to an atomic skill as a separate sub-agent invocation:

Stage Sub-skill
GREEN path scaffold-op
REDESIGN path scaffold-op (after ARCHIVE + CLEAR)
MINOR path implement-op
TEST test-op
IMPLEMENT implement-op (green / redesign only; minor already did it)
BENCH bench-op

Separate invocations preserve the per-skill contracts (e.g., scaffold-op's §1 fact-freeze; implement-op's no-test-modification rule).
After each sub-skill returns, align-op verifies git status --porcelain is empty before dispatching the next. If a sub-skill left an uncommitted change, align-op commits on its behalf with Sub-skill [name]: [summary] before proceeding.

Workflow

stateDiagram-v2
    [*] --> PRE_CHECK
    PRE_CHECK --> CLASSIFY: manifest preconditions pass
    PRE_CHECK --> BLOCKED: prereq missing
    CLASSIFY --> CLASSIFY_ONLY_EXIT: --classify-only flag
    CLASSIFY --> DISPATCH: mode decided (auto or --mode or user-declared)
    DISPATCH --> GREEN_PATH: case = green
    DISPATCH --> REDESIGN_PATH: case = redesign
    DISPATCH --> MINOR_PATH: case = minor
    GREEN_PATH --> TEST: scaffold-op succeeded
    REDESIGN_PATH --> KERNEL_CHECK: rescaffold + port done
    KERNEL_CHECK --> TEST: kernel-check.json written
    MINOR_PATH --> TEST: implement-op succeeded (minor-case main stage ran here)
    TEST --> IMPLEMENT: tests fail on current code (expected gap to close)
    TEST --> BENCH: tests already pass (DONE_SKIP), usually minor path
    IMPLEMENT --> BENCH: implementation closes the gap, tests pass
    IMPLEMENT --> BLOCKED: gap beyond op-layer (e.g. kernel rewrite required)
    BENCH --> REVALIDATE: benchmark produces numbers
    REVALIDATE --> FLIP_STATUS: check-op and pytest both pass
    REVALIDATE --> BLOCKED: regression
    FLIP_STATUS --> CLEANUP: manifest status flipped
    CLEANUP --> REPORT: pre-rewrite dropped on redesign, plan artefacts kept
    REPORT --> [*]
    CLASSIFY_ONLY_EXIT --> [*]
    GREEN_PATH --> BLOCKED: scaffold failed (§1 drift or validator error)
    REDESIGN_PATH --> BLOCKED: scaffold or port failed
    MINOR_PATH --> BLOCKED: implement-op failed
    BLOCKED --> [*]: return to caller with reason

Steps

1. PRE_CHECK

Preconditions identical to scaffold-op's — orchestrator enforces them up front so sub-skills never see ill-formed input:

op_name in tileops/manifest/ → proceed; otherwise BLOCKED ("op not in manifest").
status: spec-only → proceed; implemented → BLOCKED ("already aligned; flip status to spec-only in a manifest PR first if you intend to re-align"); missing/other → BLOCKED.
source.kernel_map declared and non-empty → proceed; missing → BLOCKED with the same guidance scaffold-op uses (add in a prerequisite manifest PR).
Every value in source.kernel_map resolves to an importable symbol → proceed; otherwise BLOCKED ("kernel class not found at expected path" — kernel must exist for op layer to align, regardless of case).

2. CLASSIFY

Decide which case applies. Machine-decidable input: does source.op exist?

Input	Auto case	User prompt?
`source.op` does not exist	`green`	no
`source.op` exists	(unknown)	yes — "redesign (rewrite + port) or minor (in-place edit)?"

--mode=<case> overrides prompting only when consistent with file presence. The orchestrator validates this during CLASSIFY and BLOCKs immediately on invalid combinations so sub-skills never see contradictory input:

source.op missing → only green is valid. --mode=minor → BLOCKED ("source.op is missing; cannot edit a non-existent op file; use --mode=green or omit --mode"). --mode=redesign → BLOCKED ("source.op is missing; no archive source to rewrite; use --mode=green or omit --mode").
source.op exists → --mode=green → BLOCKED ("source.op already exists; green-field scaffold would silently overwrite; use --mode=redesign for rewrite+port or --mode=minor for in-place edit").

First-op bias. If no op in the same family has status: implemented and follows the canonical pattern (docs/design/ops-design.md § Step 3), set auto-case = redesign (skip the prompt). mode.json: decided_by: "auto", reason: "no canonical-pattern precedent in family <name>". User may override with --mode=minor.

Write .foundry/plan/<op_name>/mode.json:

{
  "op_name": "CumsumFwdOp",
  "case": "redesign",
  "file_present": true,
  "kernel_class_importable": true,
  "decided_by": "user_prompt",
  "reason": "User declared: manifest _static_axes shape rewritten; structural redesign.",
  "classified_at": "YYYY-MM-DDTHH:MM:SSZ"
}

decided_by is one of auto / user_prompt / flag_override. reason is free-form text.

If --classify-only was passed, terminate here and print the mode.json content. No other side effects.

3. DISPATCH — case-specific main stage

Each path produces the aligned op file under source.op plus whatever artefacts the sub-skill creates under .foundry/plan/<op_name>/.

3a. GREEN path (`case = green`)

scaffold-op <op_name>

Sub-skill does PRE_CHECK → DRY_RUN (plan.json) → EMIT → REGISTER → VALIDATE → REPORT. align-op waits for SUCCESS or BLOCKED; on BLOCKED, surface the row and terminate.

3b. REDESIGN path (`case = redesign`)

Sequence:

ARCHIVE — mkdir -p .foundry/plan/<op_name>/pre-rewrite/, copy source.op there as source.py (rename: strip family path, keep basename). The archive is the source of truth for manual porting. It persists until CLEANUP.
CLEAR — remove source.op from the tree; remove the op's from .<module> import <ClassName> line and its __all__ entry from the package __init__.py. Commit as [Chore] align-op: archive <op_name> before rescaffold.
SCAFFOLD — scaffold-op <op_name>. Target now absent, PRE_CHECK passes, emits the 17 mechanical slots.
PORT — read pre-rewrite/source.py and port op-specific content that the scaffold cannot produce:
- Optional hooks (_pad_value, _validate_dim, _pre_kernel, _post_kernel, _cache_key override).
- Family-specific protocol variables (_op_kind, _kernel_key, _kernel_cls, etc.) if the op was a T1 thin wrapper.
- Any forward body specifics beyond the universal pattern (kernel-specific reshape/movedim choreography).
- Any class-level non-slot attributes the old file had that still make sense under the new spec. Commit as [Feat] align-op: port business logic for <op_name> from pre-rewrite. If the agent is uncertain whether a specific override should be ported, record an open_questions item in plan.json §3 (needs_human_decision) and port conservatively.
KERNEL_CHECK — see §5 below.

3c. MINOR path (`case = minor`)

implement-op(
  op_name=<op_name>,
  manifest_signature=<manifest_signature>,
  source_op=<source_op>,
  source_test=<source_test>
)

Sub-skill does ANALYZE → DIAGNOSE → IMPLEMENT → VALIDATE → MARK_DONE → COMMIT. align-op waits for SUCCESS or BLOCKED.

4. Skipped anchor (reserved)

5. KERNEL_CHECK (redesign path only)

Determine whether the kernel layer also needs work. align-op does not modify kernel code; it surfaces the question.

For each Kernel class referenced in source.kernel_map:

Inspect the kernel's __init__ / forward / _build_program signatures (wherever applicable) in its source file.
Compare against the new op's kernel-build call emitted by scaffold-op (self.kernel_map[<key>](<args>)). Specifically check:
- Argument names and positional order.
- Argument types.
- Any layout / dtype expectations the kernel documents.
Classify per kernel:
- aligned — new op's kernel invocation matches the kernel's ctor; no kernel work.
- signature_drift — arg names or order differ; kernel ctor must be adjusted.
- semantic_drift — the kernel expects a different data layout / dtype than the new op provides (e.g. op now passes (M, N) where kernel expects (N, M)).
- unknown — cannot determine from static inspection.

Write .foundry/plan/<op_name>/kernel-check.json:

{
  "op_name": "CumsumFwdOp",
  "checked_at": "YYYY-MM-DDTHH:MM:SSZ",
  "kernels": [
    {
      "dispatch_key": "cumulative_fwd",
      "kernel_class": "CumulativeKernel",
      "kernel_source": "tileops/kernels/reduction/cumulative.py",
      "classification": "aligned",
      "op_call": "self.kernel_map['cumulative_fwd'](M, N, 'sum', self.dtype, tune=self.tune)",
      "kernel_ctor": "__init__(self, M, N, op_kind, dtype, *, tune=False)",
      "notes": "Positional and named args match; no kernel work required."
    }
  ]
}

Non-aligned entries surface in REPORT as needs_kernel_work follow-ups. align-op itself continues to TEST — the downstream path may still pass if the kernel drift only affects performance (not correctness), or fail fast if the kernel mismatch causes runtime errors, which REVALIDATE will catch.

6. TEST

test-op(
  op_name=<op_name>,
  manifest_signature=<manifest_signature>,
  pytorch_equivalent=<pytorch_equivalent>,
  source_test=<source_test>
)

Sub-skill writes tests against the new spec. Termination:

tests fail on current code (expected TDD seed) → proceed to IMPLEMENT.
DONE_SKIP (tests already pass, e.g. a sibling migration fixed the base class, or minor-path implement-op already closed the gap in Step 3c) → skip IMPLEMENT, proceed to BENCH.

7. IMPLEMENT

implement-op(
  op_name=<op_name>,
  manifest_signature=<manifest_signature>,
  source_op=<source_op>,
  source_test=<source_test>
)

Closes the gap between the emitted op file and the tests from Step 6. Applies to:

Green path: scaffold-op produced the 17 mechanical slots, but not optional hooks or family protocol vars; implement-op fills any that are required for the tests to pass.
Redesign path: the PORT sub-step in Step 3b did a first pass; implement-op closes residual gaps surfaced by the tests.
Minor path: skipped — implement-op already ran as the minor-case main stage in Step 3c. If TEST didn't DONE_SKIP here, that signals spec-drift beyond the minor-case scope and becomes BLOCKED.

BLOCKED if the gap requires kernel-layer changes (align-op is op-layer only; kernel work surfaces via kernel-check.json from KERNEL_CHECK or as a blocked return from implement-op).

8. BENCH

bench-op(
  op_name=<op_name>,
  source_bench=<source_bench>,
  source_op=<source_op>
)

Produces numbers. Sub-skill unchanged. If BLOCKED and reason is not kernel-related, propagate blocked.

9. REVALIDATE

python scripts/validate_manifest.py --check-op <op_name>
python -m pytest <source_test> -v

Both must pass. Regression after benchmark changes → BLOCKED.

10. FLIP_STATUS

Orchestrator (not a sub-skill) edits the manifest:

ops.<op_name>.status: spec-only → status: implemented
Commit as [Refactor][Manifest] promote <op_name> to implemented.

This is the only manifest write in the entire workflow, and it MUST stay within the Status flip carve-out; any contractual-field change requires a separate manifest-only PR.

11. CLEANUP

On SUCCESS path:

Delete .foundry/plan/<op_name>/pre-rewrite/ (redesign case only; archive purpose is served).
Keep mode.json, plan.json, kernel-check.json as audit trail — they are under .foundry/plan/ which is gitignored but persists in the local worktree.

On BLOCKED path: keep all artefacts for post-mortem.

12. REPORT

Single-page summary printed to stdout. Always includes:

Status: SUCCESS | BLOCKED
Op: <op_name>
Case: green | redesign | minor
Mode decided by: auto | user_prompt | flag_override
File: <source.op> (<lines>)

Sub-skills run:
  - scaffold-op: <SUCCESS|BLOCKED|skipped>
  - implement-op: <...>
  - test-op: <...>
  - bench-op: <...>

Plan artefacts (.foundry/plan/<op_name>/):
  - mode.json
  - plan.json (if scaffold-op ran)
  - kernel-check.json (if redesign path)
  - pre-rewrite/ (redesign path, cleaned on SUCCESS)

Status flipped: spec-only → implemented (commit <sha>)

Follow-ups:
  - <needs_kernel_work for kernel X> (from kernel-check.json non-aligned entries)
  - <needs_doc_fix for slot S21> (from plan.json §3)
  - <needs_human_decision about port of _pad_value> (from port observations)

On BLOCKED, replace "Status flipped" line with the blocking error and list remaining follow-ups.

Interaction with `align-family`

align-family is the family-scoped orchestrator and delegates every per-op stage to align-op. Its workflow is AUDIT → GROUP_BY_BASE → ROUTE → (per op: ALIGN_OP) → CLEANUP_GATE → CLEANUP → CREATE_PR; the family orchestrator never invokes the atomic per-op skills (scaffold-op / test-op / implement-op / bench-op) directly — every per-op stage runs inside align-op's contract.

Use align-op <op> for per-op work (green field, redesign, or minor delta).
Use align-family <family> for family-scoped historical migration of many ops at once.

They do not conflict. align-op never manages cross-op cleanup gates; that remains align-family's. align-op's FLIP_STATUS is the sole manifest-write site, observed by align-family via align-op's SUCCESS return.

Non-goals

Kernel scaffolding / kernel-layer edits. align-op surfaces kernel work as a follow-up via kernel-check.json; a separate (future) kernel-scaffold / kernel-align skill will own that layer.
Family-level cleanup. Cross-op dual-path removal lives in align-family and is not a concern of per-op alignment.
General auto-detection of "redesign vs minor." The distinction is a design judgement; align-op prompts or accepts --mode. The one exception is the first-op bias in CLASSIFY (no canonical-pattern precedent in the family → auto redesign). Beyond that one case, no auto-detection.
Manifest changes (other than FLIP_STATUS). Per the trust model, manifest changes live in separate manifest PRs.

name

align-op

description

Arguments

op_name (positional) — manifest key, e.g. CumsumFwdOp.
--mode=green|redesign|minor (optional) — override the automatic classification. When omitted, CLASSIFY decides (auto if unambiguous, otherwise prompt).
--classify-only (optional) — stop after CLASSIFY; write mode.json and return without executing any case path. Use to ask "which case is this op in?" without side effects.

Contract

Input: op_name must be present in tileops/manifest/ with status: spec-only and a non-empty source.kernel_map (same preconditions as scaffold-op; see PRE_CHECK).
Path and data bindings used throughout this skill (resolved by the orchestrator once at PRE_CHECK when the manifest entry is first loaded, then passed into every sub-skill invocation):
- <source_op> — manifest source.op path (e.g., tileops/ops/reduction/cumsum.py).
- <source_test> — manifest source.test path (e.g., tests/ops/test_cumulative.py).
- <source_bench> — manifest source.bench path.
- <source_kernel> — manifest source.kernel path (the primary kernel implementation file).
- <manifest_signature> — the signature sub-tree from the op's manifest entry, passed verbatim to test-op / implement-op.
- <pytorch_equivalent> — manifest ref_api value (e.g., "torch.cumsum") or null if the op has no PyTorch reference. Required by test-op.
Output (SUCCESS path): op file at source.op aligned with the manifest; test file source.test aligned; __init__.py registrations consistent; status flipped spec-only → implemented (single commit). Side-artefacts in .foundry/plan/<op_name>/: mode.json (classification), plan.json (scaffold-op's §1/§2/§3 when that skill ran), kernel-check.json (redesign case only), pre-rewrite/source.py (redesign case, removed at CLEANUP on SUCCESS).
Termination (success): python scripts/validate_manifest.py --check-op <op_name> reports no errors + python -m pytest <source_test> -v passes + benchmark produces numbers + manifest status flipped.
Termination (blocked): any sub-skill (scaffold-op / test-op / implement-op / bench-op) returns blocked; or scaffold-op §1 drift; or REVALIDATE fails. Kernel-layer mismatches surfaced by KERNEL_CHECK are informational only and never cause BLOCKED by themselves — BLOCKED is reached only if a kernel drift propagates into a downstream sub-skill failure (e.g., bench-op runtime error, REVALIDATE regression). Archives are kept for post-mortem.
Constraints:
- Only align-op (and only at FLIP_STATUS) may modify tileops/manifest/. Sub-skills never touch the manifest.
- MUST NOT modify kernel code. Kernel-layer work, if needed, is surfaced via kernel-check.json as a separate follow-up.
- MUST NOT expand to multi-op scope; that is align-family's role.

Trust model

CLASSIFY, DISPATCH, FLIP_STATUS, CLEANUP, REPORT are orchestrator stages (align-op itself). Every other stage delegates to an atomic skill as a separate sub-agent invocation:

Stage Sub-skill
GREEN path scaffold-op
REDESIGN path scaffold-op (after ARCHIVE + CLEAR)
MINOR path implement-op
TEST test-op
IMPLEMENT implement-op (green / redesign only; minor already did it)
BENCH bench-op

Separate invocations preserve the per-skill contracts (e.g., scaffold-op's §1 fact-freeze; implement-op's no-test-modification rule).
After each sub-skill returns, align-op verifies git status --porcelain is empty before dispatching the next. If a sub-skill left an uncommitted change, align-op commits on its behalf with Sub-skill [name]: [summary] before proceeding.

Workflow

stateDiagram-v2
    [*] --> PRE_CHECK
    PRE_CHECK --> CLASSIFY: manifest preconditions pass
    PRE_CHECK --> BLOCKED: prereq missing
    CLASSIFY --> CLASSIFY_ONLY_EXIT: --classify-only flag
    CLASSIFY --> DISPATCH: mode decided (auto or --mode or user-declared)
    DISPATCH --> GREEN_PATH: case = green
    DISPATCH --> REDESIGN_PATH: case = redesign
    DISPATCH --> MINOR_PATH: case = minor
    GREEN_PATH --> TEST: scaffold-op succeeded
    REDESIGN_PATH --> KERNEL_CHECK: rescaffold + port done
    KERNEL_CHECK --> TEST: kernel-check.json written
    MINOR_PATH --> TEST: implement-op succeeded (minor-case main stage ran here)
    TEST --> IMPLEMENT: tests fail on current code (expected gap to close)
    TEST --> BENCH: tests already pass (DONE_SKIP), usually minor path
    IMPLEMENT --> BENCH: implementation closes the gap, tests pass
    IMPLEMENT --> BLOCKED: gap beyond op-layer (e.g. kernel rewrite required)
    BENCH --> REVALIDATE: benchmark produces numbers
    REVALIDATE --> FLIP_STATUS: check-op and pytest both pass
    REVALIDATE --> BLOCKED: regression
    FLIP_STATUS --> CLEANUP: manifest status flipped
    CLEANUP --> REPORT: pre-rewrite dropped on redesign, plan artefacts kept
    REPORT --> [*]
    CLASSIFY_ONLY_EXIT --> [*]
    GREEN_PATH --> BLOCKED: scaffold failed (§1 drift or validator error)
    REDESIGN_PATH --> BLOCKED: scaffold or port failed
    MINOR_PATH --> BLOCKED: implement-op failed
    BLOCKED --> [*]: return to caller with reason

Steps

1. PRE_CHECK

Preconditions identical to scaffold-op's — orchestrator enforces them up front so sub-skills never see ill-formed input:

op_name in tileops/manifest/ → proceed; otherwise BLOCKED ("op not in manifest").
status: spec-only → proceed; implemented → BLOCKED ("already aligned; flip status to spec-only in a manifest PR first if you intend to re-align"); missing/other → BLOCKED.
source.kernel_map declared and non-empty → proceed; missing → BLOCKED with the same guidance scaffold-op uses (add in a prerequisite manifest PR).
Every value in source.kernel_map resolves to an importable symbol → proceed; otherwise BLOCKED ("kernel class not found at expected path" — kernel must exist for op layer to align, regardless of case).

2. CLASSIFY

Decide which case applies. Machine-decidable input: does source.op exist?

Input	Auto case	User prompt?
`source.op` does not exist	`green`	no
`source.op` exists	(unknown)	yes — "redesign (rewrite + port) or minor (in-place edit)?"

source.op missing → only green is valid. --mode=minor → BLOCKED ("source.op is missing; cannot edit a non-existent op file; use --mode=green or omit --mode"). --mode=redesign → BLOCKED ("source.op is missing; no archive source to rewrite; use --mode=green or omit --mode").
source.op exists → --mode=green → BLOCKED ("source.op already exists; green-field scaffold would silently overwrite; use --mode=redesign for rewrite+port or --mode=minor for in-place edit").

Write .foundry/plan/<op_name>/mode.json:

{
  "op_name": "CumsumFwdOp",
  "case": "redesign",
  "file_present": true,
  "kernel_class_importable": true,
  "decided_by": "user_prompt",
  "reason": "User declared: manifest _static_axes shape rewritten; structural redesign.",
  "classified_at": "YYYY-MM-DDTHH:MM:SSZ"
}

decided_by is one of auto / user_prompt / flag_override. reason is free-form text.

If --classify-only was passed, terminate here and print the mode.json content. No other side effects.

3. DISPATCH — case-specific main stage

Each path produces the aligned op file under source.op plus whatever artefacts the sub-skill creates under .foundry/plan/<op_name>/.

3a. GREEN path (`case = green`)

scaffold-op <op_name>

Sub-skill does PRE_CHECK → DRY_RUN (plan.json) → EMIT → REGISTER → VALIDATE → REPORT. align-op waits for SUCCESS or BLOCKED; on BLOCKED, surface the row and terminate.

3b. REDESIGN path (`case = redesign`)

Sequence:

ARCHIVE — mkdir -p .foundry/plan/<op_name>/pre-rewrite/, copy source.op there as source.py (rename: strip family path, keep basename). The archive is the source of truth for manual porting. It persists until CLEANUP.
CLEAR — remove source.op from the tree; remove the op's from .<module> import <ClassName> line and its __all__ entry from the package __init__.py. Commit as [Chore] align-op: archive <op_name> before rescaffold.
SCAFFOLD — scaffold-op <op_name>. Target now absent, PRE_CHECK passes, emits the 17 mechanical slots.
PORT — read pre-rewrite/source.py and port op-specific content that the scaffold cannot produce:
- Optional hooks (_pad_value, _validate_dim, _pre_kernel, _post_kernel, _cache_key override).
- Family-specific protocol variables (_op_kind, _kernel_key, _kernel_cls, etc.) if the op was a T1 thin wrapper.
- Any forward body specifics beyond the universal pattern (kernel-specific reshape/movedim choreography).
- Any class-level non-slot attributes the old file had that still make sense under the new spec. Commit as [Feat] align-op: port business logic for <op_name> from pre-rewrite. If the agent is uncertain whether a specific override should be ported, record an open_questions item in plan.json §3 (needs_human_decision) and port conservatively.
KERNEL_CHECK — see §5 below.

3c. MINOR path (`case = minor`)

implement-op(
  op_name=<op_name>,
  manifest_signature=<manifest_signature>,
  source_op=<source_op>,
  source_test=<source_test>
)

Sub-skill does ANALYZE → DIAGNOSE → IMPLEMENT → VALIDATE → MARK_DONE → COMMIT. align-op waits for SUCCESS or BLOCKED.

4. Skipped anchor (reserved)

5. KERNEL_CHECK (redesign path only)

Determine whether the kernel layer also needs work. align-op does not modify kernel code; it surfaces the question.

For each Kernel class referenced in source.kernel_map:

Inspect the kernel's __init__ / forward / _build_program signatures (wherever applicable) in its source file.
Compare against the new op's kernel-build call emitted by scaffold-op (self.kernel_map[<key>](<args>)). Specifically check:
- Argument names and positional order.
- Argument types.
- Any layout / dtype expectations the kernel documents.
Classify per kernel:
- aligned — new op's kernel invocation matches the kernel's ctor; no kernel work.
- signature_drift — arg names or order differ; kernel ctor must be adjusted.
- semantic_drift — the kernel expects a different data layout / dtype than the new op provides (e.g. op now passes (M, N) where kernel expects (N, M)).
- unknown — cannot determine from static inspection.

Write .foundry/plan/<op_name>/kernel-check.json:

{
  "op_name": "CumsumFwdOp",
  "checked_at": "YYYY-MM-DDTHH:MM:SSZ",
  "kernels": [
    {
      "dispatch_key": "cumulative_fwd",
      "kernel_class": "CumulativeKernel",
      "kernel_source": "tileops/kernels/reduction/cumulative.py",
      "classification": "aligned",
      "op_call": "self.kernel_map['cumulative_fwd'](M, N, 'sum', self.dtype, tune=self.tune)",
      "kernel_ctor": "__init__(self, M, N, op_kind, dtype, *, tune=False)",
      "notes": "Positional and named args match; no kernel work required."
    }
  ]
}

6. TEST

test-op(
  op_name=<op_name>,
  manifest_signature=<manifest_signature>,
  pytorch_equivalent=<pytorch_equivalent>,
  source_test=<source_test>
)

Sub-skill writes tests against the new spec. Termination:

tests fail on current code (expected TDD seed) → proceed to IMPLEMENT.
DONE_SKIP (tests already pass, e.g. a sibling migration fixed the base class, or minor-path implement-op already closed the gap in Step 3c) → skip IMPLEMENT, proceed to BENCH.

7. IMPLEMENT

implement-op(
  op_name=<op_name>,
  manifest_signature=<manifest_signature>,
  source_op=<source_op>,
  source_test=<source_test>
)

Closes the gap between the emitted op file and the tests from Step 6. Applies to:

Green path: scaffold-op produced the 17 mechanical slots, but not optional hooks or family protocol vars; implement-op fills any that are required for the tests to pass.
Redesign path: the PORT sub-step in Step 3b did a first pass; implement-op closes residual gaps surfaced by the tests.
Minor path: skipped — implement-op already ran as the minor-case main stage in Step 3c. If TEST didn't DONE_SKIP here, that signals spec-drift beyond the minor-case scope and becomes BLOCKED.

BLOCKED if the gap requires kernel-layer changes (align-op is op-layer only; kernel work surfaces via kernel-check.json from KERNEL_CHECK or as a blocked return from implement-op).

8. BENCH

bench-op(
  op_name=<op_name>,
  source_bench=<source_bench>,
  source_op=<source_op>
)

Produces numbers. Sub-skill unchanged. If BLOCKED and reason is not kernel-related, propagate blocked.

9. REVALIDATE

python scripts/validate_manifest.py --check-op <op_name>
python -m pytest <source_test> -v

Both must pass. Regression after benchmark changes → BLOCKED.

10. FLIP_STATUS

Orchestrator (not a sub-skill) edits the manifest:

ops.<op_name>.status: spec-only → status: implemented
Commit as [Refactor][Manifest] promote <op_name> to implemented.

This is the only manifest write in the entire workflow, and it MUST stay within the Status flip carve-out; any contractual-field change requires a separate manifest-only PR.

11. CLEANUP

On SUCCESS path:

Delete .foundry/plan/<op_name>/pre-rewrite/ (redesign case only; archive purpose is served).
Keep mode.json, plan.json, kernel-check.json as audit trail — they are under .foundry/plan/ which is gitignored but persists in the local worktree.

On BLOCKED path: keep all artefacts for post-mortem.

12. REPORT

Single-page summary printed to stdout. Always includes:

Status: SUCCESS | BLOCKED
Op: <op_name>
Case: green | redesign | minor
Mode decided by: auto | user_prompt | flag_override
File: <source.op> (<lines>)

Sub-skills run:
  - scaffold-op: <SUCCESS|BLOCKED|skipped>
  - implement-op: <...>
  - test-op: <...>
  - bench-op: <...>

Plan artefacts (.foundry/plan/<op_name>/):
  - mode.json
  - plan.json (if scaffold-op ran)
  - kernel-check.json (if redesign path)
  - pre-rewrite/ (redesign path, cleaned on SUCCESS)

Status flipped: spec-only → implemented (commit <sha>)

Follow-ups:
  - <needs_kernel_work for kernel X> (from kernel-check.json non-aligned entries)
  - <needs_doc_fix for slot S21> (from plan.json §3)
  - <needs_human_decision about port of _pad_value> (from port observations)

On BLOCKED, replace "Status flipped" line with the blocking error and list remaining follow-ups.

Interaction with `align-family`

Use align-op <op> for per-op work (green field, redesign, or minor delta).
Use align-family <family> for family-scoped historical migration of many ops at once.

Non-goals

Kernel scaffolding / kernel-layer edits. align-op surfaces kernel work as a follow-up via kernel-check.json; a separate (future) kernel-scaffold / kernel-align skill will own that layer.
Family-level cleanup. Cross-op dual-path removal lives in align-family and is not a concern of per-op alignment.
General auto-detection of "redesign vs minor." The distinction is a design judgement; align-op prompts or accepts --mode. The one exception is the first-op bias in CLASSIFY (no canonical-pattern precedent in the family → auto redesign). Beyond that one case, no auto-detection.
Manifest changes (other than FLIP_STATUS). Per the trust model, manifest changes live in separate manifest PRs.

Stage	Sub-skill
GREEN path	`scaffold-op`
REDESIGN path	`scaffold-op` (after ARCHIVE + CLEAR)
MINOR path	`implement-op`
TEST	`test-op`
IMPLEMENT	`implement-op` (green / redesign only; minor already did it)
BENCH	`bench-op`

align-op

Más de este repositorio

Más de este repositorio

Arguments

Contract

Trust model

Workflow

Steps

1. PRE_CHECK

2. CLASSIFY

3. DISPATCH — case-specific main stage

3a. GREEN path (case = green)

3b. REDESIGN path (case = redesign)

3c. MINOR path (case = minor)

4. Skipped anchor (reserved)

5. KERNEL_CHECK (redesign path only)

6. TEST

7. IMPLEMENT

8. BENCH

9. REVALIDATE

10. FLIP_STATUS

11. CLEANUP

12. REPORT

Interaction with align-family

Non-goals

Arguments

Contract

Trust model

Workflow

Steps

1. PRE_CHECK

2. CLASSIFY

3. DISPATCH — case-specific main stage

3a. GREEN path (case = green)

3b. REDESIGN path (case = redesign)

3c. MINOR path (case = minor)

4. Skipped anchor (reserved)

5. KERNEL_CHECK (redesign path only)

6. TEST

7. IMPLEMENT

8. BENCH

9. REVALIDATE

10. FLIP_STATUS

11. CLEANUP

12. REPORT

Interaction with align-family

Non-goals

3a. GREEN path (`case = green`)

3b. REDESIGN path (`case = redesign`)

3c. MINOR path (`case = minor`)

Interaction with `align-family`

3a. GREEN path (`case = green`)

3b. REDESIGN path (`case = redesign`)

3c. MINOR path (`case = minor`)

Interaction with `align-family`