Run any Skill in Manus with one click

$pwd:

elodin-cranelift

Name: Elodin Cranelift
Author: elodin-sys

// Work with the Cranelift JIT MLIR backend. Use when modifying libs/cranelift-mlir/, adding new StableHLO ops, debugging simulation correctness issues, running the checkpoint diagnostic tool, or working on the pointer-ABI tensor runtime.

Run Skill in Manus

$ git log --oneline --stat

stars:528

forks:34

updated:April 19, 2026 at 21:10

SKILL.md

readonly

name	elodin-cranelift
description	Work with the Cranelift JIT MLIR backend. Use when modifying libs/cranelift-mlir/, adding new StableHLO ops, debugging simulation correctness issues, running the checkpoint diagnostic tool, or working on the pointer-ABI tensor runtime.

Elodin Cranelift Backend

libs/cranelift-mlir/ compiles StableHLO MLIR to native code via Cranelift JIT — the default CPU backend for Elodin simulations. Deep internals live in ARCHITECTURE.md; profiling is in PERFORMANCE.md.

Default backend

Cranelift is the default (backend="cranelift" in WorldBuilder.run / .build). Override per-run:

ELODIN_BACKEND=jax-cpu python examples/<name>/main.py run   # XLA native, reference for correctness
ELODIN_BACKEND=jax-gpu python examples/<name>/main.py run   # XLA CUDA

Or in Python:

w.run(system, backend="jax-cpu")

Where things live

Path	Purpose
`src/ir.rs`	Internal IR: `Module`, `FuncDef`, `Instruction` variants
`src/parser.rs`	StableHLO text → IR (Winnow parser, child contexts for while/case)
`src/lower.rs`	IR → Cranelift JIT. Dual ABI, cross-ABI marshaling, SIMD, slot pooling
`src/tensor_rt.rs`	Runtime: broadcast_nd, slice, transpose, reduce, gather_nd, scatter, matmul
`tests/ops.rs`	Per-op golden tests, both ABI paths
`tests/checkpoint_test.rs`	XLA-vs-Cranelift comparator
`libs/nox-py/src/cranelift_compile.rs`	JAX → StableHLO, XLA reference checkpoint
`libs/nox-py/src/cranelift_exec.rs`	`CraneliftExec` tick loop
`libs/nox-py/src/exec.rs`	`WorldExec` enum (dispatches Cranelift vs JAX)

Everyday commands

Always run inside nix develop.

cargo test -p cranelift-mlir                    # all unit + golden tests
cargo test -p cranelift-mlir --test ops         # per-op only
cargo clippy -p cranelift-mlir -- -Dwarnings
cargo fmt -p cranelift-mlir -- --check

# Full simulation regression (requires `just install` first):
just install
source .venv/bin/activate
ELODIN_BACKEND=cranelift bash scripts/ci/regress.sh --all                                 # every example
ELODIN_BACKEND=cranelift bash scripts/ci/regress.sh ball examples/ball/main.py            # one example
ELODIN_BACKEND=cranelift bash scripts/ci/regress.sh --update ball examples/ball/main.py   # re-baseline after verifying correctness

Baselines live in scripts/ci/baseline/; tolerances in scripts/ci/baseline/tolerances.json.

Adding a new op

IR: add Instruction variant in src/ir.rs
Parser: add arm in parse_op() in src/parser.rs
Lowering (scalar): add match arm in lower_instruction() in src/lower.rs
Lowering (pointer-ABI): add match arm in lower_instruction_mem() in src/lower.rs
Runtime (if N-D): add tensor_<op>_f64 in src/tensor_rt.rs, register in TensorRtIds
Tests: golden-value tests in tests/ops.rs exercising both run_mlir and run_mlir_mem
cargo test -p cranelift-mlir

The dual-ABI / SIMD / tensor-runtime design constraints each step has to satisfy are documented in ARCHITECTURE.md.

Adding a new simulation example

Dump MLIR: ELODIN_BACKEND=cranelift ELODIN_CRANELIFT_DEBUG_DIR=/tmp/dbg python examples/<name>/main.py run
Catalog ops: python3 libs/cranelift-mlir/scripts/catalog_ops.py /tmp/dbg/stablehlo.mlir (safe on multi-GB files — do NOT grep them)
Copy the MLIR into testdata/, create an e2e test
Implement any missing ops (see "Adding a new op")
Regression: ELODIN_BACKEND=cranelift bash scripts/ci/regress.sh <name> examples/<name>/main.py

Debugging

Simulation produces wrong values

The tick checkpoint diagnostic tool is the workhorse. Full usage and MLIR-bisection workflow in ARCHITECTURE.md — Checkpoint Diagnostic Tool section. Quick commands:

# Capture XLA reference + Cranelift outputs for every tick input:
ELODIN_BACKEND=cranelift ELODIN_CRANELIFT_DEBUG_DIR=/tmp/ckpt \
  bash scripts/ci/regress.sh <example> examples/<example>/main.py

# Compare, per output, element-by-element:
ELODIN_CRANELIFT_DEBUG_DIR=/tmp/ckpt \
  cargo test -p cranelift-mlir --test checkpoint_test --release -- --ignored --nocapture

Once the diverging output is identified, reduce it to a minimal tests/ops.rs reproducer before fixing.

Simulation crashes (segfault / abort)

Try --release first: some JIT paths trip ptr::copy_nonoverlapping debug-mode UB checks.
Stack overflow on complex sims: the checkpoint test pins a 64 MB thread stack; mirror this if reproducing outside the test.
NULL pointer in JIT code is almost always a cross-ABI marshaling bug.

Compiler says "unsupported instruction" or "unsupported custom_call target"

Follow "Adding a new op".

Environment variables

Only two matter day-to-day:

ELODIN_BACKEND — cranelift (default), jax-cpu, jax-gpu.
ELODIN_CRANELIFT_DEBUG_DIR=<dir> — single flag for every diagnostic (profile probes, op-category sampling, tick waveform, instr/fold reports, inliner and slot-pool traces, MLIR dump, first-tick XLA reference checkpoint). Files land flat under <dir>. Zero overhead when unset.

Full outputs and reading guide: PERFORMANCE.md.

libs/cranelift-mlir/ARCHITECTURE.md — compilation pipeline, dual ABI, SIMD (LaneRepr), JIT memory layout, tensor runtime, LAPACK via faer, gather patterns, while-loop scoping, checkpoint tool, testing strategy, opportunities.
libs/cranelift-mlir/PERFORMANCE.md — ELODIN_CRANELIFT_DEBUG_DIR outputs, profile report fields, diff + waveform scripts, Tracy workflow.
libs/cranelift-mlir/README.md — crate-level quick start.

related-skills.json

same repository

elodin-dev.md

from "elodin-sys/elodin"

Develop and contribute to the Elodin codebase. Use when building Elodin from source, running tests, modifying core libraries, working on the Rust workspace, or onboarding as a contributor.

2026-05-21528

elodin-editor-dev.md

from "elodin-sys/elodin"

Contribute to the Elodin Editor, the 3D viewer and graphing tool. Use when editing files in libs/elodin-editor/ or apps/elodin/, working on the Bevy/Egui UI, modifying viewport rendering, telemetry graphs, video streaming, KDL schematics, or the command palette.

2026-05-21528

bevy.md

from "elodin-sys/elodin"

Tips for working with a Bevy application

2026-05-08528

elodin-aleph.md

from "elodin-sys/elodin"

Deploy and configure AlephOS on flight computers, write flight software services, and manage NixOS modules for the Aleph platform. Use when working with aleph/, deploying to Jetson Orin hardware, writing NixOS modules, flashing firmware, or composing a flight software stack.

2026-04-22528

elodin-simulation.md

from "elodin-sys/elodin"

Create and modify physics simulations using the Elodin Python SDK. Use when writing or editing simulation Python files, defining components or systems, spawning entities, configuring 6DOF physics, setting up visualization, or integrating with SITL/HITL workflows.

2026-04-19528

elodin-tracy.md

from "elodin-sys/elodin"

Profile Elodin with Tracy. Use when profiling the editor, simulation, building with tracy features, capturing traces, analyzing performance, or adding custom instrumentation.

2026-04-19528

package.json

"author": "elodin-sys"

"repository": "elodin-sys/elodin"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	elodin-cranelift
description	Work with the Cranelift JIT MLIR backend. Use when modifying libs/cranelift-mlir/, adding new StableHLO ops, debugging simulation correctness issues, running the checkpoint diagnostic tool, or working on the pointer-ABI tensor runtime.

Elodin Cranelift Backend

Default backend

Cranelift is the default (backend="cranelift" in WorldBuilder.run / .build). Override per-run:

ELODIN_BACKEND=jax-cpu python examples/<name>/main.py run   # XLA native, reference for correctness
ELODIN_BACKEND=jax-gpu python examples/<name>/main.py run   # XLA CUDA

Or in Python:

w.run(system, backend="jax-cpu")

Where things live

Path	Purpose
`src/ir.rs`	Internal IR: `Module`, `FuncDef`, `Instruction` variants
`src/parser.rs`	StableHLO text → IR (Winnow parser, child contexts for while/case)
`src/lower.rs`	IR → Cranelift JIT. Dual ABI, cross-ABI marshaling, SIMD, slot pooling
`src/tensor_rt.rs`	Runtime: broadcast_nd, slice, transpose, reduce, gather_nd, scatter, matmul
`tests/ops.rs`	Per-op golden tests, both ABI paths
`tests/checkpoint_test.rs`	XLA-vs-Cranelift comparator
`libs/nox-py/src/cranelift_compile.rs`	JAX → StableHLO, XLA reference checkpoint
`libs/nox-py/src/cranelift_exec.rs`	`CraneliftExec` tick loop
`libs/nox-py/src/exec.rs`	`WorldExec` enum (dispatches Cranelift vs JAX)

Everyday commands

Always run inside nix develop.

cargo test -p cranelift-mlir                    # all unit + golden tests
cargo test -p cranelift-mlir --test ops         # per-op only
cargo clippy -p cranelift-mlir -- -Dwarnings
cargo fmt -p cranelift-mlir -- --check

# Full simulation regression (requires `just install` first):
just install
source .venv/bin/activate
ELODIN_BACKEND=cranelift bash scripts/ci/regress.sh --all                                 # every example
ELODIN_BACKEND=cranelift bash scripts/ci/regress.sh ball examples/ball/main.py            # one example
ELODIN_BACKEND=cranelift bash scripts/ci/regress.sh --update ball examples/ball/main.py   # re-baseline after verifying correctness

Baselines live in scripts/ci/baseline/; tolerances in scripts/ci/baseline/tolerances.json.

Adding a new op

IR: add Instruction variant in src/ir.rs
Parser: add arm in parse_op() in src/parser.rs
Lowering (scalar): add match arm in lower_instruction() in src/lower.rs
Lowering (pointer-ABI): add match arm in lower_instruction_mem() in src/lower.rs
Runtime (if N-D): add tensor_<op>_f64 in src/tensor_rt.rs, register in TensorRtIds
Tests: golden-value tests in tests/ops.rs exercising both run_mlir and run_mlir_mem
cargo test -p cranelift-mlir

The dual-ABI / SIMD / tensor-runtime design constraints each step has to satisfy are documented in ARCHITECTURE.md.

Adding a new simulation example

Dump MLIR: ELODIN_BACKEND=cranelift ELODIN_CRANELIFT_DEBUG_DIR=/tmp/dbg python examples/<name>/main.py run
Catalog ops: python3 libs/cranelift-mlir/scripts/catalog_ops.py /tmp/dbg/stablehlo.mlir (safe on multi-GB files — do NOT grep them)
Copy the MLIR into testdata/, create an e2e test
Implement any missing ops (see "Adding a new op")
Regression: ELODIN_BACKEND=cranelift bash scripts/ci/regress.sh <name> examples/<name>/main.py

Debugging

Simulation produces wrong values

The tick checkpoint diagnostic tool is the workhorse. Full usage and MLIR-bisection workflow in ARCHITECTURE.md — Checkpoint Diagnostic Tool section. Quick commands:

# Capture XLA reference + Cranelift outputs for every tick input:
ELODIN_BACKEND=cranelift ELODIN_CRANELIFT_DEBUG_DIR=/tmp/ckpt \
  bash scripts/ci/regress.sh <example> examples/<example>/main.py

# Compare, per output, element-by-element:
ELODIN_CRANELIFT_DEBUG_DIR=/tmp/ckpt \
  cargo test -p cranelift-mlir --test checkpoint_test --release -- --ignored --nocapture

Once the diverging output is identified, reduce it to a minimal tests/ops.rs reproducer before fixing.

Simulation crashes (segfault / abort)

Try --release first: some JIT paths trip ptr::copy_nonoverlapping debug-mode UB checks.
Stack overflow on complex sims: the checkpoint test pins a 64 MB thread stack; mirror this if reproducing outside the test.
NULL pointer in JIT code is almost always a cross-ABI marshaling bug.

Compiler says "unsupported instruction" or "unsupported custom_call target"

Follow "Adding a new op".

Environment variables

Only two matter day-to-day:

ELODIN_BACKEND — cranelift (default), jax-cpu, jax-gpu.
ELODIN_CRANELIFT_DEBUG_DIR=<dir> — single flag for every diagnostic (profile probes, op-category sampling, tick waveform, instr/fold reports, inliner and slot-pool traces, MLIR dump, first-tick XLA reference checkpoint). Files land flat under <dir>. Zero overhead when unset.

Full outputs and reading guide: PERFORMANCE.md.

elodin-cranelift

Elodin Cranelift Backend

Default backend

Where things live

Everyday commands

Adding a new op

Adding a new simulation example

Debugging

Simulation produces wrong values

Simulation crashes (segfault / abort)

Compiler says "unsupported instruction" or "unsupported custom_call target"

Environment variables

Further reading

Elodin Cranelift Backend

Default backend

Where things live

Everyday commands

Adding a new op

Adding a new simulation example

Debugging

Simulation produces wrong values

Simulation crashes (segfault / abort)

Compiler says "unsupported instruction" or "unsupported custom_call target"

Environment variables

Further reading

elodin-cranelift

Elodin Cranelift Backend

Default backend

Where things live

Everyday commands

Adding a new op

Adding a new simulation example

Debugging

Simulation produces wrong values

Simulation crashes (segfault / abort)

Compiler says "unsupported instruction" or "unsupported custom_call target"

Environment variables

Further reading

More from this repository

More from this repository

Elodin Cranelift Backend

Default backend

Where things live

Everyday commands

Adding a new op

Adding a new simulation example

Debugging

Simulation produces wrong values

Simulation crashes (segfault / abort)

Compiler says "unsupported instruction" or "unsupported custom_call target"

Environment variables

Further reading