Run any Skill in Manus with one click

$pwd:

ttir-model-op-analysis

Name: Ttir Model Op Analysis
Author: tenstorrent

// Given a `.mlir` file (or a directory of `.mlir` files) with TTIR ops, run the same TTIR normalization passes as `D2MFrontendPipeline` before D2M, then produce per-file outputs: `preprocessed.mlir`, `ttir-op-report.txt` (op counts from normalized IR), and `ops.mlir` (one func per unique op configuration, golden-style). Optional: per-pass IR dumps.

Run Skill in Manus

$ git log --oneline --stat

stars:274

forks:130

updated:May 5, 2026 at 16:45

SKILL.md

readonly

related-skills.json

same repository

add-op.md

from "tenstorrent/tt-mlir"

How to add a new operation (op) to the tt-mlir compiler across all layers: TTIR/TTNN dialect definitions, StableHLO composite conversion, TTIR-to-TTNN conversion, EmitC/EmitPy conversions, flatbuffer schema and serialization, runtime implementation, OpModel, ttir_builder, golden functions, and all associated tests. Use this skill whenever the user asks to add an op, implement an op, create a new operation, add support for a TTNN op, or mentions adding an op to the compiler pipeline. Also trigger when the user wants to know what files to change for a new op, or asks about the op-adding workflow.

2026-05-16274

ttir-decomposition-for-ttmetal.md

from "tenstorrent/tt-mlir"

Add a new composite op decomposition pattern to the TTMetal pipeline. Use when the user wants to decompose/lower a high-level TTIR op (e.g. rms_norm, sdpa, layer_norm, softmax) into primitive TTIR ops (matmul, add, multiply, etc.) for the D2M/TTMetal backend. Also trigger when the user mentions "decomposition pattern", "decompose op for ttmetal", or "lower op to primitives".

2026-05-05274

run-ops-mlir-snippets.md

from "tenstorrent/tt-mlir"

Compile and optionally execute every func.func in an ops.mlir-style snippet file (or every .mlir file in a directory) using `run_ops_mlir_snippets.py`. Use when the user wants to compile or run TTIR op snippets on device, test ops.mlir files, or check which ops compile/execute successfully.

2026-04-30274

add-ttir-d2m-lowering.md

from "tenstorrent/tt-mlir"

Elementwise TTIR→D2M→TTMetal path: tablegen, TTIRToD2M.cpp, D2MToTTKernel.cpp, and — only when the kernel API callee is new — TTKernelIncludesMap.h (per-op api/compute/eltwise_unary/*.h mapping for JIT). Does not edit D2MGenericRegionOps.cpp or TTKernelToCpp.cpp. Not for reductions, matmul, views, or CCL.

2026-04-22274

validate-tt-mlir-against-tt-xla.md

from "tenstorrent/tt-mlir"

Validate a tt-mlir PR against tt-xla by creating a cherry-picked branch and triggering CI. Invoked as: /validate-tt-mlir-against-tt-xla <PR number or URL>. Use this skill whenever the user wants to test, validate, qualify, or check a tt-mlir PR in tt-xla, or mentions running uplift qualification test suite, or asks to trigger tt-xla CI for a tt-mlir change. Also triggers when the user mentions "xla validate", "xla test", or "validate in xla".

2026-04-17274

add-ttir-builder-op.md

from "tenstorrent/tt-mlir"

Add full builder API support (@tag, @parse, @split) for a TTIR op. Use this skill whenever the user wants to add builder support for a new TTIR op, upgrade an existing _op_proxy-based op to use @tag/@parse/@split decorators, or asks about how to add builder API for an op in ttir_builder.py. Also trigger when the user mentions adding tag/parse/split for an op, or wants to make an op work with the parse/split test infrastructure.

2026-04-01274

package.json

"author": "tenstorrent"

"repository": "tenstorrent/tt-mlir"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	ttir-model-op-analysis
description	Given a `.mlir` file (or a directory of `.mlir` files) with TTIR ops, run the same TTIR normalization passes as `D2MFrontendPipeline` before D2M, then produce per-file outputs: `preprocessed.mlir`, `ttir-op-report.txt` (op counts from normalized IR), and `ops.mlir` (one func per unique op configuration, golden-style). Optional: per-pass IR dumps.

TTIR model op analysis (post-frontend-normalization)

Given a .mlir file (or a directory of .mlir files, e.g. a multi-graph model) containing TTIR ops (from vLLM, torch-mlir, or tt-forge), normalize it the way the compiler does before D2M, then inventory which TTIR ops (and shapes) appear. Do not open TTIRToD2M.cpp, grep populateTTIRToD2MPatterns, or produce "D2M coverage" unless the user explicitly asks for that comparison.

The input can be:

A single .mlir file (e.g. model.mlir)
A directory of .mlir files (e.g. vllm_opt/ containing graph1.mlir, graph2.mlir). Each file is processed independently with per-graph reports.

Step 1: Run the frontend TTIR normalization pipeline

These passes match createD2MFrontendPipeline in lib/Dialect/D2M/Pipelines/D2MPipelines.cpp:

#	Pass flag	Short name (for filenames)
1	`--canonicalize`	`canonicalize`
2	`--ttir-predicate-type-alignment`	`predicate-type-alignment`
3	`--ttir-element-type-normalization`	`element-type-normalization`
4	`--ttir-to-ttir-decomposition`	`decomposition`
5	`--ttir-explicate-tms`	`explicate-tms`
6	`--ttir-erase-inverse-ops`	`erase-inverse-ops`
7	`--ttir-move-reshape-to-constant`	`move-reshape-to-constant`
8	`--ttir-fold-constant-reshape-broadcast`	`fold-constant-reshape-broadcast`
9	`--ttir-implicit-broadcast-fold`	`implicit-broadcast-fold`

Single-file mode

Create <basename>/ next to the input (basename = filename without .mlir):

<basename>/
  preprocessed.mlir              # IR after all passes above
  ttir-op-report.txt             # counts + sorted mnemonic list (see below)
  ops.mlir                       # one func per unique TTIR op configuration

source env/activate
INPUT="<input.mlir>"
BASENAME="$(basename "$INPUT" .mlir)"
mkdir -p "$BASENAME"

ttmlir-opt \
  --canonicalize \
  --ttir-predicate-type-alignment \
  --ttir-element-type-normalization \
  --ttir-to-ttir-decomposition \
  --ttir-explicate-tms \
  --ttir-erase-inverse-ops \
  --ttir-move-reshape-to-constant \
  --ttir-fold-constant-reshape-broadcast \
  --ttir-implicit-broadcast-fold \
  -o "$BASENAME/preprocessed.mlir" \
  "$INPUT"

Directory mode (multi-graph models)

When the input is a directory like vllm_opt/ containing graph1.mlir, graph2.mlir, etc., loop over each file and create a subdirectory per graph. Each subdirectory uses clean names -- the folder provides the graph context:

vllm_opt/
  graph1.mlir                        # original input
  graph2.mlir
  ttir-op-report.txt                 # combined report across all graphs
  graph1/                            # per-graph output folder
    preprocessed.mlir
    ttir-op-report.txt
    ops.mlir
  graph2/
    preprocessed.mlir
    ttir-op-report.txt
    ops.mlir

source env/activate
INPUT_DIR="<dir>"
for f in "$INPUT_DIR"/*.mlir; do
  STEM="$(basename "$f" .mlir)"
  mkdir -p "$INPUT_DIR/$STEM"
  ttmlir-opt \
    --canonicalize \
    --ttir-predicate-type-alignment \
    --ttir-element-type-normalization \
    --ttir-to-ttir-decomposition \
    --ttir-explicate-tms \
    --ttir-erase-inverse-ops \
    --ttir-move-reshape-to-constant \
    --ttir-fold-constant-reshape-broadcast \
    --ttir-implicit-broadcast-fold \
    -o "$INPUT_DIR/$STEM/preprocessed.mlir" \
    "$f" || { echo "FAILED on $f"; continue; }
done

`ttir-op-report.txt` and `ops.mlir`

Do not hand-roll parsing. From the repo root (any Python 3.12+; no venv import deps). The script accepts multiple paths, so pass all preprocessed files in one command:

python tools/scripts/model_breakdown/ttir_model_op_inventory.py "$INPUT_DIR"/*/preprocessed.mlir

This writes ttir-op-report.txt and ops.mlir next to each preprocessed.mlir, plus a combined ttir-op-report.txt at the common parent directory with per-file summary and merged mnemonic counts.

Optional flags: --report PATH, --ops PATH (single-file only), -v.

The script reports: total "ttir.*" instances, per-mnemonic counts (with optional percentages), distinct mnemonic count, and distinct op configurations (SSA-normalized op lines). ops.mlir follows the same shape as test/python/golden/mlir_snippets/models/qwen3_4b/ops.mlir: one func.func per unique configuration (<mnemonic>_<index>), args %arg0, ..., single op + return. For multi-result ops such as ttir.sort and ttir.topk, the generated op must use MLIR multi-result binding syntax (%0:2 = ...) and return projected results (return %0#0, %0#1 : ...). If you see operation defines 2 results but was provided 1 to bind, regenerate with tools/scripts/model_breakdown/ttir_model_op_inventory.py instead of hand-editing every snippet.

ttir.full and ttir.constant are excluded from ops.mlir unless their result is directly returned by the module -- they almost always just produce values consumed by other ops, and are already inlined as const producers inside those ops' test functions. The report still counts them.

It assumes one generic TTIR op per line (normal ttmlir-opt output). If needed, sanity-check parse with ttmlir-opt on ops.mlir.

Optional: dump IR after each pass

Only if the user asks. Same pass loop as above; write <basename>/<basename>.<short-name>.mlir for each stage. Final stage is the source for ttir-op-report.txt and ops.mlir (same as *.implicit-broadcast-fold.mlir when you ran all nine).

source env/activate
INPUT="<input.mlir>"
BASENAME="$(basename "$INPUT" .mlir)"
mkdir -p "$BASENAME"
PREV="$INPUT"
declare -a PASSES=(
  "--canonicalize|canonicalize"
  "--ttir-predicate-type-alignment|predicate-type-alignment"
  "--ttir-element-type-normalization|element-type-normalization"
  "--ttir-to-ttir-decomposition|decomposition"
  "--ttir-explicate-tms|explicate-tms"
  "--ttir-erase-inverse-ops|erase-inverse-ops"
  "--ttir-move-reshape-to-constant|move-reshape-to-constant"
  "--ttir-fold-constant-reshape-broadcast|fold-constant-reshape-broadcast"
  "--ttir-implicit-broadcast-fold|implicit-broadcast-fold"
)
for entry in "${PASSES[@]}"; do
  IFS='|' read -r FLAG SHORTNAME <<< "$entry"
  OUT="$BASENAME/$BASENAME.$SHORTNAME.mlir"
  ttmlir-opt $FLAG -o "$OUT" "$PREV" || { echo "FAILED at: $FLAG"; break; }
  PREV="$OUT"
done

NOTE: If a pass fails, check include/ttmlir/Dialect/TTIR/Transforms/Passes.td for the registered flag. Some setups need --ttcore-register-device="system-desc-path=..." before other passes.

Step 2: Fill the report and `ops.mlir`

Run tools/scripts/model_breakdown/ttir_model_op_inventory.py on each preprocessed.mlir file (single-file mode). See above.

Next: compile/run the snippets

To compile or execute the generated ops.mlir on device, see the run-ops-mlir-snippets skill.

ttir-model-op-analysis

More from this repository

TTIR model op analysis (post-frontend-normalization)

Step 1: Run the frontend TTIR normalization pipeline

Single-file mode

Directory mode (multi-graph models)

ttir-op-report.txt and ops.mlir

Optional: dump IR after each pass

Step 2: Fill the report and ops.mlir

Next: compile/run the snippets

TTIR model op analysis (post-frontend-normalization)

Step 1: Run the frontend TTIR normalization pipeline

Single-file mode

Directory mode (multi-graph models)

ttir-op-report.txt and ops.mlir

Optional: dump IR after each pass

Step 2: Fill the report and ops.mlir

Next: compile/run the snippets

More from this repository

`ttir-op-report.txt` and `ops.mlir`

Step 2: Fill the report and `ops.mlir`

`ttir-op-report.txt` and `ops.mlir`

Step 2: Fill the report and `ops.mlir`