profile

Name: Profile
Author: pytorch

// Profile ExecuTorch model execution. Use when measuring performance, analyzing operator timing, or debugging slow models.

Ejecutar en Manus

$ git log --oneline --stat

stars:4652

forks:1005

updated:8 de febrero de 2026, 05:21

SKILL.md

readonly

name	profile
description	Profile ExecuTorch model execution. Use when measuring performance, analyzing operator timing, or debugging slow models.

Profile

1. Enable ETDump when loading

program = runtime.load_program("model.pte", enable_etdump=True, debug_buffer_size=int(1e7))

2. Execute and save

outputs = program.load_method("forward").execute(inputs)
program.write_etdump_result_to_file("etdump.etdp", "debug.bin")

3. Analyze with Inspector

from executorch.devtools import Inspector
inspector = Inspector(etrecord="model.etrecord", etdump_path="etdump.etdp")
inspector.print_data_tabular()

related-skills.json

mismo repositorio

qualcomm.md

from "pytorch/executorch"

Build, test, or develop the QNN (Qualcomm AI Engine Direct) backend. Use when working on backends/qualcomm/, building QNN (use backends/qualcomm/scripts/build.sh), adding new ops or passes, running QNN delegate tests, or exporting models for Qualcomm HTP/GPU targets. Also exposes a Buck-vs-CMake parity workflow — invoke as `/qualcomm buck-fix`, `/qualcomm buck-cmake fix`, `/qualcomm buck-parity`, or any user request to fix `test-qnn-buck-build-linux` CI failures or check buck/cmake drift in backends/qualcomm/.

2026-05-204.7k

zephyr.md

from "pytorch/executorch"

Build and configure ExecuTorch as a Zephyr RTOS module for embedded boards. Use when setting up a Zephyr workspace with ET, adding board support (overlays, confs, memory layout), building with west, or debugging linker memory overflow.

2026-04-294.7k

executorch-kb.md

from "pytorch/executorch"

Search the ExecuTorch tribal knowledge base covering QNN, XNNPACK, Vulkan, CoreML, Arm, and Cadence backends, quantization recipes, export pitfalls, runtime errors, and SoC compatibility. Use when debugging ExecuTorch errors, choosing quantization configs, checking backend op support, or answering questions about Qualcomm HTP / Snapdragon / Apple Neural Engine behavior.

2026-04-214.7k

building.md

from "pytorch/executorch"

Build ExecuTorch from source — Python package, C++ runtime, runners, cross-compilation, and backend-specific builds. Use when compiling anything in the ExecuTorch repo, diagnosing build failures, or setting up platform-specific builds.

2026-03-114.7k

binary-size.md

from "pytorch/executorch"

Analyze and reduce ExecuTorch binary size. Use when investigating binary size, running size tests, or optimizing the runtime for size-constrained deployments.

2026-03-074.7k

export.md

from "pytorch/executorch"

Export a PyTorch model to .pte format for ExecuTorch. Use when converting models, lowering to edge, or generating .pte files.

2026-02-194.7k

package.json