Run any Skill in Manus with one click

$pwd:

veomni-uv-update

Name: Veomni Uv Update
Author: ByteDance-Seed

// Use this skill when updating dependencies managed by uv: bumping a package version, upgrading the uv tool itself, updating torch/CUDA stack, switching transformers version, or regenerating the lockfile. Trigger: 'update dependency', 'bump version', 'upgrade uv', 'update torch', 'update lockfile', 'uv sync fails'.

Run Skill in Manus

$ git log --oneline --stat

stars:1,957

forks:197

updated:May 19, 2026 at 11:20

SKILL.md

readonly

name	veomni-uv-update
description	Use this skill when updating dependencies managed by uv: bumping a package version, upgrading the uv tool itself, updating torch/CUDA stack, switching transformers version, or regenerating the lockfile. Trigger: 'update dependency', 'bump version', 'upgrade uv', 'update torch', 'update lockfile', 'uv sync fails'.

Before You Start

Read .agents/knowledge/uv.md for the full dependency architecture. The key things that make VeOmni's uv setup non-trivial:

uv version is pinned in three places (must update together)
torch uses direct wheel URLs (not just version bumps)
hardware extras (gpu, npu, npu_aarch64) are mutually conflicting

Scenario 1: Update uv Version

uv is pinned to a specific version. Update all three locations together:

pyproject.toml -> [tool.uv] -> required-version = "==X.Y.Z"
docker/cuda/Dockerfile.cu129 -> COPY --from=ghcr.io/astral-sh/uv:X.Y.Z
docker/ascend/Dockerfile.ascend_* -> same pattern (if present)

Then regenerate the lockfile:

uv lock
uv sync --extra gpu --dev

Verify the lockfile diff is reasonable (git diff uv.lock — should only show version changes, not wholesale rewrites).

Scenario 2: Update a Regular Dependency

Edit version constraint in pyproject.toml under [project.dependencies] or the relevant [project.optional-dependencies] extra.
Regenerate lockfile and sync:

uv lock
uv sync --extra gpu --dev

Run tests: pytest tests/
Commit both pyproject.toml and uv.lock together.

Scenario 3: Update torch / CUDA Stack

This is the most complex update. torch versions are pinned in multiple places:

For GPU (gpu extra):

pyproject.toml -> [project.optional-dependencies] -> gpu list
pyproject.toml -> [tool.uv] -> override-dependencies (the extra == 'gpu' entries)
pyproject.toml -> [tool.uv.sources] -> torch (direct wheel URL — must update to matching wheel)
Related packages: torchvision, torchaudio, torchcodec, nvidia-cudnn-cu12

For NPU (npu / npu_aarch64 extras):

Same pattern but with +cpu suffix or no suffix

Steps:

Identify the target torch version and matching wheel URLs from https://download.pytorch.org/whl/
Update all pinned versions in pyproject.toml (extras, overrides, sources)
Check flash-attn / flash-attn-3 wheel compatibility — these are tied to specific torch versions via direct URLs in [tool.uv.sources]
Update torchcodec version if needed (compatibility note in pyproject.toml)
Regenerate lockfile:

uv lock
uv sync --extra gpu --dev

Run tests: pytest tests/
Update Docker images if torch version changed

Scenario 4: Update transformers Version

transformers is pinned by the transformers-stable dependency group (pyproject.toml -> [dependency-groups] transformers-stable), which is listed in [tool.uv] default-groups so uv sync installs it automatically.

Bump within v5 (e.g. 5.2.0 → 5.3.0):

Edit the pinned version in [dependency-groups] transformers-stable.
Regenerate lockfile and sync:

uv lock
uv sync --extra gpu --dev

Check for API breakage and adjust veomni/ accordingly. Forward-looking guards may be expressed with is_transformers_version_greater_or_equal_to() from veomni/utils/import_utils.py.
Run tests: pytest tests/models/ tests/e2e/
Regenerate model patches: make patchgen (with the target transformers installed)

Scenario 5: Regenerate Lockfile Only

When uv.lock is out of sync or corrupt:

uv lock
uv sync --extra gpu --dev

If uv lock fails due to version conflicts, check:

[tool.uv] -> conflicts declarations
override-dependencies markers
Direct wheel URL availability

Common Pitfalls

Forgetting to update Docker: uv version and torch version changes must be reflected in docker/ Dockerfiles, otherwise CI builds will fail.
Partial torch updates: updating torch but not torchvision/torchaudio/torchcodec to matching versions causes import errors.
flash-attn wheel mismatch: flash-attn wheels are built for specific torch+CUDA combinations. A torch version bump requires finding or building new wheels.
Committing only pyproject.toml: always commit uv.lock together. Docker builds use --locked which requires the lockfile to match.
override-dependencies markers: the extra == 'gpu' markers in overrides are critical. Removing them causes uv to download wrong torch variants from PyPI.
no-build-isolation: flash-attn and flash-attn-3 are listed under no-build-isolation-package. They require torch to be installed first. If sync fails, try uv sync without these extras first, then add them.

related-skills.json

same repository

veomni-migrate-transformers-v5.md

from "ByteDance-Seed/VeOmni"

Use this skill when adding or refreshing a patchgen-generated modeling file for a VeOmni model under veomni/models/transformers/<model>/generated/ — GPU-only or GPU+NPU, dense or MoE, text-only / VLM / Omni-thinker+talker. Covers: creating <model>_{gpu,npu}_patch_gen_config.py, using patchgen decorators (replace_class/override_method/replace_function/modify_init/add_post_import_block/drop_import_names), reusing sibling-model patches via name_map, handling MoE weight-loading (CheckpointTensorConverter + fused gate_up_proj layout), multimodal/VLM forward with Ulysses SP, excluding speech/vocoder subtrees in Omni models (talker/token2wav/DiT/BigVGAN), wiring __init__.py for the patchgen-generated classes, running codegen, and adding test cases. Trigger: 'port <model> to patchgen', 'add patchgen for <model>', 'transformers v5 migration', 'add NPU patchgen'. Do NOT edit files under generated/ manually — always regenerate via patchgen.

2026-05-302.0k

veomni-debug.md

from "ByteDance-Seed/VeOmni"

Use this skill for ANY bug, error, crash, wrong output, loss divergence, gradient explosion, test failure, CUDA error, distributed training hang, checkpoint load failure, or unexpected behavior. Covers both quick fixes (clear root cause) and complex debugging (unclear cause). Trigger: 'fix bug', 'fix error', 'broken', 'crash', 'doesn't work', 'fails with', 'loss NaN', 'training hangs', 'FSDP error', 'OOM'.

2026-05-292.0k

veomni-new-model.md

from "ByteDance-Seed/VeOmni"

Use this skill when adding support for a new model to VeOmni. Covers the full lifecycle: analyzing the HuggingFace model, creating model patches, defining parallel plans, writing configs, integrating with the trainer, and testing. Trigger: 'add model', 'support new model', 'integrate <model_name>', 'new model support'.

2026-05-292.0k

veomni-develop.md

from "ByteDance-Seed/VeOmni"

VeOmni-specific checklist for feature development and refactoring. Covers impact analysis across modalities, trainer hierarchy, data pipeline, and distributed code. Use before implementing any non-trivial change. For model-specific or ops-specific work, use veomni-new-model or veomni-new-op instead. Trigger: 'add feature', 'implement', 'refactor', 'reorganize', 'new capability'.

2026-05-192.0k

veomni-new-op.md

from "ByteDance-Seed/VeOmni"

Use this skill when adding a new optimized kernel or operator to veomni/ops/. Covers the full lifecycle: understanding VeOmni's ops architecture (KERNEL_REGISTRY + OpSlot dispatch, with a thin function-pointer shim for a few legacy global ops), implementing the kernel, registering it, adding tests, and documenting it. Trigger: 'add op', 'new kernel', 'add attention variant', 'new fused op', 'add triton kernel', 'optimize operator'.

2026-05-192.0k

veomni-profile.md

from "ByteDance-Seed/VeOmni"

Use this skill for performance profiling and optimization. Two modes: (1) Analyze existing profile files (Chrome traces, memory snapshots) — write scripts to parse and summarize metrics per user requirements. (2) Generate profiles during development — configure ProfileConfig, run training, collect traces, analyze bottlenecks, and suggest optimizations. Trigger: 'profile', 'performance', 'slow', 'MFU', 'throughput', 'bottleneck', 'memory usage', 'trace', 'optimize training speed'.

2026-04-132.0k

package.json

"author": "ByteDance-Seed"

"repository": "ByteDance-Seed/VeOmni"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	veomni-uv-update
description	Use this skill when updating dependencies managed by uv: bumping a package version, upgrading the uv tool itself, updating torch/CUDA stack, switching transformers version, or regenerating the lockfile. Trigger: 'update dependency', 'bump version', 'upgrade uv', 'update torch', 'update lockfile', 'uv sync fails'.

Before You Start

Read .agents/knowledge/uv.md for the full dependency architecture. The key things that make VeOmni's uv setup non-trivial:

uv version is pinned in three places (must update together)
torch uses direct wheel URLs (not just version bumps)
hardware extras (gpu, npu, npu_aarch64) are mutually conflicting

Scenario 1: Update uv Version

uv is pinned to a specific version. Update all three locations together:

pyproject.toml -> [tool.uv] -> required-version = "==X.Y.Z"
docker/cuda/Dockerfile.cu129 -> COPY --from=ghcr.io/astral-sh/uv:X.Y.Z
docker/ascend/Dockerfile.ascend_* -> same pattern (if present)

Then regenerate the lockfile:

uv lock
uv sync --extra gpu --dev

Verify the lockfile diff is reasonable (git diff uv.lock — should only show version changes, not wholesale rewrites).

Scenario 2: Update a Regular Dependency

Edit version constraint in pyproject.toml under [project.dependencies] or the relevant [project.optional-dependencies] extra.
Regenerate lockfile and sync:

uv lock
uv sync --extra gpu --dev

Run tests: pytest tests/
Commit both pyproject.toml and uv.lock together.

Scenario 3: Update torch / CUDA Stack

This is the most complex update. torch versions are pinned in multiple places:

For GPU (gpu extra):

pyproject.toml -> [project.optional-dependencies] -> gpu list
pyproject.toml -> [tool.uv] -> override-dependencies (the extra == 'gpu' entries)
pyproject.toml -> [tool.uv.sources] -> torch (direct wheel URL — must update to matching wheel)
Related packages: torchvision, torchaudio, torchcodec, nvidia-cudnn-cu12

For NPU (npu / npu_aarch64 extras):

Same pattern but with +cpu suffix or no suffix

Steps:

Identify the target torch version and matching wheel URLs from https://download.pytorch.org/whl/
Update all pinned versions in pyproject.toml (extras, overrides, sources)
Check flash-attn / flash-attn-3 wheel compatibility — these are tied to specific torch versions via direct URLs in [tool.uv.sources]
Update torchcodec version if needed (compatibility note in pyproject.toml)
Regenerate lockfile:

uv lock
uv sync --extra gpu --dev

Run tests: pytest tests/
Update Docker images if torch version changed

Scenario 4: Update transformers Version

Bump within v5 (e.g. 5.2.0 → 5.3.0):

Edit the pinned version in [dependency-groups] transformers-stable.
Regenerate lockfile and sync:

uv lock
uv sync --extra gpu --dev

Check for API breakage and adjust veomni/ accordingly. Forward-looking guards may be expressed with is_transformers_version_greater_or_equal_to() from veomni/utils/import_utils.py.
Run tests: pytest tests/models/ tests/e2e/
Regenerate model patches: make patchgen (with the target transformers installed)

Scenario 5: Regenerate Lockfile Only

When uv.lock is out of sync or corrupt:

uv lock
uv sync --extra gpu --dev

If uv lock fails due to version conflicts, check:

[tool.uv] -> conflicts declarations
override-dependencies markers
Direct wheel URL availability

Common Pitfalls

Forgetting to update Docker: uv version and torch version changes must be reflected in docker/ Dockerfiles, otherwise CI builds will fail.
Partial torch updates: updating torch but not torchvision/torchaudio/torchcodec to matching versions causes import errors.
flash-attn wheel mismatch: flash-attn wheels are built for specific torch+CUDA combinations. A torch version bump requires finding or building new wheels.
Committing only pyproject.toml: always commit uv.lock together. Docker builds use --locked which requires the lockfile to match.
override-dependencies markers: the extra == 'gpu' markers in overrides are critical. Removing them causes uv to download wrong torch variants from PyPI.
no-build-isolation: flash-attn and flash-attn-3 are listed under no-build-isolation-package. They require torch to be installed first. If sync fails, try uv sync without these extras first, then add them.

veomni-uv-update

Before You Start

Scenario 1: Update uv Version

Scenario 2: Update a Regular Dependency

Scenario 3: Update torch / CUDA Stack

Scenario 4: Update transformers Version

Scenario 5: Regenerate Lockfile Only

Common Pitfalls

More from this repository

Before You Start

Scenario 1: Update uv Version

Scenario 2: Update a Regular Dependency

Scenario 3: Update torch / CUDA Stack

Scenario 4: Update transformers Version

Scenario 5: Regenerate Lockfile Only

Common Pitfalls

More from this repository