원클릭으로 Manus에서 모든 스킬 실행

$pwd:

fastvideo-model-porting-alignment

Name: Fastvideo Model Porting Alignment
Author: hao-ai-lab

// Ports new models into FastVideo with strict numerical alignment to official implementations. Use when adding a FastVideo model/pipeline, porting an official or Diffusers checkpoint, or debugging parity/alignment.

Manus에서 실행

$ git log --oneline --stat

stars:17

forks:5

updated:2026년 2월 20일 04:04

SKILL.md

readonly

name	FastVideo Model Porting & Alignment
description	Ports new models into FastVideo with strict numerical alignment to official implementations. Use when adding a FastVideo model/pipeline, porting an official or Diffusers checkpoint, or debugging parity/alignment.
variables	["goal","model_name","official_repo","official_weights_path"]
category	skill

FastVideo Model Porting & Alignment

Goal

Inputs

Model name: {{model_name}}
Official repo: {{official_repo}}
Official weights path: {{official_weights_path}}

If an input is missing, infer a sensible value and continue.

Source of Truth

Read and follow:

docs/contributing/coding_agents.md
docs/design/overview.md
docs/contributing/testing.md

Reference:

https://haoailab.com/FastVideo/contributing/coding_agents/#faq

Non-Negotiable Workflow

Isolate workspace first (worktree if already in FastVideo; clone otherwise).
Set isolated directory as default task root.
Fetch official code + weights.
Write PLAN.md before implementation.
Execute PLAN.md step-by-step.
Treat parity/alignment as first-class acceptance criteria.

Never jump from idea to direct edits.

Step 0: Workspace Isolation (Required)

If current directory is a FastVideo repo

MODEL_SLUG="${MODEL_NAME:-{{model_name}}}"
ROOT="$(git rev-parse --show-toplevel)"
STAMP="$(date +%Y%m%d-%H%M%S)"
BRANCH="codex/port-${MODEL_SLUG}-${STAMP}"
WT_DIR="$(dirname "$ROOT")/FastVideo-${MODEL_SLUG}-${STAMP}"

git -C "$ROOT" worktree add -b "$BRANCH" "$WT_DIR"
cd "$WT_DIR"

If current directory is NOT a FastVideo repo

MODEL_SLUG="${MODEL_NAME:-{{model_name}}}"
STAMP="$(date +%Y%m%d-%H%M%S)"
TARGET_DIR="$HOME/dev/FastVideo-${MODEL_SLUG}-${STAMP}"

git clone https://github.com/hao-ai-lab/FastVideo.git "$TARGET_DIR"
cd "$TARGET_DIR"
git checkout -b "codex/port-${MODEL_SLUG}-${STAMP}"

From this point on, use the isolated directory as the root for all commands.

Step 0.5: Fetch Official Repo + Weights

Clone the official implementation under the FastVideo root.
Download official weights to official_weights/<model_name>/.
If a valid Diffusers-format repo exists, prefer direct download.

Example:

python scripts/huggingface/download_hf.py \
  --repo_id "{{official_repo}}" \
  --local_dir "official_weights/{{model_name}}" \
  --repo_type model

Step 1: Plan First (`PLAN.md`)

Create PLAN.md with checkboxes and acceptance criteria.

Minimum sections:

Scope and constraints
Architecture mapping (official -> FastVideo)
Component parity milestones (DiT first)
Pipeline wiring milestones
Testing milestones (component parity, pipeline parity, SSIM)
Example + docs milestones
Risks and mitigations

Step 2: Implement in Parity-First Order

2.1 Model + mapping first

Add/extend model in fastvideo/models/...
Add config + param_names_mapping in fastvideo/configs/models/...
Reuse existing FastVideo layers where possible
Attention convention:
- DistributedAttention for full-sequence self-attention in DiT
- LocalAttention for cross-attention / non-global attention

FAQ rule: implement FastVideo model shape/naming first, then finalize conversion/mapping.

2.2 Numerical alignment immediately

Add component parity tests under tests/local_tests/...
Compare official vs FastVideo outputs with fixed seeds/inputs
Start with atol=1e-4, rtol=1e-4
Keep dtype consistent (bf16 if available, else fp32)
Use load_state_dict(strict=False) while iterating mapping

If parity fails, debug in this order:

Fix key mapping (param_names_mapping)
Align attention backend (for example FASTVIDEO_ATTENTION_BACKEND=TORCH_SDPA)
Align scheduler/sigma/timestep behavior
Add activation logging to locate first divergence

2.3 Repeat for each component

Repeat model+mapping+parity for each required component:

DiT
VAE
Encoder/tokenizer
Any model-specific extras

2.4 Pipeline integration

Add pipeline config in fastvideo/configs/pipelines/
Add sampling defaults in fastvideo/configs/sample/
Register via explicit register_configs(...) in fastvideo/registry.py
Add pipeline logic in fastvideo/pipelines/basic/<pipeline>/
Add reusable stages in fastvideo/pipelines/stages/

2.5 End-to-end validation

Add pipeline parity tests: tests/local_tests/pipelines/
Add SSIM tests: fastvideo/tests/ssim/
Add minimal runnable example: examples/inference/basic/
Run locally and generate a video sample

2.6 Documentation

Update docs/ with:

usage
constraints
memory/speed caveats
backend requirements

Diffusers vs Conversion Rule

If Diffusers-format exists and loads correctly:

skip conversion script
focus on mapping + parity

If not:

add conversion script
stage converted output at converted_weights/<model_name>/
still validate parity against official implementation

Alignment Gate (Must Pass)

Mapping rules explicit and reviewed
Missing/unexpected key mismatches resolved
Component parity tests passing
Pipeline parity checks passing
SSIM regression checks passing
Example script generates expected video
Documentation updated

Required Output

When using this skill, always provide:

Isolated workspace path
Branch name
Completed PLAN.md
Execution progress per plan item
Final parity/test summary + residual risks

related-skills.json

같은 저장소

alert-handler.md

from "hao-ai-lab/research-agent"

System prompt for handling experiment alerts. Provides diagnosis guidance, GPU wrapper context, action suggestions, and structured response from allowed choices.

2026-02-2017

agent-mode-research-assistant.md

from "hao-ai-lab/research-agent"

Default system prompt for agent chat mode. Provides identity, environment context, compute awareness, API-driven job submission, and workflow reflection.

2026-02-2017

plan-mode-planning-assistant.md

from "hao-ai-lab/research-agent"

Generates a structured experiment plan with compute-aware recommendations and saves it via the plan API endpoint.

2026-02-2017

wild-v2-steer.md

from "hao-ai-lab/research-agent"

Wraps user steering input with context signals for the model during a wild loop session

2026-02-2017

wild-v2-execution-ops-protocol.md

from "hao-ai-lab/research-agent"

Single source of truth protocol for Wild V2 preflight, sweep/run auditability, GPU discovery, and parallel scheduling

2026-02-2017

wild-v2-gpu-discovery-parallel-scheduling.md

from "hao-ai-lab/research-agent"

Protocol for GPU discovery and parallel run scheduling across local GPU and Slurm clusters

2026-02-2017

package.json

"author": "hao-ai-lab"

"repository": "hao-ai-lab/research-agent"

GitHub 저장소 열기 Creator 저장소 보기

$ install --global

$ download --local

Manus에서 실행

$ useful --forSOC

소프트웨어 개발자컴퓨터 및 수학직15-1252L4

name	FastVideo Model Porting & Alignment
description	Ports new models into FastVideo with strict numerical alignment to official implementations. Use when adding a FastVideo model/pipeline, porting an official or Diffusers checkpoint, or debugging parity/alignment.
variables	["goal","model_name","official_repo","official_weights_path"]
category	skill

FastVideo Model Porting & Alignment

Goal

Inputs

Model name: {{model_name}}
Official repo: {{official_repo}}
Official weights path: {{official_weights_path}}

If an input is missing, infer a sensible value and continue.

Source of Truth

Read and follow:

docs/contributing/coding_agents.md
docs/design/overview.md
docs/contributing/testing.md

Reference:

https://haoailab.com/FastVideo/contributing/coding_agents/#faq

Non-Negotiable Workflow

Isolate workspace first (worktree if already in FastVideo; clone otherwise).
Set isolated directory as default task root.
Fetch official code + weights.
Write PLAN.md before implementation.
Execute PLAN.md step-by-step.
Treat parity/alignment as first-class acceptance criteria.

Never jump from idea to direct edits.

Step 0: Workspace Isolation (Required)

If current directory is a FastVideo repo

MODEL_SLUG="${MODEL_NAME:-{{model_name}}}"
ROOT="$(git rev-parse --show-toplevel)"
STAMP="$(date +%Y%m%d-%H%M%S)"
BRANCH="codex/port-${MODEL_SLUG}-${STAMP}"
WT_DIR="$(dirname "$ROOT")/FastVideo-${MODEL_SLUG}-${STAMP}"

git -C "$ROOT" worktree add -b "$BRANCH" "$WT_DIR"
cd "$WT_DIR"

If current directory is NOT a FastVideo repo

MODEL_SLUG="${MODEL_NAME:-{{model_name}}}"
STAMP="$(date +%Y%m%d-%H%M%S)"
TARGET_DIR="$HOME/dev/FastVideo-${MODEL_SLUG}-${STAMP}"

git clone https://github.com/hao-ai-lab/FastVideo.git "$TARGET_DIR"
cd "$TARGET_DIR"
git checkout -b "codex/port-${MODEL_SLUG}-${STAMP}"

From this point on, use the isolated directory as the root for all commands.

Step 0.5: Fetch Official Repo + Weights

Clone the official implementation under the FastVideo root.
Download official weights to official_weights/<model_name>/.
If a valid Diffusers-format repo exists, prefer direct download.

Example:

python scripts/huggingface/download_hf.py \
  --repo_id "{{official_repo}}" \
  --local_dir "official_weights/{{model_name}}" \
  --repo_type model

Step 1: Plan First (`PLAN.md`)

Create PLAN.md with checkboxes and acceptance criteria.

Minimum sections:

Scope and constraints
Architecture mapping (official -> FastVideo)
Component parity milestones (DiT first)
Pipeline wiring milestones
Testing milestones (component parity, pipeline parity, SSIM)
Example + docs milestones
Risks and mitigations

Step 2: Implement in Parity-First Order

2.1 Model + mapping first

Add/extend model in fastvideo/models/...
Add config + param_names_mapping in fastvideo/configs/models/...
Reuse existing FastVideo layers where possible
Attention convention:
- DistributedAttention for full-sequence self-attention in DiT
- LocalAttention for cross-attention / non-global attention

FAQ rule: implement FastVideo model shape/naming first, then finalize conversion/mapping.

2.2 Numerical alignment immediately

Add component parity tests under tests/local_tests/...
Compare official vs FastVideo outputs with fixed seeds/inputs
Start with atol=1e-4, rtol=1e-4
Keep dtype consistent (bf16 if available, else fp32)
Use load_state_dict(strict=False) while iterating mapping

If parity fails, debug in this order:

Fix key mapping (param_names_mapping)
Align attention backend (for example FASTVIDEO_ATTENTION_BACKEND=TORCH_SDPA)
Align scheduler/sigma/timestep behavior
Add activation logging to locate first divergence

2.3 Repeat for each component

Repeat model+mapping+parity for each required component:

DiT
VAE
Encoder/tokenizer
Any model-specific extras

2.4 Pipeline integration

Add pipeline config in fastvideo/configs/pipelines/
Add sampling defaults in fastvideo/configs/sample/
Register via explicit register_configs(...) in fastvideo/registry.py
Add pipeline logic in fastvideo/pipelines/basic/<pipeline>/
Add reusable stages in fastvideo/pipelines/stages/

2.5 End-to-end validation

Add pipeline parity tests: tests/local_tests/pipelines/
Add SSIM tests: fastvideo/tests/ssim/
Add minimal runnable example: examples/inference/basic/
Run locally and generate a video sample

2.6 Documentation

Update docs/ with:

usage
constraints
memory/speed caveats
backend requirements

Diffusers vs Conversion Rule

If Diffusers-format exists and loads correctly:

skip conversion script
focus on mapping + parity

If not:

add conversion script
stage converted output at converted_weights/<model_name>/
still validate parity against official implementation

Alignment Gate (Must Pass)

Mapping rules explicit and reviewed
Missing/unexpected key mismatches resolved
Component parity tests passing
Pipeline parity checks passing
SSIM regression checks passing
Example script generates expected video
Documentation updated

Required Output

When using this skill, always provide:

Isolated workspace path
Branch name
Completed PLAN.md
Execution progress per plan item
Final parity/test summary + residual risks

fastvideo-model-porting-alignment

FastVideo Model Porting & Alignment

Goal

Inputs

Source of Truth

Non-Negotiable Workflow

Step 0: Workspace Isolation (Required)

If current directory is a FastVideo repo

If current directory is NOT a FastVideo repo

Step 0.5: Fetch Official Repo + Weights

Step 1: Plan First (PLAN.md)

Step 2: Implement in Parity-First Order

2.1 Model + mapping first

2.2 Numerical alignment immediately

2.3 Repeat for each component

2.4 Pipeline integration

2.5 End-to-end validation

2.6 Documentation

Diffusers vs Conversion Rule

Alignment Gate (Must Pass)

Required Output

이 저장소의 다른 Skills

이 저장소의 다른 Skills

FastVideo Model Porting & Alignment

Goal

Inputs

Source of Truth

Non-Negotiable Workflow

Step 0: Workspace Isolation (Required)

If current directory is a FastVideo repo

If current directory is NOT a FastVideo repo

Step 0.5: Fetch Official Repo + Weights

Step 1: Plan First (PLAN.md)

Step 2: Implement in Parity-First Order

2.1 Model + mapping first

2.2 Numerical alignment immediately

2.3 Repeat for each component

2.4 Pipeline integration

2.5 End-to-end validation

2.6 Documentation

Diffusers vs Conversion Rule

Alignment Gate (Must Pass)

Required Output

Step 1: Plan First (`PLAN.md`)

Step 1: Plan First (`PLAN.md`)