一键在 Manus 中运行任何 Skill

$pwd:

ff-new-model

Name: Ff New Model
Author: X-GenGroup

// Complete workflow for adding a new model adapter. Covers analysis, sample dataclass, adapter implementation (4 abstract methods + per-modality encoder overrides), registry, example YAML, and verification. Trigger: 'add model', 'support new model', 'integrate model', 'new adapter'.

在 Manus 中运行

$ git log --oneline --stat

stars:539

forks:40

updated:2026年4月25日 14:24

SKILL.md

readonly

related-skills.json

同仓库

ff-new-algorithm.md

from "X-GenGroup/Flow-Factory"

Complete workflow for adding a new RL training algorithm. Covers paradigm selection, TrainingArguments subclass, trainer implementation, registry, example config, and verification. Trigger: 'add algorithm', 'new trainer', 'new training method', 'implement algorithm'.

2026-05-24539

ff-review.md

from "X-GenGroup/Flow-Factory"

Mandatory pre-commit code review gate. Checks constraint violations, cross-module consistency, and implementation quality. Trigger proactively when changes span multiple files or touch shared infrastructure. Trigger: 'review', 'check before commit'.

2026-05-17539

ff-develop.md

from "X-GenGroup/Flow-Factory"

Feature development with cross-module impact analysis. Covers trainer hierarchy, model adapters, reward pipeline, config system, sample dataclasses, and distributed training paths. Trigger: 'add feature', 'implement', 'refactor', 'reorganize', 'new capability'.

2026-04-25539

ff-debug.md

from "X-GenGroup/Flow-Factory"

Bug fixing and debugging for ANY error, crash, loss divergence, gradient explosion, distributed hang, NaN, or unexpected behavior. Covers quick fixes and full protocol with 5-phase investigation. Trigger: 'fix bug', 'fix error', 'broken', 'crash', 'doesn't work', 'fails with', 'loss NaN', 'training hangs', 'OOM'.

2026-04-08539

ff-new-reward.md

from "X-GenGroup/Flow-Factory"

Complete workflow for adding a new reward model. Covers pointwise vs groupwise design, __call__ contract, registration, YAML config, multi-reward setup, and verification. Trigger: 'add reward', 'new reward model', 'custom reward', 'scoring function'.

2026-04-06539

package.json

"author": "X-GenGroup"

"repository": "X-GenGroup/Flow-Factory"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

软件开发工程师计算机与数学类职业15-1252L4

name	ff-new-model
description	Complete workflow for adding a new model adapter. Covers analysis, sample dataclass, adapter implementation (4 abstract methods + per-modality encoder overrides), registry, example YAML, and verification. Trigger: 'add model', 'support new model', 'integrate model', 'new adapter'.

New Model Adapter Integration

Authoritative reference: guidance/new_model.md — read it first.

Prerequisites

Before starting, ensure you understand:

The target model's diffusers pipeline (or that you'll need a pseudo-pipeline)
The task type: Text-to-Image, Image-to-Image, Text-to-Video, Image-to-Video
Which Sample dataclass to extend

Phase 1: Analysis

Identify the diffusers pipeline for the target model
- Check if it exists in diffusers: from diffusers import <Pipeline>
- If not, you'll need a pseudo-pipeline (see guidance/new_model.md advanced section)
Study an existing adapter of the same task type:
- T2I: models/flux/flux1.py or models/stable_diffusion/sd3_5.py
- I2I: models/flux/flux1_kontext.py or models/qwen_image/qwen_image_edit_plus.py
- T2V: models/wan/wan2_t2v.py
- I2V: models/wan/wan2_i2v.py
Map pipeline components to adapter responsibilities:
- Text encoders → encode_prompt(), preprocessing_modules
- VAE → encode_image() / decode_latents(), preprocessing_modules
- Audio encoder/VAE (if any) → encode_audio(), preprocessing_modules
- Transformer/UNet → forward(), target_module_map, inference_modules
Also read: topics/adapter_conventions.md for upstream alignment rules; topics/dtype_precision.md for precision handling in cast_latents().

Phase 2: Implementation

Step 1 — Define Sample Dataclass

# src/flow_factory/models/<family>/<model>.py
@dataclass
class MyModelSample(T2ISample):  # or appropriate base
    _shared_fields: ClassVar[frozenset[str]] = frozenset({})
    # Add model-specific fields if needed

Step 2 — Create Adapter Class

class MyModelAdapter(BaseAdapter):

    @property
    def preprocessing_modules(self) -> List[str]:
        return ["text_encoder", "vae"]  # Components for Stage 1

    @property
    def inference_modules(self) -> List[str]:
        return ["vae"]  # Components needed at inference time

    @property
    def target_module_map(self) -> Dict[str, str]:
        return {"transformer": "transformer"}  # Trainable components

Step 3 — Implement Required Methods

Method	Purpose	Stage	Abstract?
`load_pipeline()`	Load diffusers pipeline	Init	Yes
`decode_latents()`	Latents → pixels	3	Yes
`inference()`	Full multi-step denoising	3	Yes
`forward()`	Single-step denoising loss	6	Yes
`encode_prompt()`	Text → embeddings	1	No (no-op default; override if your model consumes text)
`encode_image()`	Image → latents	1	No (no-op default; override if your model consumes images)
`encode_video()`	Video frames → latents	1	No (no-op default; override if your model consumes videos)
`encode_audio()`	Audio → embeddings/features	1	No (no-op default; override if your model consumes audio)
`preprocess_func()`	Raw inputs → cached tensors (dispatches to the 4 encoders)	1	No (concrete, override only for cross-modal preprocessing)

Step 4 — Register

Add to _MODEL_ADAPTER_REGISTRY in src/flow_factory/models/registry.py:

'my-model': 'flow_factory.models.<family>.<model>.MyModelAdapter',

Phase 3: Configuration

Create example YAML config in examples/grpo/lora/<model>/default.yaml:

model:
  model_type: "my-model"
  model_path: "org/model-name"
  finetune_type: "lora"
  target_components: ["transformer"]

Phase 4: Verification

Also read: topics/parity_testing.md for the 4-layer verification protocol.

load_pipeline() successfully loads the model
preprocess_func() produces correct cached tensors
inference() generates valid images/videos
forward() computes loss without errors
Training runs end-to-end with GRPO for ≥2 steps
LoRA weights save and reload correctly
Registry entry resolves correctly: get_model_adapter_class('my-model')
Example YAML config is valid and complete

Common Pitfalls

Forgetting to set preprocessing_modules — causes text encoder to stay on GPU, OOM during training
Wrong target_module_map — LoRA applied to wrong components, no training effect
Mismatched _shared_fields — data corruption during batch collation
Not handling enable_preprocess=False — encoding components not loaded at inference time
Inconsistent custom field types across samples — if a custom sample field is Tensor on some samples and List[Tensor] on others, gather_samples will fall back to slow pickle-based gather_object. Always canonicalize to a single type in __post_init__; prefer List[Tensor] for variable-length data.
Wrong images/condition_images/audios convention — preprocess_func(), encode_image(), encode_video(), encode_audio(), and inference() all operate at batch level: images is List[List[Image.Image]] (MultiImageBatch), condition_images is List[List[Tensor(C,H,W)]], and audios is List[List[Tensor]] (MultiAudioBatch), where the outer list indexes samples in the batch and the inner list holds each sample's items. Empty samples contribute [] (never None); single-item samples contribute [item] (never a bare element). Never pass a flat List[Image] / List[Tensor] or unwrap single-element lists — that breaks Arrow's homogeneous-column requirement and forces every downstream consumer to handle three input shapes. For single-condition models, _standardize_image_input / _standardize_video_input must detect the nested format with is_multi_image_batch / is_multi_video_batch, extract the first element per sample ([batch[0] for batch in images]), and warn if extra conditions are discarded (e.g. Wan2_I2V._standardize_image_input, LTX2_I2AV._standardize_image_input). See topics/adapter_conventions.md Gotcha #5 and #6.

ff-new-model

同仓库更多 Skills

同仓库更多 Skills

New Model Adapter Integration

Prerequisites

Phase 1: Analysis

Phase 2: Implementation

Step 1 — Define Sample Dataclass

Step 2 — Create Adapter Class

Step 3 — Implement Required Methods

Step 4 — Register

Phase 3: Configuration

Phase 4: Verification

Common Pitfalls

New Model Adapter Integration

Prerequisites

Phase 1: Analysis

Phase 2: Implementation

Step 1 — Define Sample Dataclass

Step 2 — Create Adapter Class

Step 3 — Implement Required Methods

Step 4 — Register

Phase 3: Configuration

Phase 4: Verification

Common Pitfalls