Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

$pwd:

review-pr

Name: Review Pr
Author: intel

// Review a pull request for the AutoRound repository with a structured checklist covering code quality, test coverage, documentation, Chinese translations, and quantization-specific concerns. Use when reviewing or preparing to submit a PR.

In Manus ausführen

$ git log --oneline --stat

stars:1.425

forks:134

updated:17. April 2026 um 02:35

SKILL.md

readonly

name	review-pr
description	Review a pull request for the AutoRound repository with a structured checklist covering code quality, test coverage, documentation, Chinese translations, and quantization-specific concerns. Use when reviewing or preparing to submit a PR.

Pull Request Review Workflow for AutoRound

Overview

This skill provides a structured workflow for reviewing pull requests in the AutoRound repository. It covers code quality, testing, documentation, and project-specific requirements like Chinese translation parity.

Review Checklist

1. Code Quality

Code follows existing patterns in the codebase (decorator registration, factory patterns, etc.)
No hardcoded paths or credentials
Proper error handling at system boundaries
No unnecessary abstractions or over-engineering
Import organization follows existing conventions

Apache 2.0 license header present on new files:

# Copyright (c) 2025 Intel Corporation
#
# Licensed under the Apache License, Version 2.0 (the "License");
# ...

2. Quantization-Specific Concerns

Numerical stability: scale computation avoids division by zero
Gradient flow: uses round_ste() or equivalent STE for differentiable rounding
Tensor shapes: group_size reshaping handles padding correctly
dtype consistency: scale_dtype, compute_dtype used properly
Memory efficiency: no unnecessary tensor copies on GPU
Device handling: tensors moved to correct device before operations

3. Registration Points

When the PR adds new functionality, verify all registration points are updated:

Feature	Registration Location
Data type	`auto_round/data_type/__init__.py` import + `@register_dtype`
Export format	`auto_round/formats.py` `@OutputFormat.register()`
VLM model	`special_model_handler.py` `SPECIAL_MULTIMODAL_BLOCK` + lists
Backend	`auto_round/inference/backend.py` `BackendInfos` dict
Dataset	`auto_round/calib_dataset.py` `@register_dataset`
Scheme preset	`auto_round/schemes.py` `PRESET_SCHEMES` dict

4. Test Coverage

New functionality has corresponding tests
Tests use existing fixtures (tiny_opt_model_path, dataloader, etc.)
Tests are placed in the correct backend directory (test_cpu/, test_cuda/, etc.)
Tests use minimal iterations (iters=2, nsamples=2) for speed
No flaky assertions (avoid exact float comparisons)

5. Documentation

README.md updated if user-facing features change
Chinese translation updated: Any changes to *.md files must have corresponding updates in their *_CN.md counterparts:
- README.md → README_CN.md
- docs/step_by_step.md → docs/step_by_step_CN.md
- docs/environments.md → docs/environments_CN.md
Translation maintains equivalent content and structure (not just copied English text)
Docstrings added for new public APIs

6. Contributing Requirements

Commits are signed off (git commit -s) per DCO
No unrelated changes mixed in
PR description clearly explains the motivation and changes
Breaking changes are called out explicitly

Chinese Translation Verification

This is a hard requirement for the AutoRound project. Use this procedure:

Identify modified markdown files:
```
git diff --name-only HEAD~1 -- '*.md'
```
Check for corresponding CN files: For each modified .md file, verify a _CN.md counterpart exists and is also modified:
- README.md → README_CN.md
- docs/step_by_step.md → docs/step_by_step_CN.md
- docs/environments.md → docs/environments_CN.md
Compare structure:
- Same number of sections/headings
- Same tables, code blocks, and links
- Equivalent content (not machine-translated gibberish)
Files that do NOT need CN translation (no _CN counterpart exists):
- CONTRIBUTING.md, CODE_OF_CONDUCT.md, SECURITY.md
- test/README.md
- docs/publication_list.md, docs/tips_and_tricks.md, accuracy result docs

Common Issues to Watch For

Quantization Bugs

Scale overflow: Large models with small group_size can produce FP16 overflow in scales. Check for torch.clamp or torch.finfo guards.
Asymmetric zero-point drift: Zero-points must be integer-rounded for INT quantization.
GGUF super-block alignment: GGUF formats require specific block sizes (typically 256 elements). Verify padding/alignment logic.

Export Compatibility

Format detection: Verify quantize_config.json or equivalent metadata is saved correctly for the target framework to detect.
Weight name mapping: Ensure packed weight names match what the inference framework expects.
Mixed-precision layers: Layers excluded from quantization (e.g., lm_head) must be saved in their original format.

Backend Selection

Priority conflicts: New backends should not override existing backends unless intentional. Check priority values.
Feature checker coverage: Ensure checkers don't silently reject valid layers (test with real model shapes).

related-skills.json

gleiches Repository

adapt-new-diffusion-model.md

from "intel/auto-round"

Adapt AutoRound to support a new diffusion model architecture (DiT, UNet, hybrid AR+DiT). Use when a new diffusion model fails quantization, needs custom output configs, requires a custom pipeline function, or is a hybrid architecture with both autoregressive and diffusion components.

2026-05-141.4k

adapt-new-llm.md

from "intel/auto-round"

Adapt AutoRound to support a new LLM architecture that doesn't work out-of-the-box. Use when quantization fails for a new model type, block detection doesn't find layers, MoE models need unfusing, custom forward passes are needed, or non-standard linear layer types need handling.

2026-05-141.4k

add-vlm-model.md

from "intel/auto-round"

Add support for a new Vision-Language Model (VLM) to AutoRound, including multimodal block handler, calibration dataset template, and special model handling. Use when integrating a new VLM like LLaVA, Qwen2-VL, GLM-Image, Phi-Vision, or similar multi-modal models for quantization.

2026-05-141.4k

add-inference-backend.md

from "intel/auto-round"

Add a new hardware inference backend to AutoRound for deploying quantized models (e.g., CUDA/Marlin, Triton, CPU, HPU, ARK). Use when implementing QuantLinear kernels, registering backend capabilities, or enabling quantized model inference on a new hardware platform.

2026-05-111.4k

add-export-format.md

from "intel/auto-round"

Add a new model export format to AutoRound (e.g., auto_round, auto_gptq, auto_awq, gguf, llm_compressor). Use when implementing a new quantized model serialization format, adding a new packing method, or extending export compatibility for deployment frameworks like vLLM, SGLang, or llama.cpp.

2026-04-171.4k

add-quantization-datatype.md

from "intel/auto-round"

Add a new quantization data type to AutoRound (e.g., INT, FP8, MXFP, NVFP, GGUF variants). Use when implementing a new weight/activation quantization scheme, registering a new quant function, or extending the data_type registry.

2026-04-171.4k

package.json

"author": "intel"

"repository": "intel/auto-round"

GitHub-Repository öffnen Creator-Repositorys ansehen

$ install --global

$ download --local

In Manus ausführen

$ useful --forSOC

Softwarequalitätssicherungsanalysten und -testerInformatik- und Mathematikberufe15-1253L4

review-pr

Pull Request Review Workflow for AutoRound

Overview

Review Checklist

1. Code Quality

2. Quantization-Specific Concerns

3. Registration Points

4. Test Coverage

5. Documentation

6. Contributing Requirements

Chinese Translation Verification

Common Issues to Watch For

Quantization Bugs

Export Compatibility

Backend Selection

Mehr aus diesem Repository

Mehr aus diesem Repository

Pull Request Review Workflow for AutoRound

Overview

Review Checklist

1. Code Quality

2. Quantization-Specific Concerns

3. Registration Points

4. Test Coverage

5. Documentation

6. Contributing Requirements

Chinese Translation Verification

Common Issues to Watch For

Quantization Bugs

Export Compatibility

Backend Selection