一键在 Manus 中运行任何 Skill

$pwd:

documentation-generator

Name: Documentation Generator
Author: albumentations-team

// Updates benchmark documentation with latest results including README tables, speedup plots, and library metadata. Use when updating documentation, generating comparison tables, or when the user mentions update_docs.sh or documentation generation.

在 Manus 中运行

$ git log --oneline --stat

stars:87

forks:3

updated:2026年5月6日 15:45

SKILL.md

readonly

related-skills.json

同仓库

paper-coverage-validator.md

from "albumentations-team/benchmark"

Validates whether benchmark artifacts cover the paper's required RGB micro and RGB DataLoader sections. Use when checking missing RGB runs, deciding what to run next, validating gcp_runs/output folders, or preparing paper tables.

2026-05-0687

benchmark-runner.md

from "albumentations-team/benchmark"

Automates running image/video augmentation benchmarks for single or multiple libraries, validates outputs, generates comparison reports, and updates documentation. Use when running benchmarks, comparing library performance, or when the user mentions benchmark, benchmark.cli, pyperf, GCP benchmark runs, or performance testing.

2026-05-0687

gcp-benchmark-triage.md

from "albumentations-team/benchmark"

Triage detached GCP benchmark runs, DONE/FAILED sentinels, VM cleanup, vm.log, gcp_last_run.json, and partial result downloads. Use when GCP benchmark logs mention DONE, FAILED, exit_code.txt, VM disappeared, STOPPING, gcloud machine type errors, or missing artifacts.

2026-05-0687

library-integration.md

from "albumentations-team/benchmark"

Guides adding support for new image/video augmentation libraries to the benchmark suite. Use when integrating a new library, adding library support, or when the user mentions adding a new augmentation library to test.

2026-05-0687

paper-benchmark-execution.md

from "albumentations-team/benchmark"

Executes the paper benchmark plan for RGB, multichannel, DataLoader, and video benchmarks. Use when the user mentions the paper benchmark, deadline plan, machine matrix, RGB micro, multichannel, DataLoader, video GPU, c4/c4d/g2 machines, or what to run next.

2026-05-0687

performance-analysis.md

from "albumentations-team/benchmark"

Analyzes benchmark results to identify slow transforms, warmup issues, and performance regressions. Compares speedups across libraries and generates optimization recommendations. Use when analyzing performance, investigating slow benchmarks, or comparing library results.

2026-05-0687

package.json

"author": "albumentations-team"

"repository": "albumentations-team/benchmark"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

软件开发工程师计算机与数学类职业15-1252L4

网页开发工程师L4

name	documentation-generator
description	Updates benchmark documentation with latest results including README tables, speedup plots, and library metadata. Use when updating documentation, generating comparison tables, or when the user mentions update_docs.sh or documentation generation.

Documentation Generator

Automate updating benchmark documentation with latest results.

Quick Update

# Update all documentation
./tools/update_docs.sh

# Update with custom paths
./tools/update_docs.sh \
  --image-results output/ \
  --video-results output_videos/ \
  --docs-dir docs/

What Gets Updated

Architecture / Policy Docs

docs/benchmark_architecture.md - Control-plane and runner architecture.
docs/benchmark_scope.md - Paper benchmark scope, transform selection, pipeline recipes, and architecture source of truth.
docs/good_plots.md - Claim-to-plot guidance for benchmark paper figures.
.cursor/skills/benchmark-runner/SKILL.md - Agent-facing benchmark execution policy.
.cursor/skills/paper-benchmark-execution/SKILL.md - Agent-facing paper run policy.

Image Benchmarks

docs/images/README.md - Detailed results table
docs/images/images_speedup_analysis.webp - Speedup visualization
docs/images/images_speedups.csv - Raw speedup data
README.md - Main speedup summary

Video Benchmarks

docs/videos/README.md - Detailed results table
docs/videos/videos_speedup_analysis.webp - Speedup visualization
docs/videos/videos_speedups.csv - Raw speedup data
README.md - Main speedup summary

Manual Documentation Steps

1. Generate Comparison Tables

Image benchmarks:

python tools/compare_results.py \
  --results-dir output/ \
  --update-readme docs/images/README.md

Video benchmarks:

python tools/compare_video_results.py \
  --results-dir output_videos/ \
  --update-readme docs/videos/README.md

2. Generate Speedup Plots

Image benchmarks:

python tools/generate_speedup_plots.py \
  --results-dir output/ \
  --output-dir docs/images \
  --type images \
  --reference-library albumentationsx

Video benchmarks:

python tools/generate_speedup_plots.py \
  --results-dir output_videos/ \
  --output-dir docs/videos \
  --type videos \
  --reference-library albumentationsx

3. Update Main README

The script automatically updates speedup summaries between markers:

 ... 
 ...

Manual update if needed:

import pandas as pd

df = pd.read_csv('docs/images/images_speedups.csv', index_col=0)
median = df['albumentationsx'].median()
max_val = df['albumentationsx'].max()
max_transform = df['albumentationsx'].idxmax()

summary = f"AlbumentationsX is generally the fastest library for image augmentation, "
summary += f"with a median speedup of {median:.1f}× compared to other libraries. "
summary += f"For some transforms, the speedup can be as high as {max_val:.1f}× ({max_transform})."

Library Metadata

Create metadata files for new libraries:

Image: docs/images/{library}_metadata.yaml Video: docs/videos/{library}_metadata.yaml

library_name: LibraryName
version: "1.2.3"
description: Brief description of the library
documentation: https://library.readthedocs.io
repository: https://github.com/org/library

Documentation Structure

docs/
├── good_plots.md                      # Paper figure and claim-to-plot policy
├── benchmark_scope.md                 # Benchmark scope and paper policy
├── images/
│   ├── README.md                      # Detailed benchmark results
│   ├── images_speedup_analysis.webp   # Main visualization
│   ├── images_speedups.csv            # Speedup data
│   ├── albumentationsx_metadata.yaml  # Library info
│   └── ...
└── videos/
    ├── README.md
    ├── videos_speedup_analysis.webp
    ├── videos_speedups.csv
    └── ...metadata.yaml files

Comparison Tools

compare_results.py (images)

python tools/compare_results.py --results-dir output/

Output format:

| Transform | albumentationsx | torchvision | kornia |
|-----------|-----------------|--------|-------------|--------|-------|
| HorizontalFlip | 1234 ± 45 | 567 ± 23 | ... | ... | ... |

compare_video_results.py (videos)

python tools/compare_video_results.py --results-dir output_videos/

Includes CPU vs GPU comparisons.

generate_speedup_plots.py

python tools/generate_speedup_plots.py \
  --results-dir output/ \
  --output-dir docs/images \
  --type images \
  --reference-library albumentationsx

Generates:

Speedup bar chart
CSV with speedup factors
Statistical summary

Validation

After updating documentation:

Check markdown syntax:

# Tables should render correctly
# Links should be valid

Verify images:

ls -lh docs/images/*.webp
ls -lh docs/videos/*.webp

Check CSV data:

import pandas as pd
df = pd.read_csv('docs/images/images_speedups.csv', index_col=0)
print(df.head())
print(f"Shape: {df.shape}")

Validate README markers:

grep -n "IMAGE_SPEEDUP_SUMMARY" README.md
grep -n "VIDEO_SPEEDUP_SUMMARY" README.md

Validate paper figures against claims:

git diff docs/good_plots.md _internal/paper/generated/insights.md

When figure recommendations change, ensure each recommended main-text figure has a stated claim, regime, metric, support denominator when applicable, and source CSV provenance.

Workflow

Complete documentation update workflow:

# 1. Run benchmarks (if needed)
python -m benchmark.cli run \
  --config configs/examples/local_rgb_micro_cpu.yaml \
  --data-dir /path/to/imagenet/val \
  --output output/rgb_micro \
  --num-items 2000

# 2. Update all documentation
./tools/update_docs.sh

# 3. Review changes
git diff README.md
git diff docs/

# 4. Commit if satisfied
git add README.md docs/
git commit -m "docs: update benchmark results"

Benchmark Policy Notes

Keep README guidance aligned with these policies:

Benchmark architecture docs should say that benchmark/matrix.py owns scenario/library/mode support, benchmark/policy.py owns media defaults and slow-skip thresholds, benchmark/jobs.py owns command construction, and benchmark/orchestrator.py owns backend dispatch.
Benchmark architecture docs should say that benchmark/config/models.py owns BenchmarkRunConfig validation, benchmark/config/resolve.py owns YAML loading/CLI overrides/payload shaping, benchmark/config/plan.py owns dry-run job expansion, and benchmark/config/env.py owns resolved-config metadata handoff.
Benchmark docs should use python -m benchmark.cli plan --config ... and python -m benchmark.cli run --config ... examples for reproducible runs. Do not add flag-only benchmark run examples; checked-in run examples live under configs/.
Benchmark docs should mention benchmark/output_naming.py for result filename policy and benchmark/cloud/paths.py for detached GCP VM path policy whenever those rules are described.
If the benchmark matrix changes, update docs/benchmark_architecture.md, docs/benchmark_scope.md, and the relevant skill docs in the same change.
Cloud benchmark docs should show --gcp-gcs-data-uri pointing at one dataset tarball, not a directory of individual images/videos. For macOS-created tarballs, document COPYFILE_DISABLE=1, tar --no-xattrs, and excludes for .DS_Store, AppleDouble ._*, and __MACOSX.
Micro benchmark docs should state that media is preloaded once per library and reused across transform measurements.
Pyperf docs should mention per-transform subprocess isolation, media-cache reuse, lazy transform construction, and slow-transform preflight/early-stop behavior.
Benchmark policy docs should mention lazy output materialization: micro timing should force returned outputs to contiguous memory, including contiguous NumPy conversion for Pillow/PIL Image.Image outputs. Checksums belong only in diagnostics.
Benchmark policy docs should state that library tables include only direct transform support. Missing transforms should remain unsupported instead of being recreated with benchmark-side helper code.
Environment docs should mention joined environments and cached dependency installs, including the detached GCP venv cache.
Local rerun examples should include --no-refresh-requirements when dependency versions are intentionally fixed.

Troubleshooting

Missing speedup summary in README:

Check CSV file exists: docs/images/images_speedups.csv
Verify markers in README.md
Run update_docs.sh again

Plot generation fails:

Ensure matplotlib, seaborn installed: pip install -r requirements-dev.txt
Check result files are valid JSON
Verify all libraries have results

Table formatting issues:

Check all result files have same transform names
Verify no special characters in transform names
Ensure consistent JSON structure

documentation-generator

同仓库更多 Skills

同仓库更多 Skills

Documentation Generator

Quick Update

What Gets Updated

Architecture / Policy Docs

Image Benchmarks

Video Benchmarks

Manual Documentation Steps

1. Generate Comparison Tables

2. Generate Speedup Plots

3. Update Main README

Library Metadata

Documentation Structure

Comparison Tools

compare_results.py (images)

compare_video_results.py (videos)

generate_speedup_plots.py

Validation

Workflow

Benchmark Policy Notes

Troubleshooting

Documentation Generator

Quick Update

What Gets Updated

Architecture / Policy Docs

Image Benchmarks

Video Benchmarks

Manual Documentation Steps

1. Generate Comparison Tables

2. Generate Speedup Plots

3. Update Main README

Library Metadata

Documentation Structure

Comparison Tools

compare_results.py (images)

compare_video_results.py (videos)

generate_speedup_plots.py

Validation

Workflow

Benchmark Policy Notes

Troubleshooting