Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

$pwd:

gcp-benchmark-triage

Name: Gcp Benchmark Triage
Author: albumentations-team

// Triage detached GCP benchmark runs, DONE/FAILED sentinels, VM cleanup, vm.log, gcp_last_run.json, and partial result downloads. Use when GCP benchmark logs mention DONE, FAILED, exit_code.txt, VM disappeared, STOPPING, gcloud machine type errors, or missing artifacts.

Exécuter dans Manus

$ git log --oneline --stat

stars:87

forks:3

updated:6 mai 2026 à 15:45

SKILL.md

readonly

name	gcp-benchmark-triage
description	Triage detached GCP benchmark runs, DONE/FAILED sentinels, VM cleanup, vm.log, gcp_last_run.json, and partial result downloads. Use when GCP benchmark logs mention DONE, FAILED, exit_code.txt, VM disappeared, STOPPING, gcloud machine type errors, or missing artifacts.

GCP Benchmark Triage

Detached runs upload artifacts under the run prefix recorded in gcp_last_run.json.

Interpret Run State

DONE: benchmark succeeded; fetch results/, then wait for VM cleanup/cache upload.
FAILED: benchmark failed; fetch vm.log, FAILED, exit_code.txt, and any partial results/.
exit_code.txt without DONE: use the exit code and vm.log as source of truth.
VM gone with no terminal marker: treat as failure until logs prove otherwise.
VM STOPPING after DONE: usually cache upload/self-delete; wait before relaunching quota-heavy jobs.

Useful Commands

Read local metadata:

python - <<'PY' path/to/gcp_last_run.json
import json, sys
print(json.dumps(json.load(open(sys.argv[1])), indent=2))
PY

Fetch artifacts:

gcloud storage rsync --recursive "$RUN_PREFIX/results/" "$LOCAL_OUT/"
gcloud storage cat "$RUN_PREFIX/vm.log"
gcloud storage cat "$RUN_PREFIX/exit_code.txt"

Check machine availability before launching:

gcloud compute machine-types list \
  --project albumentations \
  --zones us-central1-b \
  --filter='name=(c4-standard-16 c4d-standard-16 g2-standard-16)'

Check GPU quota:

python - <<'PY'
import json
import subprocess

raw = subprocess.check_output(
    ["gcloud", "compute", "project-info", "describe", "--project", "albumentations", "--format=json"],
    text=True,
)
for quota in json.loads(raw).get("quotas", []):
    metric = quota.get("metric", "")
    if "CPU" in metric or "GPU" in metric or "NVIDIA" in metric:
        print(f"{metric}: limit={quota.get('limit')} usage={quota.get('usage')}")
PY

Common Gotchas

c3-standard-16 does not exist; use c3d-standard-16 or c3-standard-22.
G2/A2/A3/A4 GPU machine types require --maintenance-policy TERMINATE. They also need a CUDA-capable image; do not treat g2-standard-16 as a CPU VM just because --gcp-gpu-type is omitted.
Quota 'GPUS_ALL_REGIONS' exceeded. Limit: 0.0 globally. means the project has zero GPU quota anywhere. Changing zone will not help; request global GPU quota and regional L4/G2 quota before retrying.
Current quota is 64 vCPUs and 1 GPU. CPU quota errors can still happen when overlapping VMs, stale STOPPING instances, or a second launch while a blipped client thinks the VM is gone consume the remaining vCPUs. A g2-standard-16 GPU VM also consumes 16 vCPUs.
No image files found in dataset tarball for a video scenario usually means the detached job.json has the wrong typed run_config.selection.scenario or media-derived staging fields. Detached runs should not carry flag argv payloads.
ModuleNotFoundError from benchmark/cloud/stage_dataset.py before Installing uv... means bootstrap staging imported a dependency too early. That script must stay stdlib-only because it runs before the control venv and library venvs are created.
Kornia image GPU Shear failures with mixed CPU/CUDA tensors are a known Kornia CUDA limitation. Current benchmark jobs filter Shear only for Kornia image GPU micro/DataLoader rows; rerun with the patched code instead of removing Shear from the global paper transform sets.
Result directories contain both summary JSON and raw pyperf JSON; docs should load only *_results.json.
Do not assume VM disappearance means success.

related-skills.json

même dépôt

paper-coverage-validator.md

from "albumentations-team/benchmark"

Validates whether benchmark artifacts cover the paper's required RGB micro and RGB DataLoader sections. Use when checking missing RGB runs, deciding what to run next, validating gcp_runs/output folders, or preparing paper tables.

2026-05-0687

benchmark-runner.md

from "albumentations-team/benchmark"

Automates running image/video augmentation benchmarks for single or multiple libraries, validates outputs, generates comparison reports, and updates documentation. Use when running benchmarks, comparing library performance, or when the user mentions benchmark, benchmark.cli, pyperf, GCP benchmark runs, or performance testing.

2026-05-0687

documentation-generator.md

from "albumentations-team/benchmark"

Updates benchmark documentation with latest results including README tables, speedup plots, and library metadata. Use when updating documentation, generating comparison tables, or when the user mentions update_docs.sh or documentation generation.

2026-05-0687

library-integration.md

from "albumentations-team/benchmark"

Guides adding support for new image/video augmentation libraries to the benchmark suite. Use when integrating a new library, adding library support, or when the user mentions adding a new augmentation library to test.

2026-05-0687

paper-benchmark-execution.md

from "albumentations-team/benchmark"

Executes the paper benchmark plan for RGB, multichannel, DataLoader, and video benchmarks. Use when the user mentions the paper benchmark, deadline plan, machine matrix, RGB micro, multichannel, DataLoader, video GPU, c4/c4d/g2 machines, or what to run next.

2026-05-0687

performance-analysis.md

from "albumentations-team/benchmark"

Analyzes benchmark results to identify slow transforms, warmup issues, and performance regressions. Compares speedups across libraries and generates optimization recommendations. Use when analyzing performance, investigating slow benchmarks, or comparing library results.

2026-05-0687

package.json

"author": "albumentations-team"

"repository": "albumentations-team/benchmark"

Ouvrir le dépôt GitHub Voir les dépôts du créateur

$ install --global

$ download --local

Exécuter dans Manus

$ useful --forSOC

Administrateurs de réseaux et de systèmes informatiquesProfessions informatiques et mathématiques15-1244L4

name	gcp-benchmark-triage
description	Triage detached GCP benchmark runs, DONE/FAILED sentinels, VM cleanup, vm.log, gcp_last_run.json, and partial result downloads. Use when GCP benchmark logs mention DONE, FAILED, exit_code.txt, VM disappeared, STOPPING, gcloud machine type errors, or missing artifacts.

GCP Benchmark Triage

Detached runs upload artifacts under the run prefix recorded in gcp_last_run.json.

Interpret Run State

DONE: benchmark succeeded; fetch results/, then wait for VM cleanup/cache upload.
FAILED: benchmark failed; fetch vm.log, FAILED, exit_code.txt, and any partial results/.
exit_code.txt without DONE: use the exit code and vm.log as source of truth.
VM gone with no terminal marker: treat as failure until logs prove otherwise.
VM STOPPING after DONE: usually cache upload/self-delete; wait before relaunching quota-heavy jobs.

Useful Commands

Read local metadata:

python - <<'PY' path/to/gcp_last_run.json
import json, sys
print(json.dumps(json.load(open(sys.argv[1])), indent=2))
PY

Fetch artifacts:

gcloud storage rsync --recursive "$RUN_PREFIX/results/" "$LOCAL_OUT/"
gcloud storage cat "$RUN_PREFIX/vm.log"
gcloud storage cat "$RUN_PREFIX/exit_code.txt"

Check machine availability before launching:

gcloud compute machine-types list \
  --project albumentations \
  --zones us-central1-b \
  --filter='name=(c4-standard-16 c4d-standard-16 g2-standard-16)'

Check GPU quota:

python - <<'PY'
import json
import subprocess

raw = subprocess.check_output(
    ["gcloud", "compute", "project-info", "describe", "--project", "albumentations", "--format=json"],
    text=True,
)
for quota in json.loads(raw).get("quotas", []):
    metric = quota.get("metric", "")
    if "CPU" in metric or "GPU" in metric or "NVIDIA" in metric:
        print(f"{metric}: limit={quota.get('limit')} usage={quota.get('usage')}")
PY

Common Gotchas

c3-standard-16 does not exist; use c3d-standard-16 or c3-standard-22.
G2/A2/A3/A4 GPU machine types require --maintenance-policy TERMINATE. They also need a CUDA-capable image; do not treat g2-standard-16 as a CPU VM just because --gcp-gpu-type is omitted.
Quota 'GPUS_ALL_REGIONS' exceeded. Limit: 0.0 globally. means the project has zero GPU quota anywhere. Changing zone will not help; request global GPU quota and regional L4/G2 quota before retrying.
Current quota is 64 vCPUs and 1 GPU. CPU quota errors can still happen when overlapping VMs, stale STOPPING instances, or a second launch while a blipped client thinks the VM is gone consume the remaining vCPUs. A g2-standard-16 GPU VM also consumes 16 vCPUs.
No image files found in dataset tarball for a video scenario usually means the detached job.json has the wrong typed run_config.selection.scenario or media-derived staging fields. Detached runs should not carry flag argv payloads.
ModuleNotFoundError from benchmark/cloud/stage_dataset.py before Installing uv... means bootstrap staging imported a dependency too early. That script must stay stdlib-only because it runs before the control venv and library venvs are created.
Kornia image GPU Shear failures with mixed CPU/CUDA tensors are a known Kornia CUDA limitation. Current benchmark jobs filter Shear only for Kornia image GPU micro/DataLoader rows; rerun with the patched code instead of removing Shear from the global paper transform sets.
Result directories contain both summary JSON and raw pyperf JSON; docs should load only *_results.json.
Do not assume VM disappearance means success.

gcp-benchmark-triage

GCP Benchmark Triage

Interpret Run State

Useful Commands

Common Gotchas

Plus depuis ce dépôt

GCP Benchmark Triage

Interpret Run State

Useful Commands

Common Gotchas

Plus depuis ce dépôt