Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

$pwd:

preflight-check

Name: Preflight Check
Author: archibate

// Resource-aware pre-launch checklist for long-running or heavy tasks — prevents OOM, wasted compute, and daytime disruption. TRIGGER before: any `babysit run` of compute work, builds/benchmarks/renders/training jobs, large-scale scrapes or scripted browser runs, spawning 2+ parallel subagents, full-repo test runs, or any Bash call with `run_in_background=true` whose runtime is uncertain.

Exécuter dans Manus

$ git log --oneline --stat

stars:27

forks:3

updated:19 mai 2026 à 05:30

Explorateur de fichiers

4 fichiers

SKILL.md

readonly

name	preflight-check
description	Resource-aware pre-launch checklist for long-running or heavy tasks — prevents OOM, wasted compute, and daytime disruption. TRIGGER before: any `babysit run` of compute work, builds/benchmarks/renders/training jobs, large-scale scrapes or scripted browser runs, spawning 2+ parallel subagents, full-repo test runs, or any Bash call with `run_in_background=true` whose runtime is uncertain.
allowed-tools	["Bash(free:)","Bash(df:)","Bash(nproc:)","Bash(nvidia-smi:)","Bash(ps aux:)","Bash(babysit list:)","Bash(babysit status:)","Bash(resource_snapshot:)","Read","Grep"]
compatibility	Claude Code

Preflight Check

Pre-launch checklist to prevent OOM kills, wasted compute, and daytime disruption. Run this mentally before every heavy task.

Project-specific data tables (cost lookup, I/O dependencies) should be maintained in a file like references/task-costs.md in the project root. If the project has no such file, initialize one from the template at examples/task-costs.md.

When to Use

Before launching any task via babysit run or background shell jobs
Before starting parallel workers, sweeps, grid searches, or hyperparameter optimization
Before running data pipelines, ETL jobs, or batch processing on large datasets
Before any computation estimated to take >10 minutes or use >2 GB memory
When stacking a new task while other heavy tasks are already running
When resuming or re-launching a previously OOM-killed task
When running an unfamiliar script or tool for the first time at scale

When NOT to Use

Quick interactive commands (e.g., git status, ruff check, uv run pytest on a small suite)
One-off file reads, edits, or searches that complete in seconds
Tasks with well-known, negligible resource footprint (linting, formatting, single-file compilation)
When the task has already been classified as Light in the project's Cost Lookup Table and no other heavy tasks are running

Steps

1. Classify Task Cost

Look up the task in the project's Cost Lookup Table and determine its category:

Category	Runtime	Memory/worker	Action
Light	<10 min	<2 GB	Proceed freely
Moderate	10-60 min	2-5 GB	Check server load (Step 2)
Heavy	>1 hr	>5 GB/worker	Full checklist (Steps 2-5)
Unknown	?	?	Probe first (Step 1a)

Classification rule: If runtime and memory point to different categories, take the higher one. A task that runs for 2 min but uses 5 GB is Moderate (memory), not Light.

GPU tasks: If the task offloads to GPU (PyTorch, TensorFlow, cuDF, CUDA kernels, etc.), also classify by VRAM: <2 GB Light, 2-8 GB Moderate, >8 GB/worker Heavy. VRAM has no swap fallback — an OOM on GPU kills the process instantly with no warning.

For batches of N instances, compute aggregate cost:

Aggregate runtime = per_instance_runtime × count / parallelism
Aggregate memory = per_instance_memory × parallelism
Any batch with aggregate runtime >1 hr or memory >10 GB is Heavy.

Note: Runtime does not always scale as 1/parallelism. I/O-bound tasks (disk, network, shared locks) suffer contention under parallel execution, reducing the speedup. Assume parallelism buys at most 50-70% of ideal speedup unless measured otherwise. Be conservative in time estimates.

1a. Probe Unknown Tasks

When a task is not in the lookup table:

Run a minimal probe — 1-2 iterations, 1 worker, smallest input possible

Measure immediately after probe starts:

ps aux --sort=-%mem | grep <keyword> | awk '{printf "PID=%s RSS=%.0fMB\n", $2, $6/1024}'

Time the probe — note wall clock for the minimal run
Extrapolate — estimate full runtime from probe
Classify — based on measured RSS and extrapolated runtime, assign Light/Moderate/Heavy
Record — add to the project's Cost Lookup Table so future runs skip probing

1b. Check Data Race Conflicts

Check for running tasks (e.g., babysit list) and compare I/O dependencies using the project's I/O Dependency Table:

Identify what the planned task reads and writes
Check if any running task writes to files the planned task reads, or vice versa
If conflict exists → BLOCK. Schedule after the conflicting task completes

If the task is not in the I/O table, assume it may conflict with anything until verified. Check the script source for file reads/writes and CLI --help for I/O flags, then add to the table.

2. Check Server Resource Headroom

Measure current resource usage before adding load:

scripts/resource_snapshot.sh

Assess current usage and decide:

Current CPU/Mem Usage	Action
<40%	Proceed freely
40–80%	Proceed, but estimate whether the planned task would push past 80%
80–90%	Caution — only add Light tasks. Wait for running tasks to finish before launching Moderate/Heavy
>90%	Block — wait for the rush to pass, or suggest scheduling for later

Decision rule: Estimate post-launch usage = current usage + planned task's cost (from the Cost Lookup Table). If that sum would push CPU or memory into a higher threshold band, do not launch — wait or reduce parallelism.

Be conservative on cost estimates. Tasks often use more resources than expected, especially under load. For unknown tasks (not yet documented in the Cost Lookup Table), assume worst-case memory and treat as Heavy until measured via Step 1a. It is far cheaper to wait unnecessarily than to OOM-kill a multi-hour job.

Swap usage >50%? → Server is already under memory pressure. Do not add load regardless of the threshold table above.

If the task uses GPU, also check GPU headroom:

nvidia-smi --query-gpu=index,utilization.gpu,memory.used,memory.total --format=csv,noheader,nounits

Apply the same threshold logic: if GPU utilization >80% or free VRAM cannot fit the task's estimated VRAM, do not launch.

3. Smoke Test First

For any sweep or optimization (hyperparameter search, grid search, parallel workers):

Run 1-2 trials with the exact same code path before launching the full sweep
Verify the output makes sense (correct sign, reasonable magnitude, no NaN)
Only then launch the full run

Why: A bug discovered after hundreds of trials wastes hours. A couple of trials take minutes and catch sign errors, data loading issues, wrong configurations, and misconfigured objectives.

4. Shortcut Check

Ask: Can I skip this step entirely and still get useful results?

Common shortcuts:

Reuse existing results instead of re-running from scratch
Run a single evaluation before committing to a full sweep
Spot-check with 1 worker before launching N parallel workers

The fastest path to a result is always preferred. Only run expensive steps when the cheap alternative is insufficient.

5. Launch with Guardrails

When launching heavy tasks:

Set parallelism conservatively — start with fewer workers than the max. 2 workers is safer than 3 if memory is tight.
Know per-worker memory — multiply by N workers and compare to available RAM (and VRAM for GPU tasks).
Minimize data loading — load only the columns/rows needed, not the entire dataset.
Set up monitoring (cron or follow) for tasks >1 hour.
Plan for OOM restarts — use shared state (databases, checkpoints) so killed workers don't lose all progress. If a task is OOM-killed, follow references/post-oom-triage.md before retrying.

6. Verify Assumptions Periodically

After launching, periodically check actual resource usage against your initial estimate:

ps aux --sort=-%mem | grep <task_keyword> | awk '{printf "PID=%s RSS=%.0fMB\n", $2, $6/1024}'
free -m

If reality breaks your assumption:

Estimated <5 min but elapsed >10 min → re-classify as moderate/heavy
Estimated <5 GB but grew to >10 GB → investigate (e.g., data accumulation, memory leak)
Update the project's Cost Lookup Table with the corrected estimate

If a "lightweight" task turns heavy and competes with other tasks:

Consider killing or pausing it
Re-schedule behind actual lightweight tasks
Re-run later when resources are free

related-skills.json

même dépôt

agent-browser.md

from "archibate/dotfiles-claude"

Inspect or debug web pages in a headless browser — fill forms, click buttons, take screenshots, test web apps, review frontend UI/UX aesthetics. Use for headless browser automation. This skill should be used when user says "test web app", "headless browser", "browser automation", "chromium --headless", "screenshot of this page", "look the page".

2026-05-2927

babysit.md

from "archibate/dotfiles-claude"

Run long-running tasks under babysit — supervised background runner with cgroup-enforced memory/CPU caps and observability-stall kill. Use BEFORE any long-running task (>2 min), compute-intensive job, or background work — or when the user says "run in background", "use babysit". This skill defines guardrails and the mandatory workflow, not just a how-to.

2026-05-2827

claude-dm.md

from "archibate/dotfiles-claude"

Send peer-to-peer message and control to Claude Code sessions. Use when coordinating multiple simultaneous Claude sessions, or user says "dm another claude", "list claude sessions", "peek peer claude", "spawn claude in tmux", "monitor remote claude", "notify peer in tmux", "handover to claude", "inspect peer transcript", "orchestrate claude sessions".

2026-05-2827

memory-add.md

from "archibate/dotfiles-claude"

Append a single bullet to staging memory. Use when a durable fact, knowledge, lesson, or pitfall (mistake-correction pattern) emerged mid-conversation that's potentially worth persisting into long-term memory. Also use when user corrected your mistake, says "remember X", "remember not to X", or "next time, do Y instead of X", "do not mistake on X again".

2026-05-2027

note.md

from "archibate/dotfiles-claude"

Add a note only visible to user. A private scratchpad not visible in agent context.

2026-05-2027

mini-prompt.md

from "archibate/dotfiles-claude"

Form a minimal self-contained prompt to reproduce this session

2026-05-1927

package.json

"author": "archibate"

"repository": "archibate/dotfiles-claude"

Ouvrir le dépôt GitHub Voir les dépôts du créateur

$ install --global

$ download --local

Exécuter dans Manus

$ useful --forSOC

Administrateurs de réseaux et de systèmes informatiquesProfessions informatiques et mathématiques15-1244L4

name	preflight-check
description	Resource-aware pre-launch checklist for long-running or heavy tasks — prevents OOM, wasted compute, and daytime disruption. TRIGGER before: any `babysit run` of compute work, builds/benchmarks/renders/training jobs, large-scale scrapes or scripted browser runs, spawning 2+ parallel subagents, full-repo test runs, or any Bash call with `run_in_background=true` whose runtime is uncertain.
allowed-tools	["Bash(free:)","Bash(df:)","Bash(nproc:)","Bash(nvidia-smi:)","Bash(ps aux:)","Bash(babysit list:)","Bash(babysit status:)","Bash(resource_snapshot:)","Read","Grep"]
compatibility	Claude Code

Preflight Check

Pre-launch checklist to prevent OOM kills, wasted compute, and daytime disruption. Run this mentally before every heavy task.

When to Use

Before launching any task via babysit run or background shell jobs
Before starting parallel workers, sweeps, grid searches, or hyperparameter optimization
Before running data pipelines, ETL jobs, or batch processing on large datasets
Before any computation estimated to take >10 minutes or use >2 GB memory
When stacking a new task while other heavy tasks are already running
When resuming or re-launching a previously OOM-killed task
When running an unfamiliar script or tool for the first time at scale

When NOT to Use

Quick interactive commands (e.g., git status, ruff check, uv run pytest on a small suite)
One-off file reads, edits, or searches that complete in seconds
Tasks with well-known, negligible resource footprint (linting, formatting, single-file compilation)
When the task has already been classified as Light in the project's Cost Lookup Table and no other heavy tasks are running

Steps

1. Classify Task Cost

Look up the task in the project's Cost Lookup Table and determine its category:

Category	Runtime	Memory/worker	Action
Light	<10 min	<2 GB	Proceed freely
Moderate	10-60 min	2-5 GB	Check server load (Step 2)
Heavy	>1 hr	>5 GB/worker	Full checklist (Steps 2-5)
Unknown	?	?	Probe first (Step 1a)

Classification rule: If runtime and memory point to different categories, take the higher one. A task that runs for 2 min but uses 5 GB is Moderate (memory), not Light.

For batches of N instances, compute aggregate cost:

Aggregate runtime = per_instance_runtime × count / parallelism
Aggregate memory = per_instance_memory × parallelism
Any batch with aggregate runtime >1 hr or memory >10 GB is Heavy.

1a. Probe Unknown Tasks

When a task is not in the lookup table:

Run a minimal probe — 1-2 iterations, 1 worker, smallest input possible

Measure immediately after probe starts:

ps aux --sort=-%mem | grep <keyword> | awk '{printf "PID=%s RSS=%.0fMB\n", $2, $6/1024}'

Time the probe — note wall clock for the minimal run
Extrapolate — estimate full runtime from probe
Classify — based on measured RSS and extrapolated runtime, assign Light/Moderate/Heavy
Record — add to the project's Cost Lookup Table so future runs skip probing

1b. Check Data Race Conflicts

Check for running tasks (e.g., babysit list) and compare I/O dependencies using the project's I/O Dependency Table:

Identify what the planned task reads and writes
Check if any running task writes to files the planned task reads, or vice versa
If conflict exists → BLOCK. Schedule after the conflicting task completes

If the task is not in the I/O table, assume it may conflict with anything until verified. Check the script source for file reads/writes and CLI --help for I/O flags, then add to the table.

2. Check Server Resource Headroom

Measure current resource usage before adding load:

scripts/resource_snapshot.sh

Assess current usage and decide:

Current CPU/Mem Usage	Action
<40%	Proceed freely
40–80%	Proceed, but estimate whether the planned task would push past 80%
80–90%	Caution — only add Light tasks. Wait for running tasks to finish before launching Moderate/Heavy
>90%	Block — wait for the rush to pass, or suggest scheduling for later

Swap usage >50%? → Server is already under memory pressure. Do not add load regardless of the threshold table above.

If the task uses GPU, also check GPU headroom:

nvidia-smi --query-gpu=index,utilization.gpu,memory.used,memory.total --format=csv,noheader,nounits

Apply the same threshold logic: if GPU utilization >80% or free VRAM cannot fit the task's estimated VRAM, do not launch.

3. Smoke Test First

For any sweep or optimization (hyperparameter search, grid search, parallel workers):

Run 1-2 trials with the exact same code path before launching the full sweep
Verify the output makes sense (correct sign, reasonable magnitude, no NaN)
Only then launch the full run

Why: A bug discovered after hundreds of trials wastes hours. A couple of trials take minutes and catch sign errors, data loading issues, wrong configurations, and misconfigured objectives.

4. Shortcut Check

Ask: Can I skip this step entirely and still get useful results?

Common shortcuts:

Reuse existing results instead of re-running from scratch
Run a single evaluation before committing to a full sweep
Spot-check with 1 worker before launching N parallel workers

The fastest path to a result is always preferred. Only run expensive steps when the cheap alternative is insufficient.

5. Launch with Guardrails

When launching heavy tasks:

Set parallelism conservatively — start with fewer workers than the max. 2 workers is safer than 3 if memory is tight.
Know per-worker memory — multiply by N workers and compare to available RAM (and VRAM for GPU tasks).
Minimize data loading — load only the columns/rows needed, not the entire dataset.
Set up monitoring (cron or follow) for tasks >1 hour.
Plan for OOM restarts — use shared state (databases, checkpoints) so killed workers don't lose all progress. If a task is OOM-killed, follow references/post-oom-triage.md before retrying.

6. Verify Assumptions Periodically

After launching, periodically check actual resource usage against your initial estimate:

ps aux --sort=-%mem | grep <task_keyword> | awk '{printf "PID=%s RSS=%.0fMB\n", $2, $6/1024}'
free -m

If reality breaks your assumption:

Estimated <5 min but elapsed >10 min → re-classify as moderate/heavy
Estimated <5 GB but grew to >10 GB → investigate (e.g., data accumulation, memory leak)
Update the project's Cost Lookup Table with the corrected estimate

If a "lightweight" task turns heavy and competes with other tasks:

Consider killing or pausing it
Re-schedule behind actual lightweight tasks
Re-run later when resources are free

preflight-check

Preflight Check

When to Use

When NOT to Use

Steps

1. Classify Task Cost

1a. Probe Unknown Tasks

1b. Check Data Race Conflicts

2. Check Server Resource Headroom

3. Smoke Test First

4. Shortcut Check

5. Launch with Guardrails

6. Verify Assumptions Periodically

Plus depuis ce dépôt

Plus depuis ce dépôt

Preflight Check

When to Use

When NOT to Use

Steps

1. Classify Task Cost

1a. Probe Unknown Tasks

1b. Check Data Race Conflicts

2. Check Server Resource Headroom

3. Smoke Test First

4. Shortcut Check

5. Launch with Guardrails

6. Verify Assumptions Periodically