Skip to main content
Run any Skill in Manus
with one click
$pwd:

experiment-execution

// Execution runner for Orbit Wars experiments under `backend/pipeline/`. Takes a single hypothesis (one line in `hypotheses.md` or one `iterN_plan.md`) and drives the full cycle inline in the main session: case implementation → local smoke test (1-episode self-play, mandatory) → `dev/test-backend` → push & RunPod GPU launch → in-flight monitoring (progress / steps / loss / GPU·CPU·memory) → on failure, stop & terminate the pod and relaunch (auto-recover) → evaluation → `iterN_result.md`. Honors the `hypotheses.md` skip list (e.g. `300 対戦 skip` → evaluation is reduced to training-log only, `smoke skip` → skip the 1-ep self-play, `RunPod 不使用` → full local pipeline). Use whenever the user types `/experiment-execution`, or asks to run / execute / iterate / kick off an experiment, train a new model, launch a RunPod run, propose a new case, or write up an experiment result — even if they don't explicitly say "execute", phrases like "imitation/case1 で dropout を試したい", "rulebase/case2 を改良して回したい", "runpod で学習を回して結果まとめて",

$ git log --oneline --stat
stars:0
forks:0
updated:May 6, 2026 at 03:08
File Explorer
2 files
SKILL.md
readonly