Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

workspace-planner-benchmark

Étoiles1

Forks0

Mis à jour24 mai 2026 à 09:01

Benchmark Moleworks workspace planner heuristics using the standardized HTML bundle generator. Use when regenerating workspace planner benchmark bundles, leaderboards, manifest/progress CSVs, trench and fan visualizations, or comparing free-lane pointwise_score and max_strip_volume planners.

Installation

Installer avec Codex ou Claude Copiez ce prompt, collez-le dans Codex, Claude ou un autre assistant, puis laissez-le vérifier la page du skill et l'installer pour vous.

Exécuter dans Manus

Source

Idate96

Idate96/codex_skills

Ouvrir le dépôt GitHub Voir les dépôts du créateur

Téléchargement

Exécuter dans Manus

Métiers associésSOC

Basé sur la classification professionnelle SOC

Développeurs de logicielsProfessions informatiques et mathématiques·SOC 15-1252

Explorateur de fichiers

2 fichiers

SKILL.md

readonly

name	workspace-planner-benchmark
description	Benchmark Moleworks workspace planner heuristics using the standardized HTML bundle generator. Use when regenerating workspace planner benchmark bundles, leaderboards, manifest/progress CSVs, trench and fan visualizations, or comparing free-lane pointwise_score and max_strip_volume planners.

Workspace Planner Benchmark

Use this skill when the user asks for workspace planner benchmark runs, standardized HTML bundles, leaderboards, trench/fan visualization indexes, or comparisons between planner selector policies.

Canonical Rule

Use only the standardized bundle generator:

high_level_planning/workspace_planner/workspace_planner/benchmark_html_bundle.py

Do not create ad hoc benchmark HTML folders or one-off direct simulator scripts as the final artifact. If a direct simulator script is needed for diagnosis, label it as temporary and follow up with a standardized bundle run before reporting benchmark results.

A complete standardized bundle has:

index.html
leaderboard.md
manifest.csv
progress.csv
run_metadata.json

Per-case Plotly animation HTML files are present only when --write-animations is used.

Repo Setup

Run commands from:

cd /home/lorenzo/moleworks/ros2_ws/src/moleworks_ros

Use:

PYTHONPATH=high_level_planning/workspace_planner

Default output root:

/home/lorenzo/tmp/workspace_planner_benchmarks

Default Focused Bundle

For the standard trench + fan bundle with free_lane pointwise_score and max_strip_volume, run:

RUN_NAME="$(date -u +%Y%m%d_%H%M%S)_workspace_planner_focused_standard"
PYTHONPATH=high_level_planning/workspace_planner \
python3 high_level_planning/workspace_planner/workspace_planner/benchmark_html_bundle.py \
  --output-root /home/lorenzo/tmp/workspace_planner_benchmarks \
  --run-name "$RUN_NAME" \
  --scope focused \
  --soil-mode standard \
  --trench-case centered_trench \
  --trench-case deep_centered_trench \
  --geometry-case center_fan \
  --write-animations

This should produce:

/home/lorenzo/tmp/workspace_planner_benchmarks/runs/$RUN_NAME/index.html
/home/lorenzo/tmp/workspace_planner_benchmarks/runs/$RUN_NAME/leaderboard.md
/home/lorenzo/tmp/workspace_planner_benchmarks/runs/$RUN_NAME/manifest.csv

Variants

For a faster metrics-only run, omit --write-animations. Do this only when the user does not need visual inspection.

For both focused fan geometries, omit --geometry-case center_fan, or pass both:

--geometry-case center_fan --geometry-case deep_center_fan

For all focused cases and default focused selectors, use:

--scope focused

with no case filters.

For the larger planner comparison suite, use:

--scope full

Expect this to take substantially longer. Keep the same output contract.

Reporting

Before reporting that a benchmark is complete, inspect:

RUN_DIR=/home/lorenzo/tmp/workspace_planner_benchmarks/runs/<run_name>
sed -n '1,120p' "$RUN_DIR/leaderboard.md"
sed -n '1,120p' "$RUN_DIR/manifest.csv"
tail -20 "$RUN_DIR/progress.csv"

Report these paths:

index.html for visual inspection
leaderboard.md for ranked summary
manifest.csv for machine-readable rows
failed or incomplete rows from manifest.csv

If a run is killed or interrupted, use progress.csv only as partial progress. Do not call the bundle complete unless index.html, leaderboard.md, manifest.csv, and run_metadata.json all exist.

Interpretation Checks

For fan-like geometry bugs, review both animation HTML and manifest.csv. Specifically call out:

whether center_fan uses the expected angular range and radial limits
whether free_lane__pointwise_score.html and free_lane__max_strip_volume.html both exist
whether finish_step, executed_steps, final_remaining_m3, and residual_tolerance_m3 are comparable between selectors
whether planner success and residual completion disagree

When comparing runs, prefer the newest standardized bundle with matching scope, soil mode, cases, and selectors. Do not compare a standardized bundle against an ad hoc HTML-only folder without saying that the artifact contracts differ.

Plus depuis ce dépôt

même dépôt

terra-trench

Idate96/codex_skills

Current Moleworks Terra trenching runbook for full autonomous Beam6-style trench execution in simulation or on the robot. Use when investigating or running the two-stage flange/bottom trench flow, generate_trench_sequence_plans.py, beam6_sequence_stage.launch.py, BASE_CONTROL target registration, mesh_to_excavation_grid_map.py, workspace planner trench-axis metadata, Terra behavior-tree activation, Newton or Isaac/Terra simulation bringup, robot bringup, and 400 mm tool handoff.

2026-05-261

chat-replies

Idate96/codex_skills

Read recent Google Chat context, draft or send a reply in the correct DM or space, download collaborator attachments such as timesheets or PDFs, and handle simple meeting coordination by creating or updating a Google Calendar invite and posting the Meet link back in Chat. Use when Lorenzo asks to read a collaborator's recent messages, understand chat context before replying, send a Google Chat reply through the Chat API, pull a PDF or timesheet out of Chat, or create a meeting from a chat exchange.

2026-05-241

dig-bag-replay

Idate96/codex_skills

Replay split DIG bags in the `moleworks_ros` container with bag TF, live self-filter, live elevation mapping, live excavation mapping, and Foxglove. Use when reviewing DIG episodes from `sensors/`, `state/`, `commands/`, `lidar/`, and optional `elevation_map/` bags.

2026-05-241

grading-student

Idate96/codex_skills

Finalize RSL student grading and offboarding. Use when Lorenzo asks to find a student's grading sheet, extract or submit a grade, update the RSL student-project tracker like the onboarding workflow, request eDoz grade entry from admin staff, mark offboarding fields such as completed/report/grading/source/access-revoked only with evidence, or send a short Google Chat status reply after the handoff.

2026-05-241

newton-nav-stack-test

Idate96/codex_skills

Validate the Newton + ROS Nav2 driving stack in a clean tmux session after bringup. Use when the user wants a repeatable navigation check in Newton sim, including health checks for the bridge/model/drive path and the lateral-shift golden test.

2026-05-241

newton-sim-ros-startup

Idate96/codex_skills

Start or restart the Moleworks ROS2 stack using the Newton simulator in the default moleworks_ros runtime shell, assuming the current shell is already inside the target container unless the user says otherwise. Use when you need a clean tmux layout for Newton bridge, robot/TF/RViz, perception (elevation + excavation mapping), optional Foxglove bridge, an isolated bridge-only validation stack on a specific ROS domain, or Terra failure capture and resume from saved checkpoints in Newton simulation, all with use_sim_time:=true.

2026-05-241

name	workspace-planner-benchmark
description	Benchmark Moleworks workspace planner heuristics using the standardized HTML bundle generator. Use when regenerating workspace planner benchmark bundles, leaderboards, manifest/progress CSVs, trench and fan visualizations, or comparing free-lane pointwise_score and max_strip_volume planners.

Workspace Planner Benchmark

Use this skill when the user asks for workspace planner benchmark runs, standardized HTML bundles, leaderboards, trench/fan visualization indexes, or comparisons between planner selector policies.

Canonical Rule

Use only the standardized bundle generator:

high_level_planning/workspace_planner/workspace_planner/benchmark_html_bundle.py

A complete standardized bundle has:

index.html
leaderboard.md
manifest.csv
progress.csv
run_metadata.json

Per-case Plotly animation HTML files are present only when --write-animations is used.

Repo Setup

Run commands from:

cd /home/lorenzo/moleworks/ros2_ws/src/moleworks_ros

Use:

PYTHONPATH=high_level_planning/workspace_planner

Default output root:

/home/lorenzo/tmp/workspace_planner_benchmarks

Default Focused Bundle

For the standard trench + fan bundle with free_lane pointwise_score and max_strip_volume, run:

RUN_NAME="$(date -u +%Y%m%d_%H%M%S)_workspace_planner_focused_standard"
PYTHONPATH=high_level_planning/workspace_planner \
python3 high_level_planning/workspace_planner/workspace_planner/benchmark_html_bundle.py \
  --output-root /home/lorenzo/tmp/workspace_planner_benchmarks \
  --run-name "$RUN_NAME" \
  --scope focused \
  --soil-mode standard \
  --trench-case centered_trench \
  --trench-case deep_centered_trench \
  --geometry-case center_fan \
  --write-animations

This should produce:

/home/lorenzo/tmp/workspace_planner_benchmarks/runs/$RUN_NAME/index.html
/home/lorenzo/tmp/workspace_planner_benchmarks/runs/$RUN_NAME/leaderboard.md
/home/lorenzo/tmp/workspace_planner_benchmarks/runs/$RUN_NAME/manifest.csv

Variants

For a faster metrics-only run, omit --write-animations. Do this only when the user does not need visual inspection.

For both focused fan geometries, omit --geometry-case center_fan, or pass both:

--geometry-case center_fan --geometry-case deep_center_fan

For all focused cases and default focused selectors, use:

--scope focused

with no case filters.

For the larger planner comparison suite, use:

--scope full

Expect this to take substantially longer. Keep the same output contract.

Reporting

Before reporting that a benchmark is complete, inspect:

RUN_DIR=/home/lorenzo/tmp/workspace_planner_benchmarks/runs/<run_name>
sed -n '1,120p' "$RUN_DIR/leaderboard.md"
sed -n '1,120p' "$RUN_DIR/manifest.csv"
tail -20 "$RUN_DIR/progress.csv"

Report these paths:

index.html for visual inspection
leaderboard.md for ranked summary
manifest.csv for machine-readable rows
failed or incomplete rows from manifest.csv

Interpretation Checks

For fan-like geometry bugs, review both animation HTML and manifest.csv. Specifically call out:

whether center_fan uses the expected angular range and radial limits
whether free_lane__pointwise_score.html and free_lane__max_strip_volume.html both exist
whether finish_step, executed_steps, final_remaining_m3, and residual_tolerance_m3 are comparable between selectors
whether planner success and residual completion disagree