Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

$pwd:

autoexperiment

Name: Autoexperiment
Author: damionrashford

// Autonomous time-budget experiment loop. Modify a training script, train for a fixed wall-clock budget, evaluate, record, repeat. Inspired by karpathy/autoresearch. Use for overnight architecture search, systematic hyperparameter sweeps, or any iterative model improvement workflow.

Exécuter dans Manus

$ git log --oneline --stat

stars:2

forks:0

updated:9 avril 2026 à 01:21

Explorateur de fichiers

5 fichiers

SKILL.md

readonly

related-skills.json

même dépôt

claude-code.md

from "damionrashford/mlx"

Comprehensive Claude Code knowledge base — plugins, hooks, skills, agents, MCP, channels, headless mode, permissions, settings, and all extensibility features. Use when building, configuring, debugging, or extending Claude Code.

2026-04-092

skill-creator.md

from "damionrashford/mlx"

Use when building a skill, creating a SKILL.md, packaging a workflow, making a slash command, or asked "how do I make a skill". Scaffolds the folder, generates SKILL.md from a template, validates against spec. Produces a complete ready-to-deploy skill folder: scripts, references, assets. Also use to review or improve an existing skill.

2026-04-092

analyze.md

from "damionrashford/mlx"

Statistical analysis, hypothesis testing, A/B testing, cohort analysis, segmentation, trend detection, business metrics, pre-delivery validation, and data visualization. Use when the user asks to "analyze this data", "run a statistical test", "compare groups", "find trends", "do A/B test analysis", "segment customers", "calculate KPIs", "validate this analysis", "check my work", "sanity check", "review my numbers", "make a chart", "create a dashboard", "plot the data", "visualize results", or mentions hypothesis testing, cohort analysis, business analytics, data validation, bar charts, line charts, heatmaps, scatter plots, or data storytelling.

2026-04-092

context-engineering.md

from "damionrashford/mlx"

Context engineering for building production LLM applications: context window management, degradation patterns, optimization strategies, memory system selection, multi-agent architecture, filesystem context patterns, and tool design principles. Use when building LLM apps, RAG pipelines, AI agents, multi-agent systems, or when designing memory, tool APIs, or context strategies for any language model application.

2026-04-092

data-prep.md

from "damionrashford/mlx"

Explore, clean, and engineer datasets end-to-end: statistical profiling, distribution checks, missing value analysis, duplicate detection, outlier removal, type fixing, encoding, create features, encode categories, transform columns, add rolling windows, build interaction terms, and feature engineering. Supports pandas, polars, and PySpark. Use when the user wants to explore data, profile columns, understand a dataset, clean data, handle missing values, remove duplicates, fix data types, preprocess a dataset before modeling, create features, encode categories, transform columns, add rolling windows, build interaction terms, or do feature engineering.

2026-04-092

drift-detect.md

from "damionrashford/mlx"

Detect data drift, concept drift, and model performance degradation in production. Uses PSI, KS-test, and chi-squared for statistical drift, plus evidently and nannyml for automated reports. Use when monitoring a deployed model or comparing training vs production data distributions.

2026-04-092

package.json

"author": "damionrashford"

"repository": "damionrashford/mlx"

Ouvrir le dépôt GitHub Voir les dépôts du créateur

$ install --global

$ download --local

Exécuter dans Manus

$ useful --forSOC

Scientifiques des donnéesProfessions informatiques et mathématiques15-2051L4

name	autoexperiment
description	Autonomous time-budget experiment loop. Modify a training script, train for a fixed wall-clock budget, evaluate, record, repeat. Inspired by karpathy/autoresearch. Use for overnight architecture search, systematic hyperparameter sweeps, or any iterative model improvement workflow.
allowed-tools	Bash(uv run * scripts/time_budget_train.py *) Bash, Read, Write, Edit, Glob, Grep
argument-hint	path to train.py or description of experiment goal
model	opus
effort	max
disable-model-invocation	true
context	fork
agent	mlx:ml-engineer
compatibility	>=1.0
metadata	{"category":"model-training","tags":["experiment-tracking","hyperparameter-search","autonomous","time-budget","iteration"],"phase":"train"}

Autoexperiment Skill

Run autonomous time-budget experiment loops. Each iteration modifies train.py, trains for a fixed wall-clock budget, evaluates, records in results.tsv, and repeats.

Setup

Ensure results.tsv exists with a baseline (exp000) before iterating
Create EXPERIMENT.md with your goal, baseline, hypothesis, and constraints
Run: /mlx:autoexperiment path/to/train.py

Protocol

Before each iteration

Read EXPERIMENT.md for the current hypothesis
Read results.tsv for experiment history
Identify one change to make (ONE variable only)

Iteration loop

Edit train.py with the single change
Run with TIME_BUDGET: timeout $BUDGET uv run train.py
Capture exit code and metrics
Record in results.tsv: KEEP / DISCARD / CRASH
If CRASH 3× in a row on the same error → stop, report diagnosis

After each iteration

Update EXPERIMENT.md "Next to try" section
Summarize: what changed, what happened, what's next

Templates

See references/EXPERIMENT.md.template for the hypothesis file format. See scripts/time_budget_train.py for a complete training script template with all patterns.

Key patterns

TIME_BUDGET: wall-clock seconds, not epochs. ~12 experiments/hour at 300s each
val_bpb: total_nats / (math.log(2) * total_bytes) — vocab-independent metric
GC freeze: after step 0 eliminates ~500ms stalls
Fast fail: if math.isnan(loss) or loss > 100: sys.exit(1)
Circuit breaker: 3 consecutive CRASHes on same error → escalate to user

See references/autoexperiment-guide.md for full documentation.

autoexperiment

Plus depuis ce dépôt

Plus depuis ce dépôt

Autoexperiment Skill

Setup

Protocol

Before each iteration

Iteration loop

After each iteration

Templates

Key patterns

Autoexperiment Skill

Setup

Protocol

Before each iteration

Iteration loop

After each iteration

Templates

Key patterns