with one click
setup-guide
// Use when the user wants to set up LLM training for the first time, or when training_hub is not yet installed/configured in the current environment.
// Use when the user wants to set up LLM training for the first time, or when training_hub is not yet installed/configured in the current environment.
Use when the user wants to estimate GPU memory (VRAM) requirements for a training configuration, check if a model will fit on their GPUs, or plan GPU allocation for training.
Use when the user wants to run a training job using a saved configuration. For algorithm selection, hyperparameter advice, or troubleshooting, use the training-hub-guide skill instead.
Guides users through LLM post-training with Training Hub, including installation, algorithm selection (SFT, OSFT, LoRA), hyperparameter tuning, troubleshooting OOM errors, interpreting loss curves, and leveraging backend-specific features. Use when the user is working with training_hub, fine-tuning language models, asking about SFT/OSFT/LoRA training, or debugging GPU/CUDA training issues.
| name | setup-guide |
| description | Use when the user wants to set up LLM training for the first time, or when training_hub is not yet installed/configured in the current environment. |
| allowed-tools | ["Bash(${CLAUDE_PLUGIN_ROOT}/scripts/th_detect.sh:*)","Bash(${CLAUDE_PLUGIN_ROOT}/scripts/th_estimate.sh:*)"] |
You are helping the user set up LLM training. For algorithm selection guidance, hyperparameter tuning, and troubleshooting, consult the training-hub-guide skill.
"${CLAUDE_PLUGIN_ROOT}/scripts/th_detect.sh"
If library=missing:
installer=uv: run uv pip install training-hubinstaller=pip: run pip install training-hubinstaller=none: tell the user they need Python and pip/uv installed firsttraining-hub[cuda] — flash-attn, bitsandbytes for GPU accelerationtraining-hub[lora] — Unsloth, TRL for parameter-efficient fine-tuningtraining-hub[grpo] — ART, veRL for reinforcement learningFor installation issues, consult the training-hub-guide skill (installation-troubleshooting section).
If gpu=unavailable, warn: "No GPU detected. Training requires CUDA-capable GPUs. You can still configure, but training will fail without a GPU."
Report GPU count if available.
If the user has a clear task ("fine-tune Llama on my data"), offer a fast path with sensible defaults:
"I detected N GPU(s). I can set up with these defaults:
- Algorithm:
lora_sft(parameter-efficient, works on a single GPU)- Learning rate:
1e-5- Epochs:
2- Batch size:
64- Max sequence length:
4096You'll just need to provide your model path and data path. Accept these defaults, or customize?"
If the user accepts, ask only for model path and data path, then skip to Step 7.
If the user wants to customize, proceed with the full configuration.
Ask these questions one at a time:
training-hub-guide skill for algorithm selection guidance if the user is unsure.meta-llama/Llama-3.1-8B-Instruct, or a local path.messages field../outputCollect hyperparameters based on the chosen algorithm. Consult the training-hub-guide skill (hyperparameter-guide section) for recommended defaults by dataset size and algorithm.
Ask: "Do you want to configure experiment tracking?" (W&B, MLflow, or TensorBoard). See the training-hub-guide skill for logger details.
Write the config to .training-hub/config.json:
{
"algorithm": "<algorithm>",
"model_path": "<model_path>",
"data_path": "<data_path>",
"ckpt_output_dir": "<output_dir>",
"nproc_per_node": N,
"hyperparams": { ... },
"algorithm_config": { ... },
"logging": { ... }
}
Add .training-hub/ to .gitignore if not already present.
Ask: "Want me to estimate GPU memory requirements before training?"
If yes, run:
"${CLAUDE_PLUGIN_ROOT}/scripts/th_estimate.sh"
If this skill is invoked again and a config already exists, ask: "You already have a configuration. Do you want to update it or start fresh?"