Use when designing or reviewing self-distillation workflows for agentic models, including trace collection, teacher or judge feedback, rejection sampling, critique, conversion to SFT or preference data, iterative TRL training loops, and safeguards against self-reinforcing errors.

2026-06-15

hugging-face-cli-workflows

مطوّرو البرمجيات

Use when working with Hugging Face CLI or Hub workflows for TRL training, including auth, repositories, uploads, downloads, Jobs, buckets, model persistence, dataset checks, Space links, and remote artifact movement.

2026-06-15

openenv-agentic-rl

مطوّرو البرمجيات

Use when designing, reviewing, or implementing OpenEnv-style environment interfaces for agentic RL with TRL, including reset/step/state contracts, tasksets, Docker or HTTP/WebSocket serving, MCP compatibility, reward separation, and GRPO environment rollouts.

2026-06-15

trackio-observability

مطوّرو البرمجيات

Use when instrumenting or inspecting TRL training runs with Trackio, run names, metric schemas, dashboards, logs, grep or ripgrep, SFTP, Hugging Face Job logs, remote artifacts, or experiment result summaries.

2026-06-15

trl-post-training

مطوّرو البرمجيات

Use when building, reviewing, or editing TRL post-training workflows for agentic applications, including SFT, DPO, GRPO, RLOO, reward modeling, dataset formats, chat templates, assistant/completion-only losses, tool-calling data, reward functions, and challenge progression from SFT to environment-based RL.

2026-06-15

trl-sft

مطوّرو البرمجيات

Use when designing, implementing, reviewing, or debugging supervised fine-tuning with TRL SFTTrainer or `trl sft`, especially for agentic models trained on chat messages, prompt/completion data, tool-calling examples, assistant-only loss, completion-only loss, LoRA/PEFT adapters, Trackio logging, or agent trace datasets such as `julien-c/synthtraces`.

2026-06-15

#002

multiautoresearch

5 skills30141تم التحديث 2026-05-02

33% من المنشئ

skill

المهنة

الوصف

آخر تحديث

huggingface-local-models

مطوّرو البرمجيات

Use to select models to run locally with llama.cpp and GGUF on CPU, Mac Metal, CUDA, or ROCm. Covers finding GGUFs, quant selection, running servers, exact GGUF file lookup, conversion, and OpenAI-compatible local serving.

2026-05-02

autolab-hermes-delegation

المهن الحاسوبية الأخرى

Use Hermes delegate_task cleanly in this repo for planner, reviewer, researcher, reporter, experiment-worker, and memory-keeper roles.

2026-05-02

autolab-managed-experiment

مطوّرو البرمجيات

Run one Autolab benchmark experiment safely on Hugging Face Jobs. Use when a planner, reviewer, or experiment worker is preparing, auditing, launching, or reviewing a single train.py hypothesis against the current local promoted master.

2026-05-02

autolab-reporter

مطوّرو البرمجيات

Operate the local Trackio reporter for Autolab HF Jobs. Use when a reporter or planner needs to inspect scores, active jobs, worker anomalies, duplicate launches, or the overall experiment board.

2026-05-02

hf-cli

مطوّرو البرمجيات

Hugging Face Hub CLI (`hf`) for downloading, uploading, and managing repositories, models, datasets, and Spaces on the Hugging Face Hub. Replaces now deprecated `huggingface-cli` command.

2026-05-02

#003

normies

2 skills70تم التحديث 2026-03-10

13% من المنشئ

skill

المهنة

الوصف

آخر تحديث

normies-workflow

مطوّرو البرمجيات

Orchestrate Docker-isolated, branch-based multi-agent git workflows with normies. Use for parallel edits, retries, and explicit review/integration gates.

2026-03-10

normies-workflow

مطوّرو البرمجيات

Orchestrate Docker-isolated, branch-based multi-agent git workflows with normies. Use for parallel edits, batched tasks, retries, and explicit review/integration gates. Prefer this over ad-hoc shell loops for multi-step coordination; skip for trivial one-file edits that do not need orchestration.

2026-02-18

#004

kernel-skill

1 skills162تم التحديث 2026-02-11

6.7% من المنشئ

skill

المهنة

الوصف

آخر تحديث

cuda-kernels

مطوّرو البرمجيات

Provides guidance for writing and benchmarking optimized CUDA kernels for NVIDIA GPUs (H100, A100, T4) targeting HuggingFace diffusers and transformers libraries. Supports models like LTX-Video, Stable Diffusion, LLaMA, Mistral, Qwen, and Qwen3. Includes integration with HuggingFace Kernels Hub (get_kernel) for loading pre-compiled kernels. Includes benchmarking scripts to compare kernel performance against baseline implementations.

2026-02-11

#005

openenv-scaling

1 skills140تم التحديث 2026-01-10

6.7% من المنشئ

skill

المهنة

الوصف

آخر تحديث

openenv-benchmark

مطوّرو البرمجيات

Run OpenEnv scaling and concurrency benchmark experiments. Use when deploying benchmark infrastructure (local uvicorn, local docker, HF Spaces, SLURM single-node, SLURM multi-node), running test_scaling.py tests, or analyzing experiment results. Triggers on requests to benchmark, test scaling, measure concurrency, compare HTTP vs WebSocket performance, or review experiment reports.

2026-01-10

عرض 5 من أصل 5 مستودعات

تم تحميل كل المستودعات