Skip to main content
在 Manus 中运行任何 Skill
一键导入
$pwd:
NVIDIA-NeMo
GitHub 创作者资料

NVIDIA-NeMo

按仓库查看 12 个 GitHub 仓库中的 85 个已收集 skills,并展示近似职业覆盖。

已收集 skills
85
仓库
12
职业领域
1
更新
2026-05-22
职业覆盖
该创作者主要覆盖的职业大类。
这里展示前 8 个仓库;完整仓库列表在下方继续。
仓库浏览

仓库与代表性 skills

#001
Megatron-Bridge
29 个 skills653324更新于 2026-05-22
占该创作者 34%
adding-model-support
软件开发工程师

Guide for adding support for new LLM or VLM models in Megatron-Bridge. Covers bridge, provider, recipe, tests, docs, and examples.

2026-05-22
perf-sequence-packing
数据科学家

Validate and use packed sequences and long-context training in Megatron-Bridge, distinguishing offline packed SFT for LLMs from in-batch packing for VLMs, and applying the right CP constraints.

2026-05-22
perf-hierarchical-context-parallel
软件开发工程师

Operational guide for enabling hierarchical context parallelism in Megatron-Bridge, including config knobs, code anchors, pitfalls, and verification.

2026-05-20
perf-parallelism-strategies
软件开发工程师

Operational guide for choosing and combining parallelism strategies in Megatron Bridge, including sizing rules, hardware topology mapping, and combined parallelism configuration.

2026-05-20
resiliency
软件开发工程师

Resiliency features in Megatron Bridge including fault tolerance, straggler detection, in-process restart, preemption, and re-run state machine.

2026-05-20
multi-node-slurm
网络与计算机系统管理员

Convert single-node scripts to multi-node Slurm sbatch jobs and debug common multi-node failures. Covers srun-native vs uv run torch.distributed approaches, container setup, NCCL timeouts, OOM sizing for MoE models, and interactive allocation.

2026-05-18
nemo-rl-e2e-testing
软件质量保证分析师与测试员

External NeMo-RL end-to-end validation workflow for Megatron-Bridge model/provider changes, including downstream compatibility checks, external RL lifecycle behavior, Megatron policy setup, HF import/export, checkpoint/resume, non-colocated vLLM refit, delta weight transfer, optional LoRA/generation variants, and questions such as "does this model work in NeMo-RL", "run NeMo-RL e2e", or "external RL loop validation". Covers running NeMo-RL Megatron policy jobs from a Bridge checkout, choosing GRPO/SFT/checkpoint/non-colocated refit variants, setting PYTHONPATH so NeMo-RL imports the local Bridge tree, and reporting pass/fail evidence.

2026-05-18
verl-e2e-testing
软件质量保证分析师与测试员

External verl end-to-end validation workflow for Megatron-Bridge model/provider changes. Covers running a small verl Megatron backend job from a Bridge checkout, choosing LoRA/DDP plus optional save/resume and parallelism variants, setting PYTHONPATH so verl imports the local Bridge tree, and reporting pass/fail evidence.

2026-05-18
当前展示该仓库 Top 8 / 29 个已收集 skills。
#002
RL
14 个 skills1.6k391更新于 2026-05-20
占该创作者 16%
auto-research
数据科学家

Autonomous NeMo-RL research agent workflow for directed hypothesis testing and open-ended discovery. Guides agents through the full experiment lifecycle: understanding recipes and environments, wiring RL or NeMo-gym runs, launching reproducible baselines and iterations, analyzing results, preserving human oversight, and using git plus TSV logs as the research ledger.

2026-05-20
brev-etiquette
网络与计算机系统管理员

Brev instance operating guidance for NeMo-RL agents working in /home/ubuntu/RL with limited workspace disk, a larger /ephemeral volume, and optional /home/ubuntu/RL/.env secrets. Use when running auto-research campaigns, experiments, training jobs, model or dataset downloads, shared cache-heavy commands, log-producing runs, checkpoint generation, W&B or Hugging Face authenticated workflows, or any workflow that may create large files on Brev.

2026-05-20
session-memory
软件开发工程师

Manage durable working-session memory for coding agents. Use when a user asks to preserve or recover agent context across disconnects, VS Code restarts, long-running work, handoffs, or any session where important state should be written periodically under the repo's session directory.

2026-05-20
config-conventions
软件开发工程师

Configuration conventions for NeMo-RL. YAML is the single source of truth for defaults. Covers TypedDict usage, exemplar YAML updates, and forbidden default patterns.

2026-05-14
build-and-dependency
软件开发工程师

Build and dependency management for NeMo-RL. Covers Docker image building and running, uv usage, venv setup, and adding dependencies.

2026-05-01
cicd
软件开发工程师

CI/CD reference for NeMo-RL. Covers GitHub Actions pipeline structure, CI triggering via /ok to test, and CI failure investigation.

2026-05-01
contributing
软件开发工程师

Contribution conventions for NeMo-RL. Covers PR title format, commit sign-off, and CI triggering.

2026-05-01
copyright
软件开发工程师

NVIDIA copyright header requirements for NeMo-RL. Covers which files need headers and the exact header text.

2026-05-01
当前展示该仓库 Top 8 / 14 个已收集 skills。
#003
Automodel
10 个 skills521160更新于 2026-05-19
占该创作者 12%
fern-docs
网页开发工程师

Maintain the NeMo AutoModel Fern docs site under fern/ — add, update, move, or remove pages; manage redirects, slugs, navigation, and version aliases; run validation and previews.

2026-05-19
model-onboarding
软件开发工程师

Guide for onboarding new model families into NeMo AutoModel, including architecture discovery, implementation patterns, registration, and validation.

2026-05-15
build-and-dependency
软件开发工程师

Dev environment setup for NeMo AutoModel — container-based development, uv package management, installation options, environment variables, and common build pitfalls.

2026-05-04
cicd
软件开发工程师

CI/CD reference for NeMo AutoModel — pipeline structure, commit and PR workflow, CI failure investigation, and common failure patterns.

2026-05-04
distributed-training
软件开发工程师

Guide for selecting and configuring distributed training strategies in NeMo AutoModel, including FSDP2, Megatron FSDP, DDP, and parallelism settings.

2026-05-04
launcher-config
软件开发工程师

Configure NeMo AutoModel job launches for interactive runs, Slurm clusters, and SkyPilot cloud execution.

2026-05-04
linting-and-formatting
软件开发工程师

Code style and quality rules for NeMo AutoModel — ruff configuration, naming conventions, type hints, docstrings, copyright headers, and the code review checklist.

2026-05-04
parity-testing
软件质量保证分析师与测试员

Verify numerical parity between NeMo AutoModel implementations and reference HuggingFace models, including state dict and forward-pass checks.

2026-05-04
当前展示该仓库 Top 8 / 10 个已收集 skills。
#004
DataDesigner
8 个 skills1.9k173更新于 2026-05-21
占该创作者 9.4%
#005
Gym
8 个 skills914151更新于 2026-05-19
占该创作者 9.4%
nemo-gym-debugging
软件开发工程师

Use when debugging a Nemo Gym run or reward profiling job. Covers rollout collection failures, empty or partial JSONL outputs, stale materialized inputs, verifier/schema errors, Ray or Slurm issues, vLLM readiness, judge failures, tool/sandbox failures, cache problems, and throughput bottlenecks.

2026-05-19
nemo-gym-pivot-datasets
数据库架构师

Use when creating, validating, or documenting Nemo Gym pivot datasets from rollout, trajectory, chat-completion, Responses API, or tool-call artifacts. Covers Gym Responses-style row conversion, pivot selection, single-step tool-use configs, agent_ref alignment, verifier knobs, expected-action row contracts, and train/eval usage.

2026-05-19
nemo-gym-reward-profiling
数据科学家

Use to help users get started with Nemo Gym reward profiling. Covers the basic ng_run, ng_collect_rollouts, and ng_reward_profile workflow, repeated rollouts, materialized inputs, rollout JSONL artifacts, task and rollout identity, output inspection, partial profiling, and rollout_infos. For failed jobs, prefer nemo-gym-debugging.

2026-05-19
nemo-gym-pivot-datasets
数据科学家

Use when creating, validating, or documenting Nemo Gym pivot datasets from rollout, trajectory, chat-completion, Responses API, or tool-call artifacts. Covers Gym Responses-style row conversion, pivot selection, single-step tool-use configs, agent_ref alignment, verifier knobs, expected-action row contracts, and train/eval usage.

2026-05-12
nemo-gym-reward-profiling
数据科学家

Use to help users get started with Nemo Gym reward profiling. Covers the basic ng_run, ng_collect_rollouts, and ng_reward_profile workflow, repeated rollouts, materialized inputs, rollout JSONL artifacts, task and rollout identity, output inspection, partial profiling, and rollout_infos. For failed jobs, prefer nemo-gym-debugging.

2026-05-11
nemo-gym-docs
软件开发工程师

Maintain the NeMo Gym Fern docs site — add, update, move, or remove pages under fern/. Use for any documentation change. Triggered by: "edit docs", "add doc page", "update docs", "rename page", "fix broken link", "add redirect", "preview docs", "publish docs", any request that touches `fern/`.

2026-05-11
nemo-gym-debugging
软件开发工程师

Use when debugging a Nemo Gym run or reward profiling job. Covers rollout collection failures, empty or partial JSONL outputs, stale materialized inputs, verifier/schema errors, Ray or Slurm issues, vLLM readiness, judge failures, tool/sandbox failures, cache problems, and throughput bottlenecks.

2026-04-28
add-benchmark
软件开发工程师

Guide for adding a new benchmark or training environment to NeMo-Gym. Use when the user asks to add, create, or integrate a benchmark, evaluation, training environment, or resources server into NeMo-Gym. Also use when wrapping an existing 3rd-party benchmark library. Covers the full workflow: data preparation, resources server implementation, agent wiring, YAML config, testing, and reward profiling (baselining). Triggered by: "add benchmark", "new resources server", "integrate benchmark", "wrap benchmark", "add training environment", "add eval".

2026-03-25
#006
NeMo
4 个 skills17.3k3.4k更新于 2026-04-17
占该创作者 4.7%
#007
Evaluator
4 个 skills28348更新于 2026-05-22
占该创作者 4.7%
launching-evals
软件质量保证分析师与测试员

Run, monitor, analyze, and debug LLM evaluations via nemo-evaluator-launcher. Covers running evaluations, checking status and live progress, debugging failed runs, exporting artifacts and logs, and analyzing results. ALWAYS triggers on mentions of running evaluations, checking progress, debugging failed evals, analyzing or analysing runs or results, run directories or artifact paths on clusters, Slurm job issues, invocation IDs, or inspecting logs (client logs, server logs, SSH to cluster, tail logs, grep logs). Do NOT use for creating or modifying evaluation configs.

2026-05-22
byob
软件质量保证分析师与测试员

Create custom LLM evaluation benchmarks using the BYOB decorator framework. Use when the user wants to (1) create a new benchmark from a dataset, (2) pick or write a scorer, (3) compile and run a BYOB benchmark, (4) containerize a benchmark, or (5) use LLM-as-Judge evaluation. Triggers on mentions of BYOB, custom benchmark, bring your own benchmark, scorer, or benchmark compilation.

2026-05-07
nel-assistant
软件开发工程师

Interactive config wizard for NeMo Evaluator Launcher (NEL). Use when the user wants to create a new evaluation config from scratch, set up an evaluation from existing configs, or modify a NEL config (deployment, tasks, multi-node, interceptors). ALWAYS triggers on mentions of creating configs, setting up evaluations, configuring models for evaluation, or modifying NEL YAML files. Do NOT use for monitoring, debugging, or analyzing already-running evaluations.

2026-04-21
accessing-mlflow
数据科学家

Query and browse evaluation results stored in MLflow. Use when the user wants to look up runs by invocation ID, compare metrics across models, fetch artifacts (configs, logs, results), or set up the MLflow MCP server. ALWAYS triggers on mentions of MLflow, experiment results, run comparison, invocation IDs in the context of results, or MLflow MCP setup.

2026-03-20
#008
Safe-Synthesizer
4 个 skills223更新于 2026-05-14
占该创作者 4.7%
git-worktrees
软件开发工程师

Create, manage, and clean up git worktrees for isolated development, PR review, and A/B testing of agent configurations. Trigger keywords - worktree, worktrees, git worktree, parallel branches, isolated workspace, worktree cleanup, worktree prune, PR review, address PR comments, work on branch, work on PR.

2026-05-14
uv-build
软件开发工程师

uv package management, dependency groups, PyTorch index handling, hatch build system, and versioning for this repo. Triggers on: uv, uv sync, uv lock, uv add, uv build, dependency, pyproject.toml, extras, cpu, cu128, hatch, wheel, version, publish.

2026-05-14
safe-synthesizer
软件开发工程师

Use NeMo Safe Synthesizer through task-specific routing: running the CLI or SDK, configuring parameters, troubleshooting runtime failures, inspecting artifacts, and interpreting evaluation outputs. Use when the user asks about safe-synthesizer, NeMo Safe Synthesizer, synthetic data pipeline runs, DP settings, generation failures, artifacts, logs, offline/GPU setup, config overrides, or evaluation metrics.

2026-05-07
github-cli
软件开发工程师

Interact with the Safe-Synthesizer GitHub repository using the gh CLI. Activate when users want to list or create pull requests, check out PRs, work on someone else's PR, check CI status, investigate workflow failures, view job logs, create or triage issues, check review and approval status, manage releases, or inspect repo metadata. Trigger keywords - pull request, PR, issue, workflow, CI, actions, failed job, job log, release, review, approve, CODEOWNERS, labels, milestone, checkout, gh, GitHub.

2026-05-06
已展示 12 / 12 个仓库
已展示全部仓库