Troubleshooter for agentic-RL training, evaluation, and experiment design on LLM agents (single or multi-agent, multi-turn, tool-augmented). Routes a user's symptom to fixes anchored in the corpus. TRIGGER when: user is training, evaluating, or designing experiments for an RL-trained LLM agent; symptoms like reward not moving, eval flat, KL/entropy/length blow-ups, retokenization drift, tool-call parse failures, credit assignment, async-rollout staleness, judge inconsistency, benchmark contamination, pass@k vs pass@1; choices about ablation, baseline, framework, algorithm, reward, or data curation; user names GRPO, PPO, DAPO, veRL, OpenRLHF, slime, AReaL, RAGEN, or similar. SKIP: generic supervised LLM fine-tuning with no RL component; classical RL theory or tabular RL; non-LLM agents. Distilled from the AgentsMeetRL awesome list, snapshot 2026-06-20.

2026-06-20

#002

BeforeSubmitSkill

1 个 skills40更新于 2026-05-25

占该创作者 50%

skill

职业分类

描述

更新

before-submit

软件开发工程师

Comprehensive pre-submission quality check for an academic paper (LaTeX + BibTeX/biblatex). Use when the user is preparing to submit a paper and wants to verify: bibliography correctness (references actually exist, aren't hallucinated, aren't retracted, metadata matches), LaTeX formatting & writing quality, internal faithfulness (numbers in the text match the tables, figures match what the prose claims about them, no broken/empty citations or cross-references), double-blind / anonymization compliance, and venue-specific template rules (page limits, mandatory sections, checklists, style files). Triggers: "check my paper before I submit", "before submit", "is my paper ready for ACL/EMNLP/NeurIPS/CVPR/ICLR/...", "verify my references / .bib", "find fake or retracted citations", "do my numbers match the tables / figures", "check faithfulness / internal consistency", "double-blind / anonymization check", "did I follow the <venue> template". Works on a single .tex/.bib pair OR a whole multi-file LaTeX project direc

2026-05-25

已展示 2 / 2 个仓库

已展示全部仓库