| "install", "setup", "environment", "venv", "dependencies", "API keys", "Node", "Python", or missing native tools | references/environment-setup.md |
| "install AgentV", "install ASSERT", "setup eval tools", "eval runner install", or native eval validation setup | references/install-eval-tools.md |
| "install GEPA", "install Trace", "install Agent Lightning", "install SkillOpt", "setup optimizer", or improvement library dependencies | references/install-improvement-libs.md |
| "create an eval", "judge", "grader", "rubric", "EVAL.yaml", or no eval standard | references/agentevals.md and references/agentv.md |
| "which eval standard", "convert eval", "compare standards", or mixed eval formats | references/eval-standards-guide.md |
"Agent Skills eval", evals.json, "skill quality", "with_skill", or "without_skill" | references/agent-skills-evals.md |
"ASSERT", assert-ai, "judge-traces", "spec-driven", "behavior taxonomy", "trace-aware", "policy failure modes", or eval_config.yaml | references/assert.md |
| "eval starter", "eval lint", "eval workspace contract", or expected eval artifacts | references/eval-workspace-contracts.md |
| "optimize a skill", "progressive disclosure", "Table of Contents", "Index Page", "conditional access", "top-level links", "scripted workflow", or "deterministic workflow generation" | references/skill-optimization-strategy.md |
| "which technique", "optimize this", "improvement plan", or mixed artifacts | references/techniques-guide.md |
| "GEPA", "Pareto", "reflective mutation", "prompt evolution", or "optimize anything" | references/gepa.md |
| "Trace", "OptoPrime", "computation graph", "node", "bundle", or end-to-end generative optimization | references/microsoft-trace.md |
| "VISTA", "interpretable APO", "hypothesis agent", "random restart", or "epsilon-greedy" | references/vista.md |
| "Agent Lightning", "RL", "reward", "policy reward", "governed training", or skill improvement with policy constraints | references/agent-lightning.md |
"SkillOpt", "SkillOpts", "skill evolution", best_skill.md, "held-out gate", "bounded edits", "textual learning rate", or "SkillOpt-Sleep" | references/skillopt.md |
| "eval failures", "agent traces", "span logs", "benchmark deltas", or "release evidence" | references/eval-trace-improvement.md |
| "synthetic data", "simulation data", "Simula", "QDC", "Source2Synth", "MAG-V", "MetaSynth", "BARE", "Condor", "data auditor", "generate data", or "simulate" | references/simulation-data.md |
| "CLI", "init", "improve", "eval", "simulate", "lint", "workspace", or "deterministic improvement artifacts" | references/workspace-contracts.md |