在 Manus 中运行任何 Skill
一键导入
一键导入
一键在 Manus 中运行任何 Skill
开始使用$pwd:
wandb-improve
// Analyze Weave traces and eval results, then improve Mistral prompts for the Promus task agent
$ git log --oneline --stat
stars:4
forks:0
updated:2026年3月1日 00:02
SKILL.md
// Analyze Weave traces and eval results, then improve Mistral prompts for the Promus task agent
| name | wandb-improve |
| description | Analyze Weave traces and eval results, then improve Mistral prompts for the Promus task agent |
You have access to the W&B MCP Server. Follow this loop:
Run cd evals && uv run python scripts/run_eval.py to get baseline scores.
Use the W&B MCP tools to:
Read the current prompts in evals/scripts/run_eval.py:
INTENT_CLASSIFIER_PROMPT — intent classification system promptTASK_PLANNER_PROMPT — task planning system promptCompare prompt instructions against the failures found in traces.
Edit the prompts to address identified issues. Common improvements:
Run cd evals && uv run python scripts/run_eval.py again to generate new scores.
Use W&B MCP create_wandb_report_tool to create a comparison report showing before/after metrics.
Document what changed and why in a commit message.