Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic

ab-test

A/B test an agent's current prompt against a candidate variant from policy/<agent>/candidates/. Runs both on the same scenarios, aggregates rewards, recommends promotion if candidate beats current by >1 stderr with n≥10. Triggers on /ab-test, "compare prompts", "test the candidate", "is the new prompt better".

Étoiles0
Forks0
Mis à jour10 mai 2026 à 15:17
SKILL.md
readonly