بنقرة واحدة
arksim-simulate
// Use when the user wants to simulate multi-turn conversations against an AI agent. Alias for the arksim-test skill; the canonical flow lives there.
// Use when the user wants to simulate multi-turn conversations against an AI agent. Alias for the arksim-test skill; the canonical flow lives there.
Use when the user wants to re-evaluate a previous arksim simulation with different metrics, thresholds, or judge model without re-running the agent. Cheaper than re-simulating.
Use when the user wants to inspect arksim evaluation results, debug specific failures turn by turn, or compare two runs to measure improvement.
Use when the user wants to generate, edit, or extend arksim test scenarios. Reads the agent's source code to derive realistic scenarios; can build regression scenarios from past failures.
Use when the user wants to test, simulate, or evaluate an AI agent against multi-turn scenarios (also exposed as the arksim-simulate alias). Discovers the agent, generates scenarios, runs simulation and evaluation, surfaces failures.
Use when the user wants to launch the arksim web dashboard to browse evaluation results visually rather than in CLI output.
Generate a PR title and description from your changes
| name | arksim-simulate |
| description | Use when the user wants to simulate multi-turn conversations against an AI agent. Alias for the arksim-test skill; the canonical flow lives there. |
| allowed-tools | ["mcp__arksim__init_project","mcp__arksim__simulate_evaluate","mcp__arksim__read_result","Read","Write","Edit","Glob","Grep"] |
This skill is an alias for arksim-test. The canonical multi-turn
simulation + evaluation flow is documented there. Both names exist so
users can ask for "test" or "simulate" interchangeably.
When invoked, follow the instructions in the arksim-test skill.