con un clic
add-benchmark
// Add a new SWE benchmark task from a real GitHub bug-fix. Use when the user provides a GitHub issue or PR URL and wants to add it to the bench-swe pipeline.
// Add a new SWE benchmark task from a real GitHub bug-fix. Use when the user provides a GitHub issue or PR URL and wants to add it to the bench-swe pipeline.
Refresh or rebuild the bundled Lumen index for the current project, preferring MCP-driven refreshes and using the CLI only for an explicit clean rebuild.
Run a health check on the bundled Lumen semantic search setup for the current project, verify backend reachability and index freshness, and summarize remediation steps.
| name | add-benchmark |
| description | Add a new SWE benchmark task from a real GitHub bug-fix. Use when the user provides a GitHub issue or PR URL and wants to add it to the bench-swe pipeline. |
| argument-hint | <github-issue-or-pr-url> <language> |
| disable-model-invocation | true |
Add a new benchmark task to the bench-swe pipeline from a real GitHub bug-fix. The human provides the GitHub issue or PR URL; the agent handles extraction, validation, and file creation.
https://github.com/gorilla/mux/issues/534 or
https://github.com/gorilla/mux/pull/585)Good benchmark repos are focused libraries with a clear bug — not large applications. Before submitting a URL, prefer repos that are:
The agent will reject repos that exceed these limits.
Dispatch the task-curator agent with the provided arguments. The agent
will:
bench-swe/tasks/{language}/ and patch to
bench-swe/patches/Report the result including: