with one click
improve-gap-task
// Researches, plans, implements, validates, and opens a focused PR for one selected self-improvement gap. Use when executing one chosen backlog item from the nightly self-improvement workflow.
// Researches, plans, implements, validates, and opens a focused PR for one selected self-improvement gap. Use when executing one chosen backlog item from the nightly self-improvement workflow.
Investigate reported auth, credential, permission, API proxy, 401, 403, 503, OAuth, token, or secret-resolution failures by querying VictoriaLogs first. Use when a user says an integration auth failed, a tool got unauthorized/forbidden/service unavailable, an api-proxy request failed, or asks why credentials/secrets are not working.
Scaffold and build new tool integrations in tools/. Use when asked to create a new tool, add an API integration, or build a new client for an external service.
Analyzes a day of Slack-thread user sessions to discover opportunities for new skills, personas, tools, workflows, and system-level improvements. Use alongside gap-analysis for the nightly self-improvement workflow.
Run comprehensive QA and integration tests against the local Centaur stack. Use when asked to QA the stack, run integration tests, verify a deployment, or check stack health after a refactor.
| name | improve-gap-task |
| description | Researches, plans, implements, validates, and opens a focused PR for one selected self-improvement gap. Use when executing one chosen backlog item from the nightly self-improvement workflow. |
Execute one focused self-improvement fix from research through PR creation.
Use this skill only when the workflow has already selected a single fix packet.
The workflow phases are fixed and must stay focused:
Keep the work narrow. One selected backlog item should become one focused PR.
git-branch <owner/repo> because the mounted repo is read-only.prompt_tweak is a real tool but an over-prescribed one. Before committing to it, confirm that perfect prompt compliance would actually fix the failure. If the failure would recur even with the prompt read verbatim, choose a code, workflow, or tool fix instead.Produce a short structured answer covering:
Falsifiability rule: If you cannot articulate how to verify the fix worked, the fix is speculative. Downgrade confidence and say so.
Produce a short implementation plan covering:
Make the change in the writable clone.
Run the smallest relevant checks. Prefer targeted tests or repo checks over broad full-stack suites unless the change touches infra-wide behavior.
If the fix relies on a heuristic, regex, keyword trigger, or prompt wording change, the validation phase must include adversarial examples:
Do not ship a prompt tweak or routing heuristic that only matches the exact phrasing from the source thread.
Open one focused PR.
Before committing to a fix type, work through this decision tree:
Is this failure caused by the agent not knowing what to do, or not being able to do it?
prompt_tweak, new_skill, or new_personabug_fix, tool_improvement, or workflow_fixDoes this failure pattern appear in 2+ reviewed tasks?
Can the fix be verified by re-running the failing scenario?
Does the fix risk regressing any currently-passing behavior?
Is this a missing capability (recurring demand with no existing support)?
new_skill (procedural, repeatable workflow) or new_persona (stance, style, decision framing)bug_fix, workflow_fix, prompt_tweak, or tool_improvementAllowed fix types:
bug_fixworkflow_fixprompt_tweaktool_improvementnew_skillnew_personaTreat new_skill and new_persona as first-class fix types.
If the selected fix type is new_skill or new_persona, include an explicit justification answering all three questions:
Do not hand-wave this. Make the justification concrete and tied to evidence from the reviewed tasks.
Write the PR like a senior engineer explaining a thoughtful improvement to the codebase. The PR title and body are the public face of the self-improvement loop — they should be clear, well-reasoned, and useful to a reviewer.
Use this structure:
## Summary
1-3 bullet points: what changed and why.
## Problem
Explain the failure pattern or capability gap that motivated this fix.
Describe the root cause concretely — cite the specific code, prompt, or
workflow behavior that was wrong or missing. Do NOT describe individual
user sessions, tasks, thread contents, or conversations.
## Fix
Explain the change and why it addresses the root cause. A reviewer who
has never seen the gap-analysis output should understand the reasoning.
## Verification
What checks were run. Include command output summaries where useful.
The PR body, title, and commit messages must NEVER contain:
Frame everything in terms of the system behavior that was wrong, not the user interaction where it was observed. For example:
If you need to cite evidence, describe the pattern ("3 of 5 reviewed tasks hit the context limit without using handoff") not the specific sessions.
The upstream gap-analysis / learning-synthesis pass separately emits a
slack_narrative that DOES name users and sessions — that narrative is used
only for the internal ai-v2 scorecard post and is stripped from the
fix_packet before it reaches you. If you somehow receive a slack_narrative
field, ignore it entirely and never echo it into the PR or commits. Every
name, handle, and thread reference you see anywhere in your context is
treated as private.
The PR body must stay concise and human-readable. Use labels as the required machine-readable handoff:
self-improvefix-type:<type> where <type> is the selected fix typeAfter opening the PR, verify with gh pr view that the required labels are present.
Do NOT add a hidden HTML-comment metadata block. Source-thread notification is best-effort only and may be unavailable when a PR body intentionally omits private thread metadata.
Return JSON only in each phase.
{
"root_cause": "",
"fix_type": "workflow_fix",
"affected_files": [""],
"acceptance_criteria": [""],
"verification_plan": [""],
"risks": [""],
"confidence": "high",
"new_capability_justification": ""
}
{
"files": [""],
"plan": [""],
"validation": [""],
"pr_title": "",
"expected_impact": ""
}
{
"changed_files": [""],
"summary": ""
}
{
"checks": [
{
"command": "",
"status": "passed"
}
],
"summary": "",
"regression_check": ""
}
{
"branch": "",
"commit": "",
"pr_number": 0,
"pr_url": "",
"pr_title": "",
"verified_handoff": true
}
references/history.md for the intervention log format and prior fix context.