一键导入
autoresearch
// Evolve a skill by generating variations, evaluating them, and updating the best version
// Evolve a skill by generating variations, evaluating them, and updating the best version
| name | Autoresearch |
| description | Evolve a skill by generating variations, evaluating them, and updating the best version |
| var | |
| tags | ["meta","dev"] |
${var} — Name of the skill to evolve (e.g.
token-movers). Required.
If ${var} is empty, abort with: "autoresearch requires var= set to a skill name" and exit.
Read memory/MEMORY.md for context.
Improve an existing skill by researching better approaches, generating 4 distinct variations, scoring them against a rubric, and committing the winning version as a PR.
Read skills/${var}/SKILL.md. If the file doesn't exist, abort and notify: "Skill '${var}' not found."
Parse the skill's:
Save the original content — you'll need it for the PR diff later.
Search the web for better approaches to what this skill does:
Also review:
memory/cron-state.json — has this skill been failing?Create 4 distinct improved versions of the SKILL.md, each with a different thesis:
Variation A — Better inputs: Improve data sources. Add alternative/complementary APIs, better search queries, more reliable endpoints. Fix any broken or deprecated sources found in step 2.
Variation B — Sharper output: Improve the output format and content quality. Make notifications more actionable, articles more substantive, analysis more insightful. Reduce noise, improve signal.
Variation C — More robust: Improve reliability and edge-case handling. Add fallback logic for when APIs fail, better deduplication, graceful handling of empty data, clearer error messages.
Variation D — Rethink: Take a fundamentally different approach to achieving the same goal. Different methodology, different angle, or a creative combination of techniques the original didn't consider.
Each variation must:
./notify)<!-- autoresearch: variation X — thesis description -->Score each variation on a 1-5 scale across these criteria:
| Criterion | What to evaluate |
|---|---|
| Clarity | Will Claude execute this correctly? Are instructions unambiguous? |
| Data quality | Are sources reliable, diverse, and likely to return useful data? |
| Output value | Is the output actionable and worth reading? Low noise? |
| Robustness | Does it handle failures, empty data, and edge cases? |
| Conventions | Does it follow Aeon patterns? (memory, logging, notify, var usage) |
| Improvement | How much better is this than the original? |
Write out your scoring with brief justification for each score. Calculate a weighted total:
Pick the highest-scoring variation. If scores are very close (within 2 points total), prefer the variation that makes the biggest single improvement rather than small incremental changes.
Write the winning variation to skills/${var}/SKILL.md, replacing the original.
Create a branch named autoresearch/${var} and commit the change:
git checkout -b autoresearch/${var}
git add skills/${var}/SKILL.md
git commit -m "improve(${var}): autoresearch evolution
Variation chosen: [A/B/C/D] — [thesis]
Key changes: [1-2 sentence summary]"
git push -u origin autoresearch/${var}
Open a PR with:
improve(${var}): autoresearch evolutiongh pr create --title "improve(${var}): autoresearch evolution" --body "..."
Important: step 6 left you on the autoresearch/${var} feature branch. Checkout main before writing the log so the entry lands on the default branch where skill-evals scans — otherwise the log is trapped on the feature branch and the skill appears to have never run (ISS-010).
git checkout main
git pull --ff-only origin main
Send via ./notify -f (write the body to a temp file first — multi-line argv trips the sandbox, ISS-009):
TEMP=$(mktemp -t autoresearch.XXXXXX.md)
cat > "$TEMP" <<'MSG'
*Autoresearch — ${var}*
Winner: Variation [X] — [thesis]
Score: [total]/50
Key changes: [summary]
PR: [url]
MSG
./notify -f "$TEMP"
Append to memory/logs/${today}.md with an explicit header so the skill-evals pattern (Autoresearch|autoresearch|AUTORESEARCH) matches:
## Autoresearch — ${today}
- **Target:** ${var}
- **Winner:** Variation [X] ([score]/50)
- **Thesis:** [description]
- **PR:** [url]
- **Runners-up:** [brief scores]
The workflow's auto-commit picks up the log on main.
5 concrete real-life actions for today based on everything known about Aaron
Top 10 tweets by influence mentioning AI agents or OpenClaw
Weekly tracker of AI agent substitution signals — which roles, companies, and industries show real headcount displacement. Named roles + real deployments only. Closes topic-momentum "agent substitution" gap.
Cross-domain market pulse from AIXBT's free grounding endpoint — crypto, macro, tradfi, geopolitics. Refreshes taxonomy references (clusters, chains) as a bonus.
Research a trending topic in Aaron's interest areas and write an opinionated article in his voice
Analyze a URL and generate a tailored aeon.yml schedule with skill suggestions