Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

$pwd:

autoresearch

Name: Autoresearch
Author: TheGoatedDev

// Use when the user wants to run the autonomous skill optimization loop. Iteratively improves skills and their agents in the marketplace by modifying SKILL.md and agent files, evaluating changes via head-to-head ELO ranking, and keeping improvements.

In Manus ausführen

$ git log --oneline --stat

stars:0

forks:0

updated:22. März 2026 um 12:58

SKILL.md

readonly

related-skills.json

gleiches Repository

ubiquitous-language.md

from "TheGoatedDev/AutoSkillResearch-Marketplace"

Use when user wants to define domain terms, build a glossary, harden terminology, create a ubiquitous language, audit codebase terminology, or mentions "domain model" or "DDD". Also triggers on "audit" with terminology context.

2026-03-300

ux-audit.md

from "TheGoatedDev/AutoSkillResearch-Marketplace"

Cognitive load audit of UI code. Use this skill when the user wants to audit, review, or analyze UI components for cognitive load, usability, or UX quality. Triggers on: "ux audit", "cognitive load audit", "audit my form", "review this UI for usability", "check cognitive load", "is this form too complex", "analyze UX of these components", or any request to evaluate UI code against cognitive load principles, usability heuristics, or form design best practices. Also triggers when the user provides UI file paths and asks about complexity, overwhelm, or user confusion.

2026-03-220

deep-research.md

from "TheGoatedDev/AutoSkillResearch-Marketplace"

Multi-phase deep research pipeline that discovers, validates, and synthesizes information from many sources into a comprehensive report. Uses parallel Researcher subagents for maximum coverage and depth. Use this skill whenever the user says "deep research" followed by a topic, when they explicitly invoke it, or when you determine that a question requires thorough, multi-source investigation rather than a quick lookup. This is different from a simple documentation search or a single web query — use this when the user needs a comprehensive understanding of a topic drawn from many sources, cross-validated and synthesized into a structured report. Triggers include: "deep research on X", "research X thoroughly", "I need a full report on X", "dive deep into X", "investigate X comprehensively", or any context where surface-level answers won't cut it and the user (or you) needs authoritative, multi-source coverage of a topic.

2026-03-220

interrogate.md

from "TheGoatedDev/AutoSkillResearch-Marketplace"

Devil's advocate interrogation of a feature or idea. Use this skill whenever the user wants to pressure-test, challenge, refine, or deeply explore a feature, idea, plan, or proposal. Triggers on phrases like "interrogate this", "challenge my idea", "poke holes in this", "help me think this through", "is this a good idea", "what am I missing", or any request to stress-test thinking. Also use when the user describes a feature and seems uncertain or is asking for validation.

2026-03-200

example-greeter.md

from "TheGoatedDev/AutoSkillResearch-Marketplace"

Use when the user greets you with hello, hi, hey, good morning, good afternoon, good evening, howdy, or any greeting — including greetings that also mention what they're working on. Responds with a friendly greeting and offers to help.

2026-03-200

package.json

"author": "TheGoatedDev"

"repository": "TheGoatedDev/AutoSkillResearch-Marketplace"

GitHub-Repository öffnen Creator-Repositorys ansehen

$ install --global

$ download --local

In Manus ausführen

$ useful --forSOC

SoftwareentwicklerInformatik- und Mathematikberufe15-1252L4

name	autoresearch
description	Use when the user wants to run the autonomous skill optimization loop. Iteratively improves skills and their agents in the marketplace by modifying SKILL.md and agent files, evaluating changes via head-to-head ELO ranking, and keeping improvements.
disable-model-invocation	true
allowed-tools	Read, Write, Edit, Bash, Agent, Glob, Grep

AutoSkillResearch Orchestrator

You are the autonomous skill optimization loop. You modify skills and their agents, evaluate them via head-to-head comparison, and keep improvements.

Invocation

/autoresearch — optimize all skills in round-robin rotation
/autoresearch <skill-name> — optimize a single skill

Startup

Check the concurrency lock:
```
python3 scripts/lockfile.py check --lock-path .autoresearch.lock
```
If locked, print the lock info and stop.

Acquire the lock:

python3 scripts/lockfile.py acquire --lock-path .autoresearch.lock --skill "<first-skill>"

Read program.md for the global optimization strategy.
List all optimized-tier skills:
- Glob for plugins/*/evals/cases.json
- For each, verify it has >=5 eval cases
- If $ARGUMENTS specifies a skill name, filter to just that skill
For the current skill, create or switch to the working branch:
```
git checkout -B autoresearch/<skill-name>
```
This creates the branch if it doesn't exist, or switches to it if it does.

The Loop

For each skill in rotation, run iterations_per_rotation iterations (read from evals/config.json, default 10):

Per-Iteration Steps

1. Read Context Read these files for the current skill:

plugins/<skill>/evals/config.json
plugins/<skill>/evals/cases.json
plugins/<skill>/experiments/log.json
plugins/<skill>/skills/<skill>/SKILL.md
plugins/<skill>/optimize.md (if exists)
program.md

2. Hypothesize Based on the experiment log (what worked, what failed) and the optimization strategy:

Form a clear hypothesis about what change will improve the skill
Write the hypothesis down before making any changes
Make ONE focused change per iteration
Save the current SKILL.md content as champion_content (you need it for comparison)

Write the champion content to a temp file for cache lookups:

cp plugins/<skill>/skills/<skill>/SKILL.md /tmp/ar-champion.md

3. Modify

Edit the SKILL.md file to implement your hypothesis

Commit the change:

git add plugins/<skill>/skills/<skill>/SKILL.md
git commit -m "autoresearch: <one-line hypothesis>"

4. Evaluate For each eval case in cases.json:

a. Run candidate skill — dispatch a skill-executor agent with:

The modified SKILL.md content
The eval case input and context

b. Run champion skill (with caching) — first check the cache:

python3 scripts/champion_cache.py get --cache-dir plugins/<skill>/experiments/.cache --champion-content-file /tmp/ar-champion.md --case-id <case_id>

If the result contains "hit": true, use the cached output, token_count, and triggered values. Do NOT dispatch a skill-executor agent.

If the result contains "hit": false, dispatch a skill-executor agent with the saved champion_content and the eval case input/context. Then cache the result:

python3 scripts/champion_cache.py put --cache-dir plugins/<skill>/experiments/.cache --champion-content-file /tmp/ar-champion.md --case-id <case_id> --result '<executor_result_json>'

c. Judge — dispatch a skill-judge agent with:

Both outputs (randomized order, anonymized as A/B)
The eval case rubric

d. Record results — for each case, record:

Winner (candidate/champion/draw)
Trigger status (did the skill trigger?)
Token counts

If a subagent fails, log it and continue with remaining cases. If >=50% of cases error, abort this iteration and revert.

5. Compute Metrics Write match and trigger results to temp files, then compute:

echo '<matches_json>' > /tmp/ar-matches.json
echo '<triggers_json>' > /tmp/ar-triggers.json
python3 scripts/metrics.py --matches-file /tmp/ar-matches.json --triggers-file /tmp/ar-triggers.json --champion-tokens <n> --candidate-tokens <n>

6. Update ELO For each match result:

python3 scripts/elo.py update --elo-path plugins/<skill>/experiments/elo.json --candidate-commit <hash> --result <A|B|draw>

7. Decide: Keep or Discard Write metrics to temp files:

echo '<metrics_old_json>' > /tmp/ar-metrics-old.json
echo '<metrics_new_json>' > /tmp/ar-metrics-new.json
python3 scripts/promotion.py --config plugins/<skill>/evals/config.json --elo <candidate_elo> --matches <n> --metrics-old-file /tmp/ar-metrics-old.json --metrics-new-file /tmp/ar-metrics-new.json

If keep:

Invalidate the champion cache (the champion is about to change):

python3 scripts/champion_cache.py invalidate --cache-dir plugins/<skill>/experiments/.cache

Update experiment log:

python3 scripts/experiment_log.py add --log-path plugins/<skill>/experiments/log.json --entry '<json>'
python3 scripts/experiment_log.py prune --log-path plugins/<skill>/experiments/log.json

Update ELO state to promote:

python3 scripts/elo.py update --elo-path plugins/<skill>/experiments/elo.json --candidate-commit <hash> --result promote

Generate changelog:

python3 scripts/changelog.py --skill-name <name> --log-path plugins/<skill>/experiments/log.json --output plugins/<skill>/experiments/changelog.md

Bump version:

python3 scripts/version.py --plugin-json plugins/<skill>/.claude-plugin/plugin.json

Commit all state files:

git add plugins/<skill>/experiments/ plugins/<skill>/.claude-plugin/plugin.json
git commit -m "autoresearch: keep — <hypothesis summary>"

If discard:

Revert the SKILL.md change:
```
git reset --hard HEAD~1
```

Finalize ELO state for the discarded candidate:

python3 scripts/elo.py update --elo-path plugins/<skill>/experiments/elo.json --candidate-commit <hash> --result discard

Log the discarded attempt:

python3 scripts/experiment_log.py add --log-path plugins/<skill>/experiments/log.json --entry '<json>'

Commit the log update:

git add plugins/<skill>/experiments/log.json plugins/<skill>/experiments/elo.json
git commit -m "autoresearch: discard — <hypothesis summary>"

8. Check Exit Conditions

If iterations_per_rotation reached -> move to next skill
If all metrics at ceiling (eval_quality >= 0.95, trigger_accuracy >= 0.95) -> move to next skill early
Otherwise -> loop back to Step 1

9. Refresh Lock Every 30 minutes (track wall clock time):

python3 scripts/lockfile.py refresh --lock-path .autoresearch.lock

Agent Rotation

After completing all skill iterations for a plugin, check if the plugin has agents to optimize:

Read plugins/<skill>/.claude-plugin/plugin.json and check for an optimizable_agents array.
If absent or empty, skip to the next skill.
If present, read program-agents.md for the agent optimization strategy.

For each agent filename in optimizable_agents (e.g., researcher.md), run a separate rotation of iterations_per_rotation iterations:

Agent Target Naming

The target name is the agent filename without extension (e.g., researcher for researcher.md). This is used to construct per-target state file paths:

ELO: plugins/<skill>/experiments/elo-<target>.json
Log: plugins/<skill>/experiments/log-<target>.json
Changelog: plugins/<skill>/experiments/changelog-<target>.md

If the ELO or log file doesn't exist yet, initialize them:

python3 scripts/elo.py init --elo-path plugins/<skill>/experiments/elo-<target>.json --champion-commit $(git rev-parse HEAD)
python3 scripts/experiment_log.py init --log-path plugins/<skill>/experiments/log-<target>.json

Per-Iteration Steps (Agent)

1. Read Context Read these files:

plugins/<skill>/evals/config.json
plugins/<skill>/evals/cases.json
plugins/<skill>/experiments/log-<target>.json
plugins/<skill>/agents/<agent-filename> (the agent to optimize)
plugins/<skill>/skills/<skill>/SKILL.md (the frozen champion skill)
program-agents.md

2. Hypothesize Based on the agent experiment log and the agent optimization strategy:

Form a clear hypothesis about what agent change will improve the skill's end-to-end output
Write the hypothesis down before making any changes
Make ONE focused change per iteration

Save the current agent file as champion_agent_content:

cp plugins/<skill>/agents/<agent-filename> /tmp/ar-champion-agent.md

3. Modify

Edit the agent file to implement your hypothesis

Commit the change:

git add plugins/<skill>/agents/<agent-filename>
git commit -m "autoresearch: agent/<target> — <one-line hypothesis>"

4. Evaluate For each eval case in cases.json:

a. Run candidate — dispatch a skill-executor agent with:

The champion SKILL.md content (frozen — do not modify)
The eval case input and context
The agents dict including the modified agent file: {"<agent-filename>": "<modified content>"}

b. Run champion (with caching) — check cache with agent contents included:

python3 scripts/champion_cache.py get --cache-dir plugins/<skill>/experiments/.cache --champion-content-file /tmp/ar-champion.md --case-id <case_id> --agent-content-files /tmp/ar-champion-agent.md

If hit, use cached results.

If miss, dispatch a skill-executor with the champion SKILL.md and champion agent content, then cache:

python3 scripts/champion_cache.py put --cache-dir plugins/<skill>/experiments/.cache --champion-content-file /tmp/ar-champion.md --case-id <case_id> --result '<json>' --agent-content-files /tmp/ar-champion-agent.md

c. Judge — same as skill rotation: dispatch skill-judge with both outputs and the rubric.

d. Record results — same format as skill rotation.

5-7. Compute Metrics, Update ELO, Decide Same as skill rotation, but use per-target state files:

python3 scripts/elo.py update --elo-path plugins/<skill>/experiments/elo-<target>.json --candidate-commit <hash> --result <A|B|draw>
python3 scripts/promotion.py --config plugins/<skill>/evals/config.json --elo <candidate_elo> --matches <n> --metrics-old-file /tmp/ar-metrics-old.json --metrics-new-file /tmp/ar-metrics-new.json

If keep:

Invalidate the champion cache:

python3 scripts/champion_cache.py invalidate --cache-dir plugins/<skill>/experiments/.cache

Update per-target experiment log and ELO:

python3 scripts/experiment_log.py add --log-path plugins/<skill>/experiments/log-<target>.json --entry '<json>'
python3 scripts/experiment_log.py prune --log-path plugins/<skill>/experiments/log-<target>.json
python3 scripts/elo.py update --elo-path plugins/<skill>/experiments/elo-<target>.json --candidate-commit <hash> --result promote

Generate per-target changelog:

python3 scripts/changelog.py --skill-name "<skill>/<target>" --log-path plugins/<skill>/experiments/log-<target>.json --output plugins/<skill>/experiments/changelog-<target>.md

Bump plugin version:

python3 scripts/version.py --plugin-json plugins/<skill>/.claude-plugin/plugin.json

Commit state:

git add plugins/<skill>/experiments/ plugins/<skill>/.claude-plugin/plugin.json
git commit -m "autoresearch: keep agent/<target> — <hypothesis summary>"

If discard:

Revert the agent file change:
```
git reset --hard HEAD~1
```

Finalize ELO and log:

python3 scripts/elo.py update --elo-path plugins/<skill>/experiments/elo-<target>.json --candidate-commit <hash> --result discard
python3 scripts/experiment_log.py add --log-path plugins/<skill>/experiments/log-<target>.json --entry '<json>'

Commit log update:

git add plugins/<skill>/experiments/log-<target>.json plugins/<skill>/experiments/elo-<target>.json
git commit -m "autoresearch: discard agent/<target> — <hypothesis summary>"

8. Check Exit Conditions Same as skill rotation — check iteration count and metric ceilings.

Shutdown

On completion or interruption:

python3 scripts/lockfile.py release --lock-path .autoresearch.lock

Error Recovery

If you encounter an error:

Log it in the experiment log with outcome "error"
Revert any uncommitted changes: git checkout -- .
Continue to the next iteration
If 3 consecutive errors on the same skill, move to the next skill

Key Rules

Do NOT pause to ask the human if you should continue
Do NOT make multiple changes per iteration
Do NOT skip the hypothesis step — always write it down before editing
During skill rotation: Do NOT modify any file other than the skill's SKILL.md during the modify step
During agent rotation: Do NOT modify any file other than the target agent file during the modify step. The SKILL.md is frozen as the champion.
Always save champion content BEFORE modifying (SKILL.md for skill rotation, agent file for agent rotation)
Use program.md for skill hypotheses, program-agents.md for agent hypotheses

autoresearch

Mehr aus diesem Repository

Mehr aus diesem Repository

AutoSkillResearch Orchestrator

Invocation

Startup

The Loop

Per-Iteration Steps

Agent Rotation

Agent Target Naming

Per-Iteration Steps (Agent)

Shutdown

Error Recovery

Key Rules

AutoSkillResearch Orchestrator

Invocation

Startup

The Loop

Per-Iteration Steps

Agent Rotation

Agent Target Naming

Per-Iteration Steps (Agent)

Shutdown

Error Recovery

Key Rules