一键在 Manus 中运行任何 Skill

evo-end-to-end

Run a Codex planning-to-Evo workflow for evo-hq/evo v0.4.4+. Use when the user wants to start from a vague performance, architecture, refactor, flaky-test, slow-build, or code-quality problem; optionally use grill-me/grill-with-docs/improve-codebase-architecture; produce an Evo-ready experiment brief; then hand the brief to `$evo discover`, `$evo optimize`, and, when needed, Evo backend/runtime setup with safe scope, metric, gate, backend, host, budget, stall rule, and merge rules.

在 Manus 中运行

概览

安装命令

npx skills add https://github.com/Hellfrosted/agents --skill evo-end-to-end

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

Hellfrosted/agents

星标0

分支0

更新时间2026年6月4日 00:35

文件资源管理器

2 个文件

SKILL.md

readonly

name

evo-end-to-end

description

Evo End To End

Turn a fuzzy improvement request into an Evo-ready experiment, then run Evo only after approval.

Target Evo release line: evo-hq/evo v0.4.4 or newer.

Flow

Clarify only what local repo inspection cannot answer. Use $grill-me when the problem, constraints, non-goals, success metric, or forbidden changes are unclear.
Use $grill-with-docs only for unclear terminology, ownership, or ADR/CONTEXT decisions.
Use $improve-codebase-architecture only when architecture or testability must be decomposed before choosing a metric.
Inspect the repo for manifests, tests, docs, benchmarks, and likely editable scope.
Verify Evo is installable and version-aligned before running plugin skills:
- Run evo --version; it must report evo-hq-cli, not the unrelated evo SLAM package.
- Compare the CLI version to the installed evo plugin skill version. If the plugin skill is tagged evo_version: 0.4.4, evo --version must print exactly evo-hq-cli 0.4.4.
- Do not auto-install or upgrade the CLI unless the user explicitly asks.
Draft an Evo brief with: goal, metric, baseline command/data, pass gate, editable scope, read-only context, forbidden changes, host, backend, runtime/env needs, budget, stall rule, merge rule.
Stop for approval before Evo edits production behavior, APIs, persistence, auth/security, tests, packaging, dependencies, deployment, user-visible behavior, dependency manifests, or remote/cloud infrastructure.
Run $evo discover with the approved brief; optimize only after discovery records a baseline.
If using an existing Evo workspace from before v0.4.0, silently migrate host metadata with evo host show; if it prints <not set>, run evo host set codex.
For remote, pool, or non-default runtime setup, configure Evo explicitly before optimizing:

Local default: worktree backend.
Faster local reuse: pool backend with a fixed workspace list.
Remote: configure the provider first, using Evo's infra-setup guidance for Modal, E2B, Daytona, AWS, Azure, SSH, manual, or custom providers.
Runtime commands/env belong in evo config runtime ... and evo env ..., not hard-coded into benchmark scripts.

Run evo run <exp_id> --check when wiring risk is material and a non-mutating validation is available.
Before optimizing, resolve run behavior the same way $evo optimize does:

autonomous defaults on unless the user or stored defaults turn it off.
subagents-only defaults on unless the user or stored defaults turn it off.
Arm the resolved state with evo autonomous on|off and evo subagents-only on|off.

Run $evo optimize subagents=<n> budget=<n> stall=<n> within the approved scope. Size the round from benchmark/backend resources first; use the presets below only as fallbacks or user-facing shorthand.
Use evo direct "<text>" only for mid-run steering of an already-running Evo session. If an agent receives an [EVO DIRECTIVE id=...] banner, it must run evo ack <event_id> before proceeding.
Manually review Evo output before merging behavior, API, persistence, security, packaging, deployment, or user-visible changes.

Optimize Presets

Use these as fallbacks for $evo optimize when the benchmark resource profile is unknown and the user gave no exact values:

tiny: subagents=3 budget=5 stall=2
small: subagents=3 budget=8 stall=3
medium: subagents=4 budget=10 stall=4
big: subagents=5 budget=14 stall=5
huge: subagents=8 budget=20 stall=6

Default to medium only when the benchmark is light, isolated, and no better sizing signal is available. Reduce subagents to 1 for exclusive resources such as a GPU, fixed port, shared database, or serialized fixture. Cap pool runs at the pool slot count. Use tiny or small when the editable scope is narrow or risky. Use big or huge only when the metric is stable, the baseline is repeatable, and the approved scope can absorb broader exploration.

Brief Template

Goal:
Metric:
Baseline:
Gate:
Editable scope:
Read-only context:
Forbidden changes:
Host: codex
Backend: worktree | pool | remote:<provider>
Runtime/env:
Budget:
Stall rule:
Autonomous: on | off
Subagents-only: on | off
Optimize preset:
Merge rule:

Evo v0.4.4 Notes

evo init --host <claude-code|codex|cursor|opencode|openclaw|hermes|pi|generic> is required for new workspaces. For this skill on Codex, use codex.
New workspaces default to the pareto_per_task frontier strategy instead of argmax. Existing workspaces keep their configured strategy.
Local execution has two backends: worktree and pool. Pool mode is useful when setup is expensive, but it changes commit discipline because warm workspace state should stay out of commits.
Pool mode defaults to commit_strategy=tracked-only; subagents must git add new source files and pass --i-staged-new-files yes to evo run.
Remote experiments can run through Modal, E2B, Daytona, AWS, Azure, SSH, manual, or custom providers. Treat provider SDK installation, credentials, and cloud allocation as explicit user-approved setup.
In remote mode, subagent briefs must state the experiment id explicitly and require --exp-id <id> on every evo bash/read/write/edit/glob/grep command.
Backend provider credentials and benchmark runtime environment are separate concerns. Configure benchmark variables with evo env, and do not copy secrets into worktrees or docs.
evo run <exp_id> --check validates benchmark/gate wiring without committing, evaluating, or consuming retry budget.
$evo optimize defaults to autonomous, subagents-only operation. The user can override either explicitly, or via evo config get default-autonomous, evo defaults get autonomous, evo config get default-subagents-only, and evo defaults get subagents-only.
evo direct "<text>" --wait expects an agent to acknowledge delivered directives with evo ack <event_id>.
Use evo gc to clean worktrees, pool slots, and remote sandboxes across configured backends.
Use evo config show, evo config backend show, evo config runtime show, and evo env show to inspect setup before changing it.

同仓库更多 Skills

同仓库

confidence-loop

Hellfrosted/agents

Stress-tests a strategy, plan, implementation approach, or answer until remaining uncertainty is explicit and evidence-backed, then reports a 0-100 confidence score. Use when the user asks whether Codex is 100% confident, asks to find loopholes or failure modes, requests a confidence audit, says to run a loop until the strategy is factually solid, or invokes confidence-loop hard for up to four sub-agent second opinions.

2026-05-090

icm-recall

Hellfrosted/agents

Searches ICM persistent memory from Codex. Use when the user invokes `icm-recall`, asks to recall or search ICM memory, asks what ICM remembers, or provides a query that should be looked up in long-term memory.

2026-05-090

icm-remember

Hellfrosted/agents

Stores information in ICM persistent memory from Codex. Use when the user invokes `icm-remember`, asks to remember something, asks to store/save a note in ICM, or provides durable context that should be kept for future sessions.

2026-05-090

icm

Hellfrosted/agents

Provides the ICM (Infinite Context Memory) persistent-memory rule for Codex. Use when persistent memory should be consulted or maintained for a task, when the user asks to use ICM generally, or when durable context such as user preferences, resolved errors, architecture decisions, or significant project progress should persist across Codex sessions.

2026-05-090

tuck

Hellfrosted/agents

Tucks completed local changes into focused, reviewable git commits with mandatory read-only per-file review subagents before staging. Use when the user explicitly invokes $tuck; do not use for ordinary, small, or natural-language commit requests.

2026-05-090

codex-goal-control

Hellfrosted/agents

Opens the local Codex goal panel and manages the current Codex thread goal through bundled helper scripts. Use when the user asks to open the goal panel, inspect/set/pause/resume/complete/clear a thread goal, manage goals without the CLI, or make the local goal web app target the current thread.

2026-05-090

来源

Hellfrosted

Hellfrosted/agents

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

适用职业SOC

软件开发工程师计算机与数学类职业15-1252L4

name

evo-end-to-end

description

Evo End To End

Turn a fuzzy improvement request into an Evo-ready experiment, then run Evo only after approval.

Target Evo release line: evo-hq/evo v0.4.4 or newer.

Flow

Clarify only what local repo inspection cannot answer. Use $grill-me when the problem, constraints, non-goals, success metric, or forbidden changes are unclear.
Use $grill-with-docs only for unclear terminology, ownership, or ADR/CONTEXT decisions.
Use $improve-codebase-architecture only when architecture or testability must be decomposed before choosing a metric.
Inspect the repo for manifests, tests, docs, benchmarks, and likely editable scope.
Verify Evo is installable and version-aligned before running plugin skills:
- Run evo --version; it must report evo-hq-cli, not the unrelated evo SLAM package.
- Compare the CLI version to the installed evo plugin skill version. If the plugin skill is tagged evo_version: 0.4.4, evo --version must print exactly evo-hq-cli 0.4.4.
- Do not auto-install or upgrade the CLI unless the user explicitly asks.
Draft an Evo brief with: goal, metric, baseline command/data, pass gate, editable scope, read-only context, forbidden changes, host, backend, runtime/env needs, budget, stall rule, merge rule.
Stop for approval before Evo edits production behavior, APIs, persistence, auth/security, tests, packaging, dependencies, deployment, user-visible behavior, dependency manifests, or remote/cloud infrastructure.
Run $evo discover with the approved brief; optimize only after discovery records a baseline.
If using an existing Evo workspace from before v0.4.0, silently migrate host metadata with evo host show; if it prints <not set>, run evo host set codex.
For remote, pool, or non-default runtime setup, configure Evo explicitly before optimizing:

Local default: worktree backend.
Faster local reuse: pool backend with a fixed workspace list.
Remote: configure the provider first, using Evo's infra-setup guidance for Modal, E2B, Daytona, AWS, Azure, SSH, manual, or custom providers.
Runtime commands/env belong in evo config runtime ... and evo env ..., not hard-coded into benchmark scripts.

Run evo run <exp_id> --check when wiring risk is material and a non-mutating validation is available.
Before optimizing, resolve run behavior the same way $evo optimize does:

autonomous defaults on unless the user or stored defaults turn it off.
subagents-only defaults on unless the user or stored defaults turn it off.
Arm the resolved state with evo autonomous on|off and evo subagents-only on|off.

Run $evo optimize subagents=<n> budget=<n> stall=<n> within the approved scope. Size the round from benchmark/backend resources first; use the presets below only as fallbacks or user-facing shorthand.
Use evo direct "<text>" only for mid-run steering of an already-running Evo session. If an agent receives an [EVO DIRECTIVE id=...] banner, it must run evo ack <event_id> before proceeding.
Manually review Evo output before merging behavior, API, persistence, security, packaging, deployment, or user-visible changes.

Optimize Presets

Use these as fallbacks for $evo optimize when the benchmark resource profile is unknown and the user gave no exact values:

tiny: subagents=3 budget=5 stall=2
small: subagents=3 budget=8 stall=3
medium: subagents=4 budget=10 stall=4
big: subagents=5 budget=14 stall=5
huge: subagents=8 budget=20 stall=6

Brief Template

Goal:
Metric:
Baseline:
Gate:
Editable scope:
Read-only context:
Forbidden changes:
Host: codex
Backend: worktree | pool | remote:<provider>
Runtime/env:
Budget:
Stall rule:
Autonomous: on | off
Subagents-only: on | off
Optimize preset:
Merge rule:

Evo v0.4.4 Notes

evo init --host <claude-code|codex|cursor|opencode|openclaw|hermes|pi|generic> is required for new workspaces. For this skill on Codex, use codex.
New workspaces default to the pareto_per_task frontier strategy instead of argmax. Existing workspaces keep their configured strategy.
Local execution has two backends: worktree and pool. Pool mode is useful when setup is expensive, but it changes commit discipline because warm workspace state should stay out of commits.
Pool mode defaults to commit_strategy=tracked-only; subagents must git add new source files and pass --i-staged-new-files yes to evo run.
Remote experiments can run through Modal, E2B, Daytona, AWS, Azure, SSH, manual, or custom providers. Treat provider SDK installation, credentials, and cloud allocation as explicit user-approved setup.
In remote mode, subagent briefs must state the experiment id explicitly and require --exp-id <id> on every evo bash/read/write/edit/glob/grep command.
Backend provider credentials and benchmark runtime environment are separate concerns. Configure benchmark variables with evo env, and do not copy secrets into worktrees or docs.
evo run <exp_id> --check validates benchmark/gate wiring without committing, evaluating, or consuming retry budget.
$evo optimize defaults to autonomous, subagents-only operation. The user can override either explicitly, or via evo config get default-autonomous, evo defaults get autonomous, evo config get default-subagents-only, and evo defaults get subagents-only.
evo direct "<text>" --wait expects an agent to acknowledge delivered directives with evo ack <event_id>.
Use evo gc to clean worktrees, pool slots, and remote sandboxes across configured backends.
Use evo config show, evo config backend show, evo config runtime show, and evo env show to inspect setup before changing it.