Run any Skill in Manus with one click

Get Started

experiment-running

Stars378

Forks27

UpdatedJune 16, 2026 at 15:03

Execute the plan by dispatching fresh subagents per task, monitoring status, and collecting results

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

Run Skill in Manus

Source

yogsoth-ai

yogsoth-ai/de-anthropocentric-research-engine

View GitHub Repository View Creator Repositories

Download

Run Skill in Manus

Related occupationsSOC

Based on SOC occupation classification

Project Management SpecialistsBusiness and Financial Operations Occupations·SOC 13-1082

File Explorer

2 files

SKILL.md

readonly

name	experiment-running
description	Execute the plan by dispatching fresh subagents per task, monitoring status, and collecting results
version	1.0.0
category	experiment-execution
type	strategy
sops	["implementer-dispatch","execution-monitoring","result-collection"]
tactics	["subagent-execution-loop","checkpoint-and-recover"]
dependencies	{"sops":["execution-monitoring","implementer-dispatch","result-collection","ponytail:ponytail","ponytail:ponytail-debt","superpowers:executing-plans","superpowers:finishing-a-development-branch","superpowers:subagent-driven-development","superpowers:using-git-worktrees","superpowers:verification-before-completion"],"tactics":["checkpoint-and-recover","subagent-execution-loop"]}

Strategy: Experiment Running

Key Question: How to execute?

Methodology

实现的执行直接交给 superpowers 现成链路，不再自写 fresh-subagent / 三段 review。 plan（上游 plan-writing 产出）就绪后，本策略是一串决策节点：

Skill load superpowers:using-git-worktrees —— 建隔离工作区 + 跑 baseline 测试。
Skill load ponytail:ponytail —— 进入写代码前开启精简反射（边写边 lean）。
二选一执行引擎：
- superpowers:executing-plans（本 session 批量执行，带 checkpoint），或
- superpowers:subagent-driven-development（每任务 fresh 子代理 + 两段 review）。多数实验实现用前者；任务高度独立、需强隔离时用后者（详见 subagent-execution-loop）。
Skill load superpowers:verification-before-completion —— claim 完成前先跑证明命令。
Skill load ponytail:ponytail-debt —— 收尾前收集 ponytail: 欠债标记。
Skill load superpowers:finishing-a-development-branch —— 验证测试 → merge/PR/branch。

DARE 原生的 checkpoint-and-recover（高风险操作前存档）与 subagent-execution-loop （执行循环细节）作为 tactic 仍在编排内保留。

Execution Flow

[plan from plan-writing]
    → superpowers:using-git-worktrees   (隔离区 + baseline)
    → ponytail:ponytail                 (精简反射开启)
    → superpowers:executing-plans  或  superpowers:subagent-driven-development
    → superpowers:verification-before-completion  (claim 前验证)
    → ponytail:ponytail-debt            (收欠债)
    → superpowers:finishing-a-development-branch  (收尾)

Budget Gate

Step	Max Budget	Output
Per-task execution	50% of execution budget / N tasks	Task result
Monitoring overhead	5% of execution budget	Status log
Retry budget	10% of execution budget	Unblocked tasks

Key Decisions

执行引擎二选一：批量 → executing-plans；强隔离/逐任务审查 → subagent-driven-development
model 选择 / retry / 并行：交给所选 superpowers 引擎，不在本策略重定义
Abort：>50% 关键路径 BLOCKED 时中止并报告（DARE 排程层判据）

Available Tactics

Optional, no fixed order; the final leaf is always a sop.

Tactic	When to use
checkpoint-and-recover	Checkpoint state before risky operations, detect anomalies, and recover gracefully
subagent-execution-loop	Orchestrate task execution via fresh subagents with dispatch, monitoring, and result collection

Available SOPs

Optional, no fixed order; the final leaf is always a sop.

SOP	When to use
execution-monitoring	Monitor execution progress, detect anomalies, and report status
implementer-dispatch	Dispatch execution subagent — select model by complexity, construct prompt with full task context
ponytail:ponytail	Lazy-senior reflex: simplest thing that holds; mark every deliberate shortcut
ponytail:ponytail-debt	Harvest ponytail debt markers before finishing
result-collection	Collect experiment outputs — metrics, logs, artifacts — into structured result set
superpowers:executing-plans	Execute the plan task-by-task in the current session with checkpoints
superpowers:finishing-a-development-branch	Verify tests -> merge / PR / branch cleanup
superpowers:subagent-driven-development	Execute the plan via a fresh subagent per task with two-stage review
superpowers:using-git-worktrees	Create an isolated worktree + run baseline tests before implementing
superpowers:verification-before-completion	Run the proving command and confirm output before claiming done

More from this repository

same repository

isomorphism-falsification

yogsoth-ai/de-anthropocentric-research-engine

Strategy: Attack an isomorphism claim by demanding an explicit structure-preserving map and trying to break it. Targets any multi-language claim of the form 'X ≅ Y ≅ … across N mathematical languages'. Forces the claim to either earn the word 'isomorphism' or be demoted to 'analogy'. Methods: category theory (functor/natural-iso criteria), model theory, Lakatos monster-barring.

2026-06-21378

adversarial-debate-truthseeking

yogsoth-ai/de-anthropocentric-research-engine

Strategy: Dialectic engine retuned for truth-seeking, not survival. A defender steelmans a claim into its MOST falsifiable form, a critic attacks to refute it, a judge classifies the exchange into BROKEN/CORROBORATED/UNFALSIFIABLE — the judge does NOT pick a winner or score persuasiveness. Methods: Irving debate (repurposed), Toulmin argumentation, Mayo severe testing.

2026-06-21378

circular-validation-audit

yogsoth-ai/de-anthropocentric-research-engine

Strategy: Run BEFORE building any validator (sandbox/simulation/benchmark). Builds a non-circularity matrix of theory-claim × validator-assumption to detect when a validator would 'confirm' a theory only because it was built on the theory's own premises. A circular validator's PASS carries zero evidential weight. Methods: Cartwright nomological machines, Winsberg sanctioning-of-simulations, tautology detection.

2026-06-21378

elegance-trap-probe

yogsoth-ai/de-anthropocentric-research-engine

Strategy: Attack a beautiful unified result on the suspicion that its beauty is the bug. Distinguishes EARNED simplicity (forbids/predicts/subsumes) from DECORATIVE simplicity (re-describes/relabels/accommodates). Directly serves the Occam aesthetic by making it a falsifiable bar, not a vibe. Methods: Sober parsimony-as-evidence, MDL, Meehl risky prediction, accommodation-vs-prediction.

2026-06-21378

falsification-first-stress-test

yogsoth-ai/de-anthropocentric-research-engine

Campaign: Truth-seeking adversarial validation for scientific research artifacts (NOT publication defense). Core question: Where have we fooled ourselves, and is each load-bearing claim even falsifiable? Win-condition is INVERTED from survival/resilience to active refutation. Methods: Popper falsificationism, Lakatos Proofs and Refutations, Mayo severe testing, Platt strong inference.

2026-06-21378

independent-convergence-audit

yogsoth-ai/de-anthropocentric-research-engine

Strategy: Attack the evidential weight of an 'independent convergence' claim. When N reasoning paths all reach the same conclusion, the confidence boost is real only if the paths were actually independent. Measures shared-prior / shared-blindspot contamination and corrects the over-counted confidence. Methods: Bayesian agreement-as-evidence, correlated-error analysis, jury theorem assumptions.

2026-06-21378

Strategy: Experiment Running

Key Question: How to execute?

Methodology

实现的执行直接交给 superpowers 现成链路，不再自写 fresh-subagent / 三段 review。 plan（上游 plan-writing 产出）就绪后，本策略是一串决策节点：

Skill load superpowers:using-git-worktrees —— 建隔离工作区 + 跑 baseline 测试。

Skill load ponytail:ponytail —— 进入写代码前开启精简反射（边写边 lean）。

二选一执行引擎：

superpowers:executing-plans（本 session 批量执行，带 checkpoint），或
superpowers:subagent-driven-development（每任务 fresh 子代理 + 两段 review）。多数实验实现用前者；任务高度独立、需强隔离时用后者（详见 subagent-execution-loop）。

Skill load superpowers:verification-before-completion —— claim 完成前先跑证明命令。

Skill load ponytail:ponytail-debt —— 收尾前收集 ponytail: 欠债标记。

Skill load superpowers:finishing-a-development-branch —— 验证测试 → merge/PR/branch。

DARE 原生的 checkpoint-and-recover（高风险操作前存档）与 subagent-execution-loop （执行循环细节）作为 tactic 仍在编排内保留。

Execution Flow

[plan from plan-writing] → superpowers:using-git-worktrees (隔离区 + baseline) → ponytail:ponytail (精简反射开启) → superpowers:executing-plans 或 superpowers:subagent-driven-development → superpowers:verification-before-completion (claim 前验证) → ponytail:ponytail-debt (收欠债) → superpowers:finishing-a-development-branch (收尾)

Budget Gate

Step

Max Budget

Output

Per-task execution

50% of execution budget / N tasks

Task result

Monitoring overhead

5% of execution budget

Status log

Retry budget

10% of execution budget

Unblocked tasks

Key Decisions

执行引擎二选一：批量 → executing-plans；强隔离/逐任务审查 → subagent-driven-development

model 选择 / retry / 并行：交给所选 superpowers 引擎，不在本策略重定义

Abort：>50% 关键路径 BLOCKED 时中止并报告（DARE 排程层判据）

Available Tactics

Optional, no fixed order; the final leaf is always a sop.

Tactic

When to use

checkpoint-and-recover

Checkpoint state before risky operations, detect anomalies, and recover gracefully

subagent-execution-loop

Orchestrate task execution via fresh subagents with dispatch, monitoring, and result collection

Available SOPs

Optional, no fixed order; the final leaf is always a sop.

SOP

When to use

execution-monitoring

Monitor execution progress, detect anomalies, and report status

implementer-dispatch

Dispatch execution subagent — select model by complexity, construct prompt with full task context

ponytail:ponytail

Lazy-senior reflex: simplest thing that holds; mark every deliberate shortcut

ponytail:ponytail-debt

Harvest ponytail debt markers before finishing

result-collection

Collect experiment outputs — metrics, logs, artifacts — into structured result set

superpowers:executing-plans

Execute the plan task-by-task in the current session with checkpoints

superpowers:finishing-a-development-branch

Verify tests -> merge / PR / branch cleanup

superpowers:subagent-driven-development

Execute the plan via a fresh subagent per task with two-stage review

superpowers:using-git-worktrees

Create an isolated worktree + run baseline tests before implementing

superpowers:verification-before-completion

Run the proving command and confirm output before claiming done