Run any Skill in Manus with one click

$pwd:

wagf-quickstart

Name: Wagf Quickstart
Author: WenyuChiou

// First-time WAGF setup walkthrough — environment check, smoke test, first experiment, and handoff to the four lifecycle skills. Use when the user says "I just cloned WAGF", "set up WAGF", "first WAGF run", "I'm new to this", "where do I start with WAGF", or opens a Claude Code session in a freshly-cloned WAGF repo without a clear task.

Run Skill in Manus

$ git log --oneline --stat

stars:0

forks:0

updated:April 26, 2026 at 02:10

File Explorer

6 files

SKILL.md

readonly

related-skills.json

same repository

llm-agent-audit-trace-analyzer.md

from "WenyuChiou/WAGF"

Turn raw WAGF audit traces (household_governance_audit.csv + raw/*.jsonl) into paper-ready governance metrics — IBR, EHE, rejection taxonomy, retry outcomes, model-condition comparisons. Use when the user says "analyze these traces", "compute governance metrics", "summarize rejection and retry outcomes", or hands over a results directory and asks "what does this say".

2026-05-260

wagf-domain-builder.md

from "WenyuChiou/WAGF"

Walk a researcher (PhD, collaborator, lab-mate) through building their first single-agent WAGF domain — from "I have a research question + maybe an external model" to "I have a working WAGF experiment producing audit traces." Conducts a structured S0-S7 interview, invokes `broker.tools.scaffold_domain` at S4, guides 4 surgical edits in S5, and runs `broker.tools.validate_prompt` after every change. Hands off to `wagf-coupling-designer` for any coupling work and to `wagf-experiment-designer` / `abm-reproducibility-checker` once the domain runs green. Use when the user says "I want to build a WAGF model for <my domain>", "help me set up a new domain", "I'm new to WAGF and have a research question", or "scaffold a domain from scratch".

2026-05-260

model-coupling-contract-checker.md

from "WenyuChiou/WAGF"

Verify the contract between WAGF/ABM agents and an external model (flood, hydrology, irrigation, seismic, catastrophe) — units, time steps, state mutation direction, feedback-loop double-counting. Use when the user says "check ABM-model coupling", "audit feedback loop", "verify units between WAGF and X model", or asks to confirm an external-model integration is safe.

2026-05-170

wagf-coupling-designer.md

from "WenyuChiou/WAGF"

Walk a researcher through designing the LLM↔external-model interface — decision flow IN, observation flow OUT — for a single-agent WAGF domain. Emits a coupling contract, a working mock adapter, and a pattern-specific real-model adapter scaffold so the WAGF side can be built and smoke-tested BEFORE the real model is wired in. Use when the user says "I want to couple my LLM agents to <my simulator>", "help me design the WAGF↔X interface", "scaffold the external model adapter", "draft a coupling contract", "I have a Python / R / CSV-based model and want WAGF to drive it". Sister skill to `model-coupling-contract-checker` (which AUDITS existing contracts; this one DESIGNS new ones).

2026-05-170

abm-reproducibility-checker.md

from "WenyuChiou/WAGF"

Verify another researcher can reproduce a WAGF experiment — manifests, seeds, configs, runnable commands, data provenance vs git blame, figure-script outputs match references. Use when the user says "audit reproducibility", "prepare for submission", "check this experiment folder", or any time a results directory needs a pre-publication integrity sweep.

2026-04-260

wagf-experiment-designer.md

from "WenyuChiou/WAGF"

Turn a WAGF research question into a reproducible experiment matrix (model × governance × seed × metric × artefact path). Use when the user says "design an experiment", "plan an ablation", "compare strict vs disabled", "set up cross-model evaluation", or wants a runnable matrix written to .research/.

2026-04-260

package.json

"author": "WenyuChiou"

"repository": "WenyuChiou/WAGF"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Network and Computer Systems AdministratorsComputer and Mathematical Occupations15-1244L4

name	wagf-quickstart
description	First-time WAGF setup walkthrough — environment check, smoke test, first experiment, and handoff to the four lifecycle skills. Use when the user says "I just cloned WAGF", "set up WAGF", "first WAGF run", "I'm new to this", "where do I start with WAGF", or opens a Claude Code session in a freshly-cloned WAGF repo without a clear task.

WAGF Quickstart — First-Time Setup

The single entry-point skill for a researcher who just cloned this repo and wants to be productive within ~40 minutes. This skill itself does very little; it orchestrates four phases and hands off to the existing lifecycle skills at the right moments.

Researcher progress is tracked in .wagf-quickstart-status.json so a returning user resumes from the last completed phase rather than starting over.

When to Use

Load this skill the moment a user opens a session in a WAGF repo and says any of:

"I just cloned WAGF, help me set this up."
"Set up WAGF."
"First WAGF run."
"I'm new to this — where do I start?"
"Help me get WAGF working."

Also load this skill PROACTIVELY when:

The session opens in a directory containing broker/INVARIANTS.md AND no .research/ folder AND no .wagf-quickstart-status.json — this signals a fresh clone with no work done yet.

Do NOT use this skill for:

An experienced WAGF user who knows what they want → load the matching lifecycle skill directly (wagf-experiment-designer, llm-agent-audit-trace-analyzer, model-coupling-contract-checker, abm-reproducibility-checker).
Generic "how do I run a Python project" questions → defer to whatever the user actually wants.

The four-phase workflow

Each phase has an entry condition, a runnable artefact, and a clear hand-off rule. Skip nothing; refuse to advance if the prior phase did not produce its expected output.

Phase 1 — Environment check (~3 min)

Goal: confirm the user can run any WAGF script at all.

Run:

python .claude/skills/wagf-quickstart/scripts/check_env.py

What it validates:

Python ≥ 3.10
pip install -r requirements.txt resolves (probe-mode; doesn't install)
Ollama daemon is reachable at http://localhost:11434
At least one supported model is pulled. gemma3:4b is the recommended onboarding model (small enough to run on a CPU laptop in a pinch, large enough to produce sensible WAGF behaviour).

Outputs:

Verdict: GREEN / YELLOW / RED.
Numbered remediation list if YELLOW or RED (e.g., "1. Install Ollama: https://ollama.com/download"); "2. Run: ollama pull gemma3:4b").
Records phase-1 status to .wagf-quickstart-status.json.

Refuse to advance if RED. Show the remediation list and wait for the user to fix.

Phase 2 — Smoke test (~5 min)

Goal: confirm the broker pipeline produces meaningful behavioural diff (governed vs ungoverned).

Run (in order):

python examples/quickstart/01_barebone.py
python examples/quickstart/02_governance.py

01_barebone.py runs without governance; 02_governance.py runs the same scenario through the full broker pipeline. Both are short (~5 agents × 2 years).

What to show the user after the runs:

A 3-line diff: skill distribution from 01_ vs 02_ (typically governance reduces the increase / inaction rate).
Confirm examples/quickstart/results/<run_dir>/simulation_log.csv was written.

Refuse to advance if either script crashes or simulation_log.csv is missing/empty. Inspect stdout for the actual error and recommend the fix from references/troubleshooting.md.

Record phase-2 status with the path of each run dir, so Phase 4's analyser can find them later.

Phase 3 — First real experiment (~30 min planning + multi-hour run)

Goal: turn the user's research question into a runnable matrix.

This phase is delegated to the wagf-experiment-designer skill. Do NOT replicate its workflow here. Instead:

Ask the user (in plain language): "What question do you want to answer with WAGF? E.g., 'Does governance reduce hallucinated actions in flood adaptation?'"
Pre-fill defaults appropriate for a first-time user:
- Domain: irrigation or flood (ask).
- Models: 1 model, default gemma3:4b (already pulled in Phase 1).
- Conditions: [strict, disabled].
- Seeds: 3 (smallest meaningful paired-t; bump to 5 later).
- Time horizon: domain default (irrigation 42 yr, flood 10 yr).
Hand off explicitly: "Now loading wagf-experiment-designer with these defaults — confirm or override."
After the matrix is written, wagf-experiment-designer produces .research/wagf_experiment_matrix.yml, .research/metrics_plan.md, and .research/run_plan.md. The researcher then runs the bat in run_plan.md.

Refuse to advance if the user has not chosen a domain or has not provided a research question (even one sentence).

Record phase-3 status with the .research/ artefact paths.

Phase 4 — First analysis (~5 min after run completes)

Goal: turn the run output into paper-ready governance metrics.

This phase is delegated to the llm-agent-audit-trace-analyzer skill.

Wait for simulation_log.csv to appear in the run output dir (the user runs the bat themselves; this skill does not babysit the LLM run).
When the user returns and says "the run is done" or "analyse the results", hand off: "Now loading llm-agent-audit-trace-analyzer to compute governance metrics."
After the analyser writes analysis/governance_summary.md, point the user at the next two skills:
- model-coupling-contract-checker — if they added an external model.
- abm-reproducibility-checker — before they submit a paper.

Record phase-4 status with the analysis/ artefact paths.

State file

The skill maintains .wagf-quickstart-status.json at the repo root:

{
  "phase_1_env": {"completed": true, "verdict": "GREEN", "ts": "..."},
  "phase_2_smoke": {"completed": true, "ts": "...", "run_dirs": ["..."]},
  "phase_3_experiment": {"completed": false, "matrix_path": null},
  "phase_4_analysis": {"completed": false, "report_path": null}
}

When invoked, the skill reads this file FIRST and resumes at the first incomplete phase rather than starting over.

Refusal Protocol

The skill MUST refuse to:

Pretend the environment is fine when check_env.py reports RED or any required tool is missing. Show the remediation list and stop.
Skip phases. No Phase 3 without successful Phase 2; no Phase 4 without a real simulation_log.csv from Phase 3.
Auto-fill the user's research question. Phase 3 input must come from the user, even if the rest is defaulted.
Continue if the smoke test produces zero output. The broker is broken in that case; debug rather than mask.
Replace the lifecycle skills' content. Always hand off via explicit "now load " cues.

Bundled resources

references/environment_check.md — full env-check rubric with per-platform install instructions.
references/smoke_test_recipe.md — the exact commands to run Phase 2 and how to interpret the output diff.
references/first_experiment_template.md — the pre-filled defaults for Phase 3 (per domain).
references/troubleshooting.md — common failures (Ollama not running, model not pulled, Python version mismatch) with one-line fixes.
scripts/check_env.py — runnable environment validator.

Acceptance criteria

The skill is ready when:

A user typing "I just cloned WAGF, help me set this up" in a freshly-cloned repo gets a useful response within 5 messages (env check + smoke test + Phase 3 prompt).
check_env.py returns GREEN on this repo (Python 3.14, Ollama with gemma3:4b, gemma4:e2b present).
examples/quickstart/01_barebone.py and 02_governance.py both produce a simulation_log.csv after Phase 2.
The hand-off to wagf-experiment-designer produces a valid .research/wagf_experiment_matrix.yml with the user's research question filled in (not auto-invented).
.wagf-quickstart-status.json is created and updated correctly across phases.

wagf-quickstart

More from this repository

More from this repository

WAGF Quickstart — First-Time Setup

When to Use

The four-phase workflow

Phase 1 — Environment check (~3 min)

Phase 2 — Smoke test (~5 min)

Phase 3 — First real experiment (~30 min planning + multi-hour run)

Phase 4 — First analysis (~5 min after run completes)

State file

Refusal Protocol

Bundled resources

Acceptance criteria

WAGF Quickstart — First-Time Setup

When to Use

The four-phase workflow

Phase 1 — Environment check (~3 min)

Phase 2 — Smoke test (~5 min)

Phase 3 — First real experiment (~30 min planning + multi-hour run)

Phase 4 — First analysis (~5 min after run completes)

State file

Refusal Protocol

Bundled resources

Acceptance criteria