| name | ascent-research |
| description | Full ascent-research CLI — incremental, resumable research sessions driven from a Claude Code or Codex instance. Orchestrate postagent (HTTP API) + actionbook browser (CDP) + local file ingest to build reproducible sessions with a persistent wiki layer, autonomous LLM loop, and editorial HTML reports. Covers all command surfaces — online fetch (add / batch / route), local ingest (add-local), session lifecycle (new / list / status / resume / close / rm / series / fork via --from), autonomous loop, wiki knowledge layer (list / show / rm / query / lint), user-editable SCHEMA.md, and renderers (synthesize / report / coverage / diff). Use for any "build a reproducible report on topic X" or "investigate source tree Y" or "compare technologies A and B with citations" request. |
| triggers | research, deep dive, deep-dive, investigate, analyze topic, survey, literature review, compare frameworks, review source, source tree, build knowledge base, library analysis, codebase analysis, ascent-research, ascent-research loop, session report |
| force_tool_turns | 15 |
ascent-research — Full CLI Skill
Build reproducible, figure-rich research reports with a persistent per-session wiki. One CLI, three input modes (HTTP API / browser fallback / local file tree), three output surfaces (narrative report / entity wiki / event log), autonomous loop optional.
Installation (MANDATORY — run before any playbook)
Three binaries must be on PATH: ascent-research (the CLI itself), postagent (HTTP ingest — npm), actionbook (browser fallback — npm).
cargo install ascent-research --features "provider-claude provider-codex"
npm install -g postagent @actionbookdev/cli
Doctor check — run this FIRST in every new session:
ascent-research --json doctor || { echo "INSTALL_REQUIRED"; exit 1; }
If the doctor check exits non-zero, STOP and run the install block above. Do NOT narrate the workflow in chat as a substitute — the CLI is the only thing that actually produces report.html.
For any playbook that will fetch online sources through add, batch, or route, run tool smoke as well:
ascent-research --json doctor --tool-smoke
If a required tool-smoke check fails, STOP and surface the failing tool check. Optional warnings such as postagent_public_dry_run should be treated as routing guidance: prefer browser or local ingest if public postagent fetches are not accepted by the installed postagent contract.
When online hands fail, preserve fallback provenance instead of silently
switching tools. If you cache a web page/source note locally, ingest it with:
ascent-research add-local ./cache --slug <slug> \
--original-url https://example.com/source \
--origin-tool curl \
--origin-note "actionbook daemon unavailable; cached with curl"
Rules for fallback sources:
- Use source notes only as a last resort when postagent/browser cannot fetch.
- Every source note must list the original URLs and explain why direct ingest failed.
- In the final reply, disclose which hand failed and which fallback path was used.
- For legal, medical, financial, current, or compliance claims, do not present a
source note as high-confidence primary evidence unless the report labels the
conclusion limited-confidence or later re-fetches the original URLs.
For any playbook that will call loop, wiki query, or synthesize --bilingual, also run a live provider smoke check first:
ascent-research --json doctor --provider-smoke --provider codex
ascent-research --json doctor --provider-smoke --provider claude
If provider smoke fails, STOP and surface the failing provider check. Do not start the research loop or bilingual synthesis until the provider is healthy.
Data home: all sessions, user preset overrides, wiki pages, and rendered reports live under ~/.actionbook/ascent-research/. Override with ACTIONBOOK_RESEARCH_HOME for sandboxing. Upgraders from v0.2: the legacy ~/.actionbook/research/ tree is read-only — new writes land in the v0.3 canonical root.
V2 Browser Backend Setup (one-time, REQUIRED for add/batch on JS pages)
ascent-research defaults to the V2 actionbook MCP backend
(ACTIONBOOK_BACKEND=v2-mcp). Four things must be set up once — without
all four, every browser-based fetch will fail.
1. Actionbook Chrome extension + dedicated profile (recommended)
Install Actionbook Cloud (v2)
in Chrome, v0.2.0-alpha.4 or later.
Strongly recommend a dedicated Chrome profile, for two reasons:
- chrome.debugger conflict: if actionbook shares a profile with a
password manager / AI sidebar / translation extension that injects
content frames into every page, Chrome will refuse to attach with
Cannot access a chrome-extension:// URL of different extension, and
every V2 attach call fails.
- Cookie scoping per profile: fetching logged-in content (X / GitHub
private / SaaS dashboards) requires actionbook to be in the same
profile that is logged in. A mixed-profile setup = unlogged-in fetches.
How to set up: Chrome top-right avatar → Add → new profile named
"Actionbook Research" → install only actionbook in this profile → log
into every research-target site here. macOS one-shot launcher:
open -na "Google Chrome" --args --profile-directory="Profile X"
If a user reports "V2 doesn't fetch / always EXTENSION_OFFLINE / always
attach fails", your first question is whether the profile is dedicated.
2. ACTIONBOOK_API_KEY permanent export
echo 'export ACTIONBOOK_API_KEY=ak_xxxxxxxxxxxxxxxx' > ~/.actionbook.env
chmod 600 ~/.actionbook.env
echo '[ -f ~/.actionbook.env ] && source ~/.actionbook.env' >> ~/.zshrc
⚠️ Never paste the token into chat / IDE prompt / git commit.
ascent-research never echoes the token in stderr / logs.
3. Claude Code permission allow rule
Claude Code's default classifier blocks certain "advanced" actionbook
patterns (XHR interception / screenshot return / sensitive cookie probes),
even when these are explicitly recommended by actionbook's official
manuals. One settings rule whitelists the whole MCP tool permanently:
{
"permissions": {
"allow": [
"mcp__*__actionbook"
]
}
}
Whitelist actionbook narrowly; other tools (reading Chrome profile,
calling arbitrary external APIs) remain protected by the classifier.
Do not use --dangerously-skip-permissions to bypass globally.
4. postagent secrets — one-time register per site
postagent does not read shell env / gh CLI / macOS Keychain — it has its
own secret store:
postagent auth github
postagent auth x
postagent auth openai
postagent auth status
Without a GitHub token, every github.com/<owner>/<repo> URL in the
tech preset hits unauthenticated GitHub API and will almost certainly
hit rate-limit 403.
When to use V2 vs postagent vs add-local
Research target
│
├── Public API (api.github.com / api.openai.com / etc.)
│ → preset auto-routes to postagent → fast (< 500ms)
│ Requires: postagent auth configured (step 4 above)
│
├── Static / semi-static page (official blog / docs / changelog)
│ → preset fallback routes to V2 browser → ~3-5s
│ Requires: dedicated profile + actionbook online
│
├── Heavy SPA + public content (HN / Reddit feed / x.com timeline)
│ → from inside Claude Code, call the actionbook MCP tool directly
│ (Claude writes its own run-code script + controls timeout)
│ → write captured content to local .md files
│ → `ascent-research add-local <dir>` injects them into the session
│ Requires: step 3 allow rule (otherwise classifier blocks XHR interception)
│
└── Login-required SaaS (Linear / Notion / GitHub private repo)
→ same as above, AND the actionbook profile must be logged into that site
→ cannot use postagent (no public API token, or too expensive)
Pitfalls to avoid
document.cookie is not a reliable login check. X, GitHub, and
most SaaS set their session token (e.g. X's auth_token, GitHub's
user_session) as HttpOnly cookies — JavaScript cannot read them.
To verify "is the user logged in", issue a request to an
authenticated endpoint and check status 200 vs 401, do not grep
document.cookie.
- Don't expect 3 s
networkidle on heavy SPAs. GitHub PR pages and
x.com search results take 5-10 s to hydrate. The V2 inline JS now uses
a three-stage wait (DOMContentLoaded + networkidle + body-content
poll, ~16 s worst case) to handle this; if you write a custom
run-code for an even heavier site, mirror the same pattern.
add-local accepts UTF-8 text now (including dense CJK / Japanese
/ Korean / emoji). Pre-2026-05-17 the text detector mis-flagged these
as binary; that's fixed in this build.
- GitHub PR / repo URLs route through postagent, not V2 browser —
if
postagent auth github is not configured, you'll get
fetch_failed even though V2 looks healthy. Step 4 above prevents
this.
- x.com / twitter.com URLs use a dedicated
XTweet runcode flavor
since v0.4.1. The generic networkidle + body-text poll returned only
X's left-nav chrome (~160 bytes) because X's tweet <article> is
GraphQL-hydrated AFTER networkidle, and the page virtualizes (unmounts
scrolled-off articles). XTweet uses waitForSelector('article[data-testid="tweet"]', …) instead, plus snapshot-collect across scrolls keyed by tweetId so
the main tweet survives virtualization. Thread + media (tweet attach
- link card + video poster) are returned as markdown
 so
rich-html renders them as <img> and Obsidian / VS Code preview shows
them. Three explicit preset rules — x-tweet-status (3-seg path),
x-search-live (/search), x-profile (1-seg) — route through this
flavor; everything else x.com routes via fallback (also XTweet by host
sniff).
- Default
--min-bytes is 500 — short single tweets need --min-bytes 200.
A typical single tweet returns ~400-500 bytes (text + author + metrics);
multi-article threads naturally exceed 500. If a known-good single X
URL gets SMELL_REJECTED reject_reason=empty_content, pass
--min-bytes 200. Future spec may auto-adjust per x-tweet-status
kind.
Mandatory Tail (MANDATORY — finish is preferred)
ascent-research loop does NOT render report.html. Before you declare the task done, prefer the single completion protocol:
ascent-research finish <slug> [--bilingual] [--open]
finish runs coverage -> synthesize -> audit and returns non-zero unless all three stages pass. If you need to debug a stage, run the inspection commands individually:
ascent-research coverage <slug>
ascent-research synthesize <slug> [--bilingual] [--open]
ascent-research --json audit <slug>
Rules:
- Always run
finish after loop (or after manual add / write work) before replying.
- If the user asks for Chinese, bilingual output, or 中英文 output, you MUST run
finish <slug> --bilingual; plain finish renders English-only HTML.
- If
finish fails at coverage, do NOT claim the report is complete. Surface the blockers from report_ready_blockers and keep working or ask the user what to relax.
- If
finish fails at synthesize, do NOT claim report.html exists. Surface the synthesize error and keep working or ask the user what to relax.
- If
finish fails at audit, do NOT claim the session is验收-complete. Surface audit_blockers and keep working or ask the user what to relax.
- If
--bilingual reports bilingual_skipped, do NOT claim Chinese output is complete. Fix the provider and rerun finish <slug> --bilingual.
- In the final reply, include the exact
<session>/report.html path and audit_status.
Chinese/bilingual generation:
ascent-research finish <slug> --bilingual --open
ASR_BILINGUAL_PROVIDER=codex ascent-research finish <slug> --bilingual --open
--bilingual produces an English/中文 toggle in report.html by injecting <p class="tr-zh"> siblings. Without --bilingual, the report intentionally has no Chinese paragraphs.
Dynamic Fact Topics Require Fact Check
For live, sports, news, current roster, current price, latest version, market, legal, medical, financial, or other time-sensitive factual research, create the session with --tag fact-check.
ascent-research new "<topic>" --slug <slug> --preset tech --tag fact-check
For sports/current-roster tasks, prefer the sports preset and seed at least one roster source URL from an official roster page, Basketball-Reference team season page, or ESPN roster page before synthesis:
ascent-research new "<topic>" --slug <slug> --preset sports --tag fact-check
ascent-research add "https://www.nba.com/<team>/roster" --slug <slug>
ascent-research add "https://www.basketball-reference.com/teams/<TEAM>/<YEAR>.html" --slug <slug>
When --tag fact-check is present, the loop must emit fact_check actions for concrete person, team, date, number, price, roster, standing, release version, or current-status claims before the final report depends on them. If evidence is stale or conflicting, emit fact_check with outcome:"uncertain" and continue fetching instead of writing a confident assertion.
Before final synthesis on these sessions, run:
ascent-research coverage <slug> --json
Verify fact_check_required=true and fact_checks_total >= 1. If report_ready_blockers contains fact_checks_total or fact_check_invalid_sources, do NOT run synthesize; continue fetching accepted sources and emitting fact_check actions.
GitHub Trust / Fake-Star Audit
For “fake GitHub stars”, repository trust, suspicious star growth, or investor/developer due diligence tasks, start with the deterministic audit hand before asking the LLM to interpret anything. Do not ask the model to decide whether stars are fake from vibes.
ascent-research github-audit <owner>/<repo> --depth timeline --sample 500 --out audit.json --html audit.html
ascent-research new "<owner>/<repo> GitHub trust audit" --slug <owner>-<repo>-trust --preset github-trust --tag fact-check
ascent-research add-local audit.json --slug <owner>-<repo>-trust
ascent-research loop <owner>-<repo>-trust --provider codex --iterations 8
ascent-research finish <owner>-<repo>-trust --open
Rules:
github-audit outputs trust score, risk score, band, confidence, reasons, and evidence; never call a repo “fake” or “real” deterministically.
- If the user asks whether a repo looks trustworthy, show
audit.html first. It is the deterministic scorecard: trust score, risk score, confidence, metric dashboard, reasons, and evidence gaps.
- Use
--depth repo for anonymous quick checks; use stargazers or timeline only when postagent can resolve $POSTAGENT.GITHUB.TOKEN.
- The audit JSON is the evidence artifact. Add it with
add-local before loop so the report cites the deterministic signals instead of re-deriving them.
--preset github-trust is for contextual follow-up sources; it does not replace github-audit scoring.
- Finish still goes through the normal completion protocol:
coverage -> synthesize -> audit.
Mental Model
One research project = one session under ~/.actionbook/ascent-research/<slug>/
session.toml metadata (topic, preset, tags, parent slug)
SCHEMA.md user-editable loop guidance
session.md narrative — Overview / 01·WHY / 02·HOW / ... report spine
session.jsonl append-only event log — authoritative machine state
raw/ every accepted source, one file
wiki/<slug>.md persistent entity + concept + analysis pages
diagrams/*.svg hand-drawn figures, inlined in the HTML report
report.html rendered editorial output with wiki TOC + bilingual toggle
Three-layer control flow:
LLM orchestrator (this skill / active-research / custom agent)
| [CLI ONLY — observability > terseness]
v
research CLI ──> route presets ──> postagent (HTTP)
──> actionbook browser (CDP)
──> local file reader (file://)
| [infra-enforced smell test on every fetched body]
v
session.md + session.jsonl + wiki/ + diagrams/ + report.html
Everything downstream of the CLI is stateless between turns — the agent addresses state only by --slug. The CLI holds the event log and the preset registry.
Command Surface (all of it)
Session lifecycle
ascent-research new <topic> --slug <s> [--preset tech] [--tag t]... [--from <parent>] [--force]
ascent-research list [--tag t] [--tree]
ascent-research show <slug>
ascent-research status [<slug>]
ascent-research resume <slug>
ascent-research close [<slug>]
ascent-research rm <slug> [--force]
new seeds SCHEMA.md with a starter template and sets the session active.
--from <parent> forks a session — parent's Overview becomes the new Context, tags are inherited. Wiki does NOT auto-fork (by design).
--force on new overwrites an existing slug; on rm skips the dry-run confirmation.
list --tree renders parent→child hierarchy as ASCII.
Ingest — online
ascent-research add <url> [--slug <s>] [--timeout <ms>] [--readable | --no-readable]
[--min-bytes N] [--on-short-body {reject|warn}]
[--frame-id <u32>] [--run-code-args <json>] [--reseed]
[--actionbook-backend {v2-mcp|v1-cli}]
ascent-research batch <url>... [--slug <s>] [--concurrency 1..16] [--timeout <ms>] [--readable | --no-readable]
[--frame-id <u32>] [--run-code-args <json>] [--reseed]
[--actionbook-backend {v2-mcp|v1-cli}]
ascent-research sources [<slug>] [--rejected]
ascent-research route <url> [--rules <file>] [--preset <name>] [--prefer browser]
add routes via preset (tech.toml default) — HN/arXiv/GitHub hit postagent directly, other hosts fall through to actionbook browser.
- Default per-source timeout is 90 s since v0.4.0 (V2 server inner run-code default is 60 s; the extra 30 s covers edge + transport overhead). Override with
--timeout <ms>.
--frame-id targets a specific iframe for V2 run-code. --run-code-args injects structured JSON arguments into the script.
--reseed re-probes the V2 catalog even when a wiki entry already exists (the default pre-fetch step skips already-seeded sites).
--actionbook-backend v1-cli forces the legacy subprocess path for this single call (useful when the V2 extension is offline). Otherwise the binary-wide default (env ACTIONBOOK_BACKEND, default v2-mcp) applies.
batch fetches in parallel workers; each call runs the smell test independently.
route prints the decision without fetching — useful for debugging preset rules.
- Smell test fails →
SMELL_REJECTED with a reason (too_short, wrong_url, browser_chrome_error, etc.). The URL attempt is always logged in jsonl.
- A composite route rule (one rule, N parts merged under
composite-v1 schema) is a single source; if any part rejects, the source is rejected and composite_failed_part in the jsonl event names the failing label.
Ingest — local (v3)
ascent-research add-local <path> [--slug <s>] [--glob '...']... [--max-file-bytes N] [--max-total-bytes N]
[--original-url URL] [--origin-tool TOOL] [--origin-note TEXT]
<path> can be file://abs/path, /abs/path, ./rel/path, ~/rel/path, or a bare path.
--glob is repeatable; prefix ! excludes. Default glob matches all files.
- Caps enforced at walk time: default 256 KB per file, 2 MB per walk. Direct
add file:///… calls get an 8 MB fetch-stage backstop.
- Binary files (null-byte probe) are rejected; only text survives the gate.
- Each accepted file becomes an independent source with
file:// URL — same pipeline as online add, goes through smell test, appears in sources and coverage.
--original-url / --origin-tool / --origin-note record fallback provenance in session.jsonl when a local file is standing in for a failed online hand.
GitHub trust audit
ascent-research github-audit <owner>/<repo> [--depth repo|stargazers|timeline] [--sample N] [--out audit.json] [--html audit.html]
ascent-research github-audit https://github.com/<owner>/<repo> --depth timeline --sample 500 --out audit.json --html audit.html
- Produces a deterministic trust evidence envelope: repo ratios, sampled stargazer profile signals, timeline burst signals, trust score, risk score/band/confidence/reasons/evidence.
- Default depth is
stargazers, default sample is 200. Use repo when GitHub token auth is unavailable.
stargazers and timeline use postagent with Authorization: Bearer $POSTAGENT.GITHUB.TOKEN; no raw token should appear in stdout/stderr/session files.
--out writes the full JSON envelope for add-local ingestion into a follow-up --preset github-trust report.
--html writes the deterministic scorecard HTML. This is not the generic research report template and does not call an LLM.
Autonomous loop (feature: autoresearch)
ascent-research loop [<slug>] --provider {fake|claude|codex} [--iterations N]
[--max-actions M] [--dry-run] [--fake-responses 'r1;r2;...']
- Default iteration budget is 8, default action budget is 20 — both raisable.
fake provider replays scripted JSON turns; used by tests and manual debug runs.
claude provider uses cc-sdk (requires --features provider-claude at build time).
codex provider spawns codex app-server (requires --features provider-codex).
opencode-go provider (new in v0.4.2) hits the OpenCode Go $10/mo subscription via OpenAI- or Anthropic-compatible HTTP (requires --features provider-opencode-go). Required env: OPENCODE_API_KEY, ASR_OPENCODE_MODEL (pick from the OpenCode Go catalog; no default). Optional: ASR_OPENCODE_PROTOCOL (openai default | anthropic), ASR_OPENCODE_TEMPERATURE (default 0.2), ASR_OPENCODE_MAX_TOKENS (default 16384), ASR_OPENCODE_TIMEOUT_MS (default 120000, clamped [5000, 600000]). Useful when Claude/ChatGPT subscriptions are out of reach (cost / region / payment).
- Loop reads
SCHEMA.md each turn; user edits via schema edit take effect on the next iteration.
- Action types the loop accepts:
write_plan, write_overview, write_aside, write_section, write_diagram, note_diagram_needed, digest_source, fact_check, add, batch, write_wiki_page, append_wiki_page, actionbook_search, actionbook_manual, actionbook_run_code.
- New in v0.4.0 — the three
actionbook_* actions are dispatch arms for the V2 MCP backend. Per-iteration caps: actionbook_search ≤ 5, actionbook_manual ≤ 5, actionbook_run_code ≤ 3. Long outputs get a […truncated to <N>KB…] marker. Each emits an ActionbookCalled jsonl event.
- Termination reasons:
report_ready, iterations_exhausted, max_actions_exhausted, provider_done, provider_unavailable, diverged (same coverage signature 3 turns in a row).
User-editable loop guidance (v3)
ascent-research schema show [--slug <s>]
ascent-research schema edit [--slug <s>] # opens $EDITOR
- Starter template has five sections: Goal / Wiki conventions / What to emphasize / What to deprioritize / House style.
- Edits that actually change the body emit a
SchemaUpdated jsonl event; no-op edits (e.g. :q) don't.
- Loop strips HTML comments before injecting, so placeholder hints in the starter don't leak into the prompt.
Wiki layer (v3)
ascent-research wiki list [--slug <s>]
ascent-research wiki show <page> [--slug <s>]
ascent-research wiki rm <page> [--slug <s>] [--force]
ascent-research wiki query "<question>" [--slug <s>] [--save-as <slug>]
[--format prose|comparison|table]
[--provider fake|claude|codex]
ascent-research wiki lint [--slug <s>] [--stale-days N]
- Page slug rules:
[a-z0-9_-]{1,64}.
- Frontmatter fields:
kind (entity / concept / source-summary / comparison / analysis), sources (URL list), related (slug list), updated (date).
- Cross-links use
[[slug]]; the renderer resolves existing pages to <a href="#wiki-<slug>">, flags broken targets as <span class="wiki-broken">.
wiki query retrieval: token-overlap against page bodies + slug names, plus one-hop BFS along outbound [[slug]] links from the top seeds. Top-N default 5, capped at 2×N after BFS.
wiki query --save-as <slug> persists the answer as a kind: analysis page with sources: [wiki:a, wiki:b, ...] frontmatter citing the retrieved pages.
wiki lint checks: orphans (no inbound link), broken outbound [[...]], stale updated: dates, missing cross-refs (two pages share a source but don't [[ref]] each other), kind conflicts (slug variants with mismatched kind:). Non-blocking — diagnostic only.
Output / QA
ascent-research synthesize [<slug>] [--no-render] [--open] [--bilingual] [--pdf] [--pdf-output <path>]
ascent-research report <slug> --format rich-html|brief-md [--open | --no-open] [--stdout] [--output <path>]
ascent-research series <tag> [--open]
ascent-research coverage [<slug>]
ascent-research diff [<slug>] [--unused-only]
synthesize is the full path: renders report.json + inline-SVG + wiki TOC + sources list + optional bilingual (--bilingual calls Claude to inject <p class="tr-zh"> siblings).
- PDF export is opt-in only:
synthesize <slug> --pdf converts the rendered report.html to <session>/report.pdf through the local backend. It prefers isolated Playwright Chromium/headless_shell and keeps the HTML on the user's machine. If no safe local Chromium is installed, run npx playwright install chromium or set ASR_PDF_CHROME_BIN. Do not auto-launch the user's desktop Google Chrome; only opt into that fallback with ASR_PDF_ALLOW_SYSTEM_CHROME=1 if explicitly requested. --pdf-output <path> overrides the output and implies --pdf.
report --format brief-md dumps a lean markdown digest — useful for PR descriptions or quick sharing.
series <tag> renders an HTML index for every session carrying that tag.
coverage returns metrics + report_ready_blockers (array of human-readable reasons). If report_ready: true, the session is done. For --tag fact-check sessions, inspect fact_check_required, fact_checks_total, and fact_check_invalid_sources.
diff surfaces two sets: unused (accepted but never cited) and hallucinated (cited URLs that weren't accepted). --unused-only trims to the first set.
Global flags (apply to every command)
--json machine-readable envelope (ok/data/error/meta)
-v / --verbose stderr verbosity
--no-color disable ANSI
--help clap-generated help; also `ascent-research help`
Envelope shape:
{
"ok": true,
"command": "ascent-research add",
"context": {"session": "tokio-v3", "url": "..."},
"data": {"...": "..."},
"error": null,
"meta": {"duration_ms": 1820, "warnings": []}
}
On failure, error.code is machine-readable — never parse error.message for routing decisions.
Scenario Playbooks
A. Survey a technology topic from public sources
RBIN=~/.cargo/bin/ascent-research
$RBIN new "state-space models vs attention 2026" --slug ssm-vs-attn --preset tech
$RBIN batch \
https://arxiv.org/abs/2111.00396 \
https://arxiv.org/abs/2312.00752 \
https://huggingface.co/papers/2111.00396 \
--concurrency 4
$RBIN loop ssm-vs-attn --provider claude --iterations 10
$RBIN wiki query "what breaks when you scale S6 past 10B params?" \
--format comparison --save-as s6-scaling
$RBIN synthesize ssm-vs-attn --bilingual --open
B. Deep-dive a Rust library's source tree
$RBIN new "tokio internals 2026" --slug tokio-v3 --preset tech
$RBIN schema edit
$RBIN add-local ~/tokio/tokio/src/runtime/scheduler \
--glob '**/*.rs' --glob '!**/tests/**' \
--max-file-bytes 65536 --max-total-bytes 524288
$RBIN add-local ~/tokio/tokio/src/runtime/task \
--glob '**/*.rs' --glob '!**/tests/**'
$RBIN loop tokio-v3 --provider claude --iterations 12 --max-actions 40
$RBIN wiki query "how does the scheduler balance work across threads?" \
--save-as scheduler-balancing
$RBIN wiki lint --slug tokio-v3
$RBIN synthesize tokio-v3 --open
C. Paper + companion codebase
$RBIN new "S4 state space model" --slug s4 --preset tech
$RBIN add https://arxiv.org/abs/2111.00396
$RBIN add https://github.com/HazyResearch/state-spaces
$RBIN add-local ~/state-spaces/src --glob '**/*.py' --max-file-bytes 65536
$RBIN loop s4 --provider claude --iterations 8
$RBIN synthesize s4 --bilingual --open
D. Compare two frameworks with a dedicated analysis page
$RBIN new "tokio vs async-std scheduling 2026" --slug cmp-tokio-async-std
$RBIN batch https://github.com/tokio-rs/tokio \
https://github.com/async-rs/async-std
$RBIN loop cmp-tokio-async-std --provider claude --iterations 10
$RBIN wiki query "scheduling strategy: work-stealing vs single-queue" \
--format comparison --save-as cmp-scheduling
$RBIN synthesize cmp-tokio-async-std --open
E. Fork a session, refocus
$RBIN new "tokio task system isolation" --slug tokio-tasks --from tokio-v3 \
--tag rust-deep-dive --tag task-system
$RBIN schema edit --slug tokio-tasks
$RBIN loop tokio-tasks --provider claude --iterations 8
F. Resume a stale session
$RBIN list --tag rust-deep-dive
$RBIN resume tokio-v3
$RBIN status
$RBIN schema edit
$RBIN loop tokio-v3 --provider claude --iterations 6
G. Series index for many sibling sessions
for topic in axum actix hyper rocket; do
$RBIN new "$topic internals 2026" --slug "${topic}-deep" --tag rust-web
$RBIN add "https://github.com/tokio-rs/$topic"
$RBIN loop "${topic}-deep" --provider claude --iterations 6
done
$RBIN series rust-web --open
H. Manual curation (no LLM)
$RBIN new "skim axum routing" --slug axum-skim --preset tech
$RBIN add-local ~/axum/axum/src/routing --glob '**/*.rs'
$RBIN sources axum-skim
$RBIN synthesize axum-skim --open
I. Debug a preset rule
$RBIN route "https://some.obscure.host/foo" --prefer browser
$RBIN add "https://some.obscure.host/foo"
$RBIN sources --rejected
Loop Contracts (what the autoresearch prompts enforce)
These rules are encoded in autoresearch/executor.rs and surfaced to the agent as non-negotiable:
- First-iteration contract. A fresh session accepts only
write_plan. Other actions are rejected with plan_required.
- Every accepted source must be digested.
sources_unused > 0 is a report_ready blocker. The agent cannot skip a URL the user added.
- Dynamic facts need explicit fact checks. Sessions tagged
fact-check require at least one FactChecked event. Use fact_check for live/sports/news/current roster/current price/latest-version claims and keep working while fact_checks_total or fact_check_invalid_sources appears in blockers.
- Wiki-first for durable entities. Source summaries, recurring concepts, library components →
write_wiki_page. Numbered sections cite [[slug]] pages.
- Figure-rich contract. Target ≥ 1 SVG per numbered section. Every
 requires a matching write_diagram same-or-earlier turn; every write_diagram should have a body reference. The user prompt nags about unresolved references and orphan SVG files at the top of each turn.
write_section preserves figures. If the current section body references  and your new body omits it, the CLI re-appends the reference automatically — agents never silently orphan figures even if they try.
- No plan re-authoring. The
## Plan block is pinned at the top of the prompt from iteration 2 onward. Emitting write_plan after iter 1 is wasted tokens unless you're materially revising.
- SVG safety.
write_diagram bodies must start with <svg, declare xmlns="http://www.w3.org/2000/svg", and must NOT contain <script>, <foreignObject>, on*= handlers, or javascript: URLs. Max 3 write_diagram per turn. Violations become DiagramRejected events with a reason code.
Output Shape
<session>/report.html
<lang-switch EN|ZH> (sticky top-right)
<eyebrow>Research report
<h1>{{topic}}
<sub>Session: code + tags + preset
<aside> (optional epigraph)
<numbered sections>
## 01 · WHY / 02 · HOW / 03 · WHAT ...
with inline <svg> figures + <p class="tr-zh"> siblings in --bilingual mode
<wiki-root>
<h2>WIKI · Entity & concept pages
<nav class="wiki-toc"> (pill grid, kind + slug + updated)
<section id="wiki-<slug>"> (one per page, with ↑index back-link)
<orphan-diagrams> (safety net — stays empty in normal ops)
<sources> (auto-generated from session.jsonl)
Error-Code Triage
| Code | Meaning | Fix |
|---|
NO_ACTIVE_SESSION | No active session set | ascent-research new or ascent-research resume <slug> |
SESSION_NOT_FOUND | Slug doesn't exist | ascent-research list to enumerate |
SLUG_EXISTS | new collision | --force to overwrite, or pick fresh slug |
PARENT_NOT_FOUND | --from <x> unknown | Create parent first |
PATH_NOT_FOUND | add-local path missing | Check ~ expansion, use absolute path |
actionbook_browser_* failed in doctor | Browser hand unhealthy | Run actionbook browser doctor --start --json or actionbook browser restart --json when available; otherwise use add-local --original-url ... and disclose fallback |
WALK_FAILED | Dir walk error | Usually permissions; try ls -la |
SMELL_REJECTED | Fetched body failed quality gate | See sources --rejected for reason; try --readable for browser fetches |
PROVIDER_NOT_AVAILABLE | Build lacks LLM feature | cargo build --features "autoresearch provider-claude provider-codex" |
PROVIDER_CALL_FAILED | LLM call reached the wire but errored | Retry or check auth / rate limit |
WIKI_EMPTY | wiki query with no pages | Run loop first |
WIKI_PAGE_NOT_FOUND | Bad slug on wiki show/rm | wiki list |
INVALID_ARGUMENT | Bad flag value | Check envelope message for specifics |
IO_ERROR | FS failure | Disk space, permissions |
FEATURE_DISABLED | Command requires disabled feature | Rebuild with feature |
DIAGRAM_OUT_OF_BOUNDS | write_diagram path escapes dir | path must be bare filename ending .svg |
DIAGRAM_REJECTED | SVG safety failure | See warnings for specific rule |
Build Targets
cargo build -p research --release
cargo build -p research --release --features autoresearch
cargo build -p research --release --features "autoresearch provider-claude provider-codex"
cargo build -p research --release --features "autoresearch provider-codex"
Environment Variables
| Var | Effect |
|---|
ACTIONBOOK_RESEARCH_HOME | Override ~/.actionbook/ascent-research/ (tests use this) |
ACTIONBOOK_BIN | Path to actionbook binary (default: from $PATH) |
ACTIONBOOK_BROWSER_SESSION | Reuse an existing browser session when a human is using the Chrome profile |
JSON_UI_BIN | Path to json-ui for legacy synthesize path |
ACTIONBOOK_RESEARCH_ADD_TIMEOUT_MS | Default per-URL fetch timeout |
ACTIONBOOK_FAKE_QUERY_RESPONSE | Scripted wiki query answer for the fake provider (test-only) |
RESEARCH_NO_OPEN, SYNTHESIZE_NO_OPEN, CI | Suppress --open side effects |
EDITOR | schema edit target |
When NOT to Use This Skill
- Quick one-shot web lookup → use a browser-only skill like
active-research that doesn't need a persistent session.
- Live-data dashboards (market prices, infra monitoring) → the session model assumes sources are stable enough to survive a loop run; use Grafana / log-search for real-time.
- Interactive coding / debugging → not a research task; normal Edit/Bash tooling is faster.
- One-file reads → if
cat foo.rs | head -50 answers the question, don't spin up a session.
Quality Heuristics
- Every wiki page cites at least one URL in its frontmatter.
- Numbered sections cite
[[wiki-slug]] pages rather than restating the wiki content.
- Hand-drawn SVG figures, not screenshots or PNGs — readable in any browser, zero external assets.
wiki lint reports 0 orphans and ≤ 3 broken links before calling the session done.
coverage returns report_ready: true with no blockers.
diff --unused-only is empty — no accepted source went uncited.