Run any Skill in Manus with one click

ask

Name: Ask
Author: grahama1970

Zero cognitive-load learning and querying skill. Learn about a topic or persona (e.g., "Lisa Feldman Barrett") by discovering, ingesting, and extracting knowledge — or ask questions against what's been learned. Supports multi-hour deep learning with progress tracking, persona profiles, and nightly incremental updates. Uses Federated Taxonomy for multi-hop graph traversal across knowledge domains. Composes: dogpile, discover-books, ingest-youtube, fetcher, extractor, memory, taxonomy, task-monitor.

Run Skill in Manus

Skill metadata

Stars3

Forks1

UpdatedMay 30, 2026 at 14:39

File Explorer

100 files

SKILL.md

readonly

ask

Zero cognitive-load learning and querying interface. Eight modes:

Learn Mode — Discover, ingest, and extract knowledge about a topic or persona
Ask Mode — Query accumulated knowledge with Federated Taxonomy multi-hop traversal
Auto-Learn Mode — Ask a question; if no knowledge exists, automatically learn then answer
Nightly Mode — Scheduled incremental updates to persona knowledge bases
OS Mode — Learn about and query embry-os internals, skills, packages, and runtime health
Deep Review Mode — High-reasoning, read-only review with review.md and review.json
CAE Gap Review Mode — Evidence-case-backed QRA review with bounded reviewer/judge rerouting
Image Generation Mode — Generate image artifacts through /scillm /v1/images/generations

Literal Runtime Contract

When a human names $ask, /ask, or asks to use the ask skill, the project agent must use this skill's ./run.sh runtime unless the human explicitly asks for a fallback or the runtime is unavailable.

Do not replace $ask with:

spawn_agent
an informal subagent prompt
a plain model call
a hand-written reviewer summary
a web search
a local-only critique that bypasses ask artifacts

For review requests, pass the complete target artifact through the documented ask mode instead of summarizing it. Examples:

Use --deep-review --deep-review-target <path> for Web-GPT-style prompt, schema, code, plan, or artifact reviews.
Use --parallel-review for independent reviewer fanout.
Use --roundtable only when the user asks for persona deliberation.
Use --cae-gap-review only for evidence-case-backed CAE/QRA gap review.

For $ask --deep-review, the target must be readable review material. Do not pass compressed archives such as .zip, .tar, .tar.gz, .tgz, .7z, or binary bundles directly as --deep-review-target; the runtime will treat them as opaque bytes and the reviewer cannot inspect the contents. If the only available artifact is a compressed review bundle, first extract or render the specific relevant files into readable text files, then pass an explicit space-separated list of those files as the deep-review target. Directory targets are metadata-only in deep-review target resolution, so do not pass a directory and assume its contents will be inspected.

Proof of a real $ask run is the ask artifact set, not an assistant summary. Return the relevant artifact paths, such as:

.request.json
.status.json
.events.jsonl
review.md
review.json
mode-specific generated artifacts

If the runtime is unavailable, report that directly and ask before substituting spawn_agent or another fallback.

Release readiness is evidence-based, not implied. /ask uses ask.config.yml, config doctor, live sanity reports, and Docker preflights to say what is ready, what needs user attention, and what is not established.

Every ask call also writes runtime artifacts so long oracle/review runs are inspectable without guessing whether the runner is blocked in retrieval, persona routing, oracle synthesis, or artifact verification. Direct scillm oracle calls use SSE streaming and record oracle_scillm_call_started, oracle_scillm_stream_progress, oracle_scillm_call_finished, and oracle_scillm_call_failed events so project agents can distinguish active model work from hard-deadline failure. The same runtime protocol is available for learn, nightly, os learn, os ask, and os health.

Browser Oracle Backends — routing (read first)

Project agents choose a browser oracle by task type, then confirm Chrome vs Cursor.

Team defaults (which backend for which work)

Work type	Use	Site / transport
Code — review bundles, architecture, implementation, tests, tech-lead loops	`$ask webgpt`	`chatgpt.com` in Chrome via surf-cli; `--no-activate`
Prose — papers, voice, clarity, long-form writing	`$ask webkimi`	`kimi.com` in Chrome
Design — mockups, UX, visual hierarchy, design critique	`$ask webgemini`	`gemini.google.com` in Chrome
Research — fresh facts, citations, current events, web synthesis	`$ask webperplexity`	Perplexity one-shot (no standing review tab)
Inside Cursor — ChatGPT in embedded Browser (self-contained; no external Chrome)	`$ask cursor-browser`	Cursor Browser + viewId + cursor-browser-bridge

Rules of thumb:

In Cursor IDE with ChatGPT open in the embedded Browser → prefer cursor-browser, not webgpt.
Need background Chrome while working in another window → webgpt + --webgpt-tab-id or --webgpt-project + --no-activate (via surf).
Do not use webperplexity for multi-round review loops on the same thread.
Do not pass Chrome tab ids to --cursor-browser-view-id (different namespace from viewId).

Shared limitations (all browser backends)

Tabs cannot read local paths you only mention in the prompt. Use a concatenated .md/.txt bundle (inlined under ## Attached files) or a zip (≤5 files, webgpt only).
Proof is the ask artifact set (.ask_artifacts/runs/<ask_id>/), not an assistant paraphrase.
Normal work: $ask … (orchestration). Raw $surf … only when debugging transport.

Backend reference

Shorthand	`--oracle-backend`	Multi-turn tab	Tab binding
`$ask webgpt`	`webgpt`	Yes	`--webgpt-project`, `--webgpt-tab-id`
`$ask webgemini`	`webgemini`	Yes	`--gemini-tab-id`, `--gemini-url`
`$ask webkimi`	`webkimi`	Yes	`--kimi-tab-id`, `--kimi-url`
`$ask webperplexity`	`webperplexity`	No (one-shot)	—
`$ask cursor-browser`	`cursor-browser`	Yes	`--cursor-browser-project`, `--cursor-browser-view-id`

Detailed contracts: WebGPT and Cursor Browser sections below; Gemini/Kimi/Perplexity share WebGPT-style bundle rules and Chrome tab resolution via surf tab.list (fail-closed when 0 or >1 tabs).

WebGPT Oracle Backend

--oracle-backend webgpt (or the $ask webgpt … shorthand) routes oracle synthesis through the user's already-authenticated ChatGPT tab in Chrome via surf webgpt.submit --no-activate. The tab is controlled in the background; it never foregrounds.

Boundary contract:

$ask owns WebGPT orchestration: routing, run artifacts, rate limiting, project tab binding, multi-turn review loops, and oracle/review semantics.
$surf owns browser transport and proof: controlled tab selection, webgpt.submit, sentinel injection, completion waiting, clean/raw/meta outputs, screenshots, and no-activate focus preservation.
$scillm owns direct model-provider calls through localhost:4001, including GPT-5.5 high reasoning, Gemini, Claude, Chutes, OpenCode Go, image generation, streaming, batching, JSON repair, fallback, and usage logging.
Do not add WebGPT browser-tab access to $scillm. WebGPT is not a provider API call; it depends on the user's live authenticated browser session and the $surf sentinel proof contract. Callers that want WebGPT must use $ask webgpt or --oracle-backend webgpt.
Do not call $surf directly for ask/oracle/review work unless debugging the transport layer. Normal callers should use $ask so request/status/events artifacts, WebGPT rate limits, tab bindings, file auto-attachment, and reviewer loop semantics are preserved.

# Auto-resolve tab id (when exactly one chatgpt.com tab is open)
./run.sh ask "to perform the review on /tmp/code-runner-reliability-review/review-bundle.md" \
  --oracle --oracle-backend webgpt

# Equivalent shorthand
./run.sh ask webgpt to perform the review on /tmp/code-runner-reliability-review/review-bundle.md

# Explicit tab id (from the Tab ID Viewer Chrome extension)
./run.sh ask "summarise the review bundle" \
  --oracle --oracle-backend webgpt --webgpt-tab-id 837343564

# Resolve by ChatGPT conversation URL
./run.sh ask "summarise the review bundle" \
  --oracle --oracle-backend webgpt \
  --webgpt-url "https://chatgpt.com/c/6a0097ff-e7e0-83ea-93c2-3a6b88e2a67f"

# Autonomous mode: let surf pick or create a background ChatGPT tab.
./run.sh ask webgpt summarise the review bundle --webgpt-create-tab

# Per-project tab binding — first call auto-creates and persists; subsequent
# calls reuse the same tab so the conversation context survives across days.
./run.sh ask webgpt summarise the review bundle --webgpt-project code-runner-review

Per-project tab bindings

--webgpt-project NAME binds the call to a per-project ChatGPT tab whose id is persisted at ~/.pi/webgpt-projects/<name>.json. Two workflows:

Autonomous (default — ephemeral tasks):

# First call: auto-create a background tab and remember it under "pr-1234".
./run.sh ask webgpt review this PR diff --webgpt-project pr-1234

# Subsequent calls reuse the same tab; ChatGPT preserves conversation state.
./run.sh ask webgpt and now check the migration path --webgpt-project pr-1234

If the bound tab gets closed, the runtime silently creates a new one and re-binds.

Manual (long-lived projects you want to babysit):

# Open exactly the ChatGPT tab you want, copy its id from the Tab ID Viewer
# Chrome extension, then bind once:
./run.sh webgpt-project bind code-runner-review --tab-id 837343543 \
  --url "https://chatgpt.com/c/<conv-id>"

# All future calls for that project use this exact tab — never replaced.
./run.sh ask webgpt continue the review --webgpt-project code-runner-review

Manual bindings are recorded with bound_manually: true. If the tab is later closed, the runtime raises a clear ProjectBindingError asking the human to re-bind explicitly — it will not silently swap in a new tab.

Management commands (./run.sh webgpt-project ...):

bind NAME --tab-id ID [--url URL] [--manual|--auto] — create or update a binding
list [--json] [--verify] — show all projects; --verify checks tab still open in Chrome
show NAME [--json] — full state dump for one project
verify NAME [--json] — re-check whether the bound tab is still open
unbind NAME — remove a binding (does not close the Chrome tab)
gc [--days 30] — remove stale auto-bindings (manual bindings are never collected)

Behavior:

Tab resolution. Priority: --webgpt-tab-id → --webgpt-url → --webgpt-project NAME (with a valid binding) → --webgpt-create-tab → auto-resolve from surf tab.list filtered to chatgpt.com. Auto-resolve fails closed when 0 or >1 candidates exist — the call refuses to run rather than guess. When the project agent hits this, it must ask the human to either: (a) open exactly one ChatGPT tab so auto-resolve picks it, (b) provide a tab id from the Tab ID Viewer extension to pass through --webgpt-tab-id, (c) re-invoke with --webgpt-create-tab for the agent to acquire a tab autonomously (surf picks the most-recent existing chatgpt.com tab without foregrounding, or creates a fresh background one if none are open), or (d) pass --webgpt-project NAME to bind this call to a persistent per-project tab (see "Per-project tab bindings" below). The resolved tab id surfaces in oracle_model_served: webgpt:<id> so the agent can pass it explicitly to follow-up rounds.
File auto-attachment. Absolute paths embedded in the question (e.g. /tmp/foo.md, ~/notes.md) are read from disk and inlined under ## Attached files in the prompt. Truncated at 2 MB per file.
Path-only bundle rejection (fail closed, friendly). WebGPT, WebGemini, WebKimi, and WebPerplexity cannot read the local filesystem. If the project agent passes bare path lists or unreadable bundles, $ask stops before surf webgpt.submit and returns a project-agent message: I'm a web-based agent and I can't read local file paths. Please provide either a zip review bundle of no more than 5 files, or give me a concatenated text file. The run records needs_attention (exit code 2), not an opaque transport failure. Valid inputs: one concatenated .md/.txt path whose content is inlined, or one .zip path (≤5 files, $ask webgpt only) passed via --attach-file.
Focus preservation. The controlled tab is never foregrounded. The caller's active tab and focused window are unchanged across the call; meta.focus_changed must be false.
Default multi-turn. Unless --once is passed, explicit $ask webgpt uses ASK_WEBGPT_DEFAULT_ITERATIONS (default 2) on the same controlled tab via --oracle-iterations. Use --once for a single round; use --oracle-iterations N to override explicitly.
No DAG on plain webgpt. Product/architecture questions routed through $ask webgpt do not auto-draft coding DAGs; use --orchestrate only when you intend skill-graph execution.
Background collaboration loops → see /collab (collab webgpt --profile …), which shells to /ask webgpt and fires notify-send on HUMAN_REQUIRED / BLOCKED.
Multi-turn iteration. Each $ask webgpt call is one round on the same controlled tab. ChatGPT keeps the conversation context per tab, so a second call refines naturally. The canonical pattern is: project agent reads the first round's answer, decides whether to push back, and re-invokes $ask webgpt … to send the follow-up — no special iteration flag needed. Internal --oracle-iterations N is also honoured: each iteration sends one follow-up nudge ("identify the weakest claim and address it").
Proof contract. Inherits the WebGPT sentinel contract from surf: controlled_tab_id == requested_tab_id, sentinel present in raw response, stripped from clean response, no clean-response contamination from page chrome. oracle_webgpt_call_started / _finished / _failed events are recorded to the run state.
Other consumers. /review-prompt, /review-design, /review-code, /review-plan compose /ask and inherit this backend for free — pass --oracle-backend webgpt (or set ASK_ORACLE_BACKEND=webgpt) to the underlying /ask call.
Live sanity. skills/ask/sanity-webgpt.sh exercises the full path end-to-end against a real ChatGPT tab and asserts the proof contract, oracle wiring, and focus invariance. Modes: --tab-id ID, --url URL, --create-tab, or no flag (which auto-picks a single chatgpt.com tab or prints a 4-option help block when 0 or >1 candidates exist). Run after changes to webgpt_runtime.py, the oracle dispatcher, or the model alias router.
Rate-limit guard. Multi-round ping-pong can fan out to many ChatGPT rounds quickly. webgpt_rate_limit.py enforces a per-hour budget per account (default 30 rounds) and refuses to start new rounds once the cap is hit. Tunable via ASK_WEBGPT_MAX_ROUNDS_PER_HOUR; bypass via ASK_WEBGPT_RATE_LIMIT_DISABLE=1 (tests only). State at ~/.pi/webgpt-rate-limit.json.
Verdict JSON parsing. ChatGPT occasionally emits technically-invalid JSON (raw control chars inside string fields, trailing commas, missing quotes). extract_verdict layers strict json.loads → json_repair → plain-text VERDICT: regex, so transient model misbehavior never silently drops the verdict label.

Tech lead vs code runner (read this before WebGPT rounds)

When the human says WebGPT is the tech lead and the project agent is the code runner, roles are fixed:

Role	Who	Does	Does not
Tech lead	`$ask webgpt` on a bound tab (`--webgpt-project`)	Adjudicate policy, page contracts, screenshots; author `$test-interactions` manifest JSON; ask clarifying questions	Patch the repo, run shells, run `$test-interactions`, replace deterministic gates
Code runner	Project agent (you)	Implement allowlisted fixes, `cargo check`, `uv run maturin develop`, re-extract/materialize, run gates, write artifacts	Pretend WebGPT output is proof; skip local verification

How the tech lead is contacted (two modes):

Clarifying question — one focused ambiguity (policy, human vs pipeline, closure count semantics). Minimal context: page id, counts, artifact paths.
Review bundle — one concatenated markdown packet for real adjudication. Build /tmp/pdf-oxide-review-bundle.md with request text, gate output, diff summary, and JSON excerpts inlined — not a directory of separate files and not a zip archive.

Example (single file the runtime can inline):

/tmp/pdf-oxide-review-bundle.md
  # Review request (one paragraph)
  ## Gate output (inlined JSON)
  ## Diagnosis / diff summary (inlined)

Invoke WebGPT with that one file path (auto-attached and validated):

./run.sh ask webgpt "Review /tmp/pdf-oxide-review-bundle.md.
Return PASS, NEEDS_CHANGES, HUMAN_REQUIRED, or BLOCKED with specific guidance only.
Do not implement fixes."   --webgpt-project pdf-oxide   --oracle-iterations 1

For bounded background loops with desktop notify, use /collab webgpt --profile … (shells to this same /ask webgpt path; does not change roles).

Not the same lane as code repair:

WebGPT guidance → $ask webgpt / /collab (surf, background tab).
Discrepancy + patch → project agent + optional scillm exec graph (scillm_call + oc-kimi for multimodal review, pi-opencode-kimi for allowlisted edits) or scripts/pdf_lab/exec_two_call_page_repair.py.
Do not describe WebGPT rounds as "two scillm exec oc-kimi calls"; oc-kimi is HTTP chat, not scillm exec, and is for extraction repair—not the ChatGPT tab.

Proof of a WebGPT round is under .ask_artifacts/runs/<ask_id>/ (and oracle_webgpt_call_finished with no_activate: true), not an assistant paraphrase.

Bounded reviewer/executor workflow

Use this pattern when the human wants WebGPT as a reviewer/oracle while the project agent remains the executor:

human defines intent
  -> optional /interview for acceptance criteria
  -> project agent implements or gathers evidence
  -> /ask webgpt reviews the evidence bundle
  -> project agent applies corrections
  -> repeat until gate passes, max rounds is reached, or a human decision is required

Start with /interview when the acceptance criteria are not concrete enough to verify without guessing. Examples: product workflow direction, UI purpose, schema ownership, status vocabulary, prompt contract expected outputs, or any case where the reviewer would otherwise invent the definition of done. Skip /interview when the human already provided executable acceptance criteria or the task is a narrow bug fix with an obvious failing proof.

The project agent should not expose every internal reviewer turn to the human. Human-facing status should be limited to:

current state
blocker
proposed decision
evidence link or artifact path
what changed since the previous round
whether a human decision is required

Recommended bounded loop:

./run.sh ask webgpt "Review /tmp/review-bundle.md (concatenated evidence file).
Return PASS, NEEDS_CHANGES, or BLOCKED with specific fixes only." \
  --webgpt-project <project-review> \
  --oracle-iterations 1

Then the project agent applies fixes locally, regenerates the evidence bundle, and calls /ask webgpt again on the same --webgpt-project until one of these terminal conditions occurs:

PASS + local PASS (dual agreement): WebGPT returns PASS (or VERDICT: PASS) on the same This round acceptance bullets and deterministic local gates pass. Only then set Goals met this round: YES in COLLABORATION_STATUS.md. Eligible for $plan-iterate close-phase when that phase's contract matches.
NEEDS_CHANGES: WebGPT names specific fixes; project agent patches, refreshes status, re-runs local proof, re-asks WebGPT.
BLOCKED: WebGPT names a missing dependency or unresolved product decision the agent cannot clear → human escalation (see below).
MAX_ROUNDS: configured round cap reached → human or narrower scope.
NEEDS_HUMAN_DECISION: acceptance/product policy, not implementation.

Do not use this loop for unbounded brainstorming, bulk review queues, or as a substitute for deterministic tests. WebGPT review is a gate over concrete evidence, not proof by itself.

Collaboration status file (required for multi-round WebGPT)

For any multi-round project-agent + WebGPT collaboration on the same --webgpt-project, maintain a shared COLLABORATION_STATUS.md at an absolute path. Refresh it before every $ask webgpt round. Embed that path in the question so file auto-attachment inlines it.

Full contract: docs/ASK_COLLABORATION_STATUS_CONTRACT.md
Copy template: docs/templates/COLLABORATION_STATUS.template.md

Required sections: Goals (table with per-goal status: complete | outstanding | blocked | pending), North star, Accomplished, Standing (not closed), Blockers (agent-actionable vs human-required), This round acceptance, Agreement (round N) with local gates, WebGPT verdict, Goals met this round, Human needed.

WebGPT reviews only This round acceptance. It must not invent criteria or declare overall project/phase complete unless both parties recorded PASS on those bullets.

Dual agreement: Neither WebGPT alone nor the project agent alone may claim goals complete. WebGPT PASS without local live e2e/tests when bullets require them is not closure. Local PASS without a successful $ask artifact when review is required is not closure.

Human assistance when: BLOCKED, non-empty Human-required blockers, MAX_ROUNDS, persistent local/WebGPT disagreement after one reconcile attempt, or $plan-iterate returns HUMAN_REQUIRED. Human-facing updates stay limited to state, blocker, decision, evidence paths, delta since last round, and whether a human decision is required.

Recommended invocation:

./run.sh ask webgpt "Read /path/to/COLLABORATION_STATUS.md.
Review ONLY 'This round acceptance'. Return VERDICT: PASS | NEEDS_CHANGES | BLOCKED."   --webgpt-project <project>   --ask-id <round-id>   --run-output-root /tmp/ask-webgpt-<project>   --overwrite

Proof of WebGPT review is the ask artifact set (<ask-id>.status.json, etc.), not an assistant summary. Update Agreement in the status file after each round.

Explorer / UI page review loop (WebGPT + test-interactions)

Use when the human wants WebGPT to review live product pages (screenshots + contracts) and to author interaction manifests that the project agent runs. WebGPT is tech lead; the project agent is the only code runner.

Role split (non-negotiable):

Output	Tech lead (WebGPT)	Code runner (project agent)
Per-page verdict	pass / degraded / fail + reason	Re-check against live gates after fixes
Overall package verdict	PASS / NEEDS_CHANGES / BLOCKED	Do not claim product-ready without local proof
Interaction manifest JSON	Draft `[data-qid]` workflow + assertions	Run `/test-interactions/run.sh run`
Code / predicate fixes	Describe required change only	Patch repo, re-capture, re-run gates
systemd / human policy	Flag HUMAN_REQUIRED	Execute only after human approves

WebGPT never runs ./run.sh, uv run, patches, or $test-interactions. Those are always project-agent steps after the ask artifact lands.

Review bundle (one concatenated markdown file — auto-inlined):

Build /tmp/<project>-page-review-rN.md with request text, contract audit JSON, gate readouts, and screenshot paths described in prose only when screenshots are reviewed separately via multimodal $scillm routes. Do not pass a directory path or a manifest of bare /tmp/... file paths — browser reviewers cannot read the filesystem.

/tmp/<project>-page-review-rN.md
  # Review request (one paragraph)
  ## page-contract-audit.json (inlined)
  ## coverage-health.json (inlined)
  ## page-purpose excerpt (inlined)
  ## prior verdict (optional, inlined)

Capture screenshots with $test-interactions (deterministic run) or $surf before the WebGPT round. Prefer crops of page-purpose strip, primary table/panel, and fail-closed empty states—not only UX Lab chrome.

Ask WebGPT for manifest output in the same round:

In REVIEW_REQUEST.md, require a fenced JSON block named test-interactions-manifest (or a separate file path WebGPT names explicitly) with semantic workflow steps the scaffold generator cannot infer:

navigate to Final Site + tab (if needed)
wait_ready on page-specific root qid
refresh / row drill / keyboard path
deterministic assertions (assert_visible, assert_text, monitor row counts)

The project agent saves that JSON and runs:

/home/graham/.codex/skills/test-interactions/run.sh run   --manifest /tmp/<project>-coverage-manifest.json   --output-dir /tmp/<project>-coverage-ti-rN/

/home/graham/.codex/skills/test-interactions/run.sh review   --captures /tmp/<project>-coverage-ti-rN/   --persona nico-bailon   # or brandon-bailey for compliance surfaces

PASS/FAIL comes from $test-interactions, not from WebGPT. WebGPT comments on evidence; it does not override deterministic verdicts.

Example invoke (tab-bound, review-only wording):

cd ~/.codex/skills/ask
./run.sh ask webgpt "Review /tmp/sparta-page-review-r4/REVIEW_REQUEST.md and attached screenshots.
For each Explorer page: verdict pass|degraded|fail, blockers, and next evidence.
Return overall PASS, NEEDS_CHANGES, or BLOCKED.
Also emit test-interactions-manifest JSON for Coverage (semantic workflow, all [data-qid] selectors).
Do not run commands or edit the repo."   --webgpt-tab-id 837344161   --webgpt-project sparta-explorer-review   --oracle-iterations 1

Prompt vocabulary (avoid ask DAG misfire):

Natural-language $ask prompts containing implement, fix, patch, or refactor can auto-draft an ask.dag.v1 with an implement node and fail with AskDagError on review-only work. For WebGPT review rounds, prefer:

"review", "adjudicate", "verdict", "guidance", "manifest JSON", "evidence bundle"
explicit "Do not run commands or edit the repo"

If DAG mode is required for coding, use --dag-file explicitly—not accidental orchestrate triggers inside WebGPT review prompts.

Proof chain for the human:

.ask_artifacts/runs/<ask_id>/ — request, status, events, WebGPT clean response
Project-agent captures + results.json from $test-interactions
Updated contract audit / monitor health JSON after local fixes

Do not treat accepted plan-iterate phases or WebGPT PASS as product closure until (2) and (3) agree.

Cursor Browser Oracle Backend (within Cursor IDE)

Use --oracle-backend cursor-browser or $ask cursor-browser … when you want ChatGPT inside Cursor's embedded Browser pane, not external Chrome + surf-cli.

Tab identity: Cursor uses viewId (e.g. f53e74) from browser_tabs / MCP — NOT Chrome numeric tab ids from surf tab.list or the Tab ID Viewer extension.

Context	Identifier	How to get it
Chrome + `$ask webgpt`	Chrome tab id	Tab ID Viewer; `surf tab.list`; `--webgpt-project`
Cursor Browser + `$ask cursor-browser`	viewId	MCP `browser_tabs` list; `surf cursor-browser.tab.list`; `--cursor-browser-project`

Prerequisites

Shell automation requires cursor-browser-bridge (one-time install + Cursor window reload). It exposes Cursor Browser to /ask and /surf via HTTP on /tmp/cursor-browser-bridge-port.

Without the bridge, use @Browser in chat (agent MCP) or install the bridge for ./run.sh.

Quick start

# List Cursor Browser tabs (viewId, title, url)
cd ~/.claude/skills/surf
./run.sh cursor-browser.tab.list

# Bind a project to a viewId (recommended for iterative work)
cd ~/.claude/skills/ask
./run.sh cursor-browser-project bind scilla-harness --view-id f53e74 \
  --url "https://chatgpt.com/c/6a15cd46-9768-83ea-b7e7-88b601dab985"

# Ask via Cursor Browser (same artifact contract as webgpt)
./run.sh ask cursor-browser "what is the capital of Texas" \
  --oracle --oracle-backend cursor-browser --once \
  --cursor-browser-project scilla-harness \
  --ask-id texas-cursor --run-output-root /tmp/ask-cursor --overwrite

Tab resolution (fail-closed)

Priority: --cursor-browser-view-id → --cursor-browser-url → --cursor-browser-project binding → auto-resolve from cursor-browser.tab.list filtered to chatgpt.com (exactly one tab required).

When 0 or >1 ChatGPT tabs exist, /ask refuses to guess — pass --cursor-browser-view-id or bind a project.

Proof artifacts

Same layout as WebGPT rounds under .ask_artifacts/runs/<ask_id>/, with oracle_cursor_browser_call_started / _finished events and surf cursor-browser.submit meta JSON (controlled_view_id, sentinel fields).

When to use which path

Goal	Use
Code review / tech lead in Chrome (background)	`$ask webgpt` + `--webgpt-project` or `--webgpt-tab-id`
Prose / writing in Chrome	`$ask webkimi` + `--kimi-tab-id`
Design review in Chrome	`$ask webgemini` + `--gemini-tab-id`
Research (fresh web, one shot)	`$ask webperplexity`
Cursor IDE — ChatGPT in embedded Browser (self-contained)	`$ask cursor-browser` + `--cursor-browser-project` or `--cursor-browser-view-id`
Ad-hoc one question, agent already in chat	`@Browser` on current page (no `./run.sh` artifacts unless you re-run ask)

Do not pass Chrome tab ids to --cursor-browser-view-id. They are different namespaces.

WebGemini Oracle Backend (design)

--oracle-backend webgemini or $ask webgemini … — design work (mockups, UX, visual critique) via an authenticated Gemini tab in Chrome.

Transport: surf gemini.submit --no-activate (surf-cli extension required; /tmp/surf.sock).
Tab resolution: --gemini-tab-id → --gemini-url → auto-resolve when exactly one gemini.google.com tab is open (fail-closed otherwise).
Same bundle rules as WebGPT: concatenate evidence into one .md/.txt; no zip attach on this backend.
Multi-turn: re-invoke on the same controlled tab; no --gemini-project bindings yet (pass tab id explicitly).
Events: oracle_gemini_call_started / _finished / _failed.

./run.sh ask webgemini "Review /tmp/design-review-bundle.md for visual hierarchy and NVIS compliance."   --oracle --gemini-tab-id <id> --once

WebKimi Oracle Backend (prose)

--oracle-backend webkimi or $ask webkimi … — prose work (papers, voice, clarity, long-form writing) via an authenticated Kimi tab in Chrome.

Transport: surf kimi.submit --no-activate.
Tab resolution: --kimi-tab-id → --kimi-url → auto-resolve when exactly one kimi.com tab is open (fail-closed otherwise).
Same bundle rules as WebGPT (concatenated text; no zip).
Multi-turn on the same tab; no project bindings yet.
Events: oracle_kimi_call_started / _finished / _failed.

./run.sh ask webkimi "Review /tmp/paper-section.md for voice and clarity."   --oracle --kimi-tab-id <id> --once

WebPerplexity Oracle Backend (research)

--oracle-backend webperplexity or $ask webperplexity … — research questions needing fresh web synthesis (not multi-round review on a standing tab).

Transport: surf perplexity (one-shot CDP session per query).
No standing tab — each call is independent; do not use for bounded code/design review loops.
Bundle: concatenated .md/.txt only (paths inlined); no zip.
Events: oracle_perplexity_call_started / _finished / _failed.

./run.sh ask webperplexity "What changed in NIST SP 800-171 Rev 3 draft in the last 12 months?"   --oracle --once

Image Generation Mode

Use /ask --image-generate when the answer should be an image artifact rather than retrieved memory or an oracle text response. /ask sends the prompt to /scillm POST /v1/images/generations, writes generated image files under the ask run directory by default, and records an image_generation.json manifest.

./run.sh ask "a precise architecture diagram of ask calling scillm for image generation" \
  --image-generate \
  --image-model gpt-image-2 \
  --image-size 1024x1024 \
  --image-quality high

Image generation is standalone: do not combine it with memory retrieval, oracle, roundtable, argue, parallel-review, deep-review, or CAE gap-review options. Use --image-output to choose a file or directory, and --image-output-format png|jpeg|webp to choose the artifact format.

Zero Cognitive Load for Project Agents

Project agents should just ask — the skill handles all discovery complexity:

# Project agent just asks a question
./run.sh ask "Lisa Feldman Barrett how might we improve our /memory system?" --auto-learn

# What happens automatically:
# 1. Memory check (existing knowledge?)
# 2. /dogpile deep research (Brave, Perplexity, ArXiv, YouTube, GitHub)
# 3. YouTube transcript ingestion (lectures, interviews)
# 4. Web content fetching (blogs, articles)
# 5. QRA extraction + Federated Taxonomy tagging
# 6. Memory storage with persona profile
# 7. Answer synthesis with multi-hop traversal

Human Chat Usage

As the human using chat, use natural $ask phrasing. The agent should translate these into the correct /ask CLI call. See docs/HUMAN_CHAT_EXAMPLES.md for the complete human-facing example catalog. Representative examples are enforced by sanity.sh via tests/test_human_chat_examples.py; do not add new chat patterns without adding route coverage.

Agent translation rules:

Preserve the human's natural topic text; do not over-normalize domain terms.
Treat a named persona before the question as --oracle-persona <name>.
Treat multiple named personas with "debate", "roundtable", or "discuss" as --roundtable.
Treat "parallel reviewers", "adversarial reviewers", or "N reviewers" as --parallel-review.
Treat "argue whether", "debate whether", or "make the case for and against" as --argue.
Treat "review then roundtable" as both --parallel-review and --roundtable.
Treat "CAE gap review", "QRA gap review", or "CAE reviewers" as --cae-gap-review.
Treat "generate an image", "image generation", or "make an image" as --image-generate; keep it standalone from memory/oracle/review modes.
Treat "deep review", "comprehensive review", "safe to proceed", or "production readiness" as --deep-review; require or infer a concrete --deep-review-target.
Treat leading model shorthand such as $ask oc kimi ..., $ask opencode qwen ..., $ask chutes kimi ..., $ask oc-kimi ..., or $ask chutes-kimi ... as --oracle --oracle-backend scillm with the resolved provider model.
Treat leading $ask webgpt ... (or $ask chatgpt ...) as --oracle --oracle-backend webgpt. This drives an already-authenticated ChatGPT tab in the user's Chrome via the surf-cli extension; the controlled tab never foregrounds (--no-activate).
Treat multi-round WebGPT review with a bound --webgpt-project as requiring an absolute-path COLLABORATION_STATUS.md (refresh before each round; see docs/ASK_COLLABORATION_STATUS_CONTRACT.md). Dual agreement: local PASS and WebGPT PASS on the same This round acceptance bullets before claiming goals met.
Treat Ask Nico ..., Bring Nico into this conversation ..., and other visible named-subagent conversation requests as /ask-owned persistent tmux-visible subagent sessions using --oracle-backend subagent-runner. Do not satisfy these with direct /subagent-runner, a one-shot oracle answer, or a hidden fallback. The proof artifact must include the tmux attach command, session directory, transcript, status, events, and send-input command.
Treat date-sensitive words (2026, current, latest, today, recent) as --dogpile auto.
Default high-value analytical questions to --oracle --oracle-model gpt-5.5 --oracle-reasoning high.

Human chat prompt	Route
`$ask what do we know about the release checklist?`	Memory-backed ask synthesis
`$ask What is the state of Python packaging in 2026?`	Oracle with auto persona selection and `--dogpile auto`
`$ask What is the state of space-based cybersecurity in 2026?`	SPARTA-scoped oracle: `--scope sparta --oracle`
`$ask oc kimi explain this design tradeoff`	scillm OpenCode Go oracle using live model discovery and capability metadata, currently `opencode-go/kimi-k2.6`
`$ask oc kimi for a $review-design with maximum 3 rounds`	Ask-backed review-design loop using `opencode-go/kimi-k2.6`; capture fresh screenshots, ask Kimi for a verdict, patch locally, re-render, and stop after PASS/blocker/3 rounds
`$ask oc-qwen compare these options`	Hyphenated OpenCode Go shorthand, currently `opencode-go/qwen3.6-plus`
`$ask chutes kimi explain this design tradeoff`	scillm Chutes oracle using configured alias `chutes-kimi`
`$ask chutes-kimi explain this design tradeoff`	Hyphenated Chutes shorthand using configured alias `chutes-kimi`
`$ask webgpt review round 2 using COLLABORATION_STATUS.md`	WebGPT oracle with status file auto-attached; update Agreement after artifact; dual agreement before closure
`$ask webgpt to perform the review on /tmp/review-bundle.md`	WebGPT oracle backed by the user's signed-in ChatGPT tab (via `surf webgpt.submit --no-activate`). File paths in the prompt are auto-attached. Tab id auto-resolves when exactly one chatgpt.com tab is open; otherwise pass `--webgpt-tab-id`.
`$ask webgpt again — refine your answer`	Multi-turn: each `$ask webgpt` call is one round against the same controlled tab. ChatGPT preserves conversation context, so iterations form a coherent dialogue.
`$ask Bring Nico into this conversation to review the mockups`	Persistent visible Nico session. Human attaches with `tmux a -t nico-<project>`; project agent sends follow-up turns through the `/ask` artifact's `send_input_command`.
`$ask Brandon what is the state of space-based cybersecurity in 2016?`	Brandon persona oracle over `--scope sparta`
`$ask Brandon, Margaret, and Jennifer personas to roundtable about the topic: What is the state of cybersecurity in 2026?`	SPARTA-scoped sequential persona roundtable
`$ask Brandon what is the best way to review this API boundary?`	Brandon persona oracle subagent
`$ask Brandon persona about whether this retry design fails closed`	Brandon persona oracle over memory/project context
`$ask Brandon critique this architecture`	Brandon persona critique
`$ask Brandon ask Margaret where are we weak?`	Safe Brandon→Margaret peer deliberation
`$ask Brandon, Margaret, and Jennifer personas to roundtable about the topic: Should this service use retries or queues?`	Sequential protocolized persona roundtable
`$ask run 3 parallel adversarial reviewers on this implementation`	Independent parallel review plus moderator synthesis
`$ask cae gap review AC-2 MFA evidence for the production tenant`	Evidence-case-backed QRA review: `--cae-gap-review --cae-max-rounds 3`
`$ask generate an image of the ask to scillm image route`	Image artifact generation: `--image-generate`
`$ask argue whether we should ship this change`	Two parallel `/scillm` advocates plus sequential judge and verifier
`$ask deep review this implementation --deep-review-target src/ask/ask.py`	Read-only deep review with markdown and JSON artifacts
`$ask review then roundtable with Brandon, Margaret, Jennifer`	Parallel findings first, then sequential persona debate
`$ask oracle should we use subagent-runner here?`	GPT-5.5 high-reasoning oracle
`$ask learn Lisa Feldman Barrett`	Learn/persona-ingest mode
`$ask is memory healthy?`	OS/runtime health mode

Natural persona syntax is supported by the CLI too:

./run.sh ask Brandon what is the best way to review this API boundary?

This is equivalent to:

./run.sh ask "what is the best way to review this API boundary?" \
  --oracle \
  --oracle-backend subagent-runner \
  --oracle-persona Brandon \
  --oracle-model gpt-5.5 \
  --oracle-reasoning high

The subagent, not the chat router, decides whether /dogpile is necessary after checking /memory. The /ask controller decides the timeout and records execution telemetry in /memory for future timeout policy.

Plain broad analytical questions, such as "what is the state of X", are routed to the oracle with persona consultation even when the human does not name a persona. The oracle should choose the best ready persona using /memory recall over persona lessons/lore; /monitor-personas is only an optional readiness check.

Quick Start

cd .pi/skills/ask

# Learn about a persona (deep learning)
./run.sh learn "Lisa Feldman Barrett" --scope behavioral --depth deep

# Ask with auto-learn (discovers + ingests if no knowledge exists)
./run.sh ask "What does Sapolsky say about free will?" --scope behavioral --auto-learn

# Interactive mode (asks about learning preferences)
./run.sh learn "Robert Sapolsky" --scope behavioral --interactive

# Check learning progress (includes task-monitor state with ETA)
./run.sh status --scope behavioral

# Run nightly persona update
./run.sh nightly --scope behavioral

Commands

`ask` — Query Accumulated Knowledge

./run.sh ask <question> [options]
./run.sh ask Brandon what is the state of space-based cybersecurity in 2016? --scope sparta

Options:
  --scope <scope>         Memory scope to query (default: "ask")
  --k <n>                 Number of results (default: 5)
  --bridges               Also traverse bridge attributes (multi-hop)
  --auto-learn            Auto-discover and learn if no knowledge found
  --collection <coll>     Taxonomy collection for auto-learn (default: behavioral)
  --consult-personas      Find and suggest relevant personas to consult
  --persona-scope <scope> Scope to search for personas (default: personas)
  --hybrid                Use hybrid RAG+QRA retrieval
  --oracle                Use scillm/Codex for final oracle synthesis
  --oracle-backend <b>    Oracle backend: auto, scillm, subagent-runner, webgpt
  --oracle-model <model>  Oracle synthesis model (default: gpt-5.5)
  --oracle-reasoning <r>  Oracle reasoning effort (default: high; deep-review default: xhigh)
  --oracle-timeout <sec>  Oracle HTTP timeout in seconds (default: 300)
  --oracle-idle-timeout <sec> Subagent silence timeout before stalled recovery (default: 300)
  --oracle-heartbeat-interval <sec> Memory heartbeat write interval (default: 30)
  --oracle-persona <p>    Primary persona/subagent for oracle synthesis
  --oracle-peer <p>       Second persona/subagent for oracle deliberation
  --oracle-persona-model <m> Model for primary persona turns
  --oracle-peer-model <m> Model for peer persona turns
  --oracle-iterations <n> Sequential oracle deliberation calls (default: 1)
  --oracle-image <path> Attach an image to a direct scillm oracle call (repeatable)
  --roundtable          Run sequential protocolized persona deliberation
  --roundtable-personas <p> Comma-separated persona[:protocol_role] participants
  --roundtable-role-preset <p> Role preset (default: adversarial-review)
  --roundtable-rounds <n> Number of full participant rounds (default: 2)
  --roundtable-mode <m> Mode label (default: adversarial)
  --roundtable-persist <summary|full> Persist compact protocol state or full turns
  --argue                Run two parallel /scillm advocates followed by a judge
  --decision-required    Force FOR/AGAINST with uncertainty disclosure
  --tie-breaker <policy> Tie-breaker for --decision-required
  --parallel-review     Run independent parallel adversarial reviewers
  --parallel-reviewers <n> Number of default reviewers (default: 3)
  --parallel-review-personas <p> Comma-separated reviewer persona[:protocol_role] specs
  --parallel-review-focus <f> Comma-separated focus labels for default reviewers
  --parallel-review-role-preset <p> Role preset for parallel reviewers
  --cae-gap-review      Run evidence-case-backed CAE/QRA gap review
  --cae-reviewers <p>   Comma-separated CAE persona:role pairs
  --cae-judge <p>       CAE judge persona label
  --cae-max-rounds <n>  Maximum CAE clarify/reroute rounds
  --deep-review          Run read-only deep review with review.md and review.json artifacts
  --deep-review-target <target> Explicit target: paths, diff, plan, manifest, or artifact
  --deep-review-profile <p> Deep-review profile label (default: max_available)
  --deep-reviewers <n> Reviewer breadth requested for deep review (default: 5)
  --deep-review-focus <f> Comma-separated deep-review focus labels
  --deep-review-fallback-policy <fail_closed|warn> Downgrade behavior
  --deep-review-persist <summary|full> Persist compact metadata or full review state
  --deep-review-output-root <dir> Artifact root (default: .ask_artifacts/deep-review)
  --chain <name|path>   Saved review chain spec (e.g. deep-review-safety)
  --reviewer-spec <name|path> Reviewer role/focus spec (repeatable)
  --orchestrate          Draft and execute an ask DAG from the natural-language request
  --dag-json <json>     Execute an inline ask/scillm-style DAG before synthesis
  --dag-file <path>     Execute an ask/scillm-style DAG JSON file before synthesis
  --dogpile <auto|off|force> Freshness policy for date-sensitive oracle prompts
  --dry-run             Emit execution spec/risk analysis without mutation
  --ask-id <id>          Stable runtime artifact id for this ask call
  --run-output-root <dir> Runtime artifact root (default: .ask_artifacts/runs or ASK_RUN_OUTPUT_DIR)
  --overwrite            Replace an existing run directory for --ask-id
  --resume               Resume a non-terminal existing run directory for --ask-id
  --raw                   Return raw memory results (no synthesis)
  --image-generate        Generate image artifact(s) through scillm
  --image-model <model>   Image generation model (default: gpt-image-2)
  --image-size <size>     Image size, for example auto or 1024x1024
  --image-quality <q>     Image quality, for example auto, medium, or high
  --image-count <n>       Number of images to generate (default: 1)
  --image-output <path>   Output file or directory for generated image(s)
  --image-output-format <fmt> Image file format: png, jpeg, or webp
  --image-timeout <sec>   Image generation timeout in seconds (default: 300)
  --json                  JSON output
  --debug                 Enable debug logging

Runtime artifacts:

.ask_artifacts/runs/<ask_id>/
  <ask_id>.request.json
  <ask_id>.status.json
  <ask_id>.events.jsonl
.ask_artifacts/runs/index.jsonl

request.json captures the normalized routed request before mutation or oracle execution. status.json is atomically replaced as the run progresses. events.jsonl is append-only and records lifecycle events such as request_written, ask_started, memory_recall_started, memory_recall_finished, evidence_case_started, synthesis_finished, finished, and failed.

Natural-Language DAG Orchestration

/ask --orchestrate drafts an executable ask.dag.v1 from a clear natural language request. This is the zero-cognitive-load interface for project agents: the human names the desired skill orchestration, and /ask converts it into memory-first, optional dogpile, oracle, and sibling-skill nodes.

Rules:

/ask owns skill orchestration, dependencies, joins, run-state artifacts, and final synthesis.
$scillm remains a skill/backend boundary. /ask may request a model/profile for an ask.oracle node, but it does not expose model-pool, queue, provider-capacity, fallback, or transport choices.
/memory and /dogpile own their own recall/research internals.
/interview is used only when a natural-language request lacks a target skill, output artifact, or acceptance criteria that cannot be safely inferred.
skill.run validation uses runnable sibling skill contracts (SKILL.md plus run.sh) instead of a narrow product-specific skill list.

Natural-language DAG drafting also auto-engages for prompts that explicitly ask for a DAG, workflow, skill graph, orchestration, or coding implementation (implement, fix, patch, refactor). When the request is ambiguous, /ask fails closed with a needs_attention artifact pointing to /interview rather than guessing.

DAG JSON Mode

/ask can execute a project-agent DAG directly with --dag-json or --dag-file. This is the preferred interface when an agent can express the workflow more clearly as JSON than as a long list of flags.

Public graph envelope:

ask.dag.v1, the native ask DAG schema.

Compatibility graph envelope:

scillm.exec.graph.v1, normalized at the /ask boundary for React Flow, migration, and diagnostic tooling. This is not the preferred human/project agent input surface; normal callers use natural language or ask.dag.v1.

Supported node types:

memory.recall — run /memory recall and feed returned items into later nodes and synthesis.
dogpile.search — run /dogpile search for fresh research context.
ask.oracle — run a one-shot /scillm chat completion through /ask with the required ask caller headers and scillm metadata.
skill.run — run a runnable sibling skill with a SKILL.md and run.sh, such as create-report, review-code, memory, or dogpile.
scillm.agent_turn — run a standing Codex app-server worker through /v1/scillm/agents/{worker_id}/* (handoff → lease → turn → result). Use this for coding (worktree, declared_write_set, multi-turn, steer). Do not route implementation through /v1/chat/completions or default skill.run code-runner.

Coding DAG defaults:

Natural-language requests with implement/fix/patch/refactor intent draft an scillm.agent_turn node (id: implement, sandbox: workspace-write).
Worker id: --agent-worker, else ASK_AGENT_WORKER_ID, else implementation.
Registry must expose an implementation worker in config/scillm-agents.yaml.

Parallel review implementation:

--implement-with code-runner — handoff artifact only (read-only /ask).
--implement-with scillm-agent — same review path; no --code-runner-dod-command required (worker worktree enforces bounds).

Dependencies are expressed with depends_on. Independent nodes in the same layer run concurrently up to max_concurrency; dependent nodes run after their parents and receive upstream context. Each node may set max_attempts from 1 to 10. DAG runs always write request/status/events artifacts plus dag/manifest.json and one JSON artifact per node.

Each node may set allow_failure (boolean, default false). When a node finishes with ok: false:

allow_failure: false (default): the layer is fail-closed. /ask records dag_layer_failed with failed_node_ids and safe_default: stop_before_dependent_nodes, then raises AskDagError before any dependent layer runs.
allow_failure: true: the failed node is recorded in dag/manifest.json and its node artifact, but the DAG continues to dependent nodes that still declare the failed parent in depends_on. Use only for optional probes, shadow checks, or best-effort enrichment that must not block the main workflow.

Bounded auto-repair runs after a node attempt when /ask recognizes a known, deterministic failure mode. Repairs are recorded on the node artifact (repairs, auto_repaired) and in run-state events:

dag_node_auto_repair_started / dag_node_auto_repair_finished
Current repair: dogpile.search with returncode == -2 (skill timeout) when input.timeout is below 360 seconds — one retry at 360s with reason dogpile_timeout_budget_too_low.
Auto-repair does not bypass the required-node gate. If the repaired attempt still fails and allow_failure is false, the layer still fails closed.

Repair policy is data-driven:

Global policy: config/ask_dag_repair_policy.yaml
Timeout profiles: config/ask_dag_timeout_profiles.yaml
Skill hints: skills/<skill>/config/ask_dag_repair_hints.yaml (for example skills/dogpile/config/ask_dag_repair_hints.yaml)

Supported repair actions:

bump_timeout — retry once with a higher input.timeout when the skill subprocess was killed for budget (returncode == -2).
consume_partial_artifact — when a sibling skill wrote usable partial output (for example dogpile_partial_results.json with final_report), promote that artifact into the node result instead of failing immediately.
True ambiguity (missing target skill, artifact, or acceptance criteria) still routes to /interview via needs_attention, not silent DAG repair.

For layers containing ask.oracle nodes, /ask creates an internal scillm.exec.graph.v1 subgraph artifact such as dag/layer-001-scillm.subgraph.json and records dag_scillm_partition_started / dag_scillm_partition_finished events. /ask still owns dependencies, sibling skills, joins, run state, and final artifacts; the partition artifact is a diagnostic handoff receipt for the scillm-owned model lane, not user-authored orchestration input. /ask does not expose model-pool, queue, provider-capacity, or transport choices in the DAG contract; those remain $scillm internals. $ask submits model nodes with the requested model/profile and required metadata, and $scillm handles routing, streaming, provider queues, fallback, pooling, and telemetry.

The live sanity check for this contract is:

./scripts/dag_e2e_sanity.py --output-root /tmp/ask-dag-e2e-proof --ask-id ask-dag-e2e-proof

That check writes a realistic scillm.exec.graph.v1 file, runs two concurrent memory.recall nodes, joins them into a sequential create-report node, and verifies request/status/events, dag/manifest.json, per-node artifacts, and the generated Markdown report.

./scripts/dag_negative_sanity.py --output-root /tmp/ask-dag-negative-proof --ask-id ask-dag-negative-proof

That check runs a required skill.run node with an invalid flag, records dag_layer_failed, leaves the failed node artifact, and does not create the dependent node artifact.

For a costful live model lane, add --include-oracle; the same DAG inserts two concurrent ask.oracle nodes before the final create-report join.

Example native graph:

{
  "schema_version": "ask.dag.v1",
  "max_concurrency": 2,
  "nodes": [
    {
      "id": "memory_first",
      "type": "memory.recall",
      "input": {"query": "prior lessons for this report", "scope": "project"}
    },
    {
      "id": "brandon",
      "type": "ask.oracle",
      "depends_on": ["memory_first"],
      "input": {
        "prompt": "Brandon: read the report and comment against the criteria.",
        "model": "gpt-5.5",
        "reasoning_effort": "high"
      }
    },
    {
      "id": "margaret",
      "type": "ask.oracle",
      "depends_on": ["memory_first"],
      "input": {
        "prompt": "Margaret: read the report and comment against the criteria.",
        "model": "gpt-5.5",
        "reasoning_effort": "high"
      }
    },
    {
      "id": "final_report",
      "type": "skill.run",
      "depends_on": ["brandon", "margaret"],
      "input": {"skill": "create-report", "args": ["--help"]}
    }
  ]
}

Concurrent lanes and sequential joins

/ask schedules nodes in topological layers. Nodes with no unresolved depends_on in the same layer run concurrently (up to max_concurrency). Nodes that list parents in depends_on wait until those parents finish, then run in a later layer — that is the sequential join.

RIGHT — Layer 1: two concurrent memory.recall nodes; Layer 2: one sequential skill.run join (matches ./scripts/dag_e2e_sanity.py):

{
  "schema_version": "ask.dag.v1",
  "graph_id": "memory-fanout-report-join",
  "graph_goal": "Recall two memory scopes in parallel, then create one report.",
  "max_concurrency": 2,
  "nodes": [
    {
      "id": "memory_ask_contract",
      "type": "memory.recall",
      "input": {
        "query": "ask DAG scillm metadata oracle review artifacts",
        "scope": "ask",
        "k": 3
      }
    },
    {
      "id": "memory_report_contract",
      "type": "memory.recall",
      "input": {
        "query": "best practices report source-of-truth inventory findings",
        "scope": "ask",
        "k": 3
      }
    },
    {
      "id": "final_report",
      "type": "skill.run",
      "depends_on": ["memory_ask_contract", "memory_report_contract"],
      "input": {
        "skill": "create-report",
        "args": [
          "--title", "Ask DAG Proof Report",
          "--input", "${dag_context_json}",
          "--output", "${dag_node_output}"
        ]
      }
    }
  ]
}

Layer semantics for this graph:

Layer	Runs	Why
1	`memory_ask_contract` and `memory_report_contract` in parallel	No `depends_on`; same topological layer; `max_concurrency: 2` allows both at once.
2	`final_report` after both parents succeed	`depends_on` lists both memory nodes — sequential join with merged `${dag_context_json}`.

Optional third layer (live model lane): add two concurrent ask.oracle nodes that both depends_on the memory pair, then point final_report.depends_on at the oracle ids as well. See ./scripts/dag_e2e_sanity.py --include-oracle.

WRONG — expecting a join without depends_on:

{
  "nodes": [
    {"id": "memory_a", "type": "memory.recall", "input": {"query": "a"}},
    {"id": "memory_b", "type": "memory.recall", "input": {"query": "b"}},
    {"id": "final_report", "type": "skill.run", "input": {"skill": "create-report", "args": ["--help"]}}
  ]
}

All three nodes are in one concurrent layer. final_report can start before either memory recall finishes and may see empty upstream context.

PHART renderer (preferred)

Terminal charts use PHART — Python Hierarchical ASCII Representation Tool. Source of truth: https://github.com/scottvr/phart.git (PHART 1.5+; PyPI is behind git).

Preferred path (mockup-style bboxes): sibling skill $phart-dag-chart (skills/phart-dag-chart/) pins PHART from git on Python 3.14+ and renders layout=layered with bboxes=True (matches scillm/docs/goals/DAG-ux-mockup.html middle pane).

# One-time: install Python 3.14 for uv (if needed)
uv python install 3.14

cd skills/phart-dag-chart
./run.sh validate path/to/plan.dag.json
./run.sh chart path/to/plan.dag.json

/ask dry-run and ./scripts/render_dag_chart.py delegate to $phart-dag-chart when uv + Python 3.14 are available.

ask_dag.ascii_renderer: phart-git | phart-pypi | fallback
phart-git: $phart-dag-chart subprocess (PHART 1.5 git)
phart-pypi: in-process PyPI phart 1.1.x on Python 3.11–3.12 (bracket style)
fallback: built-in renderer if PHART unavailable

./scripts/render_dag_chart.py /tmp/memory-fanout.dag.json

Legacy skills/ask/phart-renderer/ is deprecated.

Terminal ASCII DAG chart

Before a costful run, print the layer plan to the terminal so humans and project agents share the same mental model.

Preferred — dry-run prints the chart automatically:

./run.sh ask "Plan the memory fanout DAG" \
  --dag-file /tmp/memory-fanout-report-join.dag.json \
  --dry-run

The dry-run payload includes ask_dag.layers (machine-readable) and prints ask_dag.ascii_chart to stdout before the JSON spec. The chart mirrors the middle pane of scillm/docs/goals/DAG-ux-mockup.html: boxed nodes, vertical trunk, concurrent fanout row, then join — simplified for the terminal.

DAG decision tree · memory-fanout-report-join
schema=ask.dag.v1 · max_concurrency=2

┌───────────────┐   ┌───────────────┐
│ memory_a      │   │ memory_b      │
│ memory.recall │   │ memory.recall │
│ [req]         │   │ [req]         │
└───────────────┘   └───────────────┘
        │                   │
        ──────────┴──────────
                  │
         ┌────────────────┐
         │ final_report   │
         │ skill.run      │
         │ [req]          │
         └────────────────┘

Trunk + 3-way fanout + join (same mockup shape):

                   ┌───────────────┐
                   │ shape         │
                   └───────────────┘
                           │
        ┬──────────────────┼───────────────────┬
┌──────────────┐   ┌──────────────┐   ┌────────────────┐
│ payload      │   │ exec         │   │ nico           │
└──────────────┘   └──────────────┘   └────────────────┘
        └──────────────────┴───────────────────┘
                            │
                    ┌──────────────┐
                    │ gate         │
                    └──────────────┘

Also valid — JSON only (automation / logs):

./run.sh ask "Plan the DAG" --dag-file /tmp/graph.dag.json --dry-run --json \
  | jq -r '.options.ask_dag.ascii_chart'

After a real run, correlate the chart with events.jsonl (dag_layer_started, dag_layer_finished, dag_layer_failed) and dag/manifest.json.

Required-node failure: who fixes the bug?

Bounded auto-repair (timeout bump, partial artifact consume) handles known deterministic modes. It does not replace debugging.

When a required node fails (allow_failure: false, the default):

/ask records dag_layer_failed, writes the failed node artifact (dag/<node_id>.json), and does not run dependent layers.
The project agent (or a /scillm exec subagent such as scillm exec pi-chutes-kimi / pi-opencode-kimi in read-only sandbox) must inspect evidence, fix the root cause, and re-run:
- dag/<failed_node>.json — error, stderr, returncode, repairs
- events.jsonl — dag_node_failed, dag_layer_failed, repair events
- Upstream skill logs under the run output root
Fixes are usually: correct input.args, skill bug patch, timeout profile, missing artifact path, or auth/routing (for ask.oracle, check /v1/scillm/auth before blaming the DAG).
Re-submit the same DAG with --resume when appropriate, or a corrected --dag-file after the code/config fix lands.

WRONG — treating auto-repair or answered as proof the workflow succeeded:

# dogpile still failed after repair; final_report never ran
jq '.status' .ask_artifacts/runs/<ask-id>/status.json   # may still show failure
grep dag_layer_failed .ask_artifacts/runs/<ask-id>/events.jsonl

WRONG — marking a broken required probe optional instead of fixing it:

{"id": "fresh_context", "type": "dogpile.search", "allow_failure": true}

Use allow_failure: true only for shadow/best-effort enrichment. If the main path needs dogpile output, fix timeout/skill/code — do not hide a required failure.

RIGHT — project agent loop after dag_layer_failed:

1. Read ascii_chart + dag_layer_failed event
2. Open dag/<node>.json and sibling skill stderr
3. Patch skill/CLI/config OR adjust DAG input
4. Re-run: ./run.sh ask "..." --dag-file fixed.dag.json --ask-id <id> --resume
5. Confirm dag/manifest.json shows downstream node artifacts

DAG JSON usage — good vs bad:

Use ask.dag.v1 for project-agent handoffs. Prefer --dag-file when the graph is non-trivial; use --dag-json only for small inline graphs.

RIGHT — layered workflow with explicit dependencies, safe timeouts, and a single final join:

{
  "schema_version": "ask.dag.v1",
  "graph_id": "report-with-fresh-research",
  "max_concurrency": 2,
  "nodes": [
    {
      "id": "memory_first",
      "type": "memory.recall",
      "input": {"query": "prior lessons for this report", "scope": "project", "k": 5}
    },
    {
      "id": "fresh_context",
      "type": "dogpile.search",
      "input": {
        "query": "current state of LLM workflow orchestration boundaries",
        "timeout": 360
      }
    },
    {
      "id": "final_report",
      "type": "skill.run",
      "depends_on": ["memory_first", "fresh_context"],
      "input": {
        "skill": "create-report",
        "args": [
          "--title", "Ask DAG Proof Report",
          "--input", "${dag_context_json}",
          "--output", "${dag_node_output}"
        ]
      }
    }
  ]
}

./run.sh ask "Run the report DAG"   --dag-file /tmp/report-with-fresh-research.dag.json   --ask-id report-dag-proof   --run-output-root /tmp/ask-dag-proof/runs   --overwrite   --json

Why this is good:

schema_version is ask.dag.v1.
Node id values are stable snake_case identifiers.
depends_on expresses the real join order.
dogpile.search gets a realistic timeout (or omits it and uses the profile default).
skill.run targets a runnable sibling skill and passes DAG context placeholders.
Model-pool / queue / provider-capacity fields are not in the graph; only ask.oracle nodes name a model/profile when needed.

RIGHT — optional probe that must not block the main path:

{
  "id": "shadow_dogpile",
  "type": "dogpile.search",
  "allow_failure": true,
  "input": {"query": "best-effort fresh sources", "timeout": 120}
}

Use allow_failure: true only for enrichment probes. Required nodes stay fail-closed (default allow_failure: false).

WRONG — treating the DAG like a scillm transport/routing config:

{
  "schema_version": "ask.dag.v1",
  "nodes": [
    {
      "id": "lane",
      "type": "ask.oracle",
      "input": {
        "prompt": "Review this",
        "model_pool": "gpt-pool",
        "provider_queue": "chutes",
        "fallback_chain": ["gemini", "claude"]
      }
    }
  ]
}

/ask owns orchestration and artifacts. $scillm owns pools, queues, fallback, and telemetry. Do not put transport knobs in project-agent DAG JSON.

WRONG — dependent node with no depends_on (race / empty context):

{
  "schema_version": "ask.dag.v1",
  "nodes": [
    {"id": "memory_first", "type": "memory.recall", "input": {"query": "context"}},
    {"id": "final_report", "type": "skill.run", "input": {"skill": "create-report", "args": ["--help"]}}
  ]
}

Without depends_on, final_report can run in parallel with memory_first and receive empty ${dag_context_json}.

WRONG — required node with an impossible budget, then expecting dependents to run:

{
  "schema_version": "ask.dag.v1",
  "nodes": [
    {
      "id": "fresh_context",
      "type": "dogpile.search",
      "input": {"query": "broad research query", "timeout": 30}
    },
    {
      "id": "final_report",
      "type": "skill.run",
      "depends_on": ["fresh_context"],
      "input": {"skill": "create-report", "args": ["--help"]}
    }
  ]
}

If fresh_context stays failed after auto-repair, /ask records dag_layer_failed and does not create final_report.json. Fix the timeout/profile or mark the probe allow_failure: true when it is optional.

WRONG — inventing node types or skills:

{
  "schema_version": "ask.dag.v1",
  "nodes": [
    {"id": "x", "type": "scillm.batch", "input": {}},
    {"id": "y", "type": "skill.run", "input": {"skill": "made-up-skill", "args": []}}
  ]
}

Allowed node types: memory.recall, dogpile.search, ask.oracle, skill.run. skill.run skills must have a sibling SKILL.md and run.sh.

WRONG — using --orchestrate and --dag-file together without a clear source of truth:

./run.sh ask "Use $memory and $dogpile then $create-report"   --orchestrate   --dag-file /tmp/already-authored.dag.json

Pick one authoring path:

Natural language only: --orchestrate ( /ask drafts the DAG ).
Explicit graph: --dag-file or --dag-json (do not also --orchestrate unless you intend to replace the hand-authored graph).

When validation fails, /ask returns AskDagError at load time or dag_layer_failed at runtime; inspect dag/manifest.json, per-node artifacts, and <ask_id>.events.jsonl instead of guessing from the final answer text.

React Flow or migration tooling may still hand /ask compatibility graph nodes with exec_graph_version: "scillm.exec.graph.v1" plus execution and prompt_payload. /ask normalizes those at the boundary into ask.dag.v1 before execution, so callers still do not choose model pools, queues, provider capacity, fallback, or transport.

When a run cannot safely continue, status.json uses state: needs_attention and includes a structured needs_attention object with reason, question, safe_default, and resume_hint. Deep review pauses this way when the target is missing instead of guessing repo scope.

Inspect a run:

./run.sh status --run <ask_id> --tail-events 25
./run.sh status --run .ask_artifacts/runs/<ask_id> --json
./run.sh status --run <ask_id> --watch --watch-timeout-seconds 300
./run.sh status --run <ask_id> --serve --open
./run.sh status --runs --limit 10
./run.sh status --prune --older-than-days 14 --dry-run

status --run <ask_id> --serve writes a read-only HTML monitor into the run directory and serves it from 127.0.0.1 with a random query token. The viewer polls status.json, events.jsonl, and request.json so long-running argue, parallel-review, deep-review, and SPARTA evidence-case routes are not black boxes. The server auto-shuts down after terminal state plus TTL.

Runtime safety:

Generated run IDs include timestamp, question digest, and random suffix to avoid same-second collisions.
Explicit --ask-id reuse is rejected by default so event logs cannot mix separate runs.
--overwrite is explicit and replaces a prior run directory; --resume is explicit and only allowed for non-terminal runs.
Plain /ask degrades to a no-op runtime state if artifact writes fail; deep review and parallel review fail closed.
status --prune removes only validated direct-child ask.runtime.v1 run directories whose ask_id and artifacts.run_dir match the directory.
status --watch has a bounded timeout and exits nonzero if the run never reaches a terminal state.
status --serve is read-only, localhost-bound, token-gated, and intended for human inspection of run artifacts; it must not mutate answers or retry nodes.
Runtime artifacts are validated by doctor against the deterministic request/status/event schema.
status --runs reads the append-only index.jsonl first, then falls back to directory scanning.

Dry-run preview:

./run.sh ask "safe to proceed?" --deep-review --dry-run --json
./run.sh learn "Lisa Feldman Barrett" --dry-run
./run.sh nightly --dry-run --json
./run.sh os learn --dry-run --json

Dry-run mode emits ask.dry_run.v1 execution specs with planned steps, external calls, filesystem writes, memory writes, and risk notes. It exits before runtime artifacts, memory writes, oracle calls, dogpile calls, or ingestion subprocesses.

Saved review specs:

./run.sh ask "review this runtime layer" \
  --chain deep-review-safety \
  --reviewer-spec security \
  --reviewer-spec qa \
  --deep-review-target src/ask/run_state.py

Built-in specs live under docs/chains/ and docs/reviewers/. Historical and active implementation/orchestration plans live under docs/plans/. Chain specs set deterministic workflow options; reviewer specs contribute protocol role/focus labels without making the agent infer the review contract.

Preflight the runtime:

./run.sh config doctor --profile release --json
./run.sh config init
./run.sh doctor
./run.sh doctor --json
./run.sh doctor --live --json

config doctor is non-interactive and safe for CI/release sanity. Missing config, credentials, Docker storage, or companion service paths return needs_attention with safe_default=do_not_claim_release_ready. config init is the interactive repair path and may call /interview to collect missing local values.

Oracle Synthesis:

--oracle keeps /ask retrieval, bridge traversal, and optional auto-learn local, then makes focused oracle calls for final synthesis. The default model is gpt-5.5 with high reasoning. Backend auto uses direct scillm for simple one-shot calls and subagent-runner for persona/iteration deliberation:

./run.sh ask "What should we do next?" --oracle
./run.sh ask "What should we do next?" --oracle --oracle-backend subagent-runner --oracle-model gpt-5.5 --oracle-reasoning high

Direct scillm oracle calls can include screenshots or other images with --oracle-image. /ask converts each local file to an OpenAI-compatible image_url data URI, sends it through the same SSE /scillm /v1/chat/completions route, and records oracle_image_paths in request artifacts:

./run.sh ask oc-kimi "Brandon, review this Posture page screenshot against the design brief" \
  --oracle-image /tmp/posture-page.png \
  --ask-id posture-page-brandon-review \
  --run-output-root /tmp/ask-runs \
  --overwrite

For deliberation, set --oracle-iterations and optional persona roles. /ask will run sequential subagent-style oracle calls, feeding each turn into the next:

./run.sh ask "What should we do next?" --oracle \
  --oracle-backend subagent-runner \
  --oracle-persona "systems architect" \
  --oracle-peer "skeptical reviewer" \
  --oracle-iterations 3

Natural peer syntax is also supported:

./run.sh ask Brandon ask Margaret where are we weak?

This maps to --oracle-persona Brandon --oracle-peer Margaret --oracle-iterations 2 with subagent-runner. Do not have subagents recursively call /ask for peer questions unless the human explicitly asks for recursive calls; /ask should orchestrate persona-to-persona turns. For same-model persona dialogue, /ask uses one subagent session and has it switch personas dynamically so it keeps the full subagent conversation context. Separate sessions are only for isolation, parallelism, or different peer model backends such as DeepSeek via /scillm.

Visible persistent subagents:

When the human asks to bring a named collaborator such as Nico into the current conversation, /ask must create or reuse a persistent, human-visible session. This is different from a bounded oracle answer.

Contract:

/ask owns the runtime. Do not manually call /subagent-runner, send tmux keys, or scrape terminal text as a hidden fallback.
The public abstraction is ask_agent(agent_id, message): resolve the named collaborator's durable conversation state, run the next turn, persist transcript/events, and return the collaborator's final_text.
Conversation identity is provider-neutral state. Store agent_id, backend_type, conversation_id, role/cwd, and timestamps in a dedicated memory-backed registry such as codex_subagent_state; do not mix runtime ids into lesson/persona lore.
Codex App Server is one backend adapter. Its conversation_id is the Codex thread.id, and each message uses turn/start against that thread. Normal follow-up turns must not create a new thread unless the user explicitly resets/forks the collaborator.
thread/resume is for reconnecting/reloading a stored thread before the next turn/start; turn/steer is only for appending input to an active in-flight turn.
Proof is the visible conversation text plus artifacts: request/status/events, backend event log, result JSON, thread_id, turn_id, observed App Server methods, and returned final_text.

Example:

./run.sh ask "Bring Nico into this conversation to review the PDF annotator mockups" \
  --ask-id nico-pdf-oxide \
  --run-output-root /tmp/ask-visible-subagents \
  --overwrite

Required proof paths from the resulting /ask artifacts:

<ask-id>.request.json
<ask-id>.status.json
<ask-id>.events.jsonl
scillm/App Server event log path
scillm result JSON path
visible terminal rendering of Nico -> project agent: <final_text>

Optional tmux/TUI views are diagnostic only. They are not the machine interface and are not proof unless backed by the structured event/result artifacts above.

Ask-backed design review loops:

When $ask is combined with $review-design, route the request as a bounded review-design critique loop, not as a generic Q&A answer. Leading model shorthand still applies. For example, $ask oc kimi for a $review-design with maximum 3 rounds resolves the reviewer model to opencode-go/kimi-k2.6 through /scillm.

Required behavior:

Capture or accept the current screenshot bundle for the UI surface.
Send the screenshot(s), design constraints, and review-design verdict schema to the resolved reviewer model.
Require a structured verdict: satisfied, needs_changes, or blocked.
If the verdict is needs_changes, the project agent patches the UI, captures a fresh screenshot, and asks the reviewer again.
Stop at the first satisfied verdict, concrete blocker, or requested maximum round count.

The round cap is literal. "maximum 3 rounds" means no more than three reviewer verdict calls. The reviewer must inspect a fresh rendered screenshot before marking the design satisfied.

Peer turns can use any one-shot model supported by /scillm. This lets a Codex subagent converse with DeepSeek V4, MiniMax, Gemini, or other scillm routes:

./run.sh ask "What should we do next?" --oracle \
  --oracle-backend subagent-runner \
  --oracle-model gpt-5.5 \
  --oracle-persona "systems architect" \
  --oracle-peer "DeepSeek V4 critic" \
  --oracle-peer-model opencode-go/deepseek-v4-pro \
  --oracle-iterations 3

If --consult-personas is also set, suggested personas are included as advisory subagent context, and the top suggestion is used as the deliberation peer when --oracle-peer is omitted.

When --oracle-persona, --oracle-peer, or --consult-personas names a stored persona, /ask recalls the actual persona profile from /memory and includes it in the oracle context. For example, --oracle-persona Brandon resolves to the stored Brandon Bailey profile when present. Runner-backed Codex subagents also receive explicit instructions and environment access to call the core oracle tool belt themselves during the session:

$ASK_ORACLE_MEMORY_RUN recall --q "Persona: Brandon Bailey"
$ASK_ORACLE_SCILLM_RUN warm-check --json
$ASK_ORACLE_DOGPILE_RUN --help

Core tool rules for oracle subagents:

/memory is mandatory for stored facts, actual personas, persona lessons, persona lore, prior lessons, and database state.
If no persona is specified, the oracle should use /memory recall as the primary persona selector because it already combines semantic, BM25, and graph traversal over persona lessons/lore; /monitor-personas is only an optional readiness/ops check.
Persona subagents must assume their persona may have stored lessons/lore and query it before answering from that persona's perspective.
Persona subagents may store a concise /memory learn record when the conversation produces a durable, reusable lesson; do not store transient chatter or secrets.
/scillm is available for one-shot peer model checks, not batch loops.
/dogpile is available for fresh external discovery only; do not use it for private/internal facts.
Subagents must state whether an answer used memory context, scillm peer checks, dogpile discovery, or inference.

Use oracle mode for single high-value questions, not nightly runs or batch ingestion loops.

Roundtable, Argue, Parallel Review, and CAE Gap Review Modes:

/ask supports four distinct review protocols:

--parallel-review: independent reviewers inspect the same artifact/question concurrently, then a neutral moderator synthesizes findings.
--argue: a FOR advocate and AGAINST advocate run in parallel through /scillm; a sequential judge decides or abstains, then a deterministic verifier gates the verdict.
--roundtable: selected personas speak sequentially through a state-machine protocol; each turn must reference prior claims and critiques.
--cae-gap-review: /create-evidence-case builds or loads the QRA/evidence snapshot first; Brandon, Margaret, and Jennifer review policy evidence, technical enforcement, and control mapping; a judge reroutes one missing evidence item per round before halting.

Use both together when you want independent findings first, followed by persona debate over those findings.

Citation rule: /ask uses ask.citations.v1 across answer surfaces. Memory citations support knowledge/persona/project-context answers, but never code or review safety claims. Safe review claims must cite target/file/diff/artifact sources, and verifier gates reject missing or inadmissible citations.

The narrow contract for reviewer fanout is documented in docs/ASK_PARALLEL_REVIEW_CONTRACT.md. The key boundary is that /ask owns target resolution, read-only reviewer roles, synthesis, verifier gates, and artifacts; /code-runner owns implementation, and Pi/subagent adapters only provide bounded execution mechanics.

Personas and protocol roles are separate:

persona = domain/voice/source-of-judgment
protocol_role = job in the review loop

Explicit role assignment:

./run.sh ask "Formal Methods in large scale aerospace projects in 2026" \
  --roundtable \
  --roundtable-personas "Brandon:failure_mode,Margaret:evidence_auditor,Jennifer:complexity_minimizer" \
  --roundtable-rounds 2 \
  --dogpile auto

Natural roundtable syntax:

./run.sh ask Brandon, Margaret, and Jennifer to debate the relevance of Formal Methods in large scale aerospace projects in 2026

This maps to:

./run.sh ask "the relevance of Formal Methods in large scale aerospace projects in 2026" \
  --roundtable \
  --roundtable-personas "Brandon,Margaret,Jennifer" \
  --roundtable-role-preset adversarial-review \
  --roundtable-rounds 2 \
  --oracle-backend subagent-runner \
  --dogpile auto

Parallel review:

./run.sh ask "Review this implementation" \
  --parallel-review \
  --parallel-reviewers 3 \
  --parallel-review-focus correctness,tests,maintainability

Argue:

./run.sh ask "Should we ship this reversible runtime change?" \
  --argue

Forced binary decision:

./run.sh ask "Should we ship this reversible runtime change?" \
  --argue \
  --decision-required \
  --tie-breaker more-reversible

Review then roundtable:

./run.sh ask "Review this architecture" \
  --parallel-review \
  --roundtable \
  --roundtable-personas "Brandon:failure_mode,Margaret:evidence_auditor,Jennifer:complexity_minimizer"

CAE/QRA gap review:

./run.sh ask "cae gap review AC-2 MFA evidence for the production tenant" \
  --cae-reviewers "Brandon:cae_policy_evidence,Margaret:cae_technical_enforcement,Jennifer:cae_control_mapping" \
  --cae-judge "CAE Gap Judge" \
  --cae-max-rounds 3

CAE gap review is a post-evidence-case review layer, not the QRA generator and not a compliance oracle. The QRA claim, answer, controls, and evidence_case snapshot stay fixed. The only adaptive behavior is targeted recurrence: when the judge returns NEEDS_CLARIFICATION, /ask reroutes exactly one missing evidence item to the matching CAE reviewer role, then asks the judge again. It halts on a terminal judge decision, repeated missing evidence, invalid judge JSON, model failure, or the max round limit.

Use it in the QRA lifecycle as:

generated QRA
  → candidate QRA
  → CAE gap review
  → human review
  → approve / edit / reject / defer
  → promote to sparta_qra or keep as gap

Protocol rules:

Critiques must anchor to specific claims.
Each participant must add a non-trivial disagreement or justify why none exists.
Critique and synthesis are separate; the moderator performs final synthesis.
Default persistence stores compact state, durable lessons, unresolved issues, and critique summaries.
SPARTA and space-cybersecurity questions use the deterministic SPARTA preflight contract in docs/ASK_SPARTA_PREFLIGHT_CONTRACT.md: preserve the question text, run /extract-entities and /memory recall first, route grounded SPARTA-corpora matches to /create-evidence-case, fail closed with needs_attention and safe_default=do_not_answer_as_grounded when required evidence-case creation is unavailable or fails, and continue normal /ask routing only when no grounded match is found.
SPARTA-corpora match signals are only extractor-grounded resolved control IDs, control metadata for SPARTA/CWE/NIST/CAPEC/ATT&CK, related/crosswalk pairs, taxonomy tags, or SPARTA recall items. Unresolved or fabricated SPARTA-looking references require needs_attention; never fabricate a control, crosswalk, relationship, or compliance status. All CAE/evidence-case outputs default to NEEDS_VERIFICATION and require human review before any status change. structured citations, and what would change the verdict. See docs/ASK_ARGUE_CONTRACT.md.

Deep Review Mode:

Use deep review when the human wants a comprehensive, Web-GPT-style review without copy-paste into the browser. Deep review is not /code-runner; it must produce analysis and artifacts, not patches.

Deep-review targets must be directly inspectable. Pass concrete readable files, diffs, manifests, plans, or rendered review artifacts. Do not pass compressed archives or opaque binary files as the target. If a phase or review package exists only as a ZIP or other archive, extract the archive or create a readable target manifest, then pass the individual files that the reviewer must inspect:

# WRONG: reviewer receives compressed bytes, not readable evidence.
./run.sh ask "deep review this phase" \
  --deep-review \
  --deep-review-target /tmp/phase-review.zip \
  --oracle-backend scillm \
  --oracle-model gpt-5.5 \
  --oracle-reasoning high

# RIGHT: reviewer receives the actual readable phase artifacts.
./run.sh ask "deep review this phase" \
  --deep-review \
  --deep-review-target "/tmp/phase-review/PHASE_STATUS.json /tmp/phase-review/PHASE_REVIEW_REQUEST.md /tmp/phase-review/progress-context.md /tmp/phase-review/plan.dag.json" \
  --oracle-backend scillm \
  --oracle-model gpt-5.5 \
  --oracle-reasoning high

./run.sh ask "deep review this implementation" \
  --deep-review \
  --deep-review-target src/ask/ask.py \
  --deep-reviewers 5 \
  --deep-review-focus boundaries,fail-closed,tests,auditability \
  --oracle-backend subagent-runner \
  --oracle-model gpt-5.5 \
  --oracle-reasoning xhigh

Deep review writes:

.ask_artifacts/deep-review/<timestamp>/review.md
.ask_artifacts/deep-review/<timestamp>/review.json

When deep review runs under an --ask-id, those review.md and review.json paths are also registered in <ask_id>.status.json under artifacts.

Deep review verifier rules:

Reject missing or not_assessed required sections.
Reject SAFE or SAFE_WITH_CONDITIONS without inspected evidence.
Reject major findings that lack evidence, impact, fix, or verification.
Reject unexpected non-artifact file changes from before/after git status.
Treat JSON as an audit gate, not proof of reasoning quality.

Every /ask execution is logged to /memory collection ask_call_log with question, scope, persona, oracle backend, model, iteration count, status, and duration. This telemetry is the basis for future data-driven timeout selection.

Runner-backed oracle calls also have heartbeat/recovery telemetry:

/subagent-runner emits transcript delta and heartbeat events while the PTY session is alive.
/ask follows runner events.jsonl as the primary liveness channel and falls back to status polling only when necessary.
If transcript output is silent for --oracle-idle-timeout, the runner marks the session stalled and terminates the process group.
/ask writes sparse heartbeat snapshots to /memory collection ask_subagent_heartbeat.
Heartbeat records include session id, artifact dir, persona, model, turn number, status, transcript byte count, last_output_age_ms, timeout, and idle timeout.
On stalled, timed_out, failed, or cancelled, /ask returns the terminal status plus the transcript tail so the project agent can recover or retry with a different timeout/model.
E2E validators must fail empty answers, No answer could be synthesized, refusal-style non-answers, missing domain grounding, wrong persona routing, or missing roundtable participants.

Oracle Backend Decision Table:

User intent	Use	Why
One direct high-reasoning answer	`--oracle --oracle-backend scillm`	Fast path through `/scillm`
Browser/session-backed ChatGPT answer or review	`$ask webgpt ...` or `--oracle --oracle-backend webgpt`	Preserves `/ask` artifacts while using `/surf` sentinel proof against the authenticated ChatGPT tab
Focused Codex agent answer	`--oracle --oracle-backend subagent-runner`	Runs a real Codex CLI subagent session
Persona or peer deliberation	`--oracle --oracle-backend auto --oracle-iterations 2+`	`auto` selects `subagent-runner`
N-persona sequential debate	`--roundtable --roundtable-personas ...`	State-machine review protocol with claim anchoring
Two-sided calibrated decision	`--argue`	Two parallel `/scillm` advocates, sequential judge, verifier gate
N independent adversarial reviewers	`--parallel-review --parallel-reviewers N`	Parallel breadth before moderator synthesis
Independent findings then debate	`--parallel-review --roundtable`	Best for high-stakes review
Web-GPT-style deep review	`--deep-review --deep-review-target <target>`	Pass-based review with `review.md` and `review.json`
GPT-5.5 vs DeepSeek/Gemini/MiniMax debate	`--oracle-peer-model <scillm-model>`	Codex turn uses runner; peer turn uses `/scillm`
Batch/nightly ingestion	Do not use oracle	OAuth models and subagents are not batch lanes

How to Prompt an Agent to Use /ask Oracle:

Natural user prompts should map to these commands:

Use $ask oracle on "should subagent-runner replace direct scillm for focused agent calls?"

./run.sh ask "should subagent-runner replace direct scillm for focused agent calls?" \
  --oracle \
  --oracle-backend subagent-runner \
  --oracle-model gpt-5.5 \
  --oracle-reasoning high

Use $ask oracle with 3 rounds: architect vs skeptical reviewer.

./run.sh ask "<question>" \
  --oracle \
  --oracle-backend subagent-runner \
  --oracle-persona "systems architect" \
  --oracle-peer "skeptical reviewer" \
  --oracle-iterations 3

Use $ask oracle and have GPT-5.5 converse with DeepSeek V4.

./run.sh ask "<question>" \
  --oracle \
  --oracle-backend subagent-runner \
  --oracle-model gpt-5.5 \
  --oracle-persona "GPT-5.5 architect" \
  --oracle-peer "DeepSeek V4 critic" \
  --oracle-peer-model opencode-go/deepseek-v4-pro \
  --oracle-iterations 3

Correct Usage Examples:

# Ask memory first, then use GPT-5.5 high reasoning as the final oracle.
./run.sh ask "What is the best architecture for X?" --oracle

# Force direct scillm when you want a single fast oracle call.
./run.sh ask "Summarize the tradeoff in one paragraph." \
  --oracle --oracle-backend scillm

# Force a real Codex subagent session.
./run.sh ask "Analyze this design decision." \
  --oracle --oracle-backend subagent-runner

# Let /ask suggest personas and include them in oracle context.
./run.sh ask "How should we improve visual hierarchy?" \
  --consult-personas --oracle --oracle-iterations 2

Incorrect Usage Examples:

# WRONG: Asking deep-review to inspect a compressed package.
./run.sh ask "deep review this review bundle" \
  --deep-review \
  --deep-review-target /tmp/review-bundle.zip

# RIGHT: Extract or render the bundle, then pass concrete readable files.
./run.sh ask "deep review this review bundle" \
  --deep-review \
  --deep-review-target "/tmp/review-bundle/PHASE_STATUS.json /tmp/review-bundle/review-request.md /tmp/review-bundle/manifest.json"

# WRONG: Using oracle mode for many independent items.
for q in "${questions[@]}"; do ./run.sh ask "$q" --oracle; done

# RIGHT: Use normal /ask or batch-capable /scillm lanes for bulk work.
./run.sh ask "one high-value question" --oracle

# WRONG: Assuming --oracle-peer changes the model.
./run.sh ask "question" --oracle --oracle-peer "DeepSeek critic"

# RIGHT: Set the peer model explicitly for scillm one-shot model turns.
./run.sh ask "question" --oracle \
  --oracle-peer "DeepSeek critic" \
  --oracle-peer-model opencode-go/deepseek-v4-pro \
  --oracle-iterations 2

# WRONG: Asking for persona deliberation with --raw.
./run.sh ask "question" --raw --oracle --oracle-iterations 3

# RIGHT: Oracle synthesis requires synthesized context, so omit --raw.
./run.sh ask "question" --oracle --oracle-iterations 3

# WRONG: Treating /ask as a replacement for fresh web search.
./run.sh ask "latest price/news today" --oracle

# RIGHT: Use /dogpile or a search-capable skill first, then ask the oracle over gathered context.
./run.sh ask "question over learned or retrieved context" --oracle

Persona Routing:

When --consult-personas is enabled, /ask uses Federated Taxonomy bridges to find personas best suited to answer the question:

./run.sh ask "How should we improve visual hierarchy in the UI?" --consult-personas

# Output:
#   Suggested personas to consult:
#     - Paula Scher (Graphic Designer) [Precision]
#     - Don Norman (Cognitive Scientist) [Precision, Loyalty]

This enables cross-persona knowledge queries where Embry can automatically identify that Paula Scher should be consulted for typography questions.

Auto-Learn Flow:

Question → Memory Recall → No results?
  YES → Learn Pipeline (dogpile → YouTube → web → QRA → store)
      → Re-query Memory → Return answer with multi-hop traversal
  NO  → Return answer directly with bridge connections

`learn` — Discover and Ingest Knowledge

./run.sh learn <topic> [options]

Options:
  --scope <scope>         Memory scope (default: "ask")
  --collection <coll>     Taxonomy collection (lore, operational, sparta, behavioral)
  --depth <level>         Learning depth: quick (5-10min), standard (30-60min), deep (hours)
  -i, --interactive       Use /interview to ask about learning preferences
  --youtube <url>         Specific YouTube URL to ingest (repeatable)
  --books-only            Only discover and process books
  --youtube-only          Only process YouTube content
  --max-books <n>         Max books to discover (default: 5)
  --max-videos <n>        Max YouTube videos to process (default: 3)
  --dry-run               Preview what would be ingested without storing
  --ask-id <id>           Stable runtime artifact id for this learn call
  --run-output-root <dir> Runtime artifact root (default: .ask_artifacts/runs or ASK_RUN_OUTPUT_DIR)
  --overwrite             Replace an existing run directory for --ask-id
  --resume                Resume a non-terminal existing run directory for --ask-id
  --debug                 Enable debug logging

Learning Depths:

Depth	Time	Videos	Books	ArXiv	Use Case
`quick`	5-10 min	3	0	0	Quick overview, verify facts
`standard`	30-60 min	5	3	3	Moderate understanding
`deep`	2-6 hours	10+	5	10	Comprehensive persona building

Persona Detection: When the topic looks like a person's name (e.g., "Lisa Feldman Barrett", "Dr. Robert Sapolsky"), the skill automatically:

Stores a persona profile to memory
Searches for additional lectures by the person
Downloads books by/about the person
Creates a queryable knowledge base

`status` — Learning Progress

./run.sh status [options]

Options:
  --scope <scope>         Filter by scope
  --run <id|path>         Show runtime status for an ask id, run directory, or status file
  --tail-events <n>       Include the last N runtime events with --run
  --watch                 Watch runtime status until terminal
  --watch-timeout-seconds <n> Maximum seconds to wait with --watch
  --poll-interval-seconds <n> Polling interval for --watch
  --serve                 Serve a local read-only HTML viewer for --run
  --open                  Open the local HTML viewer in a browser
  --serve-port <n>        Port for --serve; 0 selects a free port
  --serve-ttl-seconds <n> Seconds to keep viewer alive after terminal state
  --runs                  List recent runtime runs
  --limit <n>             Maximum runs to list with --runs
  --prune                 Prune old runtime run directories
  --older-than-days <n>   Age threshold for --prune (default: 14)
  --dry-run               Preview --prune without deleting
  --run-output-root <dir> Runtime artifact root for --run ids
  --json                  JSON output
  --debug                 Enable debug logging

Shows:

Total knowledge items in scope
Persona profiles
Q-R-A pairs count
Last task-monitor state (steps, timing, stats, ETA)

`nightly` — Scheduled Persona Updates

./run.sh nightly [options]

Options:
  --scope <scope>         Memory scope to update (default: ask)
  --persona <name>        Update a single persona by name
  --dry-run               Preview without storing
  --ask-id <id>           Stable runtime artifact id for this nightly call
  --run-output-root <dir> Runtime artifact root (default: .ask_artifacts/runs or ASK_RUN_OUTPUT_DIR)
  --overwrite             Replace an existing run directory for --ask-id
  --resume                Resume a non-terminal existing run directory for --ask-id
  --json                  Output summary as JSON
  --debug                 Enable debug logging

Nightly Update Flow:

Query memory for all stored persona profiles
For each persona, search for new content since last update
Ingest new YouTube videos, papers, news articles
Update persona profile with new sources
Report summary via task-monitor

`os` — Query Embry-OS Internals

# Index OS knowledge (skills, packages, config)
./run.sh os learn --depth quick --dry-run
./run.sh os learn --depth standard
./run.sh os learn --depth quick --dry-run --ask-id os-preview

# Query OS knowledge
./run.sh os ask "what does the /dogpile skill do?"
./run.sh os ask "which skills provide memory?" --json
./run.sh os ask "which skills provide memory?" --ask-id os-memory-query

# Query runtime health
./run.sh os health "is memory healthy?"
./run.sh os health "check workstation" --subsystem workstation
./run.sh os health "is memory healthy?" --ask-id os-memory-health

# Classify query intent
./run.sh intent "how does the memory skill work?"

OS Learn crawls .pi/skills/*/SKILL.md, packages/*/package.json, .pi/config.toml, and .pi/extensions/*.ts. Generates QRA triples tagged with scope=os, bridge attributes, and source metadata (skill, package, config, extension).

OS Health dispatches to the relevant monitor/ops skill (e.g., monitor-memory health --json) and combines runtime data with static knowledge from memory.

Intent Classifier routes queries through a 3-stage pipeline:

Memory cache (~1ms) — cached classifications
Rule-based heuristics (<5ms) — regex pattern matching
LLM fallback (1-5s) — scillm classification when uncertain

Architecture

Agent: "Lisa Feldman Barrett how might we improve our memory system?"
                    │
                    ▼
            ┌──────────────┐
            │ Persona Detect│  ← Is this a person?
            └──────┬───────┘
                   │
            ┌──────┴───────┐
            │ Memory Recall │  ← Check what we already know
            └──────┬───────┘
                   │
              Items found?
              ┌────┴────┐
             YES        NO + --auto-learn
              │          │
              │    ┌─────┴──────────────────────────────────┐
              │    │ Learn Loop (multi-source, multi-hour)  │
              │    │                                         │
              │    │ 1. /dogpile (Brave + Perplexity + etc)  │
              │    │ 2. YouTube ingest (lectures, interviews)│
              │    │ 3. Web fetch (blogs, articles)          │
              │    │ 4. discover-books (OpenLibrary)         │
              │    │ 5. extractor --format qra               │
              │    │ 6. memory learn + persona profile       │
              │    │                                         │
              │    │ (tracked via task-monitor with ETA)     │
              │    └─────┬──────────────────────────────────┘
              │          │
              │    ┌─────┴──────┐
              │    │ Re-query   │
              │    └─────┬──────┘
              │          │
              └────┬─────┘
                   │
            ┌──────┴───────┐
            │  Synthesize  │  ← Combine results
            └──────┬───────┘
                   │
            ┌──────┴────────────────┐
            │ Federated Taxonomy    │  ← Multi-hop bridge traversal
            │ (Corruption, Precision,│
            │  Resilience, etc.)     │
            └───────────────────────┘

Task-Monitor Integration

Every learn session registers with /task-monitor:

Registry: ~/.pi/task-monitor/registry.json
State file: .pi/skills/ask/ask_task_state.json (atomic writes)
Steps tracked: memory_check → dogpile → ingest_youtube → fetch_web → extractor_qra → store
Sub-steps: Individual items within each step (e.g., each video URL)
ETA: Estimated time remaining based on depth and progress
Stats: books_discovered, youtube_ingested, web_fetched, qra_extracted, items_stored

# View real-time progress
uv run --project .pi/skills/ask python -m json.tool < .pi/skills/ask/ask_task_state.json

# Example state output:
{
  "completed": 3,
  "total": 6,
  "progress_pct": 55.0,
  "current_item": "ingest_youtube",
  "current_detail": "https://youtube.com/watch?v=...",
  "substep_current": 2,
  "substep_total": 5,
  "eta_seconds": 180.0,
  "eta_display": "~3 min remaining",
  "depth": "standard"
}

# Via task-monitor TUI
cd ~/.pi/skills/task-monitor
uv run python monitor.py tui --filter ask

Federated Taxonomy Integration

Knowledge is tagged with bridge attributes for multi-hop graph traversal:

Bridge	Meaning	Example Topics
Corruption	Moral decay, entropy	Power dynamics, institutional failure
Precision	Exactness, clarity	Scientific method, measurement
Resilience	Recovery, adaptation	Stress response, neuroplasticity
Fragility	Vulnerability	Trauma, system failure
Stealth	Hidden operations	Unconscious processes

Multi-hop Query Example:

Query: "How does stress affect decision-making?"
          │
          ├── Direct hits: Stress research papers
          │
          └── Bridge traversal:
              ├── [Resilience] → Neuroplasticity studies
              ├── [Fragility] → Trauma responses
              └── [Corruption] → Decision biases under stress

Memory Scopes

Scope	Use
`behavioral`	Psychology, neuroscience, behavioral studies
`ask`	General learning (default)
Custom	Any scope name you provide

Environment

Variable	Purpose
`ASK_DEFAULT_SCOPE`	Override default memory scope
`ASK_NIGHTLY_SCOPE`	Scope for nightly updates
`ASK_MAX_BOOKS`	Override default max books to discover
`ASK_MAX_VIDEOS`	Override default max videos
`ASK_ORACLE_MODEL`	Override default oracle model (default: `gpt-5.5`)
`ASK_ORACLE_REASONING`	Override default oracle reasoning effort (default: `high`)
`ASK_ORACLE_TIMEOUT`	Override default oracle timeout seconds (default: `300`)
`ASK_ORACLE_IDLE_TIMEOUT`	Override subagent silence timeout before `stalled` (default: `300`)
`ASK_ORACLE_HEARTBEAT_INTERVAL`	Override heartbeat write interval seconds (default: `30`)
`ASK_ORACLE_BACKEND`	Override oracle backend (default: `auto`)
`ASK_RUN_OUTPUT_DIR`	Override runtime artifact root for request/status/events files
`ASK_SUBAGENT_RUNNER`	Path to subagent-runner `run.sh`
`ASK_SUBAGENT_OUTPUT_DIR`	Output directory for oracle subagent artifacts
`ASK_MEMORY_RUN`	Path to memory `run.sh` exposed to oracle subagents
`ASK_SCILLM_RUN`	Path to scillm `run.sh` exposed to oracle subagents
`ASK_DOGPILE_RUN`	Path to dogpile `run.sh` exposed to oracle subagents
`ASK_MONITOR_PERSONAS_RUN`	Path to monitor-personas `run.sh` exposed to oracle subagents
`SCILLM_BASE_URL`	scillm base URL for oracle synthesis (default: `http://localhost:4001`)
`SCILLM_API_KEY`	scillm bearer token for oracle synthesis (default: `sk-dev-proxy-123`)
`ASK_DEBUG`	Enable debug logging (set to any value)
`TASK_MONITOR_URL`	Task-monitor API URL for remote push

Related Skills

Skill	Relationship
`/dogpile`	Primary discovery engine (Brave, Perplexity, ArXiv, YouTube, GitHub)
`/memory`	Knowledge storage and retrieval
`/surf`	Browser transport and sentinel proof layer for `$ask webgpt`
`/project-knowledge`	Curated human-readable current-state document for `/ask` development
`/discover-books`	Book discovery via OpenLibrary
`/ingest-youtube`	YouTube transcript extraction
`/fetcher`	Web content fetching
`/extractor`	Document extraction with QRA mode
`/taxonomy`	Federated Taxonomy tagging
`/interview`	Interactive preference gathering
`/task-monitor`	Progress tracking with ETA
`/scheduler`	Nightly update scheduling
`/prompt-lab`	Prompt optimization for scillm calls

Persona Profiles

When learning about a person, a persona profile is stored to memory:

{
  "name": "Lisa Feldman Barrett",
  "scope": "behavioral",
  "sources": {
    "dogpile_sections": 5,
    "books": 3,
    "youtube": 8,
    "web": 4
  },
  "stats": {
    "qra_extracted": 45,
    "stored": 46
  },
  "last_updated": "2024-01-15T10:30:00"
}

Query a persona:

./run.sh ask "How does Barrett define emotions?" --scope behavioral --bridges

Nightly Scheduling

To run nightly persona updates automatically, use the /scheduler skill or cron:

# Via scheduler skill
./path/to/scheduler/run.sh add ask-nightly \
  --command ".agent/skills/ask/run.sh nightly --scope behavioral" \
  --schedule "0 3 * * *"  # 3 AM daily

# Via cron
# Add to crontab -e:
# 0 3 * * * /path/to/.agent/skills/ask/run.sh nightly --scope behavioral >> /var/log/ask-nightly.log 2>&1

name	ask
description	Zero cognitive-load learning and querying skill. Learn about a topic or persona (e.g., "Lisa Feldman Barrett") by discovering, ingesting, and extracting knowledge — or ask questions against what's been learned. Supports multi-hour deep learning with progress tracking, persona profiles, and nightly incremental updates. Uses Federated Taxonomy for multi-hop graph traversal across knowledge domains. Composes: dogpile, discover-books, ingest-youtube, fetcher, extractor, memory, taxonomy, task-monitor.
allowed-tools	["Bash","Read","Write"]
triggers	["I want to learn about","I want to ask","ask about","learn about","/ask","$ask Brandon","ask Brandon","ask <persona> what","ask <persona> how","ask <persona> critique","ask <persona> review","ask <persona>, <persona>, and <persona> to debate","ask roundtable","ask parallel reviewers","ask adversarial review","ask deep review","ask image generation","ask generate image","ask CAE gap review","ask QRA gap review","ask safe to proceed","ask comprehensive review","ask persona roundtable about","ask N parallel reviewers","ask NIST control","ask SPARTA countermeasure","ask current architecture risk","ask the oracle as","ask oracle with persona","ask webgpt","$ask webgpt","ask chatgpt","$ask chatgpt","webgpt review","webgpt oracle","chatgpt oracle","ask webgpt to review","ask webgpt about","ask cursor-browser","$ask cursor-browser","ask webgemini","$ask webgemini","ask webkimi","$ask webkimi","ask webperplexity","$ask webperplexity","teach me about","what does X say about","what does Sapolsky say","what does Barrett say","learn from","ask about embry","how does X skill work","what does the X skill do","is memory healthy","is X healthy","what skills provide","os health","os learn"]
metadata	{"short-description":"Zero cognitive-load learning and querying for personas, topics, and OS internals","author":"Horus","version":"0.6.1"}
provides	["ask","oracle-query","os-knowledge"]
composes	["phart-dag-chart","memory","dogpile","extract-entities","create-evidence-case","discover-books","ingest-youtube","fetcher","extractor","taxonomy","scillm","surf","subagent-runner","create-context","monitor-memory","monitor-skills","monitor-skill-health","monitor-security","monitor-sparta","monitor-personas","monitor-taxonomy","project-knowledge","ops-workstation","ops-arango","ops-docker","ops-llm","ops-chutes","task-monitor"]

STOP. READ THIS ENTIRE SKILL.MD BEFORE CALLING ANY ENDPOINT.

ask

Zero cognitive-load learning and querying interface. Eight modes:

Learn Mode — Discover, ingest, and extract knowledge about a topic or persona
Ask Mode — Query accumulated knowledge with Federated Taxonomy multi-hop traversal
Auto-Learn Mode — Ask a question; if no knowledge exists, automatically learn then answer
Nightly Mode — Scheduled incremental updates to persona knowledge bases
OS Mode — Learn about and query embry-os internals, skills, packages, and runtime health
Deep Review Mode — High-reasoning, read-only review with review.md and review.json
CAE Gap Review Mode — Evidence-case-backed QRA review with bounded reviewer/judge rerouting
Image Generation Mode — Generate image artifacts through /scillm /v1/images/generations

Literal Runtime Contract

Do not replace $ask with:

spawn_agent
an informal subagent prompt
a plain model call
a hand-written reviewer summary
a web search
a local-only critique that bypasses ask artifacts

For review requests, pass the complete target artifact through the documented ask mode instead of summarizing it. Examples:

Use --deep-review --deep-review-target <path> for Web-GPT-style prompt, schema, code, plan, or artifact reviews.
Use --parallel-review for independent reviewer fanout.
Use --roundtable only when the user asks for persona deliberation.
Use --cae-gap-review only for evidence-case-backed CAE/QRA gap review.

Proof of a real $ask run is the ask artifact set, not an assistant summary. Return the relevant artifact paths, such as:

.request.json
.status.json
.events.jsonl
review.md
review.json
mode-specific generated artifacts

If the runtime is unavailable, report that directly and ask before substituting spawn_agent or another fallback.

Browser Oracle Backends — routing (read first)

Project agents choose a browser oracle by task type, then confirm Chrome vs Cursor.

Team defaults (which backend for which work)

Work type	Use	Site / transport
Code — review bundles, architecture, implementation, tests, tech-lead loops	`$ask webgpt`	`chatgpt.com` in Chrome via surf-cli; `--no-activate`
Prose — papers, voice, clarity, long-form writing	`$ask webkimi`	`kimi.com` in Chrome
Design — mockups, UX, visual hierarchy, design critique	`$ask webgemini`	`gemini.google.com` in Chrome
Research — fresh facts, citations, current events, web synthesis	`$ask webperplexity`	Perplexity one-shot (no standing review tab)
Inside Cursor — ChatGPT in embedded Browser (self-contained; no external Chrome)	`$ask cursor-browser`	Cursor Browser + viewId + cursor-browser-bridge

Rules of thumb:

In Cursor IDE with ChatGPT open in the embedded Browser → prefer cursor-browser, not webgpt.
Need background Chrome while working in another window → webgpt + --webgpt-tab-id or --webgpt-project + --no-activate (via surf).
Do not use webperplexity for multi-round review loops on the same thread.
Do not pass Chrome tab ids to --cursor-browser-view-id (different namespace from viewId).

Shared limitations (all browser backends)

Tabs cannot read local paths you only mention in the prompt. Use a concatenated .md/.txt bundle (inlined under ## Attached files) or a zip (≤5 files, webgpt only).
Proof is the ask artifact set (.ask_artifacts/runs/<ask_id>/), not an assistant paraphrase.
Normal work: $ask … (orchestration). Raw $surf … only when debugging transport.

Backend reference

Shorthand	`--oracle-backend`	Multi-turn tab	Tab binding
`$ask webgpt`	`webgpt`	Yes	`--webgpt-project`, `--webgpt-tab-id`
`$ask webgemini`	`webgemini`	Yes	`--gemini-tab-id`, `--gemini-url`
`$ask webkimi`	`webkimi`	Yes	`--kimi-tab-id`, `--kimi-url`
`$ask webperplexity`	`webperplexity`	No (one-shot)	—
`$ask cursor-browser`	`cursor-browser`	Yes	`--cursor-browser-project`, `--cursor-browser-view-id`

WebGPT Oracle Backend

Boundary contract:

$ask owns WebGPT orchestration: routing, run artifacts, rate limiting, project tab binding, multi-turn review loops, and oracle/review semantics.
$surf owns browser transport and proof: controlled tab selection, webgpt.submit, sentinel injection, completion waiting, clean/raw/meta outputs, screenshots, and no-activate focus preservation.
$scillm owns direct model-provider calls through localhost:4001, including GPT-5.5 high reasoning, Gemini, Claude, Chutes, OpenCode Go, image generation, streaming, batching, JSON repair, fallback, and usage logging.
Do not add WebGPT browser-tab access to $scillm. WebGPT is not a provider API call; it depends on the user's live authenticated browser session and the $surf sentinel proof contract. Callers that want WebGPT must use $ask webgpt or --oracle-backend webgpt.
Do not call $surf directly for ask/oracle/review work unless debugging the transport layer. Normal callers should use $ask so request/status/events artifacts, WebGPT rate limits, tab bindings, file auto-attachment, and reviewer loop semantics are preserved.

# Auto-resolve tab id (when exactly one chatgpt.com tab is open)
./run.sh ask "to perform the review on /tmp/code-runner-reliability-review/review-bundle.md" \
  --oracle --oracle-backend webgpt

# Equivalent shorthand
./run.sh ask webgpt to perform the review on /tmp/code-runner-reliability-review/review-bundle.md

# Explicit tab id (from the Tab ID Viewer Chrome extension)
./run.sh ask "summarise the review bundle" \
  --oracle --oracle-backend webgpt --webgpt-tab-id 837343564

# Resolve by ChatGPT conversation URL
./run.sh ask "summarise the review bundle" \
  --oracle --oracle-backend webgpt \
  --webgpt-url "https://chatgpt.com/c/6a0097ff-e7e0-83ea-93c2-3a6b88e2a67f"

# Autonomous mode: let surf pick or create a background ChatGPT tab.
./run.sh ask webgpt summarise the review bundle --webgpt-create-tab

# Per-project tab binding — first call auto-creates and persists; subsequent
# calls reuse the same tab so the conversation context survives across days.
./run.sh ask webgpt summarise the review bundle --webgpt-project code-runner-review

Per-project tab bindings

--webgpt-project NAME binds the call to a per-project ChatGPT tab whose id is persisted at ~/.pi/webgpt-projects/<name>.json. Two workflows:

Autonomous (default — ephemeral tasks):

# First call: auto-create a background tab and remember it under "pr-1234".
./run.sh ask webgpt review this PR diff --webgpt-project pr-1234

# Subsequent calls reuse the same tab; ChatGPT preserves conversation state.
./run.sh ask webgpt and now check the migration path --webgpt-project pr-1234

If the bound tab gets closed, the runtime silently creates a new one and re-binds.

Manual (long-lived projects you want to babysit):

# Open exactly the ChatGPT tab you want, copy its id from the Tab ID Viewer
# Chrome extension, then bind once:
./run.sh webgpt-project bind code-runner-review --tab-id 837343543 \
  --url "https://chatgpt.com/c/<conv-id>"

# All future calls for that project use this exact tab — never replaced.
./run.sh ask webgpt continue the review --webgpt-project code-runner-review

Management commands (./run.sh webgpt-project ...):

bind NAME --tab-id ID [--url URL] [--manual|--auto] — create or update a binding
list [--json] [--verify] — show all projects; --verify checks tab still open in Chrome
show NAME [--json] — full state dump for one project
verify NAME [--json] — re-check whether the bound tab is still open
unbind NAME — remove a binding (does not close the Chrome tab)
gc [--days 30] — remove stale auto-bindings (manual bindings are never collected)

Behavior:

Tab resolution. Priority: --webgpt-tab-id → --webgpt-url → --webgpt-project NAME (with a valid binding) → --webgpt-create-tab → auto-resolve from surf tab.list filtered to chatgpt.com. Auto-resolve fails closed when 0 or >1 candidates exist — the call refuses to run rather than guess. When the project agent hits this, it must ask the human to either: (a) open exactly one ChatGPT tab so auto-resolve picks it, (b) provide a tab id from the Tab ID Viewer extension to pass through --webgpt-tab-id, (c) re-invoke with --webgpt-create-tab for the agent to acquire a tab autonomously (surf picks the most-recent existing chatgpt.com tab without foregrounding, or creates a fresh background one if none are open), or (d) pass --webgpt-project NAME to bind this call to a persistent per-project tab (see "Per-project tab bindings" below). The resolved tab id surfaces in oracle_model_served: webgpt:<id> so the agent can pass it explicitly to follow-up rounds.
File auto-attachment. Absolute paths embedded in the question (e.g. /tmp/foo.md, ~/notes.md) are read from disk and inlined under ## Attached files in the prompt. Truncated at 2 MB per file.
Path-only bundle rejection (fail closed, friendly). WebGPT, WebGemini, WebKimi, and WebPerplexity cannot read the local filesystem. If the project agent passes bare path lists or unreadable bundles, $ask stops before surf webgpt.submit and returns a project-agent message: I'm a web-based agent and I can't read local file paths. Please provide either a zip review bundle of no more than 5 files, or give me a concatenated text file. The run records needs_attention (exit code 2), not an opaque transport failure. Valid inputs: one concatenated .md/.txt path whose content is inlined, or one .zip path (≤5 files, $ask webgpt only) passed via --attach-file.
Focus preservation. The controlled tab is never foregrounded. The caller's active tab and focused window are unchanged across the call; meta.focus_changed must be false.
Default multi-turn. Unless --once is passed, explicit $ask webgpt uses ASK_WEBGPT_DEFAULT_ITERATIONS (default 2) on the same controlled tab via --oracle-iterations. Use --once for a single round; use --oracle-iterations N to override explicitly.
No DAG on plain webgpt. Product/architecture questions routed through $ask webgpt do not auto-draft coding DAGs; use --orchestrate only when you intend skill-graph execution.
Background collaboration loops → see /collab (collab webgpt --profile …), which shells to /ask webgpt and fires notify-send on HUMAN_REQUIRED / BLOCKED.
Multi-turn iteration. Each $ask webgpt call is one round on the same controlled tab. ChatGPT keeps the conversation context per tab, so a second call refines naturally. The canonical pattern is: project agent reads the first round's answer, decides whether to push back, and re-invokes $ask webgpt … to send the follow-up — no special iteration flag needed. Internal --oracle-iterations N is also honoured: each iteration sends one follow-up nudge ("identify the weakest claim and address it").
Proof contract. Inherits the WebGPT sentinel contract from surf: controlled_tab_id == requested_tab_id, sentinel present in raw response, stripped from clean response, no clean-response contamination from page chrome. oracle_webgpt_call_started / _finished / _failed events are recorded to the run state.
Other consumers. /review-prompt, /review-design, /review-code, /review-plan compose /ask and inherit this backend for free — pass --oracle-backend webgpt (or set ASK_ORACLE_BACKEND=webgpt) to the underlying /ask call.
Live sanity. skills/ask/sanity-webgpt.sh exercises the full path end-to-end against a real ChatGPT tab and asserts the proof contract, oracle wiring, and focus invariance. Modes: --tab-id ID, --url URL, --create-tab, or no flag (which auto-picks a single chatgpt.com tab or prints a 4-option help block when 0 or >1 candidates exist). Run after changes to webgpt_runtime.py, the oracle dispatcher, or the model alias router.
Rate-limit guard. Multi-round ping-pong can fan out to many ChatGPT rounds quickly. webgpt_rate_limit.py enforces a per-hour budget per account (default 30 rounds) and refuses to start new rounds once the cap is hit. Tunable via ASK_WEBGPT_MAX_ROUNDS_PER_HOUR; bypass via ASK_WEBGPT_RATE_LIMIT_DISABLE=1 (tests only). State at ~/.pi/webgpt-rate-limit.json.
Verdict JSON parsing. ChatGPT occasionally emits technically-invalid JSON (raw control chars inside string fields, trailing commas, missing quotes). extract_verdict layers strict json.loads → json_repair → plain-text VERDICT: regex, so transient model misbehavior never silently drops the verdict label.

Tech lead vs code runner (read this before WebGPT rounds)

When the human says WebGPT is the tech lead and the project agent is the code runner, roles are fixed:

Role	Who	Does	Does not
Tech lead	`$ask webgpt` on a bound tab (`--webgpt-project`)	Adjudicate policy, page contracts, screenshots; author `$test-interactions` manifest JSON; ask clarifying questions	Patch the repo, run shells, run `$test-interactions`, replace deterministic gates
Code runner	Project agent (you)	Implement allowlisted fixes, `cargo check`, `uv run maturin develop`, re-extract/materialize, run gates, write artifacts	Pretend WebGPT output is proof; skip local verification

How the tech lead is contacted (two modes):

Clarifying question — one focused ambiguity (policy, human vs pipeline, closure count semantics). Minimal context: page id, counts, artifact paths.
Review bundle — one concatenated markdown packet for real adjudication. Build /tmp/pdf-oxide-review-bundle.md with request text, gate output, diff summary, and JSON excerpts inlined — not a directory of separate files and not a zip archive.

Example (single file the runtime can inline):

/tmp/pdf-oxide-review-bundle.md
  # Review request (one paragraph)
  ## Gate output (inlined JSON)
  ## Diagnosis / diff summary (inlined)

Invoke WebGPT with that one file path (auto-attached and validated):

./run.sh ask webgpt "Review /tmp/pdf-oxide-review-bundle.md.
Return PASS, NEEDS_CHANGES, HUMAN_REQUIRED, or BLOCKED with specific guidance only.
Do not implement fixes."   --webgpt-project pdf-oxide   --oracle-iterations 1

For bounded background loops with desktop notify, use /collab webgpt --profile … (shells to this same /ask webgpt path; does not change roles).

Not the same lane as code repair:

WebGPT guidance → $ask webgpt / /collab (surf, background tab).
Discrepancy + patch → project agent + optional scillm exec graph (scillm_call + oc-kimi for multimodal review, pi-opencode-kimi for allowlisted edits) or scripts/pdf_lab/exec_two_call_page_repair.py.
Do not describe WebGPT rounds as "two scillm exec oc-kimi calls"; oc-kimi is HTTP chat, not scillm exec, and is for extraction repair—not the ChatGPT tab.

Proof of a WebGPT round is under .ask_artifacts/runs/<ask_id>/ (and oracle_webgpt_call_finished with no_activate: true), not an assistant paraphrase.

Bounded reviewer/executor workflow

Use this pattern when the human wants WebGPT as a reviewer/oracle while the project agent remains the executor:

human defines intent
  -> optional /interview for acceptance criteria
  -> project agent implements or gathers evidence
  -> /ask webgpt reviews the evidence bundle
  -> project agent applies corrections
  -> repeat until gate passes, max rounds is reached, or a human decision is required

The project agent should not expose every internal reviewer turn to the human. Human-facing status should be limited to:

current state
blocker
proposed decision
evidence link or artifact path
what changed since the previous round
whether a human decision is required

Recommended bounded loop:

./run.sh ask webgpt "Review /tmp/review-bundle.md (concatenated evidence file).
Return PASS, NEEDS_CHANGES, or BLOCKED with specific fixes only." \
  --webgpt-project <project-review> \
  --oracle-iterations 1

Then the project agent applies fixes locally, regenerates the evidence bundle, and calls /ask webgpt again on the same --webgpt-project until one of these terminal conditions occurs:

PASS + local PASS (dual agreement): WebGPT returns PASS (or VERDICT: PASS) on the same This round acceptance bullets and deterministic local gates pass. Only then set Goals met this round: YES in COLLABORATION_STATUS.md. Eligible for $plan-iterate close-phase when that phase's contract matches.
NEEDS_CHANGES: WebGPT names specific fixes; project agent patches, refreshes status, re-runs local proof, re-asks WebGPT.
BLOCKED: WebGPT names a missing dependency or unresolved product decision the agent cannot clear → human escalation (see below).
MAX_ROUNDS: configured round cap reached → human or narrower scope.
NEEDS_HUMAN_DECISION: acceptance/product policy, not implementation.

Do not use this loop for unbounded brainstorming, bulk review queues, or as a substitute for deterministic tests. WebGPT review is a gate over concrete evidence, not proof by itself.

Collaboration status file (required for multi-round WebGPT)

Full contract: docs/ASK_COLLABORATION_STATUS_CONTRACT.md
Copy template: docs/templates/COLLABORATION_STATUS.template.md

WebGPT reviews only This round acceptance. It must not invent criteria or declare overall project/phase complete unless both parties recorded PASS on those bullets.

Recommended invocation:

./run.sh ask webgpt "Read /path/to/COLLABORATION_STATUS.md.
Review ONLY 'This round acceptance'. Return VERDICT: PASS | NEEDS_CHANGES | BLOCKED."   --webgpt-project <project>   --ask-id <round-id>   --run-output-root /tmp/ask-webgpt-<project>   --overwrite

Proof of WebGPT review is the ask artifact set (<ask-id>.status.json, etc.), not an assistant summary. Update Agreement in the status file after each round.

Explorer / UI page review loop (WebGPT + test-interactions)

Role split (non-negotiable):

Output	Tech lead (WebGPT)	Code runner (project agent)
Per-page verdict	pass / degraded / fail + reason	Re-check against live gates after fixes
Overall package verdict	PASS / NEEDS_CHANGES / BLOCKED	Do not claim product-ready without local proof
Interaction manifest JSON	Draft `[data-qid]` workflow + assertions	Run `/test-interactions/run.sh run`
Code / predicate fixes	Describe required change only	Patch repo, re-capture, re-run gates
systemd / human policy	Flag HUMAN_REQUIRED	Execute only after human approves

WebGPT never runs ./run.sh, uv run, patches, or $test-interactions. Those are always project-agent steps after the ask artifact lands.

Review bundle (one concatenated markdown file — auto-inlined):

/tmp/<project>-page-review-rN.md
  # Review request (one paragraph)
  ## page-contract-audit.json (inlined)
  ## coverage-health.json (inlined)
  ## page-purpose excerpt (inlined)
  ## prior verdict (optional, inlined)

Ask WebGPT for manifest output in the same round:

navigate to Final Site + tab (if needed)
wait_ready on page-specific root qid
refresh / row drill / keyboard path
deterministic assertions (assert_visible, assert_text, monitor row counts)

The project agent saves that JSON and runs:

/home/graham/.codex/skills/test-interactions/run.sh run   --manifest /tmp/<project>-coverage-manifest.json   --output-dir /tmp/<project>-coverage-ti-rN/

/home/graham/.codex/skills/test-interactions/run.sh review   --captures /tmp/<project>-coverage-ti-rN/   --persona nico-bailon   # or brandon-bailey for compliance surfaces

PASS/FAIL comes from $test-interactions, not from WebGPT. WebGPT comments on evidence; it does not override deterministic verdicts.

Example invoke (tab-bound, review-only wording):

cd ~/.codex/skills/ask
./run.sh ask webgpt "Review /tmp/sparta-page-review-r4/REVIEW_REQUEST.md and attached screenshots.
For each Explorer page: verdict pass|degraded|fail, blockers, and next evidence.
Return overall PASS, NEEDS_CHANGES, or BLOCKED.
Also emit test-interactions-manifest JSON for Coverage (semantic workflow, all [data-qid] selectors).
Do not run commands or edit the repo."   --webgpt-tab-id 837344161   --webgpt-project sparta-explorer-review   --oracle-iterations 1

Prompt vocabulary (avoid ask DAG misfire):

"review", "adjudicate", "verdict", "guidance", "manifest JSON", "evidence bundle"
explicit "Do not run commands or edit the repo"

If DAG mode is required for coding, use --dag-file explicitly—not accidental orchestrate triggers inside WebGPT review prompts.

Proof chain for the human:

.ask_artifacts/runs/<ask_id>/ — request, status, events, WebGPT clean response
Project-agent captures + results.json from $test-interactions
Updated contract audit / monitor health JSON after local fixes

Do not treat accepted plan-iterate phases or WebGPT PASS as product closure until (2) and (3) agree.

Cursor Browser Oracle Backend (within Cursor IDE)

Use --oracle-backend cursor-browser or $ask cursor-browser … when you want ChatGPT inside Cursor's embedded Browser pane, not external Chrome + surf-cli.

Tab identity: Cursor uses viewId (e.g. f53e74) from browser_tabs / MCP — NOT Chrome numeric tab ids from surf tab.list or the Tab ID Viewer extension.

Context	Identifier	How to get it
Chrome + `$ask webgpt`	Chrome tab id	Tab ID Viewer; `surf tab.list`; `--webgpt-project`
Cursor Browser + `$ask cursor-browser`	viewId	MCP `browser_tabs` list; `surf cursor-browser.tab.list`; `--cursor-browser-project`

Prerequisites

Shell automation requires cursor-browser-bridge (one-time install + Cursor window reload). It exposes Cursor Browser to /ask and /surf via HTTP on /tmp/cursor-browser-bridge-port.

Without the bridge, use @Browser in chat (agent MCP) or install the bridge for ./run.sh.

Quick start

# List Cursor Browser tabs (viewId, title, url)
cd ~/.claude/skills/surf
./run.sh cursor-browser.tab.list

# Bind a project to a viewId (recommended for iterative work)
cd ~/.claude/skills/ask
./run.sh cursor-browser-project bind scilla-harness --view-id f53e74 \
  --url "https://chatgpt.com/c/6a15cd46-9768-83ea-b7e7-88b601dab985"

# Ask via Cursor Browser (same artifact contract as webgpt)
./run.sh ask cursor-browser "what is the capital of Texas" \
  --oracle --oracle-backend cursor-browser --once \
  --cursor-browser-project scilla-harness \
  --ask-id texas-cursor --run-output-root /tmp/ask-cursor --overwrite

Tab resolution (fail-closed)

When 0 or >1 ChatGPT tabs exist, /ask refuses to guess — pass --cursor-browser-view-id or bind a project.

Proof artifacts

When to use which path

Goal	Use
Code review / tech lead in Chrome (background)	`$ask webgpt` + `--webgpt-project` or `--webgpt-tab-id`
Prose / writing in Chrome	`$ask webkimi` + `--kimi-tab-id`
Design review in Chrome	`$ask webgemini` + `--gemini-tab-id`
Research (fresh web, one shot)	`$ask webperplexity`
Cursor IDE — ChatGPT in embedded Browser (self-contained)	`$ask cursor-browser` + `--cursor-browser-project` or `--cursor-browser-view-id`
Ad-hoc one question, agent already in chat	`@Browser` on current page (no `./run.sh` artifacts unless you re-run ask)

Do not pass Chrome tab ids to --cursor-browser-view-id. They are different namespaces.

WebGemini Oracle Backend (design)

--oracle-backend webgemini or $ask webgemini … — design work (mockups, UX, visual critique) via an authenticated Gemini tab in Chrome.

Transport: surf gemini.submit --no-activate (surf-cli extension required; /tmp/surf.sock).
Tab resolution: --gemini-tab-id → --gemini-url → auto-resolve when exactly one gemini.google.com tab is open (fail-closed otherwise).
Same bundle rules as WebGPT: concatenate evidence into one .md/.txt; no zip attach on this backend.
Multi-turn: re-invoke on the same controlled tab; no --gemini-project bindings yet (pass tab id explicitly).
Events: oracle_gemini_call_started / _finished / _failed.

./run.sh ask webgemini "Review /tmp/design-review-bundle.md for visual hierarchy and NVIS compliance."   --oracle --gemini-tab-id <id> --once

WebKimi Oracle Backend (prose)

--oracle-backend webkimi or $ask webkimi … — prose work (papers, voice, clarity, long-form writing) via an authenticated Kimi tab in Chrome.

Transport: surf kimi.submit --no-activate.
Tab resolution: --kimi-tab-id → --kimi-url → auto-resolve when exactly one kimi.com tab is open (fail-closed otherwise).
Same bundle rules as WebGPT (concatenated text; no zip).
Multi-turn on the same tab; no project bindings yet.
Events: oracle_kimi_call_started / _finished / _failed.

./run.sh ask webkimi "Review /tmp/paper-section.md for voice and clarity."   --oracle --kimi-tab-id <id> --once

WebPerplexity Oracle Backend (research)

--oracle-backend webperplexity or $ask webperplexity … — research questions needing fresh web synthesis (not multi-round review on a standing tab).

Transport: surf perplexity (one-shot CDP session per query).
No standing tab — each call is independent; do not use for bounded code/design review loops.
Bundle: concatenated .md/.txt only (paths inlined); no zip.
Events: oracle_perplexity_call_started / _finished / _failed.

./run.sh ask webperplexity "What changed in NIST SP 800-171 Rev 3 draft in the last 12 months?"   --oracle --once

Image Generation Mode

./run.sh ask "a precise architecture diagram of ask calling scillm for image generation" \
  --image-generate \
  --image-model gpt-image-2 \
  --image-size 1024x1024 \
  --image-quality high

Zero Cognitive Load for Project Agents

Project agents should just ask — the skill handles all discovery complexity:

# Project agent just asks a question
./run.sh ask "Lisa Feldman Barrett how might we improve our /memory system?" --auto-learn

# What happens automatically:
# 1. Memory check (existing knowledge?)
# 2. /dogpile deep research (Brave, Perplexity, ArXiv, YouTube, GitHub)
# 3. YouTube transcript ingestion (lectures, interviews)
# 4. Web content fetching (blogs, articles)
# 5. QRA extraction + Federated Taxonomy tagging
# 6. Memory storage with persona profile
# 7. Answer synthesis with multi-hop traversal

Human Chat Usage

Agent translation rules:

Preserve the human's natural topic text; do not over-normalize domain terms.
Treat a named persona before the question as --oracle-persona <name>.
Treat multiple named personas with "debate", "roundtable", or "discuss" as --roundtable.
Treat "parallel reviewers", "adversarial reviewers", or "N reviewers" as --parallel-review.
Treat "argue whether", "debate whether", or "make the case for and against" as --argue.
Treat "review then roundtable" as both --parallel-review and --roundtable.
Treat "CAE gap review", "QRA gap review", or "CAE reviewers" as --cae-gap-review.
Treat "generate an image", "image generation", or "make an image" as --image-generate; keep it standalone from memory/oracle/review modes.
Treat "deep review", "comprehensive review", "safe to proceed", or "production readiness" as --deep-review; require or infer a concrete --deep-review-target.
Treat leading model shorthand such as $ask oc kimi ..., $ask opencode qwen ..., $ask chutes kimi ..., $ask oc-kimi ..., or $ask chutes-kimi ... as --oracle --oracle-backend scillm with the resolved provider model.
Treat leading $ask webgpt ... (or $ask chatgpt ...) as --oracle --oracle-backend webgpt. This drives an already-authenticated ChatGPT tab in the user's Chrome via the surf-cli extension; the controlled tab never foregrounds (--no-activate).
Treat multi-round WebGPT review with a bound --webgpt-project as requiring an absolute-path COLLABORATION_STATUS.md (refresh before each round; see docs/ASK_COLLABORATION_STATUS_CONTRACT.md). Dual agreement: local PASS and WebGPT PASS on the same This round acceptance bullets before claiming goals met.
Treat Ask Nico ..., Bring Nico into this conversation ..., and other visible named-subagent conversation requests as /ask-owned persistent tmux-visible subagent sessions using --oracle-backend subagent-runner. Do not satisfy these with direct /subagent-runner, a one-shot oracle answer, or a hidden fallback. The proof artifact must include the tmux attach command, session directory, transcript, status, events, and send-input command.
Treat date-sensitive words (2026, current, latest, today, recent) as --dogpile auto.
Default high-value analytical questions to --oracle --oracle-model gpt-5.5 --oracle-reasoning high.

Human chat prompt	Route
`$ask what do we know about the release checklist?`	Memory-backed ask synthesis
`$ask What is the state of Python packaging in 2026?`	Oracle with auto persona selection and `--dogpile auto`
`$ask What is the state of space-based cybersecurity in 2026?`	SPARTA-scoped oracle: `--scope sparta --oracle`
`$ask oc kimi explain this design tradeoff`	scillm OpenCode Go oracle using live model discovery and capability metadata, currently `opencode-go/kimi-k2.6`
`$ask oc kimi for a $review-design with maximum 3 rounds`	Ask-backed review-design loop using `opencode-go/kimi-k2.6`; capture fresh screenshots, ask Kimi for a verdict, patch locally, re-render, and stop after PASS/blocker/3 rounds
`$ask oc-qwen compare these options`	Hyphenated OpenCode Go shorthand, currently `opencode-go/qwen3.6-plus`
`$ask chutes kimi explain this design tradeoff`	scillm Chutes oracle using configured alias `chutes-kimi`
`$ask chutes-kimi explain this design tradeoff`	Hyphenated Chutes shorthand using configured alias `chutes-kimi`
`$ask webgpt review round 2 using COLLABORATION_STATUS.md`	WebGPT oracle with status file auto-attached; update Agreement after artifact; dual agreement before closure
`$ask webgpt to perform the review on /tmp/review-bundle.md`	WebGPT oracle backed by the user's signed-in ChatGPT tab (via `surf webgpt.submit --no-activate`). File paths in the prompt are auto-attached. Tab id auto-resolves when exactly one chatgpt.com tab is open; otherwise pass `--webgpt-tab-id`.
`$ask webgpt again — refine your answer`	Multi-turn: each `$ask webgpt` call is one round against the same controlled tab. ChatGPT preserves conversation context, so iterations form a coherent dialogue.
`$ask Bring Nico into this conversation to review the mockups`	Persistent visible Nico session. Human attaches with `tmux a -t nico-<project>`; project agent sends follow-up turns through the `/ask` artifact's `send_input_command`.
`$ask Brandon what is the state of space-based cybersecurity in 2016?`	Brandon persona oracle over `--scope sparta`
`$ask Brandon, Margaret, and Jennifer personas to roundtable about the topic: What is the state of cybersecurity in 2026?`	SPARTA-scoped sequential persona roundtable
`$ask Brandon what is the best way to review this API boundary?`	Brandon persona oracle subagent
`$ask Brandon persona about whether this retry design fails closed`	Brandon persona oracle over memory/project context
`$ask Brandon critique this architecture`	Brandon persona critique
`$ask Brandon ask Margaret where are we weak?`	Safe Brandon→Margaret peer deliberation
`$ask Brandon, Margaret, and Jennifer personas to roundtable about the topic: Should this service use retries or queues?`	Sequential protocolized persona roundtable
`$ask run 3 parallel adversarial reviewers on this implementation`	Independent parallel review plus moderator synthesis
`$ask cae gap review AC-2 MFA evidence for the production tenant`	Evidence-case-backed QRA review: `--cae-gap-review --cae-max-rounds 3`
`$ask generate an image of the ask to scillm image route`	Image artifact generation: `--image-generate`
`$ask argue whether we should ship this change`	Two parallel `/scillm` advocates plus sequential judge and verifier
`$ask deep review this implementation --deep-review-target src/ask/ask.py`	Read-only deep review with markdown and JSON artifacts
`$ask review then roundtable with Brandon, Margaret, Jennifer`	Parallel findings first, then sequential persona debate
`$ask oracle should we use subagent-runner here?`	GPT-5.5 high-reasoning oracle
`$ask learn Lisa Feldman Barrett`	Learn/persona-ingest mode
`$ask is memory healthy?`	OS/runtime health mode

Natural persona syntax is supported by the CLI too:

./run.sh ask Brandon what is the best way to review this API boundary?

This is equivalent to:

./run.sh ask "what is the best way to review this API boundary?" \
  --oracle \
  --oracle-backend subagent-runner \
  --oracle-persona Brandon \
  --oracle-model gpt-5.5 \
  --oracle-reasoning high

Quick Start

cd .pi/skills/ask

# Learn about a persona (deep learning)
./run.sh learn "Lisa Feldman Barrett" --scope behavioral --depth deep

# Ask with auto-learn (discovers + ingests if no knowledge exists)
./run.sh ask "What does Sapolsky say about free will?" --scope behavioral --auto-learn

# Interactive mode (asks about learning preferences)
./run.sh learn "Robert Sapolsky" --scope behavioral --interactive

# Check learning progress (includes task-monitor state with ETA)
./run.sh status --scope behavioral

# Run nightly persona update
./run.sh nightly --scope behavioral

Commands

`ask` — Query Accumulated Knowledge

./run.sh ask <question> [options]
./run.sh ask Brandon what is the state of space-based cybersecurity in 2016? --scope sparta

Options:
  --scope <scope>         Memory scope to query (default: "ask")
  --k <n>                 Number of results (default: 5)
  --bridges               Also traverse bridge attributes (multi-hop)
  --auto-learn            Auto-discover and learn if no knowledge found
  --collection <coll>     Taxonomy collection for auto-learn (default: behavioral)
  --consult-personas      Find and suggest relevant personas to consult
  --persona-scope <scope> Scope to search for personas (default: personas)
  --hybrid                Use hybrid RAG+QRA retrieval
  --oracle                Use scillm/Codex for final oracle synthesis
  --oracle-backend <b>    Oracle backend: auto, scillm, subagent-runner, webgpt
  --oracle-model <model>  Oracle synthesis model (default: gpt-5.5)
  --oracle-reasoning <r>  Oracle reasoning effort (default: high; deep-review default: xhigh)
  --oracle-timeout <sec>  Oracle HTTP timeout in seconds (default: 300)
  --oracle-idle-timeout <sec> Subagent silence timeout before stalled recovery (default: 300)
  --oracle-heartbeat-interval <sec> Memory heartbeat write interval (default: 30)
  --oracle-persona <p>    Primary persona/subagent for oracle synthesis
  --oracle-peer <p>       Second persona/subagent for oracle deliberation
  --oracle-persona-model <m> Model for primary persona turns
  --oracle-peer-model <m> Model for peer persona turns
  --oracle-iterations <n> Sequential oracle deliberation calls (default: 1)
  --oracle-image <path> Attach an image to a direct scillm oracle call (repeatable)
  --roundtable          Run sequential protocolized persona deliberation
  --roundtable-personas <p> Comma-separated persona[:protocol_role] participants
  --roundtable-role-preset <p> Role preset (default: adversarial-review)
  --roundtable-rounds <n> Number of full participant rounds (default: 2)
  --roundtable-mode <m> Mode label (default: adversarial)
  --roundtable-persist <summary|full> Persist compact protocol state or full turns
  --argue                Run two parallel /scillm advocates followed by a judge
  --decision-required    Force FOR/AGAINST with uncertainty disclosure
  --tie-breaker <policy> Tie-breaker for --decision-required
  --parallel-review     Run independent parallel adversarial reviewers
  --parallel-reviewers <n> Number of default reviewers (default: 3)
  --parallel-review-personas <p> Comma-separated reviewer persona[:protocol_role] specs
  --parallel-review-focus <f> Comma-separated focus labels for default reviewers
  --parallel-review-role-preset <p> Role preset for parallel reviewers
  --cae-gap-review      Run evidence-case-backed CAE/QRA gap review
  --cae-reviewers <p>   Comma-separated CAE persona:role pairs
  --cae-judge <p>       CAE judge persona label
  --cae-max-rounds <n>  Maximum CAE clarify/reroute rounds
  --deep-review          Run read-only deep review with review.md and review.json artifacts
  --deep-review-target <target> Explicit target: paths, diff, plan, manifest, or artifact
  --deep-review-profile <p> Deep-review profile label (default: max_available)
  --deep-reviewers <n> Reviewer breadth requested for deep review (default: 5)
  --deep-review-focus <f> Comma-separated deep-review focus labels
  --deep-review-fallback-policy <fail_closed|warn> Downgrade behavior
  --deep-review-persist <summary|full> Persist compact metadata or full review state
  --deep-review-output-root <dir> Artifact root (default: .ask_artifacts/deep-review)
  --chain <name|path>   Saved review chain spec (e.g. deep-review-safety)
  --reviewer-spec <name|path> Reviewer role/focus spec (repeatable)
  --orchestrate          Draft and execute an ask DAG from the natural-language request
  --dag-json <json>     Execute an inline ask/scillm-style DAG before synthesis
  --dag-file <path>     Execute an ask/scillm-style DAG JSON file before synthesis
  --dogpile <auto|off|force> Freshness policy for date-sensitive oracle prompts
  --dry-run             Emit execution spec/risk analysis without mutation
  --ask-id <id>          Stable runtime artifact id for this ask call
  --run-output-root <dir> Runtime artifact root (default: .ask_artifacts/runs or ASK_RUN_OUTPUT_DIR)
  --overwrite            Replace an existing run directory for --ask-id
  --resume               Resume a non-terminal existing run directory for --ask-id
  --raw                   Return raw memory results (no synthesis)
  --image-generate        Generate image artifact(s) through scillm
  --image-model <model>   Image generation model (default: gpt-image-2)
  --image-size <size>     Image size, for example auto or 1024x1024
  --image-quality <q>     Image quality, for example auto, medium, or high
  --image-count <n>       Number of images to generate (default: 1)
  --image-output <path>   Output file or directory for generated image(s)
  --image-output-format <fmt> Image file format: png, jpeg, or webp
  --image-timeout <sec>   Image generation timeout in seconds (default: 300)
  --json                  JSON output
  --debug                 Enable debug logging

Runtime artifacts:

.ask_artifacts/runs/<ask_id>/
  <ask_id>.request.json
  <ask_id>.status.json
  <ask_id>.events.jsonl
.ask_artifacts/runs/index.jsonl

Natural-Language DAG Orchestration

Rules:

/ask owns skill orchestration, dependencies, joins, run-state artifacts, and final synthesis.
$scillm remains a skill/backend boundary. /ask may request a model/profile for an ask.oracle node, but it does not expose model-pool, queue, provider-capacity, fallback, or transport choices.
/memory and /dogpile own their own recall/research internals.
/interview is used only when a natural-language request lacks a target skill, output artifact, or acceptance criteria that cannot be safely inferred.
skill.run validation uses runnable sibling skill contracts (SKILL.md plus run.sh) instead of a narrow product-specific skill list.

DAG JSON Mode

Public graph envelope:

ask.dag.v1, the native ask DAG schema.

Compatibility graph envelope:

scillm.exec.graph.v1, normalized at the /ask boundary for React Flow, migration, and diagnostic tooling. This is not the preferred human/project agent input surface; normal callers use natural language or ask.dag.v1.

Supported node types:

memory.recall — run /memory recall and feed returned items into later nodes and synthesis.
dogpile.search — run /dogpile search for fresh research context.
ask.oracle — run a one-shot /scillm chat completion through /ask with the required ask caller headers and scillm metadata.
skill.run — run a runnable sibling skill with a SKILL.md and run.sh, such as create-report, review-code, memory, or dogpile.
scillm.agent_turn — run a standing Codex app-server worker through /v1/scillm/agents/{worker_id}/* (handoff → lease → turn → result). Use this for coding (worktree, declared_write_set, multi-turn, steer). Do not route implementation through /v1/chat/completions or default skill.run code-runner.

Coding DAG defaults:

Natural-language requests with implement/fix/patch/refactor intent draft an scillm.agent_turn node (id: implement, sandbox: workspace-write).
Worker id: --agent-worker, else ASK_AGENT_WORKER_ID, else implementation.
Registry must expose an implementation worker in config/scillm-agents.yaml.

Parallel review implementation:

--implement-with code-runner — handoff artifact only (read-only /ask).
--implement-with scillm-agent — same review path; no --code-runner-dod-command required (worker worktree enforces bounds).

Each node may set allow_failure (boolean, default false). When a node finishes with ok: false:

allow_failure: false (default): the layer is fail-closed. /ask records dag_layer_failed with failed_node_ids and safe_default: stop_before_dependent_nodes, then raises AskDagError before any dependent layer runs.
allow_failure: true: the failed node is recorded in dag/manifest.json and its node artifact, but the DAG continues to dependent nodes that still declare the failed parent in depends_on. Use only for optional probes, shadow checks, or best-effort enrichment that must not block the main workflow.

dag_node_auto_repair_started / dag_node_auto_repair_finished
Current repair: dogpile.search with returncode == -2 (skill timeout) when input.timeout is below 360 seconds — one retry at 360s with reason dogpile_timeout_budget_too_low.
Auto-repair does not bypass the required-node gate. If the repaired attempt still fails and allow_failure is false, the layer still fails closed.

Repair policy is data-driven:

Global policy: config/ask_dag_repair_policy.yaml
Timeout profiles: config/ask_dag_timeout_profiles.yaml
Skill hints: skills/<skill>/config/ask_dag_repair_hints.yaml (for example skills/dogpile/config/ask_dag_repair_hints.yaml)

Supported repair actions:

bump_timeout — retry once with a higher input.timeout when the skill subprocess was killed for budget (returncode == -2).
consume_partial_artifact — when a sibling skill wrote usable partial output (for example dogpile_partial_results.json with final_report), promote that artifact into the node result instead of failing immediately.
True ambiguity (missing target skill, artifact, or acceptance criteria) still routes to /interview via needs_attention, not silent DAG repair.

The live sanity check for this contract is:

./scripts/dag_e2e_sanity.py --output-root /tmp/ask-dag-e2e-proof --ask-id ask-dag-e2e-proof

./scripts/dag_negative_sanity.py --output-root /tmp/ask-dag-negative-proof --ask-id ask-dag-negative-proof

That check runs a required skill.run node with an invalid flag, records dag_layer_failed, leaves the failed node artifact, and does not create the dependent node artifact.

For a costful live model lane, add --include-oracle; the same DAG inserts two concurrent ask.oracle nodes before the final create-report join.

Example native graph:

{
  "schema_version": "ask.dag.v1",
  "max_concurrency": 2,
  "nodes": [
    {
      "id": "memory_first",
      "type": "memory.recall",
      "input": {"query": "prior lessons for this report", "scope": "project"}
    },
    {
      "id": "brandon",
      "type": "ask.oracle",
      "depends_on": ["memory_first"],
      "input": {
        "prompt": "Brandon: read the report and comment against the criteria.",
        "model": "gpt-5.5",
        "reasoning_effort": "high"
      }
    },
    {
      "id": "margaret",
      "type": "ask.oracle",
      "depends_on": ["memory_first"],
      "input": {
        "prompt": "Margaret: read the report and comment against the criteria.",
        "model": "gpt-5.5",
        "reasoning_effort": "high"
      }
    },
    {
      "id": "final_report",
      "type": "skill.run",
      "depends_on": ["brandon", "margaret"],
      "input": {"skill": "create-report", "args": ["--help"]}
    }
  ]
}

Concurrent lanes and sequential joins

RIGHT — Layer 1: two concurrent memory.recall nodes; Layer 2: one sequential skill.run join (matches ./scripts/dag_e2e_sanity.py):

{
  "schema_version": "ask.dag.v1",
  "graph_id": "memory-fanout-report-join",
  "graph_goal": "Recall two memory scopes in parallel, then create one report.",
  "max_concurrency": 2,
  "nodes": [
    {
      "id": "memory_ask_contract",
      "type": "memory.recall",
      "input": {
        "query": "ask DAG scillm metadata oracle review artifacts",
        "scope": "ask",
        "k": 3
      }
    },
    {
      "id": "memory_report_contract",
      "type": "memory.recall",
      "input": {
        "query": "best practices report source-of-truth inventory findings",
        "scope": "ask",
        "k": 3
      }
    },
    {
      "id": "final_report",
      "type": "skill.run",
      "depends_on": ["memory_ask_contract", "memory_report_contract"],
      "input": {
        "skill": "create-report",
        "args": [
          "--title", "Ask DAG Proof Report",
          "--input", "${dag_context_json}",
          "--output", "${dag_node_output}"
        ]
      }
    }
  ]
}

Layer semantics for this graph:

Layer	Runs	Why
1	`memory_ask_contract` and `memory_report_contract` in parallel	No `depends_on`; same topological layer; `max_concurrency: 2` allows both at once.
2	`final_report` after both parents succeed	`depends_on` lists both memory nodes — sequential join with merged `${dag_context_json}`.

WRONG — expecting a join without depends_on:

{
  "nodes": [
    {"id": "memory_a", "type": "memory.recall", "input": {"query": "a"}},
    {"id": "memory_b", "type": "memory.recall", "input": {"query": "b"}},
    {"id": "final_report", "type": "skill.run", "input": {"skill": "create-report", "args": ["--help"]}}
  ]
}

All three nodes are in one concurrent layer. final_report can start before either memory recall finishes and may see empty upstream context.

PHART renderer (preferred)

Terminal charts use PHART — Python Hierarchical ASCII Representation Tool. Source of truth: https://github.com/scottvr/phart.git (PHART 1.5+; PyPI is behind git).

# One-time: install Python 3.14 for uv (if needed)
uv python install 3.14

cd skills/phart-dag-chart
./run.sh validate path/to/plan.dag.json
./run.sh chart path/to/plan.dag.json

/ask dry-run and ./scripts/render_dag_chart.py delegate to $phart-dag-chart when uv + Python 3.14 are available.

ask_dag.ascii_renderer: phart-git | phart-pypi | fallback
phart-git: $phart-dag-chart subprocess (PHART 1.5 git)
phart-pypi: in-process PyPI phart 1.1.x on Python 3.11–3.12 (bracket style)
fallback: built-in renderer if PHART unavailable

./scripts/render_dag_chart.py /tmp/memory-fanout.dag.json

Legacy skills/ask/phart-renderer/ is deprecated.

Terminal ASCII DAG chart

Before a costful run, print the layer plan to the terminal so humans and project agents share the same mental model.

Preferred — dry-run prints the chart automatically:

./run.sh ask "Plan the memory fanout DAG" \
  --dag-file /tmp/memory-fanout-report-join.dag.json \
  --dry-run

DAG decision tree · memory-fanout-report-join
schema=ask.dag.v1 · max_concurrency=2

┌───────────────┐   ┌───────────────┐
│ memory_a      │   │ memory_b      │
│ memory.recall │   │ memory.recall │
│ [req]         │   │ [req]         │
└───────────────┘   └───────────────┘
        │                   │
        ──────────┴──────────
                  │
         ┌────────────────┐
         │ final_report   │
         │ skill.run      │
         │ [req]          │
         └────────────────┘

Trunk + 3-way fanout + join (same mockup shape):

                   ┌───────────────┐
                   │ shape         │
                   └───────────────┘
                           │
        ┬──────────────────┼───────────────────┬
┌──────────────┐   ┌──────────────┐   ┌────────────────┐
│ payload      │   │ exec         │   │ nico           │
└──────────────┘   └──────────────┘   └────────────────┘
        └──────────────────┴───────────────────┘
                            │
                    ┌──────────────┐
                    │ gate         │
                    └──────────────┘

Also valid — JSON only (automation / logs):

./run.sh ask "Plan the DAG" --dag-file /tmp/graph.dag.json --dry-run --json \
  | jq -r '.options.ask_dag.ascii_chart'

After a real run, correlate the chart with events.jsonl (dag_layer_started, dag_layer_finished, dag_layer_failed) and dag/manifest.json.

Required-node failure: who fixes the bug?

Bounded auto-repair (timeout bump, partial artifact consume) handles known deterministic modes. It does not replace debugging.

When a required node fails (allow_failure: false, the default):

/ask records dag_layer_failed, writes the failed node artifact (dag/<node_id>.json), and does not run dependent layers.
The project agent (or a /scillm exec subagent such as scillm exec pi-chutes-kimi / pi-opencode-kimi in read-only sandbox) must inspect evidence, fix the root cause, and re-run:
- dag/<failed_node>.json — error, stderr, returncode, repairs
- events.jsonl — dag_node_failed, dag_layer_failed, repair events
- Upstream skill logs under the run output root
Fixes are usually: correct input.args, skill bug patch, timeout profile, missing artifact path, or auth/routing (for ask.oracle, check /v1/scillm/auth before blaming the DAG).
Re-submit the same DAG with --resume when appropriate, or a corrected --dag-file after the code/config fix lands.

WRONG — treating auto-repair or answered as proof the workflow succeeded:

# dogpile still failed after repair; final_report never ran
jq '.status' .ask_artifacts/runs/<ask-id>/status.json   # may still show failure
grep dag_layer_failed .ask_artifacts/runs/<ask-id>/events.jsonl

WRONG — marking a broken required probe optional instead of fixing it:

{"id": "fresh_context", "type": "dogpile.search", "allow_failure": true}

Use allow_failure: true only for shadow/best-effort enrichment. If the main path needs dogpile output, fix timeout/skill/code — do not hide a required failure.

RIGHT — project agent loop after dag_layer_failed:

1. Read ascii_chart + dag_layer_failed event
2. Open dag/<node>.json and sibling skill stderr
3. Patch skill/CLI/config OR adjust DAG input
4. Re-run: ./run.sh ask "..." --dag-file fixed.dag.json --ask-id <id> --resume
5. Confirm dag/manifest.json shows downstream node artifacts

DAG JSON usage — good vs bad:

Use ask.dag.v1 for project-agent handoffs. Prefer --dag-file when the graph is non-trivial; use --dag-json only for small inline graphs.

RIGHT — layered workflow with explicit dependencies, safe timeouts, and a single final join:

{
  "schema_version": "ask.dag.v1",
  "graph_id": "report-with-fresh-research",
  "max_concurrency": 2,
  "nodes": [
    {
      "id": "memory_first",
      "type": "memory.recall",
      "input": {"query": "prior lessons for this report", "scope": "project", "k": 5}
    },
    {
      "id": "fresh_context",
      "type": "dogpile.search",
      "input": {
        "query": "current state of LLM workflow orchestration boundaries",
        "timeout": 360
      }
    },
    {
      "id": "final_report",
      "type": "skill.run",
      "depends_on": ["memory_first", "fresh_context"],
      "input": {
        "skill": "create-report",
        "args": [
          "--title", "Ask DAG Proof Report",
          "--input", "${dag_context_json}",
          "--output", "${dag_node_output}"
        ]
      }
    }
  ]
}

./run.sh ask "Run the report DAG"   --dag-file /tmp/report-with-fresh-research.dag.json   --ask-id report-dag-proof   --run-output-root /tmp/ask-dag-proof/runs   --overwrite   --json

Why this is good:

schema_version is ask.dag.v1.
Node id values are stable snake_case identifiers.
depends_on expresses the real join order.
dogpile.search gets a realistic timeout (or omits it and uses the profile default).
skill.run targets a runnable sibling skill and passes DAG context placeholders.
Model-pool / queue / provider-capacity fields are not in the graph; only ask.oracle nodes name a model/profile when needed.

RIGHT — optional probe that must not block the main path:

{
  "id": "shadow_dogpile",
  "type": "dogpile.search",
  "allow_failure": true,
  "input": {"query": "best-effort fresh sources", "timeout": 120}
}

Use allow_failure: true only for enrichment probes. Required nodes stay fail-closed (default allow_failure: false).

WRONG — treating the DAG like a scillm transport/routing config:

{
  "schema_version": "ask.dag.v1",
  "nodes": [
    {
      "id": "lane",
      "type": "ask.oracle",
      "input": {
        "prompt": "Review this",
        "model_pool": "gpt-pool",
        "provider_queue": "chutes",
        "fallback_chain": ["gemini", "claude"]
      }
    }
  ]
}

/ask owns orchestration and artifacts. $scillm owns pools, queues, fallback, and telemetry. Do not put transport knobs in project-agent DAG JSON.

WRONG — dependent node with no depends_on (race / empty context):

{
  "schema_version": "ask.dag.v1",
  "nodes": [
    {"id": "memory_first", "type": "memory.recall", "input": {"query": "context"}},
    {"id": "final_report", "type": "skill.run", "input": {"skill": "create-report", "args": ["--help"]}}
  ]
}

Without depends_on, final_report can run in parallel with memory_first and receive empty ${dag_context_json}.

WRONG — required node with an impossible budget, then expecting dependents to run:

{
  "schema_version": "ask.dag.v1",
  "nodes": [
    {
      "id": "fresh_context",
      "type": "dogpile.search",
      "input": {"query": "broad research query", "timeout": 30}
    },
    {
      "id": "final_report",
      "type": "skill.run",
      "depends_on": ["fresh_context"],
      "input": {"skill": "create-report", "args": ["--help"]}
    }
  ]
}

WRONG — inventing node types or skills:

{
  "schema_version": "ask.dag.v1",
  "nodes": [
    {"id": "x", "type": "scillm.batch", "input": {}},
    {"id": "y", "type": "skill.run", "input": {"skill": "made-up-skill", "args": []}}
  ]
}

Allowed node types: memory.recall, dogpile.search, ask.oracle, skill.run. skill.run skills must have a sibling SKILL.md and run.sh.

WRONG — using --orchestrate and --dag-file together without a clear source of truth:

./run.sh ask "Use $memory and $dogpile then $create-report"   --orchestrate   --dag-file /tmp/already-authored.dag.json

Pick one authoring path:

Natural language only: --orchestrate ( /ask drafts the DAG ).
Explicit graph: --dag-file or --dag-json (do not also --orchestrate unless you intend to replace the hand-authored graph).

Inspect a run:

./run.sh status --run <ask_id> --tail-events 25
./run.sh status --run .ask_artifacts/runs/<ask_id> --json
./run.sh status --run <ask_id> --watch --watch-timeout-seconds 300
./run.sh status --run <ask_id> --serve --open
./run.sh status --runs --limit 10
./run.sh status --prune --older-than-days 14 --dry-run

Runtime safety:

Generated run IDs include timestamp, question digest, and random suffix to avoid same-second collisions.
Explicit --ask-id reuse is rejected by default so event logs cannot mix separate runs.
--overwrite is explicit and replaces a prior run directory; --resume is explicit and only allowed for non-terminal runs.
Plain /ask degrades to a no-op runtime state if artifact writes fail; deep review and parallel review fail closed.
status --prune removes only validated direct-child ask.runtime.v1 run directories whose ask_id and artifacts.run_dir match the directory.
status --watch has a bounded timeout and exits nonzero if the run never reaches a terminal state.
status --serve is read-only, localhost-bound, token-gated, and intended for human inspection of run artifacts; it must not mutate answers or retry nodes.
Runtime artifacts are validated by doctor against the deterministic request/status/event schema.
status --runs reads the append-only index.jsonl first, then falls back to directory scanning.

Dry-run preview:

./run.sh ask "safe to proceed?" --deep-review --dry-run --json
./run.sh learn "Lisa Feldman Barrett" --dry-run
./run.sh nightly --dry-run --json
./run.sh os learn --dry-run --json

Saved review specs:

./run.sh ask "review this runtime layer" \
  --chain deep-review-safety \
  --reviewer-spec security \
  --reviewer-spec qa \
  --deep-review-target src/ask/run_state.py

Preflight the runtime:

./run.sh config doctor --profile release --json
./run.sh config init
./run.sh doctor
./run.sh doctor --json
./run.sh doctor --live --json

Oracle Synthesis:

./run.sh ask "What should we do next?" --oracle
./run.sh ask "What should we do next?" --oracle --oracle-backend subagent-runner --oracle-model gpt-5.5 --oracle-reasoning high

./run.sh ask oc-kimi "Brandon, review this Posture page screenshot against the design brief" \
  --oracle-image /tmp/posture-page.png \
  --ask-id posture-page-brandon-review \
  --run-output-root /tmp/ask-runs \
  --overwrite

For deliberation, set --oracle-iterations and optional persona roles. /ask will run sequential subagent-style oracle calls, feeding each turn into the next:

./run.sh ask "What should we do next?" --oracle \
  --oracle-backend subagent-runner \
  --oracle-persona "systems architect" \
  --oracle-peer "skeptical reviewer" \
  --oracle-iterations 3

Natural peer syntax is also supported:

./run.sh ask Brandon ask Margaret where are we weak?

Visible persistent subagents:

Contract:

/ask owns the runtime. Do not manually call /subagent-runner, send tmux keys, or scrape terminal text as a hidden fallback.
The public abstraction is ask_agent(agent_id, message): resolve the named collaborator's durable conversation state, run the next turn, persist transcript/events, and return the collaborator's final_text.
Conversation identity is provider-neutral state. Store agent_id, backend_type, conversation_id, role/cwd, and timestamps in a dedicated memory-backed registry such as codex_subagent_state; do not mix runtime ids into lesson/persona lore.
Codex App Server is one backend adapter. Its conversation_id is the Codex thread.id, and each message uses turn/start against that thread. Normal follow-up turns must not create a new thread unless the user explicitly resets/forks the collaborator.
thread/resume is for reconnecting/reloading a stored thread before the next turn/start; turn/steer is only for appending input to an active in-flight turn.
Proof is the visible conversation text plus artifacts: request/status/events, backend event log, result JSON, thread_id, turn_id, observed App Server methods, and returned final_text.

Example:

./run.sh ask "Bring Nico into this conversation to review the PDF annotator mockups" \
  --ask-id nico-pdf-oxide \
  --run-output-root /tmp/ask-visible-subagents \
  --overwrite

Required proof paths from the resulting /ask artifacts:

<ask-id>.request.json
<ask-id>.status.json
<ask-id>.events.jsonl
scillm/App Server event log path
scillm result JSON path
visible terminal rendering of Nico -> project agent: <final_text>

Optional tmux/TUI views are diagnostic only. They are not the machine interface and are not proof unless backed by the structured event/result artifacts above.

Ask-backed design review loops:

Required behavior:

Capture or accept the current screenshot bundle for the UI surface.
Send the screenshot(s), design constraints, and review-design verdict schema to the resolved reviewer model.
Require a structured verdict: satisfied, needs_changes, or blocked.
If the verdict is needs_changes, the project agent patches the UI, captures a fresh screenshot, and asks the reviewer again.
Stop at the first satisfied verdict, concrete blocker, or requested maximum round count.

The round cap is literal. "maximum 3 rounds" means no more than three reviewer verdict calls. The reviewer must inspect a fresh rendered screenshot before marking the design satisfied.

Peer turns can use any one-shot model supported by /scillm. This lets a Codex subagent converse with DeepSeek V4, MiniMax, Gemini, or other scillm routes:

./run.sh ask "What should we do next?" --oracle \
  --oracle-backend subagent-runner \
  --oracle-model gpt-5.5 \
  --oracle-persona "systems architect" \
  --oracle-peer "DeepSeek V4 critic" \
  --oracle-peer-model opencode-go/deepseek-v4-pro \
  --oracle-iterations 3

If --consult-personas is also set, suggested personas are included as advisory subagent context, and the top suggestion is used as the deliberation peer when --oracle-peer is omitted.

$ASK_ORACLE_MEMORY_RUN recall --q "Persona: Brandon Bailey"
$ASK_ORACLE_SCILLM_RUN warm-check --json
$ASK_ORACLE_DOGPILE_RUN --help

Core tool rules for oracle subagents:

/memory is mandatory for stored facts, actual personas, persona lessons, persona lore, prior lessons, and database state.
If no persona is specified, the oracle should use /memory recall as the primary persona selector because it already combines semantic, BM25, and graph traversal over persona lessons/lore; /monitor-personas is only an optional readiness/ops check.
Persona subagents must assume their persona may have stored lessons/lore and query it before answering from that persona's perspective.
Persona subagents may store a concise /memory learn record when the conversation produces a durable, reusable lesson; do not store transient chatter or secrets.
/scillm is available for one-shot peer model checks, not batch loops.
/dogpile is available for fresh external discovery only; do not use it for private/internal facts.
Subagents must state whether an answer used memory context, scillm peer checks, dogpile discovery, or inference.

Use oracle mode for single high-value questions, not nightly runs or batch ingestion loops.

Roundtable, Argue, Parallel Review, and CAE Gap Review Modes:

/ask supports four distinct review protocols:

--parallel-review: independent reviewers inspect the same artifact/question concurrently, then a neutral moderator synthesizes findings.
--argue: a FOR advocate and AGAINST advocate run in parallel through /scillm; a sequential judge decides or abstains, then a deterministic verifier gates the verdict.
--roundtable: selected personas speak sequentially through a state-machine protocol; each turn must reference prior claims and critiques.
--cae-gap-review: /create-evidence-case builds or loads the QRA/evidence snapshot first; Brandon, Margaret, and Jennifer review policy evidence, technical enforcement, and control mapping; a judge reroutes one missing evidence item per round before halting.

Use both together when you want independent findings first, followed by persona debate over those findings.

Personas and protocol roles are separate:

persona = domain/voice/source-of-judgment
protocol_role = job in the review loop

Explicit role assignment:

./run.sh ask "Formal Methods in large scale aerospace projects in 2026" \
  --roundtable \
  --roundtable-personas "Brandon:failure_mode,Margaret:evidence_auditor,Jennifer:complexity_minimizer" \
  --roundtable-rounds 2 \
  --dogpile auto

Natural roundtable syntax:

./run.sh ask Brandon, Margaret, and Jennifer to debate the relevance of Formal Methods in large scale aerospace projects in 2026

This maps to:

./run.sh ask "the relevance of Formal Methods in large scale aerospace projects in 2026" \
  --roundtable \
  --roundtable-personas "Brandon,Margaret,Jennifer" \
  --roundtable-role-preset adversarial-review \
  --roundtable-rounds 2 \
  --oracle-backend subagent-runner \
  --dogpile auto

Parallel review:

./run.sh ask "Review this implementation" \
  --parallel-review \
  --parallel-reviewers 3 \
  --parallel-review-focus correctness,tests,maintainability

Argue:

./run.sh ask "Should we ship this reversible runtime change?" \
  --argue

Forced binary decision:

./run.sh ask "Should we ship this reversible runtime change?" \
  --argue \
  --decision-required \
  --tie-breaker more-reversible

Review then roundtable:

./run.sh ask "Review this architecture" \
  --parallel-review \
  --roundtable \
  --roundtable-personas "Brandon:failure_mode,Margaret:evidence_auditor,Jennifer:complexity_minimizer"

CAE/QRA gap review:

./run.sh ask "cae gap review AC-2 MFA evidence for the production tenant" \
  --cae-reviewers "Brandon:cae_policy_evidence,Margaret:cae_technical_enforcement,Jennifer:cae_control_mapping" \
  --cae-judge "CAE Gap Judge" \
  --cae-max-rounds 3

Use it in the QRA lifecycle as:

generated QRA
  → candidate QRA
  → CAE gap review
  → human review
  → approve / edit / reject / defer
  → promote to sparta_qra or keep as gap

Protocol rules:

Critiques must anchor to specific claims.
Each participant must add a non-trivial disagreement or justify why none exists.
Critique and synthesis are separate; the moderator performs final synthesis.
Default persistence stores compact state, durable lessons, unresolved issues, and critique summaries.
SPARTA and space-cybersecurity questions use the deterministic SPARTA preflight contract in docs/ASK_SPARTA_PREFLIGHT_CONTRACT.md: preserve the question text, run /extract-entities and /memory recall first, route grounded SPARTA-corpora matches to /create-evidence-case, fail closed with needs_attention and safe_default=do_not_answer_as_grounded when required evidence-case creation is unavailable or fails, and continue normal /ask routing only when no grounded match is found.
SPARTA-corpora match signals are only extractor-grounded resolved control IDs, control metadata for SPARTA/CWE/NIST/CAPEC/ATT&CK, related/crosswalk pairs, taxonomy tags, or SPARTA recall items. Unresolved or fabricated SPARTA-looking references require needs_attention; never fabricate a control, crosswalk, relationship, or compliance status. All CAE/evidence-case outputs default to NEEDS_VERIFICATION and require human review before any status change. structured citations, and what would change the verdict. See docs/ASK_ARGUE_CONTRACT.md.

Deep Review Mode:

Use deep review when the human wants a comprehensive, Web-GPT-style review without copy-paste into the browser. Deep review is not /code-runner; it must produce analysis and artifacts, not patches.

# WRONG: reviewer receives compressed bytes, not readable evidence.
./run.sh ask "deep review this phase" \
  --deep-review \
  --deep-review-target /tmp/phase-review.zip \
  --oracle-backend scillm \
  --oracle-model gpt-5.5 \
  --oracle-reasoning high

# RIGHT: reviewer receives the actual readable phase artifacts.
./run.sh ask "deep review this phase" \
  --deep-review \
  --deep-review-target "/tmp/phase-review/PHASE_STATUS.json /tmp/phase-review/PHASE_REVIEW_REQUEST.md /tmp/phase-review/progress-context.md /tmp/phase-review/plan.dag.json" \
  --oracle-backend scillm \
  --oracle-model gpt-5.5 \
  --oracle-reasoning high

./run.sh ask "deep review this implementation" \
  --deep-review \
  --deep-review-target src/ask/ask.py \
  --deep-reviewers 5 \
  --deep-review-focus boundaries,fail-closed,tests,auditability \
  --oracle-backend subagent-runner \
  --oracle-model gpt-5.5 \
  --oracle-reasoning xhigh

Deep review writes:

.ask_artifacts/deep-review/<timestamp>/review.md
.ask_artifacts/deep-review/<timestamp>/review.json

When deep review runs under an --ask-id, those review.md and review.json paths are also registered in <ask_id>.status.json under artifacts.

Deep review verifier rules:

Reject missing or not_assessed required sections.
Reject SAFE or SAFE_WITH_CONDITIONS without inspected evidence.
Reject major findings that lack evidence, impact, fix, or verification.
Reject unexpected non-artifact file changes from before/after git status.
Treat JSON as an audit gate, not proof of reasoning quality.

Runner-backed oracle calls also have heartbeat/recovery telemetry:

/subagent-runner emits transcript delta and heartbeat events while the PTY session is alive.
/ask follows runner events.jsonl as the primary liveness channel and falls back to status polling only when necessary.
If transcript output is silent for --oracle-idle-timeout, the runner marks the session stalled and terminates the process group.
/ask writes sparse heartbeat snapshots to /memory collection ask_subagent_heartbeat.
Heartbeat records include session id, artifact dir, persona, model, turn number, status, transcript byte count, last_output_age_ms, timeout, and idle timeout.
On stalled, timed_out, failed, or cancelled, /ask returns the terminal status plus the transcript tail so the project agent can recover or retry with a different timeout/model.
E2E validators must fail empty answers, No answer could be synthesized, refusal-style non-answers, missing domain grounding, wrong persona routing, or missing roundtable participants.

Oracle Backend Decision Table:

User intent	Use	Why
One direct high-reasoning answer	`--oracle --oracle-backend scillm`	Fast path through `/scillm`
Browser/session-backed ChatGPT answer or review	`$ask webgpt ...` or `--oracle --oracle-backend webgpt`	Preserves `/ask` artifacts while using `/surf` sentinel proof against the authenticated ChatGPT tab
Focused Codex agent answer	`--oracle --oracle-backend subagent-runner`	Runs a real Codex CLI subagent session
Persona or peer deliberation	`--oracle --oracle-backend auto --oracle-iterations 2+`	`auto` selects `subagent-runner`
N-persona sequential debate	`--roundtable --roundtable-personas ...`	State-machine review protocol with claim anchoring
Two-sided calibrated decision	`--argue`	Two parallel `/scillm` advocates, sequential judge, verifier gate
N independent adversarial reviewers	`--parallel-review --parallel-reviewers N`	Parallel breadth before moderator synthesis
Independent findings then debate	`--parallel-review --roundtable`	Best for high-stakes review
Web-GPT-style deep review	`--deep-review --deep-review-target <target>`	Pass-based review with `review.md` and `review.json`
GPT-5.5 vs DeepSeek/Gemini/MiniMax debate	`--oracle-peer-model <scillm-model>`	Codex turn uses runner; peer turn uses `/scillm`
Batch/nightly ingestion	Do not use oracle	OAuth models and subagents are not batch lanes

How to Prompt an Agent to Use /ask Oracle:

Natural user prompts should map to these commands:

Use $ask oracle on "should subagent-runner replace direct scillm for focused agent calls?"

./run.sh ask "should subagent-runner replace direct scillm for focused agent calls?" \
  --oracle \
  --oracle-backend subagent-runner \
  --oracle-model gpt-5.5 \
  --oracle-reasoning high

Use $ask oracle with 3 rounds: architect vs skeptical reviewer.

./run.sh ask "<question>" \
  --oracle \
  --oracle-backend subagent-runner \
  --oracle-persona "systems architect" \
  --oracle-peer "skeptical reviewer" \
  --oracle-iterations 3

Use $ask oracle and have GPT-5.5 converse with DeepSeek V4.

./run.sh ask "<question>" \
  --oracle \
  --oracle-backend subagent-runner \
  --oracle-model gpt-5.5 \
  --oracle-persona "GPT-5.5 architect" \
  --oracle-peer "DeepSeek V4 critic" \
  --oracle-peer-model opencode-go/deepseek-v4-pro \
  --oracle-iterations 3

Correct Usage Examples:

# Ask memory first, then use GPT-5.5 high reasoning as the final oracle.
./run.sh ask "What is the best architecture for X?" --oracle

# Force direct scillm when you want a single fast oracle call.
./run.sh ask "Summarize the tradeoff in one paragraph." \
  --oracle --oracle-backend scillm

# Force a real Codex subagent session.
./run.sh ask "Analyze this design decision." \
  --oracle --oracle-backend subagent-runner

# Let /ask suggest personas and include them in oracle context.
./run.sh ask "How should we improve visual hierarchy?" \
  --consult-personas --oracle --oracle-iterations 2

Incorrect Usage Examples:

# WRONG: Asking deep-review to inspect a compressed package.
./run.sh ask "deep review this review bundle" \
  --deep-review \
  --deep-review-target /tmp/review-bundle.zip

# RIGHT: Extract or render the bundle, then pass concrete readable files.
./run.sh ask "deep review this review bundle" \
  --deep-review \
  --deep-review-target "/tmp/review-bundle/PHASE_STATUS.json /tmp/review-bundle/review-request.md /tmp/review-bundle/manifest.json"

# WRONG: Using oracle mode for many independent items.
for q in "${questions[@]}"; do ./run.sh ask "$q" --oracle; done

# RIGHT: Use normal /ask or batch-capable /scillm lanes for bulk work.
./run.sh ask "one high-value question" --oracle

# WRONG: Assuming --oracle-peer changes the model.
./run.sh ask "question" --oracle --oracle-peer "DeepSeek critic"

# RIGHT: Set the peer model explicitly for scillm one-shot model turns.
./run.sh ask "question" --oracle \
  --oracle-peer "DeepSeek critic" \
  --oracle-peer-model opencode-go/deepseek-v4-pro \
  --oracle-iterations 2

# WRONG: Asking for persona deliberation with --raw.
./run.sh ask "question" --raw --oracle --oracle-iterations 3

# RIGHT: Oracle synthesis requires synthesized context, so omit --raw.
./run.sh ask "question" --oracle --oracle-iterations 3

# WRONG: Treating /ask as a replacement for fresh web search.
./run.sh ask "latest price/news today" --oracle

# RIGHT: Use /dogpile or a search-capable skill first, then ask the oracle over gathered context.
./run.sh ask "question over learned or retrieved context" --oracle

Persona Routing:

When --consult-personas is enabled, /ask uses Federated Taxonomy bridges to find personas best suited to answer the question:

./run.sh ask "How should we improve visual hierarchy in the UI?" --consult-personas

# Output:
#   Suggested personas to consult:
#     - Paula Scher (Graphic Designer) [Precision]
#     - Don Norman (Cognitive Scientist) [Precision, Loyalty]

This enables cross-persona knowledge queries where Embry can automatically identify that Paula Scher should be consulted for typography questions.

Auto-Learn Flow:

Question → Memory Recall → No results?
  YES → Learn Pipeline (dogpile → YouTube → web → QRA → store)
      → Re-query Memory → Return answer with multi-hop traversal
  NO  → Return answer directly with bridge connections

`learn` — Discover and Ingest Knowledge

./run.sh learn <topic> [options]

Options:
  --scope <scope>         Memory scope (default: "ask")
  --collection <coll>     Taxonomy collection (lore, operational, sparta, behavioral)
  --depth <level>         Learning depth: quick (5-10min), standard (30-60min), deep (hours)
  -i, --interactive       Use /interview to ask about learning preferences
  --youtube <url>         Specific YouTube URL to ingest (repeatable)
  --books-only            Only discover and process books
  --youtube-only          Only process YouTube content
  --max-books <n>         Max books to discover (default: 5)
  --max-videos <n>        Max YouTube videos to process (default: 3)
  --dry-run               Preview what would be ingested without storing
  --ask-id <id>           Stable runtime artifact id for this learn call
  --run-output-root <dir> Runtime artifact root (default: .ask_artifacts/runs or ASK_RUN_OUTPUT_DIR)
  --overwrite             Replace an existing run directory for --ask-id
  --resume                Resume a non-terminal existing run directory for --ask-id
  --debug                 Enable debug logging

Learning Depths:

Depth	Time	Videos	Books	ArXiv	Use Case
`quick`	5-10 min	3	0	0	Quick overview, verify facts
`standard`	30-60 min	5	3	3	Moderate understanding
`deep`	2-6 hours	10+	5	10	Comprehensive persona building

Persona Detection: When the topic looks like a person's name (e.g., "Lisa Feldman Barrett", "Dr. Robert Sapolsky"), the skill automatically:

Stores a persona profile to memory
Searches for additional lectures by the person
Downloads books by/about the person
Creates a queryable knowledge base

`status` — Learning Progress

./run.sh status [options]

Options:
  --scope <scope>         Filter by scope
  --run <id|path>         Show runtime status for an ask id, run directory, or status file
  --tail-events <n>       Include the last N runtime events with --run
  --watch                 Watch runtime status until terminal
  --watch-timeout-seconds <n> Maximum seconds to wait with --watch
  --poll-interval-seconds <n> Polling interval for --watch
  --serve                 Serve a local read-only HTML viewer for --run
  --open                  Open the local HTML viewer in a browser
  --serve-port <n>        Port for --serve; 0 selects a free port
  --serve-ttl-seconds <n> Seconds to keep viewer alive after terminal state
  --runs                  List recent runtime runs
  --limit <n>             Maximum runs to list with --runs
  --prune                 Prune old runtime run directories
  --older-than-days <n>   Age threshold for --prune (default: 14)
  --dry-run               Preview --prune without deleting
  --run-output-root <dir> Runtime artifact root for --run ids
  --json                  JSON output
  --debug                 Enable debug logging

Shows:

Total knowledge items in scope
Persona profiles
Q-R-A pairs count
Last task-monitor state (steps, timing, stats, ETA)

`nightly` — Scheduled Persona Updates

./run.sh nightly [options]

Options:
  --scope <scope>         Memory scope to update (default: ask)
  --persona <name>        Update a single persona by name
  --dry-run               Preview without storing
  --ask-id <id>           Stable runtime artifact id for this nightly call
  --run-output-root <dir> Runtime artifact root (default: .ask_artifacts/runs or ASK_RUN_OUTPUT_DIR)
  --overwrite             Replace an existing run directory for --ask-id
  --resume                Resume a non-terminal existing run directory for --ask-id
  --json                  Output summary as JSON
  --debug                 Enable debug logging

Nightly Update Flow:

Query memory for all stored persona profiles
For each persona, search for new content since last update
Ingest new YouTube videos, papers, news articles
Update persona profile with new sources
Report summary via task-monitor

`os` — Query Embry-OS Internals

# Index OS knowledge (skills, packages, config)
./run.sh os learn --depth quick --dry-run
./run.sh os learn --depth standard
./run.sh os learn --depth quick --dry-run --ask-id os-preview

# Query OS knowledge
./run.sh os ask "what does the /dogpile skill do?"
./run.sh os ask "which skills provide memory?" --json
./run.sh os ask "which skills provide memory?" --ask-id os-memory-query

# Query runtime health
./run.sh os health "is memory healthy?"
./run.sh os health "check workstation" --subsystem workstation
./run.sh os health "is memory healthy?" --ask-id os-memory-health

# Classify query intent
./run.sh intent "how does the memory skill work?"

OS Health dispatches to the relevant monitor/ops skill (e.g., monitor-memory health --json) and combines runtime data with static knowledge from memory.

Intent Classifier routes queries through a 3-stage pipeline:

Memory cache (~1ms) — cached classifications
Rule-based heuristics (<5ms) — regex pattern matching
LLM fallback (1-5s) — scillm classification when uncertain

Architecture

Agent: "Lisa Feldman Barrett how might we improve our memory system?"
                    │
                    ▼
            ┌──────────────┐
            │ Persona Detect│  ← Is this a person?
            └──────┬───────┘
                   │
            ┌──────┴───────┐
            │ Memory Recall │  ← Check what we already know
            └──────┬───────┘
                   │
              Items found?
              ┌────┴────┐
             YES        NO + --auto-learn
              │          │
              │    ┌─────┴──────────────────────────────────┐
              │    │ Learn Loop (multi-source, multi-hour)  │
              │    │                                         │
              │    │ 1. /dogpile (Brave + Perplexity + etc)  │
              │    │ 2. YouTube ingest (lectures, interviews)│
              │    │ 3. Web fetch (blogs, articles)          │
              │    │ 4. discover-books (OpenLibrary)         │
              │    │ 5. extractor --format qra               │
              │    │ 6. memory learn + persona profile       │
              │    │                                         │
              │    │ (tracked via task-monitor with ETA)     │
              │    └─────┬──────────────────────────────────┘
              │          │
              │    ┌─────┴──────┐
              │    │ Re-query   │
              │    └─────┬──────┘
              │          │
              └────┬─────┘
                   │
            ┌──────┴───────┐
            │  Synthesize  │  ← Combine results
            └──────┬───────┘
                   │
            ┌──────┴────────────────┐
            │ Federated Taxonomy    │  ← Multi-hop bridge traversal
            │ (Corruption, Precision,│
            │  Resilience, etc.)     │
            └───────────────────────┘

Task-Monitor Integration

Every learn session registers with /task-monitor:

Registry: ~/.pi/task-monitor/registry.json
State file: .pi/skills/ask/ask_task_state.json (atomic writes)
Steps tracked: memory_check → dogpile → ingest_youtube → fetch_web → extractor_qra → store
Sub-steps: Individual items within each step (e.g., each video URL)
ETA: Estimated time remaining based on depth and progress
Stats: books_discovered, youtube_ingested, web_fetched, qra_extracted, items_stored

# View real-time progress
uv run --project .pi/skills/ask python -m json.tool < .pi/skills/ask/ask_task_state.json

# Example state output:
{
  "completed": 3,
  "total": 6,
  "progress_pct": 55.0,
  "current_item": "ingest_youtube",
  "current_detail": "https://youtube.com/watch?v=...",
  "substep_current": 2,
  "substep_total": 5,
  "eta_seconds": 180.0,
  "eta_display": "~3 min remaining",
  "depth": "standard"
}

# Via task-monitor TUI
cd ~/.pi/skills/task-monitor
uv run python monitor.py tui --filter ask

Federated Taxonomy Integration

Knowledge is tagged with bridge attributes for multi-hop graph traversal:

Bridge	Meaning	Example Topics
Corruption	Moral decay, entropy	Power dynamics, institutional failure
Precision	Exactness, clarity	Scientific method, measurement
Resilience	Recovery, adaptation	Stress response, neuroplasticity
Fragility	Vulnerability	Trauma, system failure
Stealth	Hidden operations	Unconscious processes

Multi-hop Query Example:

Query: "How does stress affect decision-making?"
          │
          ├── Direct hits: Stress research papers
          │
          └── Bridge traversal:
              ├── [Resilience] → Neuroplasticity studies
              ├── [Fragility] → Trauma responses
              └── [Corruption] → Decision biases under stress

Memory Scopes

Scope	Use
`behavioral`	Psychology, neuroscience, behavioral studies
`ask`	General learning (default)
Custom	Any scope name you provide

Environment

Variable	Purpose
`ASK_DEFAULT_SCOPE`	Override default memory scope
`ASK_NIGHTLY_SCOPE`	Scope for nightly updates
`ASK_MAX_BOOKS`	Override default max books to discover
`ASK_MAX_VIDEOS`	Override default max videos
`ASK_ORACLE_MODEL`	Override default oracle model (default: `gpt-5.5`)
`ASK_ORACLE_REASONING`	Override default oracle reasoning effort (default: `high`)
`ASK_ORACLE_TIMEOUT`	Override default oracle timeout seconds (default: `300`)
`ASK_ORACLE_IDLE_TIMEOUT`	Override subagent silence timeout before `stalled` (default: `300`)
`ASK_ORACLE_HEARTBEAT_INTERVAL`	Override heartbeat write interval seconds (default: `30`)
`ASK_ORACLE_BACKEND`	Override oracle backend (default: `auto`)
`ASK_RUN_OUTPUT_DIR`	Override runtime artifact root for request/status/events files
`ASK_SUBAGENT_RUNNER`	Path to subagent-runner `run.sh`
`ASK_SUBAGENT_OUTPUT_DIR`	Output directory for oracle subagent artifacts
`ASK_MEMORY_RUN`	Path to memory `run.sh` exposed to oracle subagents
`ASK_SCILLM_RUN`	Path to scillm `run.sh` exposed to oracle subagents
`ASK_DOGPILE_RUN`	Path to dogpile `run.sh` exposed to oracle subagents
`ASK_MONITOR_PERSONAS_RUN`	Path to monitor-personas `run.sh` exposed to oracle subagents
`SCILLM_BASE_URL`	scillm base URL for oracle synthesis (default: `http://localhost:4001`)
`SCILLM_API_KEY`	scillm bearer token for oracle synthesis (default: `sk-dev-proxy-123`)
`ASK_DEBUG`	Enable debug logging (set to any value)
`TASK_MONITOR_URL`	Task-monitor API URL for remote push

Related Skills

Skill	Relationship
`/dogpile`	Primary discovery engine (Brave, Perplexity, ArXiv, YouTube, GitHub)
`/memory`	Knowledge storage and retrieval
`/surf`	Browser transport and sentinel proof layer for `$ask webgpt`
`/project-knowledge`	Curated human-readable current-state document for `/ask` development
`/discover-books`	Book discovery via OpenLibrary
`/ingest-youtube`	YouTube transcript extraction
`/fetcher`	Web content fetching
`/extractor`	Document extraction with QRA mode
`/taxonomy`	Federated Taxonomy tagging
`/interview`	Interactive preference gathering
`/task-monitor`	Progress tracking with ETA
`/scheduler`	Nightly update scheduling
`/prompt-lab`	Prompt optimization for scillm calls

Persona Profiles

When learning about a person, a persona profile is stored to memory:

{
  "name": "Lisa Feldman Barrett",
  "scope": "behavioral",
  "sources": {
    "dogpile_sections": 5,
    "books": 3,
    "youtube": 8,
    "web": 4
  },
  "stats": {
    "qra_extracted": 45,
    "stored": 46
  },
  "last_updated": "2024-01-15T10:30:00"
}

Query a persona:

./run.sh ask "How does Barrett define emotions?" --scope behavioral --bridges

Nightly Scheduling

To run nightly persona updates automatically, use the /scheduler skill or cron:

# Via scheduler skill
./path/to/scheduler/run.sh add ask-nightly \
  --command ".agent/skills/ask/run.sh nightly --scope behavioral" \
  --schedule "0 3 * * *"  # 3 AM daily

# Via cron
# Add to crontab -e:
# 0 3 * * * /path/to/.agent/skills/ask/run.sh nightly --scope behavioral >> /var/log/ask-nightly.log 2>&1

ask

More from this repository

More from this repository

ask

Literal Runtime Contract

Browser Oracle Backends — routing (read first)

Team defaults (which backend for which work)

Shared limitations (all browser backends)

Backend reference

WebGPT Oracle Backend

Per-project tab bindings

Tech lead vs code runner (read this before WebGPT rounds)

Bounded reviewer/executor workflow

Collaboration status file (required for multi-round WebGPT)

Explorer / UI page review loop (WebGPT + test-interactions)

Cursor Browser Oracle Backend (within Cursor IDE)

Prerequisites

Quick start

Tab resolution (fail-closed)

Proof artifacts

When to use which path

WebGemini Oracle Backend (design)

WebKimi Oracle Backend (prose)

WebPerplexity Oracle Backend (research)

Image Generation Mode

Zero Cognitive Load for Project Agents

Human Chat Usage

Quick Start

Commands

ask — Query Accumulated Knowledge

Natural-Language DAG Orchestration

DAG JSON Mode

Concurrent lanes and sequential joins

PHART renderer (preferred)

Terminal ASCII DAG chart

Required-node failure: who fixes the bug?

learn — Discover and Ingest Knowledge

status — Learning Progress

nightly — Scheduled Persona Updates

os — Query Embry-OS Internals

Architecture

Task-Monitor Integration

Federated Taxonomy Integration

Memory Scopes

Environment

Related Skills

Persona Profiles

Nightly Scheduling

ask

Literal Runtime Contract

Browser Oracle Backends — routing (read first)

Team defaults (which backend for which work)

Shared limitations (all browser backends)

Backend reference

WebGPT Oracle Backend

Per-project tab bindings

Tech lead vs code runner (read this before WebGPT rounds)

Bounded reviewer/executor workflow

Collaboration status file (required for multi-round WebGPT)

Explorer / UI page review loop (WebGPT + test-interactions)

Cursor Browser Oracle Backend (within Cursor IDE)

Prerequisites

Quick start

Tab resolution (fail-closed)

Proof artifacts

When to use which path

WebGemini Oracle Backend (design)

WebKimi Oracle Backend (prose)

WebPerplexity Oracle Backend (research)

Image Generation Mode

Zero Cognitive Load for Project Agents

Human Chat Usage

Quick Start

Commands

ask — Query Accumulated Knowledge

Natural-Language DAG Orchestration

DAG JSON Mode

Concurrent lanes and sequential joins

PHART renderer (preferred)

Terminal ASCII DAG chart

`ask` — Query Accumulated Knowledge

`learn` — Discover and Ingest Knowledge

`status` — Learning Progress

`nightly` — Scheduled Persona Updates

`os` — Query Embry-OS Internals

`ask` — Query Accumulated Knowledge

`learn` — Discover and Ingest Knowledge

`status` — Learning Progress

`nightly` — Scheduled Persona Updates

`os` — Query Embry-OS Internals