| name | openclaw-debugging |
| description | Debug OpenClaw model, provider, tool-surface, code-mode, streaming, and live/Crabbox behavior by choosing the right logs, probes, and proof path before changing code. |
OpenClaw Debugging
Use this skill when OpenClaw behavior differs between local tests, live models,
providers, code mode, Tool Search, Crabbox, or CI, and the next move should be a
debug signal rather than a guess.
Read First
docs/logging.md for log files, openclaw logs, and targeted debug flags.
docs/reference/test.md for local test commands.
docs/reference/code-mode.md for code-mode exec/wait and tool catalog rules.
- Use
$openclaw-testing for choosing test lanes.
- Use
$crabbox for broad, Docker, package, Linux, live-key, or CI-parity proof.
Default Loop
- State the suspected boundary: config, tool construction, provider payload,
fetch, stream/SSE, transcript replay, worker/runtime, package/dist, or CI.
- Add or enable the narrowest signal that proves that boundary.
- Reproduce with the same provider/model/config. Do not randomly switch models
unless the model itself is the variable being tested.
- Compare configured state with actual run activation.
- Patch the root cause.
- Rerun the exact failing probe, then broaden only if the contract requires it.
Model Transport Logs
Use targeted env flags instead of global debug when the model request shape or
stream timing matters:
OPENCLAW_DEBUG_MODEL_TRANSPORT=1 openclaw gateway
OPENCLAW_DEBUG_MODEL_PAYLOAD=tools OPENCLAW_DEBUG_SSE=events openclaw gateway
OPENCLAW_DEBUG_MODEL_PAYLOAD=full-redacted OPENCLAW_DEBUG_SSE=peek openclaw gateway
Useful flags:
OPENCLAW_DEBUG_MODEL_TRANSPORT=1: request start, fetch response, SDK
headers, first SSE event, stream done, and transport errors at info.
OPENCLAW_DEBUG_MODEL_PAYLOAD=summary: bounded payload summary.
OPENCLAW_DEBUG_MODEL_PAYLOAD=tools: all model-facing tool names.
OPENCLAW_DEBUG_MODEL_PAYLOAD=full-redacted: capped, redacted JSON payload.
Use only while debugging; prompts/message text may still appear.
OPENCLAW_DEBUG_SSE=events: first-event and stream-completion timing.
OPENCLAW_DEBUG_SSE=peek: first five redacted SSE events.
OPENCLAW_DEBUG_CODE_MODE=1: code-mode tool-surface diagnostics.
Watch logs with:
openclaw logs --follow
Common Boundaries
- Config vs activation: config can be enabled while the run disables tools,
is raw, has an empty allowlist, or lacks model tool support. Check the actual
visible tools before enforcing provider payload invariants.
- Tool surface: inspect final model-visible tool names, not only the tool
registry or config. Code mode means exactly
exec and wait only after it
actually activates.
- Provider payload: log fields, model id, service tier, reasoning, input
size, metadata keys, prompt-cache key presence, and tool names before SDK
call.
- Fetch vs SSE: fetch response proves HTTP headers arrived; first SSE event
proves provider body progress. A gap here is a stream/body/provider issue, not
tool execution.
- Worker/dist: run
pnpm build when touching workers, dynamic imports,
package exports, lazy runtime boundaries, or published paths.
- Live keys: use the configured secret workflow for missing provider keys
before saying live proof is blocked. Env checks are presence-only; never print
secrets.
Code Pointers
- Model payload + Responses stream:
src/agents/openai-transport-stream.ts
- Guarded fetch/timing:
src/agents/provider-transport-fetch.ts
- OpenAI/Codex provider wrappers:
src/agents/pi-embedded-runner/openai-stream-wrappers.ts
- Tool construction, Tool Search, code-mode activation:
src/agents/pi-embedded-runner/run/attempt.ts
- Code-mode runtime and worker:
src/agents/code-mode.ts
src/agents/code-mode.worker.ts
- Tool Search catalog:
src/agents/tool-search.ts
Proof Choice
- Single helper/payload bug: local targeted Vitest.
- Docs/logging-only:
pnpm check:docs and git diff --check.
- Worker/dist/lazy import/package surface: targeted tests plus
pnpm build.
- Live provider/model behavior: same provider/model with debug flags and a real
key if available.
- Docker/package/Linux/CI-parity:
$crabbox.
- CI failure: exact SHA, relevant job only, logs only after failure/completion.
Output Habit
Report:
- boundary tested
- exact command/env shape, redacted
- observed signal, such as tool names or first SSE event timing
- fix location
- narrow proof and any remaining risk