| name | debug-dashboard |
| description | Diagnose problems in the running pi-agent-dashboard system. Tail ~/.pi/dashboard/server.log, probe /api/health for mode + uptime, check bridge WebSocket connectivity, triage vitest failures via tee→grep, inspect known-issue FAQ entries (Electron Node bin selection, Fastify + bad-Node crashes, stale-port hangs, single-instance lock). Routes UI/visual issues to the browser skill. Use when the server seems hung, a pi session won't connect, tests fail mysteriously, the dashboard shows a blank page, restart loops, port conflicts, or any "why isn't X working" / "the dashboard is doing Y" question. |
Debug Dashboard
System-level debugging for the running pi-agent-dashboard. Three layers:
┌─────────────────────────────────────────────────────────┐
│ Layer 1 — Is the server alive? │
│ npx tsx ./scripts/health-probe.ts │
│ npx tsx ./scripts/tail-server-log.ts │
└─────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────┐
│ Layer 2 — Are the bridges connecting? │
│ npx tsx ./scripts/list-sessions.ts │
│ npx tsx ./scripts/tail-server-log.ts --errors│
└─────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────┐
│ Layer 3 — Is the UI rendering? │
│ (use the browser skill) │
└─────────────────────────────────────────────────────────┘
First moves — always run these
npx tsx ./scripts/health-probe.ts
npx tsx ./scripts/tail-server-log.ts
npx tsx ./scripts/list-sessions.ts
Scripts are TypeScript (cross-platform). All invocations use npx tsx so they work on Linux, macOS, and Windows. tsx is already a project dep.
If health-probe says "not-running" → server isn't up. Check server.log for the most recent start banner ([bootstrap] ready ...) and what came after. The log appends with timestamped headers per start, so the last banner block is the relevant one.
When the server is up but misbehaving
| Symptom | Likely cause | Where to look |
|---|
| Restart loops | Stale port held by zombie, or restart racing with bridge auto-start | lsof -i :8000, then pi-dashboard stop (kills by port, not just PID) |
EADDRINUSE on start | Concurrent spawn from multiple pi sessions | Harmless — losing process exits silently. Check log. |
| Bridge connects then disconnects | server_restarting broadcast active, or version skew | grep server_restarting in server.log; check /api/health for version |
| Blank page in browser | Vite not running in dev mode (silent fallback to prod build); or auth blocking | Check /api/health.mode; check auth settings |
Cannot connect to dashboard server on Electron boot | launchDashboardServer fell back to process.execPath (Electron GUI binary) | See references/known-issues.md → "Electron Node bin selection" |
| Fastify crashes immediately | Bad Node version (22.0–22.17.x or 24.1–24.2.x per nodejs/node#58515) | node --version — must be ≥ 22.18.0 |
Full known-issue catalogue: references/known-issues.md.
When tests fail mysteriously
Use the tee→grep pattern. Never rerun to inspect — capture once, grep forever:
npx tsx ./scripts/run-tests-triage.ts
npx tsx ./scripts/run-tests-triage.ts packages/server
npx tsx ./scripts/run-tests-triage.ts -t 'my test'
After it finishes, the log lives at:
- Linux/macOS:
/tmp/pi-test.log
- Windows:
%TEMP%\pi-test.log
You can re-grep / re-read with custom patterns; the script prints the absolute path on each run.
Patterns + per-package vitest configs + watch mode are documented in references/test-failure-triage.md.
When the UI is the problem
This skill stops at "the server says X but the UI shows Y". For visual debugging — verifying layouts, screenshotting, hunting console errors, testing responsive breakpoints — switch to the browser skill (shipped by the dashboard bridge extension to every session). Quick pointer: see references/ui-debug.md.
Log file locations
All log + config files. Worth bookmarking. See references/log-locations.md for the full map.
| Path | Contents |
|---|
~/.pi/dashboard/server.log | Daemon stdout/stderr, append mode, timestamped headers per start |
~/.pi/dashboard/server.pid | PID file (may be stale; use lsof -i :8000 to confirm) |
~/.pi/dashboard/config.json | Live config (port, piPort, auth, tunnel, plugins) |
~/.pi/dashboard/zrok.pid | Tunnel PID file |
~/.pi/dashboard/model-proxy.jsonl | Model proxy request log (50 MB rotation) |
~/.pi/dashboard/tool-overrides.json | Tool registry overrides |
/tmp/pi-test.log | Last test run (when you used the tee→grep pattern) |
~/.pi/agent/sessions/ | Per-session pi state |
STOP — docs-first gate
Before answering any "how do I X" / "why does Y happen" question:
grep -ni '<keyword>' docs/faq.md docs/*.md README.md
The FAQ already documents most recurring symptoms (Electron Node bin, Fastify crash range, tunnel watchdog, port conflicts). Reading source before grepping docs wastes tokens and risks wrong answers. Per AGENTS.md "STOP — Docs-First Gate".
Related skills
browser — UI/visual issues, screenshots, console errors, responsive testing (Electron + web)
implement — back to implementing once the bug is identified
pi-dashboard — interact with the dashboard via REST (list sessions, send prompts, abort)
ci-troubleshoot — when the problem only shows up in CI
code-review — review the fix before committing