name	capturing-screenshots
description	Capture or update documentation screenshots for the Quarto website using Playwright. Use when screenshots need refreshing, new screenshots are needed for docs pages, or the user mentions screenshots, screen captures, or visual documentation.
allowed-tools	Bash(node _tools/screenshots/), Bash(npm.cmd ), Bash(cat ), Bash(playwright-cli ), Bash(oxipng *), Agent

Setup

When this skill loads, run these commands to gather context:

List registered screenshots: bash "${CLAUDE_SKILL_DIR}/scripts/list-screenshots.sh"
Read visual rules: cat _tools/screenshots/CLAUDE.md
Read capture agent reference: cat "${CLAUDE_SKILL_DIR}/capture-agent.md"
Read manifest schema: cat "${CLAUDE_SKILL_DIR}/manifest-schema.md"

Working directory: npm run commands (render, capture, compress) work from any directory — they resolve paths from _tools/screenshots/package.json. Direct node scripts/... calls and playwright-cli must run from _tools/screenshots/ or use absolute paths. Be careful not to double up path segments if you've already cd'd into _tools/screenshots/.

Instructions

You are the screenshot orchestrator. The list output shows all registered screenshots, the visual rules define quality standards, and the capture agent reference describes how browser operations work.

If the user wants to UPDATE existing screenshots:

Ask which screenshots to update (or "all")
Process screenshots one at a time — never batch-capture without confirmation: a. Render: node _tools/screenshots/scripts/render.js <project-path> (can batch-render all profiles upfront) b. Capture: npm run capture -- --name <name> (handles serve, capture, dark variant, compress) c. Show the user the output image(s) using the Read tool d. STOP and wait for explicit confirmation before proceeding to the next screenshot e. If the user requests adjustments, update manifest and re-capture f. Only after confirmation, move to the next screenshot
Show results summary

Critical: Each screenshot requires user visual review and explicit approval. Do not proceed to the next screenshot until the user confirms the current one is acceptable. This applies to both new captures and re-captures of existing screenshots.

If the user wants to CREATE a new screenshot:

Gather these parameters (ask about unknowns, infer from context when obvious):

Parameter	Values / Notes
Source type	`url` (live site), `example` (Quarto project — render then serve)
Source detail	URL or example project path (create minimal project if needed)
Viewport	navbar=1440x400, sidebar=992x600, about=1200x900, full page=1440x900
Zoom	Default 1.0; use 1.15 for about pages or excess internal padding
Element	CSS selector if capturing a specific element; omit for full viewport
Interactions	Clicks, hovers, etc. needed before capture
Trim / Crop	`trim: true` for uniform background edges; `cropBottom`/`maxHeight` when vertical rules prevent trim
Output path	Suggest based on doc location
Doc file	Which .qmd references this image (for manifest `doc.file`)

Then work through two phases:

Phase A: Visual design (what to capture)

Use playwright-cli to explore the page interactively and nail down the visual. Phase A ends when the user approves the screenshot visual.

Create example project if needed
Render: node _tools/screenshots/scripts/render.js <project-path> (add --profile <name> if needed)
Serve the rendered output directory: node _tools/screenshots/scripts/serve.js <output-dir> The serve script takes a directory path — it does not understand --profile. For default renders, the output is _site/ inside the project. For profiled renders, it's docs-<profile>/ (e.g., examples/navbar-basic/docs-reader-mode). Check the render output to confirm the actual path.
Open in headed mode: playwright-cli -s=screenshot open --headed <url> (headed mode shows the browser window so you can see the page)
Discover what to capture: a. Take a snapshot (playwright-cli -s=screenshot snapshot) to see page structure b. If replacing an existing screenshot, download and read the current image to understand what it looks like (e.g., curl -sL -o "$TMPDIR/existing.png" <url> then Read tool). Note what's included, cropped, and framed — the new screenshot should match unless the doc content has changed. c. Read the .qmd doc file to understand what the image should illustrate — check the YAML example above the image, the fig-alt text, and surrounding prose d. Determine initial viewport from the category table (navbar=1440x400, sidebar=992x600, about=1200x900, full page=1440x900)
Test and iterate in headed mode: a. Resize: playwright-cli -s=screenshot resize <w> <h> b. Test cleanup evals if needed (hiding elements, removing banners) c. Test interactions (click/hover) — take snapshot, find ref, click, verify state d. Take a test screenshot: playwright-cli -s=screenshot screenshot --filename="$TMPDIR/test.png" e. Show the screenshot to the user: npm run open -- "$TMPDIR/test.png" (cross-platform; do NOT use open or start directly) f. Provide review context so the user can judge the screenshot:
- Which .qmd file and section (line number, heading)
- The fig-alt text (what the image is supposed to show)
- The code example shown alongside it in the doc (if any)
- A link to the live doc page if available (e.g., quarto.org URL)
- What to specifically check (does navbar match the YAML? Are the right items visible? etc.) g. Ask: "Does this capture what the doc needs? Anything to adjust?" h. Repeat until the user approves the visual
Encode findings into manifest: a. Read manifest-schema.md for the complete field reference b. Create the manifest entry based on what was validated interactively c. Every field value should come from tested exploration, not guesswork

Use playwright-cli --help to discover available commands. See capture-agent.md for eval vs run-code guidance — use run-code for complex JS.

When stuck: Chrome DevTools MCP (only if available)

If playwright-cli's shell escaping fights you on complex JS (template literals, nested quotes, getComputedStyle), Chrome DevTools MCP can help — but ONLY if it's available in the current session, and ALWAYS ask the user before switching.

evaluate_script — proper JS function, no shell escaping layer
take_screenshot — inline visual feedback in conversation
Best for: iterative CSS/DOM debugging (e.g., spotlight stacking contexts)
Trade-off: more verbose output per call = higher token usage

Never switch to Chrome DevTools MCP proactively. Suggest it as an option and let the user decide.

Phase B: Image processing (how to post-process)

Phase B starts after the user approves the visual in Phase A and a manifest entry exists. Now run the automated capture pipeline and tune post-processing.

Add the manifest entry to _tools/screenshots/manifest.json
Run npm run validate to check the manifest entry
Run npm run capture -- --name <name> to produce the screenshot
Show the user the output — ask them to verify visually
If blank space remains, decide with the user:
- Uniform background edges? → add "trim": true
- Vertical rules or multi-color edges? → add "cropBottom": N or "maxHeight": N
- Both? → trim runs first, then crop
Re-capture and verify until the user is satisfied

Launching the capture agent:

Use the Agent tool with subagent_type="general-purpose" and model="sonnet". Pass:

The base URL where the site is being served
The capture agent reference (from ${CLAUDE_SKILL_DIR}/capture-agent.md)
Specific screenshot details: viewport, cleanup, interactions, element, output path
Note: zoom and post-processing (trim, crop) are handled by capture.js, not the agent. If the agent captures manually, it should apply zoom via page.evaluate(z => document.body.style.zoom = z, String(zoom))
Instruct it to follow the capture workflow and use -s=screenshot session flag

Parameter

Values / Notes

Source type

url (live site), example (Quarto project — render then serve)

Source detail

URL or example project path (create minimal project if needed)

Viewport

navbar=1440x400, sidebar=992x600, about=1200x900, full page=1440x900

Zoom

Default 1.0; use 1.15 for about pages or excess internal padding

Element

CSS selector if capturing a specific element; omit for full viewport

Interactions

Clicks, hovers, etc. needed before capture

Trim / Crop

trim: true for uniform background edges; cropBottom/maxHeight when vertical rules prevent trim

Output path

Suggest based on doc location

Doc file

Which .qmd references this image (for manifest doc.file)

capturing-screenshots

Setup

Instructions

If the user wants to UPDATE existing screenshots:

If the user wants to CREATE a new screenshot:

Phase A: Visual design (what to capture)

When stuck: Chrome DevTools MCP (only if available)

Phase B: Image processing (how to post-process)

Launching the capture agent:

Plus depuis ce dépôt

Setup

Instructions

If the user wants to UPDATE existing screenshots:

If the user wants to CREATE a new screenshot:

Phase A: Visual design (what to capture)

When stuck: Chrome DevTools MCP (only if available)

Phase B: Image processing (how to post-process)

Launching the capture agent:

Plus depuis ce dépôt