Run any Skill in Manus with one click

user-test

Use when validating a cross-machine HUD flow where Butler deploys/runs the full Windows app over SSH+SCP (tailnet default host), then publishes configurable test messages to HUD zones via MCP `publish_to_zone`.

Run Skill in Manus

Overview

Install command

npx skills add https://github.com/Tzeusy/tze-hud --skill user-test

Copy and paste this command into Claude Code to install the skill

Source

Tzeusy/tze-hud

Stars0

Forks0

UpdatedMay 11, 2026 at 16:14

File Explorer

50 files

SKILL.md

readonly

User Test

Run an automation-first cross-machine validation loop:

Deploy a full Windows application .exe from Linux.
Launch it on Windows over SSH/SCP.
Verify MCP HTTP is reachable.
Publish configurable zone test messages.
Publish configurable widget test messages.

Required Inputs

Collect these before executing:

full_app_exe (required): absolute/relative Linux path to the full application .exe to deploy
package (optional fallback): cargo package/bin crate to build only if explicitly requested
target (default: x86_64-pc-windows-gnu)
profile (release or debug, default: release)
win_user (default: hudbot)
win_host (default: tzehouse-windows.parrot-hen.ts.net)
ssh_key_path (default in local environment: ~/.ssh/ecdsa_home)
task_name (default: TzeHudOverlay)
mcp_http_url (default: http://tzehouse-windows.parrot-hen.ts.net:9090)
mcp_psk_env (default: MCP_TEST_PSK)
messages: array of zone publishes

Message shape — content is either a plain string (StreamText) or a typed JSON object:

[
  {
    "zone_name": "alert-banner",
    "content": "Deploy v2.1.0 started",
    "ttl_us": 30000000,
    "namespace": "butler-test"
  },
  {
    "zone_name": "subtitle",
    "content": "Running integration tests...",
    "ttl_us": 60000000
  },
  {
    "zone_name": "status-bar",
    "content": {"type": "status_bar", "entries": {"build": "passing", "agent": "butler", "target": "windows"}},
    "merge_key": "build-status",
    "ttl_us": 120000000,
    "namespace": "butler-test"
  },
  {
    "zone_name": "notification-area",
    "content": {"type": "notification", "text": "Build complete", "icon": "", "urgency": 1},
    "ttl_us": 10000000
  },
  {
    "zone_name": "ambient-background",
    "content": {"type": "solid_color", "r": 0.1, "g": 0.15, "b": 0.4, "a": 0.05},
    "ttl_us": 300000000
  },
  {
    "zone_name": "pip",
    "content": {"type": "solid_color", "r": 0.2, "g": 0.8, "b": 0.2, "a": 0.05},
    "ttl_us": 60000000
  }
]

Content types by zone:

alert-banner, subtitle: plain string (StreamText)
status-bar: {"type":"status_bar","entries":{"key":"value",...}} with merge_key
notification-area: {"type":"notification","text":"...","icon":"","urgency":0-3,"title":"...","actions":[...]} (title and actions optional)
ambient-background, pip: {"type":"solid_color","r":0-1,"g":0-1,"b":0-1,"a":0-1}

merge_key, ttl_us, and namespace are optional per message.

widget_messages: array of widget publishes (optional)

Widget message shape:

[
  {
    "widget_name": "gauge",
    "params": {"level": 0.75, "label": "CPU Usage"},
    "transition_ms": 500,
    "ttl_us": 60000000,
    "namespace": "user-test"
  },
  {
    "action": "clear",
    "widget_name": "gauge",
    "namespace": "user-test"
  }
]

Widget parameter types:

f32: JSON number (e.g. 0.75) — often with min/max range
string: JSON string (e.g. "CPU Usage")
color: JSON object {"r": 0-1, "g": 0-1, "b": 0-1, "a": 0-1}
enum: JSON string from allowed values (e.g. "warning")

transition_ms, ttl_us, namespace, and instance_id are optional per message.

widget_name semantics: instance name, not type name

widget_name in publish_to_widget identifies a widget instance, not a widget type. When the HUD starts, instances are created from [[tabs.widgets]] entries in the config, each with an instance_id. That instance_id is the string you pass as widget_name.

For the production tze_hud_app deployment (see app/tze_hud_app/config/production.toml):

`widget_name`	Widget type	What it shows
`main-gauge`	`gauge`	Vertical fill gauge (level, label, severity)
`main-progress`	`progress-bar`	Horizontal progress bar (progress, label)
`main-status`	`status-indicator`	Status circle with label (online/away/busy/offline)

Use list_widgets to discover available instances:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" --psk-env MCP_TEST_PSK \
  --messages-file /dev/null --list-widgets

list_widgets returns widget_instances (with instance_name) — use those names as widget_name. If list_widgets returns no instances, the HUD binary is running without a config that declares instances.

Windows Media Ingress Exemplar

Use this lane only with app/tze_hud_app/config/windows-media-ingress.toml, which explicitly enables the one-stream media-pip surface and grants windows-local-media-producer the media_ingress capability.

HUD media-ingress proof uses a self-owned/local synthetic video source. It is video-only and does not route audio:

TZE_HUD_PSK="$PSK" python3 .claude/skills/user-test/scripts/windows_media_ingress_exemplar.py \
  local-producer \
  --target tzehouse-windows.parrot-hen.ts.net:50051 \
  --agent-id windows-local-media-producer \
  --zone-name media-pip \
  --source-label synthetic-color-bars \
  --hold-s 30 \
  --evidence-json build/windows-media-ingress/local-producer-evidence.json

YouTube source evidence is separate from HUD frame-ingress proof. Launch video ID O0FGCxkHM-U through the official embedded-player URL; do not bridge raw YouTube frames into the HUD runtime:

python3 .claude/skills/user-test/scripts/windows_media_ingress_exemplar.py \
  youtube-sidecar \
  --windows-host tzehouse-windows.parrot-hen.ts.net \
  --windows-user tzeus \
  --ssh-key ~/.ssh/ecdsa_home \
  --evidence-json build/windows-media-ingress/youtube-source-evidence.json

Record the policy boundary before validation:

python3 .claude/skills/user-test/scripts/windows_media_ingress_exemplar.py \
  policy-review \
  --evidence-json build/windows-media-ingress/policy-review.json

The media exemplar must not introduce yt-dlp, direct YouTube media URL extraction, downloads, a browser/WebView node inside the compositor, raw YouTube frame bridging, or a YouTube audio route into the HUD runtime.

Workflow

Step 0: SSH Connectivity Gate

Verify key auth for both users (Linux):

ssh -o BatchMode=yes -o IdentitiesOnly=yes -i ~/.ssh/ecdsa_home \
  hudbot@tzehouse-windows.parrot-hen.ts.net "whoami"
ssh -o BatchMode=yes -o IdentitiesOnly=yes -i ~/.ssh/ecdsa_home \
  tzeus@tzehouse-windows.parrot-hen.ts.net "whoami"

Both must succeed. hudbot is used for file deployment (SCP). tzeus is used for process control (kill, scheduled task trigger) because tzeus owns the interactive desktop session.

Step 1: Deploy (SCP via hudbot)

Copy the prebuilt .exe to the Windows host:

# Kill any running instance first (must use tzeus — hudbot can't kill it)
ssh -i ~/.ssh/ecdsa_home -o BatchMode=yes -o StrictHostKeyChecking=no \
  tzeus@tzehouse-windows.parrot-hen.ts.net "taskkill /F /IM tze_hud.exe"
sleep 2

# SCP the exe (via hudbot)
scp -i ~/.ssh/ecdsa_home -o BatchMode=yes -o StrictHostKeyChecking=no \
  /path/to/tze_hud.exe \
  hudbot@tzehouse-windows.parrot-hen.ts.net:C:/tze_hud/tze_hud.exe

Report: file size, checksum (sha256sum), remote path.

Step 2: Register + Launch (via tzeus)

The HUD must be launched via a scheduled task as tzeus with --window-mode overlay. This is critical for transparency — SSH-launched processes cannot access the desktop GPU, and run_hud.ps1 wrappers interfere with window creation.

# Register the overlay task (idempotent — safe to re-run)
ssh -i ~/.ssh/ecdsa_home -o BatchMode=yes -o StrictHostKeyChecking=no \
  tzeus@tzehouse-windows.parrot-hen.ts.net \
  "powershell -NoProfile -Command \"Register-ScheduledTask -TaskName 'TzeHudOverlay' \
    -Action (New-ScheduledTaskAction \
      -Execute 'C:\\tze_hud\\tze_hud.exe' \
      -Argument '--window-mode overlay' \
      -WorkingDirectory 'C:\\tze_hud') \
    -Settings (New-ScheduledTaskSettingsSet -AllowStartIfOnBatteries) \
    -Force\""

# Launch it
ssh -i ~/.ssh/ecdsa_home -o BatchMode=yes -o StrictHostKeyChecking=no \
  tzeus@tzehouse-windows.parrot-hen.ts.net \
  "schtasks /Run /TN TzeHudOverlay"

Transparency requirements (if the window is grey/opaque, one of these is wrong):

--window-mode overlay — fullscreen mode is intentionally opaque
Task runs as tzeus (the user logged into the console desktop)
Exe runs directly (no PowerShell/bat wrapper — wrapper windows break transparency)
NVIDIA driver 595.97+ on the Windows host
Commit must include with_no_redirection_bitmap(true), Vulkan forcing, PreMultiplied alpha

Step 2: MCP Reachability Gate

Require live MCP HTTP reachability before publish.

Default URL: http://tzehouse-windows.parrot-hen.ts.net:9090
If MCP HTTP is unreachable, stop and report launch/runtime mismatch.
Do not treat startup subtitle simulation as a substitute for MCP publish validation.

Step 3: Publish Configurable Zone Messages

Use scripts/publish_zone_batch.py from this skill.

Recommended sequence:

Generate a temporary JSON file with user-provided messages.
List zones first (--list-zones) for visibility.
Publish the full message batch.
Return per-message results (success/failure, ids, and errors).

Example:

python3 .claude/skills/user-test/scripts/publish_zone_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file /tmp/hud-zone-messages.json \
  --list-zones

Step 4: Publish Configurable Widget Messages

Use scripts/publish_widget_batch.py from this skill.

Recommended sequence:

Generate a temporary JSON file with user-provided widget messages.
List widgets first (--list-widgets) to discover available widget types and instances.
Publish the full widget message batch.
Return per-message results (success/failure, applied params, and errors).
For manual user-test runs that touch durable widget instances, clear them at the end with --cleanup-on-exit or the cleanup fixture below. This prevents stale widget state from remaining on the HUD after interrupted or partial tests.

Example:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file /tmp/hud-widget-messages.json \
  --list-widgets \
  --cleanup-on-exit

If list_widgets returns no instances, skip widget publishing and report that no widgets are registered (the HUD binary may predate widget support).

To clear stale widget state explicitly, run:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/widget-cleanup.json

Step 5: Widget Reactivity Test (Gauge Cycling)

After the initial widget publish, cycle the gauge through a sequence of values with 3-second delays to verify widget reactivity (re-rasterization on param change).

Use scripts/gauge_cycle_test.json from this skill with --delay-ms 3000:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/gauge_cycle_test.json \
  --delay-ms 3000

The gauge should visually cycle through: blue 25% "Low" → yellow 50% "Medium" → red 95% "Critical!" → green 42% "Normal". Report per-step success and whether the user confirmed visual updates.

Step 6: Widget Reactivity Test (Status Indicator)

After the gauge cycle, confirm that a status-indicator widget instance is registered before proceeding. Use the --list-widgets output from Step 4 (or re-run it) to verify that a widget named status-indicator appears in the list. If no such instance is present, skip the sub-steps below and report that the status-indicator widget is not deployed.

Run the status-indicator enum cycle to verify discrete color binding and re-rasterization on param change.

Use scripts/status-indicator-enum-cycle-test.json from this skill with --delay-ms 1000:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/status-indicator-enum-cycle-test.json \
  --delay-ms 1000 \
  --cleanup-on-exit

The status indicator should visually cycle through:

online → green badge (#4FB543)
away → amber badge (#D97706)
busy → red badge (#DC2626)
offline → gray badge (#6B7280)

Each transition is a discrete snap (no interpolation). Require human visual confirmation that both color and glyph change per state.

Next, run the theme cycle to verify all three status-indicator visual themes are separately usable via the theme enum parameter:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/status-indicator-theme-cycle-test.json \
  --delay-ms 1200 \
  --cleanup-on-exit

Expected progression (same status=online, different theme):

minimal → small quiet dot/glyph treatment
system → bordered micro-badge (ops style)
friendly → softer circular badge (assistant style)

Require human confirmation that only one theme is visible at a time and each is visually distinct.

Next, run the label-update sequence to verify text-content binding:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/status-indicator-label-update-test.json \
  --delay-ms 1000 \
  --cleanup-on-exit

Expected label progression: "Butler" → "Codex" → (empty). The badge remains online/green. Label changes are primarily visible in the tooltip content (not always-on icon text); verify by hovering long enough to reveal the tooltip.

Finally, run the validation fixture to confirm invalid enum rejection at the MCP surface:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/status-indicator-validation-test.json

Expected result: MCP returns an error response (WIDGET_PARAMETER_INVALID_VALUE) for status=do-not-disturb. The widget display must not change. Report whether the error response matches expectation.

Step 7: Widget Reactivity Test (Progress Bar)

After the status-indicator tests, confirm that a progress-bar widget instance is registered before proceeding. Use the --list-widgets output from Step 4 (or re-run it) to verify that a widget named progress-bar appears in the list. If no such instance is present, skip the sub-steps below and report that the progress-bar widget is not deployed.

This is the progress-bar-widget user-test scenario. It animates a thin horizontal bar from 0 to 100% and confirms visual quality at each step.

7a: 7-Step Sequence

Run progress-bar-step.json with --delay-ms 1000 so the tester has ~1 second to observe each visual transition:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/progress-bar-step.json \
  --delay-ms 1000 \
  --cleanup-on-exit

At each step, prompt the tester to confirm the expected visual state:

Step	Published params	What to confirm
1	`progress=0.0, label=""`	Bar is empty (zero width fill); no label text visible
2	`progress=0.25, label="25%"`	Fill animates smoothly to 25%; label reads "25%" centered on bar
3	`progress=0.5, label="50%"`	Fill animates smoothly to 50%; label reads "50%"
4	`progress=0.75, label="75%"`	Fill animates smoothly to 75%; label reads "75%"
5	`progress=1.0, label="100%"`	Fill animates smoothly to full width; label reads "100%"
6	`fill_color={r:0.0, g:0.784, b:0.325, a:1.0}`	Fill color transitions from blue to green (equivalent to RGBA `[0,200,83,255]`) over 300ms; progress/label unchanged
7	clear	Bar resets to empty with no visual artifacts

Human acceptance criteria at each step:

(a) Pill/capsule shape — The bar has visually rounded end-caps on both the track and the fill. No sharp corners.
(b) Smooth fill animation — Each step 2-5 fills with a visible 200ms animation. No jumps or jank.
(c) Centered label — Label text is horizontally and vertically centered on the bar at all non-empty steps.
(d) Correct fill color — Steps 1-5 use the accent blue (#4A9EFF or token override). Step 6 transitions to green.
(e) Clean reset — After the clear action, the bar is completely empty with no residual fill or label artifacts.

7b: Color Sweep (Optional)

Optionally, run the color-sweep fixture to validate color interpolation across the full spectrum with --delay-ms 1000:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/progress-bar-color-sweep.json \
  --delay-ms 1000 \
  --cleanup-on-exit

The bar cycles through: blue -> green -> yellow -> red -> blue (reset) -> clear (empty). Each transition should produce a visible smooth color animation over 300ms. Confirm that the fill color matches expectations at each step before the next publish fires, and that after the final clear action the bar is fully empty with no residual fill or label.

Report pass/fail per step. A step fails if the tester observes: missing animation, wrong color, misaligned label, missing rounded end-caps, or visible artifacts after the reset/clear-to-empty step.

7c: Rapid-Fire Stream Test (100 publishes / 5 seconds)

Use this fixture to simulate a dense progress-update stream and validate that the HUD stays responsive under frequent widget publishes.

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/progress-bar-rapidfire-100-5s.json \
  --delay-ms 50 \
  --cleanup-on-exit --cleanup-delay-ms 3000

Fixture details (progress-bar-rapidfire-100-5s.json):

100 sequential updates (1% -> 100%)
publish cadence: 50ms between requests (~5s total sequence duration)
per-message transition: 45ms
fixed widget target: main-progress

Expected outcomes:

No MCP transport or validation errors across the 100 publishes.
Progress bar appears continuously animated without freezing/stalling.
Final visible state settles at 100%.
No visual artifacts in label text during rapid updates.

Subtitle Exemplar Scenario

Presence Card Exemplar Scenario

Use scripts/presence_card_exemplar.py to exercise the Presence Card raw-tile resident flow on a live HUD. This scenario uses the resident gRPC session stream, not the MCP zone/widget surface.

It drives the exact operator-visible lifecycle needed for the Presence Card manual proof path:

Start 3 resident sessions (agent-alpha, agent-beta, agent-gamma)
Create 3 stacked bottom-left cards
Wait 30s and rebuild all 3 cards with updated Last active text
Disconnect agent-gamma
Pause for badge/orphan observation
Wait for orphan grace expiry while agent-alpha and agent-beta continue
Finish with 2 remaining cards and a JSON transcript artifact

Implementation note: This scenario now uploads each 32x32 PNG avatar over the resident HudSession stream (ResourceUploadStart), then applies the returned ResourceId in the Presence Card StaticImageNode. The visual proof path therefore covers stacked cards, periodic text updates, disconnect/orphan observation, cleanup, and the real resident image-upload consumer contract.

CLI

python3 .claude/skills/user-test/scripts/presence_card_exemplar.py \
  --target tzehouse-windows.parrot-hen.ts.net:50051 \
  --psk-env TZE_HUD_PSK \
  --tab-height 1080 \
  --transcript-out test_results/presence-card-latest.json

Optional flags:

--update-wait-s (default 30) — first periodic content-update wait
--heartbeat-timeout-s (default 15) — heartbeat-timeout reference for manual observation
--orphan-grace-s (default 30) — orphan grace-period wait
--observe-badge-s (default 1.0) — badge observation pause after disconnect

Output

The script emits one JSON object per step to stdout and writes a transcript file by default to test_results/presence-card-latest.json.

Each step includes:

code — stable step identifier
title — short operator-facing label
action — what the script is doing
expected_visual — what the operator should confirm on screen
status — started or completed

Human Acceptance Criteria

Verify the visible sequence in order:

Step	Expected visual
Create	3 stacked cards visible in the bottom-left corner
Update	All 3 cards show `Last active: 30s ago`
Disconnect	Only `agent-gamma` disconnects
Orphan observe	Disconnect badge appears on `agent-gamma` only
Cleanup	`agent-gamma` disappears after grace expiry
Final state	`agent-alpha` and `agent-beta` remain at original positions with no reflow

This scenario is the repo-native execution surface for docs/exemplar-presence-card-user-test.md.

Use scripts/subtitle_exemplar.py to exercise the subtitle zone on a live HUD. The script validates streaming breakpoint reveal, single-line baseline rendering, multi-line word-wrap, rapid-replacement contention, and TTL auto-clear — all using the exemplar-test namespace.

CLI

python3 .claude/skills/user-test/scripts/subtitle_exemplar.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK \
  --ttl 10000

Required: --url. Optional: --psk-env (default TZE_HUD_PSK), --ttl (ms, default 10000).

All messages are published to zone_name: "subtitle" with namespace: "exemplar-test".

Phases

Phase	What happens	Pause
1 — Streaming reveal	stream_text with breakpoints at word boundaries; compositor reveals word-by-word	TTL hold (10s default)
2 — Single line	"Hello world — exemplar subtitle test"; baseline rendering	4s
3 — Multi-line	Long text forcing word-wrap and backdrop sizing	4s
4 — Rapid replacement	3 publishes 100ms apart; only the third survives	3s
5 — TTL expiry	Subtitle with fixed 3s TTL; watch auto-clear fade-out	TTL + 0.3s safety + 1.0s margin + 2s confirmation (~6.3s total)
6 — Streaming repeat	Same streaming reveal again for final sign-off	TTL hold (10s default)

Human Acceptance Criteria

Verify each criterion visually during the run:

#	Criterion	Phase
AC1	White text with visible black outline on semi-transparent dark backdrop	2 (single line)
AC2	Text centered horizontally near bottom of screen	2 (single line)
AC3	Multi-line text wraps cleanly within backdrop bounds	3 (multi-line)
AC4	Rapid replacement transitions are smooth (no blank frames)	4 (rapid replace)
AC5	Content disappears after TTL with visible fade-out	5 (TTL expiry)
AC6	Streaming text reveals word-by-word	1 and 6 (streaming)

All six criteria must pass for the subtitle exemplar to be accepted.

Named Test Group: subtitle-full-sequence

subtitle-full-sequence.json is provided as a batch sequence file and can be invoked alongside other zone tests. Use with --delay-ms 4000 so each scenario group has time to render before the next publish fires:

python3 .claude/skills/user-test/scripts/publish_zone_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/subtitle-full-sequence.json \
  --delay-ms 4000 \
  --list-zones

The sequence runs: single line → multi-line → rapid replacement (×3) → TTL expiry → streaming. All messages use namespace: "exemplar-test".

Use --delay-ms 100 when running subtitle-rapid-replace.json alone to exercise contention at a speed that actually triggers the latest-wins logic.

Subtitle payload shape

{"zone_name": "subtitle", "content": "Hello world", "ttl_us": 10000000, "namespace": "exemplar-test"}

For streaming with word-by-word breakpoints:

{
  "zone_name": "subtitle",
  "content": "The quick brown fox jumps over the lazy dog",
  "breakpoints": [3, 9, 15, 19, 25, 30, 34, 38],
  "ttl_us": 10000000,
  "namespace": "exemplar-test"
}

The breakpoints array contains byte offsets of word boundaries in content. The compositor reveals text progressively at each breakpoint at its own frame rate — the agent does not control reveal timing.

Notification Stack Exemplar Scenario

Use scripts/notification_exemplar.py to exercise the notification-area zone on a live HUD. The script simulates 3 agents (alpha, beta, gamma) publishing notifications with mixed urgency levels across 4 phases.

CLI

python3 .claude/skills/user-test/scripts/notification_exemplar.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK \
  --ttl 8000

Required: --url. Optional: --psk-env (default TZE_HUD_PSK), --ttl (ms, default 8000).

Phases

Phase	What happens	Pause
1 — Initial burst	alpha (urgency 0), beta (urgency 1), gamma (urgency 2) published in order	2s
2 — Stack growth	alpha (urgency 3), beta (urgency 1) — stack reaches max_depth=5	2s
3 — TTL expiry	waits remaining phase-1 TTL plus ~650ms (150ms fade-out + 500ms margin) for phase-1 batch to auto-dismiss	1s
4 — Max depth eviction	6 rapid notifications; 1st is evicted instantly (no fade) when 6th arrives	3s

Visual Checklist (per phase)

Phase 1: Three notifications stacked newest-at-top. gamma (amber), beta (dark blue), alpha (dark gray) backdrops with 1px border and body-font text.

Phase 2: Five notifications. Top two are phase-2 publishes; bottom three are phase-1. All urgency-tinted correctly.

Phase 3: Phase-1 batch (urgency 0/1/2) has faded out; only 2 phase-2 notifications remain (urgency 3 and 1).

Phase 4: Exactly 5 notifications visible. "Burst A1" (oldest, urgency 0) is gone with no fade — evicted instantly. "Burst C6" is at top.

Notification payload shape

{
  "type": "notification",
  "text": "...",
  "icon": "...",
  "urgency": 0,
  "title": "Optional heading",
  "actions": [
    {"label": "Open", "callback_id": "open"},
    {"label": "Dismiss", "callback_id": "dismiss"}
  ]
}

Published via MCP publish_to_zone to notification-area zone with ttl_us derived from --ttl and namespace set to the simulated agent namespace (alpha, beta, or gamma).

Notification Full-Gamut Pass

After running notification_exemplar.py, run this additional batch to validate the full v1 notification visual surface: two-line layout (title + text), long-body containment, and action-button rows.

python3 .claude/skills/user-test/scripts/publish_zone_batch.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK \
  --messages-file .claude/skills/user-test/scripts/notification-full-gamut.json \
  --delay-ms 250 \
  --list-zones

Coverage in notification-full-gamut.json:

urgency gamut: low (0), normal (1), urgent (2), critical (3)
two-line cards via title on all messages
long-body critical text to verify card-height containment
action rows: 2 actions on urgent card, 3 actions on critical card

Visual checks:

no body text should escape its card backdrop
urgency colors should progress low → normal → urgent → critical
action rows should appear inside the card near the bottom edge
stack ordering should remain newest-at-top under mixed payload shapes

Alert-Banner Exemplar Scenario

Use scripts/alert_banner_exemplar.py to exercise the alert-banner zone on a live HUD. The script publishes 3 alerts at increasing urgency levels with 3-second delays between each, validating urgency-driven visual differentiation and simultaneous multi-alert display.

CLI

python3 .claude/skills/user-test/scripts/alert_banner_exemplar.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK \
  --ttl 15000

Required: --url. Optional: --psk-env (default TZE_HUD_PSK), --ttl (ms, default 15000).

Sequence

Step	Alert	Urgency	Text	Pause
1	Info	1	"Info: system nominal"	3s
2	Warning	2	"Warning: disk space low"	3s
3	Critical	3	"CRITICAL: security breach detected"	—

Visual Checklist

After all 3 publishes, the alert-banner zone should show all three alerts simultaneously:

Critical (red) at top — "CRITICAL: security breach detected"
Warning (amber) in middle — "Warning: disk space low"
Info (blue) at bottom — "Info: system nominal"

All three remain visible until their TTL elapses. Confirm urgency-derived color tinting is applied correctly at each level: blue for info (urgency=1), amber for warning (urgency=2), red for critical (urgency=3).

Alert payload shape

{"type": "notification", "text": "...", "icon": "", "urgency": 1}

Published via MCP publish_to_zone to alert-banner zone with ttl_us set to --ttl (ms) × 1000 (e.g. --ttl 15000 → ttl_us = 15000000), and namespace set to alert-<level> (e.g. alert-critical).

Status-Bar Exemplar Scenario

Use scripts/status_bar_exemplar.py to exercise the status-bar zone on a live HUD. The script simulates three independent agents (agent-weather, agent-power, agent-clock) publishing merge-keyed entries, validating multi-agent coexistence, key replacement, empty-value removal, and TTL-driven sweep.

CLI

python3 .claude/skills/user-test/scripts/status_bar_exemplar.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK \
  --battery-ttl 5000

Required: --url. Optional: --psk-env (default TZE_HUD_PSK), --ttl (ms, default 60000 — long TTL for weather/time entries), --battery-ttl (ms, default 15000 — TTL for battery entry; long enough to survive steps 4/6/8 visual checks but expires during step 9).

10-Step Sequence

Step	Agent	Action	Pause
1	agent-weather	publish `weather` → `"72F Sunny"`	—
2	agent-power	publish `battery` → `"85%"` (short TTL)	—
3	agent-clock	publish `time` → `"3:42 PM"`	—
4	—	VISUAL CHECK: all 3 visible	3s
5	agent-weather	update `weather` → `"75F Cloudy"` (key replacement)	—
6	—	VISUAL CHECK: weather updated; battery/time unchanged	3s
7	agent-weather	publish empty value for `weather` (key removal)	—
8	—	VISUAL CHECK: weather gone; battery/time remain	3s
9	—	wait for battery TTL to expire (remaining TTL + 500ms sweep margin)	—
10	—	VISUAL CHECK: battery gone; time remains	3s

Visual Checklist

Step 4: Status bar shows all three key-value pairs in a horizontal row at the bottom edge of the display with a dark opaque backdrop. Entries display in monospace font using secondary text color. Order may vary by insertion time.

Step 6: Status bar still shows three entries. The weather value reads 75F Cloudy (replaced). battery: 85% and time: 3:42 PM are unchanged.

Step 8: Status bar shows two entries. The weather key is no longer visible (empty-value convention suppresses rendering). battery and time remain.

Step 10: Status bar shows one entry. The battery key has been swept by sweep_expired_zone_publications after its TTL elapsed. Only time: 3:42 PM remains visible.

Human Acceptance Criteria

#	Criterion	Step
AC1	Three distinct keys coexist — no key overwrites another	4
AC2	Key replacement updates only the target key's value	6
AC3	Empty-value publish removes that key from the visible display	8
AC4	TTL expiry removes the key without explicit publish	10
AC5	Chrome-layer bar is always visible above content tiles	all visual steps
AC6	Monospace font and dark opaque backdrop visible throughout	all visual steps

All six criteria must pass for the status-bar exemplar to be accepted.

Status-bar payload shape

{
  "zone_name": "status-bar",
  "content": {"type": "status_bar", "entries": {"weather": "72F Sunny"}},
  "merge_key": "weather",
  "ttl_us": 60000000,
  "namespace": "agent-weather"
}

Each agent uses a distinct namespace (agent-weather, agent-power, agent-clock). The merge_key matches the single entry key in entries. Published via MCP publish_to_zone.

Ambient Background Exemplar Scenario

Use scripts/ambient_background_exemplar.py to exercise the ambient-background zone on a live HUD. The script publishes across 4 phases: solid-color fill, latest-wins replacement, static-image placeholder, and rapid-replacement stress.

CLI

python3 .claude/skills/user-test/scripts/ambient_background_exemplar.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK

Required: --url. Optional: --psk-env (default TZE_HUD_PSK).

Phases

Phase	What happens	Pause
1 — Dark blue	Publish `solid_color` dark navy blue (r=0.05, g=0.05, b=0.2)	3s
2 — Warm amber	Replace with warm amber (r=0.9, g=0.6, b=0.2); latest-wins Replace policy evicts dark blue	3s
3 — Static image	Publish `static_image` content type (64-char hex resource_id); runtime renders warm-gray placeholder in v1	2s
4 — Rapid replace	10 different solid colors in sequence without delay; query `list_zones` to confirm `has_content=true` and visually confirm the final color is bright green	—

Visual Checklist

Phase 1: Entire HUD background should turn dark navy blue. No content tiles are affected — background is behind all content-layer zones.

Phase 2: Background shifts instantly to warm amber. The previous dark blue must be gone (Replace policy: latest-wins, exactly 1 active publication).

Phase 3: Background changes to a warm-gray placeholder quad (v1 behavior — GPU texture upload is deferred). The zone must accept the publication without error.

Phase 4: After all 10 rapid publishes, the background should settle on bright green (last of the 10 colors). No other colors from the burst should bleed through. list_zones must report has_content=true for the ambient-background zone.

Background payload shapes

{"type": "solid_color", "r": 0.05, "g": 0.05, "b": 0.2, "a": 1.0}
{"type": "solid_color", "r": 0.9, "g": 0.6, "b": 0.2, "a": 1.0}
{"type": "static_image", "resource_id": "<64-char-hex-blake3-hash>"}

All published via MCP publish_to_zone to ambient-background zone with namespace set to ambient-test-p<N> per phase. TTL is omitted (defaults to persistent — auto_clear_ms=None on this zone) for phases 1–3.

Text Stream Portals Exemplar Scenario

Status: implementation complete, live user-test exemplar available. The phase-0 raw-tile pilot shipped via epic hud-t98e (see docs/reports/hud-t98e-text-stream-portals.md). All 13 normative requirements in openspec/specs/text-stream-portals/spec.md are covered by integration tests; gen-2 reconciliation (PR #441) confirmed 13/13 coverage. What remains is recorded manual visual sign-off.

Existing automated coverage (do not duplicate)

Integration tests in tests/integration/:

text_stream_portal_surface.rs — raw-tile pilot composition, bounded viewport, local-first scroll, ambient attention
text_stream_portal_adapter.rs — transport-agnostic seam, tmux and non-tmux adapter conformance, external adapter isolation
text_stream_portal_coalescing.rs — retained-window coherence under backpressure
text_stream_portal_governance.rs — redaction, safe-mode, freeze, orphan path, chrome exclusion

Evidence artifact: docs/evidence/text-stream-portals/validation-2026-04-16.md.

Phase-0 pilot shape (recap)

Resident raw-tile pilot — composed from existing text, solid-color, image, and hit-region primitives. No new dedicated node type.
Resident gRPC session — portal traffic rides the existing primary bidirectional HudSession stream. No second long-lived portal stream.
Content-layer surface — portal tile renders below chrome like any other content-layer zone. No chrome-hosted portal affordances.
External adapter, authenticated — local adapter processes pass through existing capability grants; no implicit local trust.

CLI

python3 .claude/skills/user-test/scripts/text_stream_portal_exemplar.py \
  --target tzehouse-windows.parrot-hen.ts.net:50051 \
  --psk-env TZE_HUD_PSK \
  --agent-id agent-alpha \
  --doc docs/exemplar-manual-review-checklist.md \
  --tab-width 1920 \
  --phases baseline,scroll \
  --baseline-hold-s 30 \
  --max-lines 80

By default the script explicitly releases its portal lease before closing the resident session, so normal exit and Ctrl-C cleanup remove the portal tiles without requiring a HUD restart. Use --leave-lease-on-exit only when deliberately testing orphan/grace behavior.

Optional --phases values:

Phase	What it exercises	Operator-visible proof
`baseline`	Two-pane INPUT/OUTPUT portal composition	Portal appears at right edge with header, composer, divider, transcript body, and footer
`scroll`	Transcript Interaction Contract	OUTPUT pane registers scroll, steps through transcript data, preserves mid-scroll window while tail lines append, then returns to latest output
`streaming`	Low-latency text interaction	OUTPUT body grows in ordered chunks
`rapid`	Coalescing coherence smoke	Rapid publish pressure does not collapse the retained window to one latest line
`diagnostic-input`	Live compositor/input path	Uses Windows OS input injection over SSH to click-focus the composer, drag the portal header, and wheel-scroll the OUTPUT pane

For diagnostic-input, --diagnostic-input-connect-timeout-s controls the SSH connect timeout separately from the overall injector timeout so unreachable Windows hosts fail fast.

Live Validation Axes

text_stream_portal_exemplar.py drives the resident raw-tile pilot against a live HUD and produces operator-visible proof for axes that integration tests cannot validate alone:

Axis	Spec requirement	What the operator should see
Streaming reveal	Low-Latency Text Interaction	Output arrives as ordered incremental updates, not snapshot replace
Local-first scroll	Transcript Interaction Contract	Scroll offset visibly updates before any adapter ack
Bounded viewport	Bounded Transcript Viewport	Retained window stays within on-screen bounds as transcript grows
Coalescing coherence	Coherent Transcript Coalescing	Under rapid-publish pressure, retained window never collapses to only latest line
Redaction	Governance, Privacy, and Override Compliance	Portal geometry preserved; transcript content suppressed under viewer policy
Safe mode	Governance, Privacy, and Override Compliance	Portal updates suspend under safe mode like other content surfaces
Orphan path	Governance, Privacy, and Override Compliance	Disconnected portal freezes at last coherent state; grace expiry removes it
Ambient attention	Ambient Portal Attention Defaults	Unread backlog does not auto-escalate interruption class

Out of scope for the live exemplar

Terminal-emulator rendering (ANSI, cursor positioning, PTY control)
Full transcript history storage in the scene graph
Chrome-hosted portal affordances or shell-owned portal controls
Portal-specific transport RPCs outside the primary session stream
Runtime ownership of external process or tmux lifecycle

Human Acceptance Criteria

The portal stays in the content layer and remains below chrome.
The OUTPUT pane text is readable and bounded within the portal.
During scroll, the visible output window advances in steps, appended tail lines do not force an unsolicited jump, and return-to-tail shows the newest output.
During streaming, output arrives incrementally rather than as a single snapshot replace.
During rapid, the pane remains coherent under fast updates.
During diagnostic-input, the JSON transcript includes input:focus-gained, drag:start/drag:end, and scroll:output checkpoints. The injector uses SetCursorPos, mouse events, wheel events, and SendInput Unicode text against the live overlay, so failures are runtime/input-path evidence rather than synthetic transcript success.
Manual review notes and any UX tweaks are recorded in docs/exemplar-manual-review-checklist.md row 11.

Behavior Rules

Use automation-first deploy/launch by default.
Default to full app deployment via SCP + scheduled task.
Use --package / cargo build only when explicitly requested.
Always pass non-interactive SSH/SCP flags (BatchMode, IdentitiesOnly) for automation safety.
Use hudbot for file operations (SCP, mkdir). Use tzeus for process control (taskkill, schtasks).
Always launch via TzeHudOverlay scheduled task with --window-mode overlay. Never launch via run_hud.ps1 or direct SSH exec — both produce grey opaque windows.
Never use Start-Process -RedirectStandardOutput — it sets CREATE_NO_WINDOW which breaks WS_EX_NOREDIRECTIONBITMAP transparency.
Require real MCP HTTP reachability before claiming publish-path success.
Never invent zone names or endpoint values; require user-provided values.
Treat any publish error as actionable; include exact response payload.
Keep messages configurable from the user prompt; do not hardcode content.
Assume Windows host defaults to tzehouse-windows.parrot-hen.ts.net unless user overrides.

name	user-test
description	Use when validating a cross-machine HUD flow where Butler deploys/runs the full Windows app over SSH+SCP (tailnet default host), then publishes configurable test messages to HUD zones via MCP `publish_to_zone`.

User Test

Run an automation-first cross-machine validation loop:

Deploy a full Windows application .exe from Linux.
Launch it on Windows over SSH/SCP.
Verify MCP HTTP is reachable.
Publish configurable zone test messages.
Publish configurable widget test messages.

Required Inputs

Collect these before executing:

full_app_exe (required): absolute/relative Linux path to the full application .exe to deploy
package (optional fallback): cargo package/bin crate to build only if explicitly requested
target (default: x86_64-pc-windows-gnu)
profile (release or debug, default: release)
win_user (default: hudbot)
win_host (default: tzehouse-windows.parrot-hen.ts.net)
ssh_key_path (default in local environment: ~/.ssh/ecdsa_home)
task_name (default: TzeHudOverlay)
mcp_http_url (default: http://tzehouse-windows.parrot-hen.ts.net:9090)
mcp_psk_env (default: MCP_TEST_PSK)
messages: array of zone publishes

Message shape — content is either a plain string (StreamText) or a typed JSON object:

[
  {
    "zone_name": "alert-banner",
    "content": "Deploy v2.1.0 started",
    "ttl_us": 30000000,
    "namespace": "butler-test"
  },
  {
    "zone_name": "subtitle",
    "content": "Running integration tests...",
    "ttl_us": 60000000
  },
  {
    "zone_name": "status-bar",
    "content": {"type": "status_bar", "entries": {"build": "passing", "agent": "butler", "target": "windows"}},
    "merge_key": "build-status",
    "ttl_us": 120000000,
    "namespace": "butler-test"
  },
  {
    "zone_name": "notification-area",
    "content": {"type": "notification", "text": "Build complete", "icon": "", "urgency": 1},
    "ttl_us": 10000000
  },
  {
    "zone_name": "ambient-background",
    "content": {"type": "solid_color", "r": 0.1, "g": 0.15, "b": 0.4, "a": 0.05},
    "ttl_us": 300000000
  },
  {
    "zone_name": "pip",
    "content": {"type": "solid_color", "r": 0.2, "g": 0.8, "b": 0.2, "a": 0.05},
    "ttl_us": 60000000
  }
]

Content types by zone:

alert-banner, subtitle: plain string (StreamText)
status-bar: {"type":"status_bar","entries":{"key":"value",...}} with merge_key
notification-area: {"type":"notification","text":"...","icon":"","urgency":0-3,"title":"...","actions":[...]} (title and actions optional)
ambient-background, pip: {"type":"solid_color","r":0-1,"g":0-1,"b":0-1,"a":0-1}

merge_key, ttl_us, and namespace are optional per message.

widget_messages: array of widget publishes (optional)

Widget message shape:

[
  {
    "widget_name": "gauge",
    "params": {"level": 0.75, "label": "CPU Usage"},
    "transition_ms": 500,
    "ttl_us": 60000000,
    "namespace": "user-test"
  },
  {
    "action": "clear",
    "widget_name": "gauge",
    "namespace": "user-test"
  }
]

Widget parameter types:

f32: JSON number (e.g. 0.75) — often with min/max range
string: JSON string (e.g. "CPU Usage")
color: JSON object {"r": 0-1, "g": 0-1, "b": 0-1, "a": 0-1}
enum: JSON string from allowed values (e.g. "warning")

transition_ms, ttl_us, namespace, and instance_id are optional per message.

widget_name semantics: instance name, not type name

For the production tze_hud_app deployment (see app/tze_hud_app/config/production.toml):

`widget_name`	Widget type	What it shows
`main-gauge`	`gauge`	Vertical fill gauge (level, label, severity)
`main-progress`	`progress-bar`	Horizontal progress bar (progress, label)
`main-status`	`status-indicator`	Status circle with label (online/away/busy/offline)

Use list_widgets to discover available instances:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" --psk-env MCP_TEST_PSK \
  --messages-file /dev/null --list-widgets

Windows Media Ingress Exemplar

HUD media-ingress proof uses a self-owned/local synthetic video source. It is video-only and does not route audio:

TZE_HUD_PSK="$PSK" python3 .claude/skills/user-test/scripts/windows_media_ingress_exemplar.py \
  local-producer \
  --target tzehouse-windows.parrot-hen.ts.net:50051 \
  --agent-id windows-local-media-producer \
  --zone-name media-pip \
  --source-label synthetic-color-bars \
  --hold-s 30 \
  --evidence-json build/windows-media-ingress/local-producer-evidence.json

YouTube source evidence is separate from HUD frame-ingress proof. Launch video ID O0FGCxkHM-U through the official embedded-player URL; do not bridge raw YouTube frames into the HUD runtime:

python3 .claude/skills/user-test/scripts/windows_media_ingress_exemplar.py \
  youtube-sidecar \
  --windows-host tzehouse-windows.parrot-hen.ts.net \
  --windows-user tzeus \
  --ssh-key ~/.ssh/ecdsa_home \
  --evidence-json build/windows-media-ingress/youtube-source-evidence.json

Record the policy boundary before validation:

python3 .claude/skills/user-test/scripts/windows_media_ingress_exemplar.py \
  policy-review \
  --evidence-json build/windows-media-ingress/policy-review.json

Workflow

Step 0: SSH Connectivity Gate

Verify key auth for both users (Linux):

ssh -o BatchMode=yes -o IdentitiesOnly=yes -i ~/.ssh/ecdsa_home \
  hudbot@tzehouse-windows.parrot-hen.ts.net "whoami"
ssh -o BatchMode=yes -o IdentitiesOnly=yes -i ~/.ssh/ecdsa_home \
  tzeus@tzehouse-windows.parrot-hen.ts.net "whoami"

Both must succeed. hudbot is used for file deployment (SCP). tzeus is used for process control (kill, scheduled task trigger) because tzeus owns the interactive desktop session.

Step 1: Deploy (SCP via hudbot)

Copy the prebuilt .exe to the Windows host:

# Kill any running instance first (must use tzeus — hudbot can't kill it)
ssh -i ~/.ssh/ecdsa_home -o BatchMode=yes -o StrictHostKeyChecking=no \
  tzeus@tzehouse-windows.parrot-hen.ts.net "taskkill /F /IM tze_hud.exe"
sleep 2

# SCP the exe (via hudbot)
scp -i ~/.ssh/ecdsa_home -o BatchMode=yes -o StrictHostKeyChecking=no \
  /path/to/tze_hud.exe \
  hudbot@tzehouse-windows.parrot-hen.ts.net:C:/tze_hud/tze_hud.exe

Report: file size, checksum (sha256sum), remote path.

Step 2: Register + Launch (via tzeus)

# Register the overlay task (idempotent — safe to re-run)
ssh -i ~/.ssh/ecdsa_home -o BatchMode=yes -o StrictHostKeyChecking=no \
  tzeus@tzehouse-windows.parrot-hen.ts.net \
  "powershell -NoProfile -Command \"Register-ScheduledTask -TaskName 'TzeHudOverlay' \
    -Action (New-ScheduledTaskAction \
      -Execute 'C:\\tze_hud\\tze_hud.exe' \
      -Argument '--window-mode overlay' \
      -WorkingDirectory 'C:\\tze_hud') \
    -Settings (New-ScheduledTaskSettingsSet -AllowStartIfOnBatteries) \
    -Force\""

# Launch it
ssh -i ~/.ssh/ecdsa_home -o BatchMode=yes -o StrictHostKeyChecking=no \
  tzeus@tzehouse-windows.parrot-hen.ts.net \
  "schtasks /Run /TN TzeHudOverlay"

Transparency requirements (if the window is grey/opaque, one of these is wrong):

--window-mode overlay — fullscreen mode is intentionally opaque
Task runs as tzeus (the user logged into the console desktop)
Exe runs directly (no PowerShell/bat wrapper — wrapper windows break transparency)
NVIDIA driver 595.97+ on the Windows host
Commit must include with_no_redirection_bitmap(true), Vulkan forcing, PreMultiplied alpha

Step 2: MCP Reachability Gate

Require live MCP HTTP reachability before publish.

Default URL: http://tzehouse-windows.parrot-hen.ts.net:9090
If MCP HTTP is unreachable, stop and report launch/runtime mismatch.
Do not treat startup subtitle simulation as a substitute for MCP publish validation.

Step 3: Publish Configurable Zone Messages

Use scripts/publish_zone_batch.py from this skill.

Recommended sequence:

Generate a temporary JSON file with user-provided messages.
List zones first (--list-zones) for visibility.
Publish the full message batch.
Return per-message results (success/failure, ids, and errors).

Example:

python3 .claude/skills/user-test/scripts/publish_zone_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file /tmp/hud-zone-messages.json \
  --list-zones

Step 4: Publish Configurable Widget Messages

Use scripts/publish_widget_batch.py from this skill.

Recommended sequence:

Generate a temporary JSON file with user-provided widget messages.
List widgets first (--list-widgets) to discover available widget types and instances.
Publish the full widget message batch.
Return per-message results (success/failure, applied params, and errors).
For manual user-test runs that touch durable widget instances, clear them at the end with --cleanup-on-exit or the cleanup fixture below. This prevents stale widget state from remaining on the HUD after interrupted or partial tests.

Example:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file /tmp/hud-widget-messages.json \
  --list-widgets \
  --cleanup-on-exit

If list_widgets returns no instances, skip widget publishing and report that no widgets are registered (the HUD binary may predate widget support).

To clear stale widget state explicitly, run:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/widget-cleanup.json

Step 5: Widget Reactivity Test (Gauge Cycling)

After the initial widget publish, cycle the gauge through a sequence of values with 3-second delays to verify widget reactivity (re-rasterization on param change).

Use scripts/gauge_cycle_test.json from this skill with --delay-ms 3000:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/gauge_cycle_test.json \
  --delay-ms 3000

The gauge should visually cycle through: blue 25% "Low" → yellow 50% "Medium" → red 95% "Critical!" → green 42% "Normal". Report per-step success and whether the user confirmed visual updates.

Step 6: Widget Reactivity Test (Status Indicator)

Run the status-indicator enum cycle to verify discrete color binding and re-rasterization on param change.

Use scripts/status-indicator-enum-cycle-test.json from this skill with --delay-ms 1000:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/status-indicator-enum-cycle-test.json \
  --delay-ms 1000 \
  --cleanup-on-exit

The status indicator should visually cycle through:

online → green badge (#4FB543)
away → amber badge (#D97706)
busy → red badge (#DC2626)
offline → gray badge (#6B7280)

Each transition is a discrete snap (no interpolation). Require human visual confirmation that both color and glyph change per state.

Next, run the theme cycle to verify all three status-indicator visual themes are separately usable via the theme enum parameter:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/status-indicator-theme-cycle-test.json \
  --delay-ms 1200 \
  --cleanup-on-exit

Expected progression (same status=online, different theme):

minimal → small quiet dot/glyph treatment
system → bordered micro-badge (ops style)
friendly → softer circular badge (assistant style)

Require human confirmation that only one theme is visible at a time and each is visually distinct.

Next, run the label-update sequence to verify text-content binding:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/status-indicator-label-update-test.json \
  --delay-ms 1000 \
  --cleanup-on-exit

Finally, run the validation fixture to confirm invalid enum rejection at the MCP surface:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/status-indicator-validation-test.json

Step 7: Widget Reactivity Test (Progress Bar)

This is the progress-bar-widget user-test scenario. It animates a thin horizontal bar from 0 to 100% and confirms visual quality at each step.

7a: 7-Step Sequence

Run progress-bar-step.json with --delay-ms 1000 so the tester has ~1 second to observe each visual transition:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/progress-bar-step.json \
  --delay-ms 1000 \
  --cleanup-on-exit

At each step, prompt the tester to confirm the expected visual state:

Step	Published params	What to confirm
1	`progress=0.0, label=""`	Bar is empty (zero width fill); no label text visible
2	`progress=0.25, label="25%"`	Fill animates smoothly to 25%; label reads "25%" centered on bar
3	`progress=0.5, label="50%"`	Fill animates smoothly to 50%; label reads "50%"
4	`progress=0.75, label="75%"`	Fill animates smoothly to 75%; label reads "75%"
5	`progress=1.0, label="100%"`	Fill animates smoothly to full width; label reads "100%"
6	`fill_color={r:0.0, g:0.784, b:0.325, a:1.0}`	Fill color transitions from blue to green (equivalent to RGBA `[0,200,83,255]`) over 300ms; progress/label unchanged
7	clear	Bar resets to empty with no visual artifacts

Human acceptance criteria at each step:

(a) Pill/capsule shape — The bar has visually rounded end-caps on both the track and the fill. No sharp corners.
(b) Smooth fill animation — Each step 2-5 fills with a visible 200ms animation. No jumps or jank.
(c) Centered label — Label text is horizontally and vertically centered on the bar at all non-empty steps.
(d) Correct fill color — Steps 1-5 use the accent blue (#4A9EFF or token override). Step 6 transitions to green.
(e) Clean reset — After the clear action, the bar is completely empty with no residual fill or label artifacts.

7b: Color Sweep (Optional)

Optionally, run the color-sweep fixture to validate color interpolation across the full spectrum with --delay-ms 1000:

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/progress-bar-color-sweep.json \
  --delay-ms 1000 \
  --cleanup-on-exit

Report pass/fail per step. A step fails if the tester observes: missing animation, wrong color, misaligned label, missing rounded end-caps, or visible artifacts after the reset/clear-to-empty step.

7c: Rapid-Fire Stream Test (100 publishes / 5 seconds)

Use this fixture to simulate a dense progress-update stream and validate that the HUD stays responsive under frequent widget publishes.

python3 .claude/skills/user-test/scripts/publish_widget_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/progress-bar-rapidfire-100-5s.json \
  --delay-ms 50 \
  --cleanup-on-exit --cleanup-delay-ms 3000

Fixture details (progress-bar-rapidfire-100-5s.json):

100 sequential updates (1% -> 100%)
publish cadence: 50ms between requests (~5s total sequence duration)
per-message transition: 45ms
fixed widget target: main-progress

Expected outcomes:

No MCP transport or validation errors across the 100 publishes.
Progress bar appears continuously animated without freezing/stalling.
Final visible state settles at 100%.
No visual artifacts in label text during rapid updates.

Subtitle Exemplar Scenario

Presence Card Exemplar Scenario

Use scripts/presence_card_exemplar.py to exercise the Presence Card raw-tile resident flow on a live HUD. This scenario uses the resident gRPC session stream, not the MCP zone/widget surface.

It drives the exact operator-visible lifecycle needed for the Presence Card manual proof path:

Start 3 resident sessions (agent-alpha, agent-beta, agent-gamma)
Create 3 stacked bottom-left cards
Wait 30s and rebuild all 3 cards with updated Last active text
Disconnect agent-gamma
Pause for badge/orphan observation
Wait for orphan grace expiry while agent-alpha and agent-beta continue
Finish with 2 remaining cards and a JSON transcript artifact

CLI

python3 .claude/skills/user-test/scripts/presence_card_exemplar.py \
  --target tzehouse-windows.parrot-hen.ts.net:50051 \
  --psk-env TZE_HUD_PSK \
  --tab-height 1080 \
  --transcript-out test_results/presence-card-latest.json

Optional flags:

--update-wait-s (default 30) — first periodic content-update wait
--heartbeat-timeout-s (default 15) — heartbeat-timeout reference for manual observation
--orphan-grace-s (default 30) — orphan grace-period wait
--observe-badge-s (default 1.0) — badge observation pause after disconnect

Output

The script emits one JSON object per step to stdout and writes a transcript file by default to test_results/presence-card-latest.json.

Each step includes:

code — stable step identifier
title — short operator-facing label
action — what the script is doing
expected_visual — what the operator should confirm on screen
status — started or completed

Human Acceptance Criteria

Verify the visible sequence in order:

Step	Expected visual
Create	3 stacked cards visible in the bottom-left corner
Update	All 3 cards show `Last active: 30s ago`
Disconnect	Only `agent-gamma` disconnects
Orphan observe	Disconnect badge appears on `agent-gamma` only
Cleanup	`agent-gamma` disappears after grace expiry
Final state	`agent-alpha` and `agent-beta` remain at original positions with no reflow

This scenario is the repo-native execution surface for docs/exemplar-presence-card-user-test.md.

CLI

python3 .claude/skills/user-test/scripts/subtitle_exemplar.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK \
  --ttl 10000

Required: --url. Optional: --psk-env (default TZE_HUD_PSK), --ttl (ms, default 10000).

All messages are published to zone_name: "subtitle" with namespace: "exemplar-test".

Phases

Phase	What happens	Pause
1 — Streaming reveal	stream_text with breakpoints at word boundaries; compositor reveals word-by-word	TTL hold (10s default)
2 — Single line	"Hello world — exemplar subtitle test"; baseline rendering	4s
3 — Multi-line	Long text forcing word-wrap and backdrop sizing	4s
4 — Rapid replacement	3 publishes 100ms apart; only the third survives	3s
5 — TTL expiry	Subtitle with fixed 3s TTL; watch auto-clear fade-out	TTL + 0.3s safety + 1.0s margin + 2s confirmation (~6.3s total)
6 — Streaming repeat	Same streaming reveal again for final sign-off	TTL hold (10s default)

Human Acceptance Criteria

Verify each criterion visually during the run:

#	Criterion	Phase
AC1	White text with visible black outline on semi-transparent dark backdrop	2 (single line)
AC2	Text centered horizontally near bottom of screen	2 (single line)
AC3	Multi-line text wraps cleanly within backdrop bounds	3 (multi-line)
AC4	Rapid replacement transitions are smooth (no blank frames)	4 (rapid replace)
AC5	Content disappears after TTL with visible fade-out	5 (TTL expiry)
AC6	Streaming text reveals word-by-word	1 and 6 (streaming)

All six criteria must pass for the subtitle exemplar to be accepted.

Named Test Group: subtitle-full-sequence

python3 .claude/skills/user-test/scripts/publish_zone_batch.py \
  --url "$MCP_HTTP_URL" \
  --psk-env MCP_TEST_PSK \
  --messages-file .claude/skills/user-test/scripts/subtitle-full-sequence.json \
  --delay-ms 4000 \
  --list-zones

The sequence runs: single line → multi-line → rapid replacement (×3) → TTL expiry → streaming. All messages use namespace: "exemplar-test".

Use --delay-ms 100 when running subtitle-rapid-replace.json alone to exercise contention at a speed that actually triggers the latest-wins logic.

Subtitle payload shape

{"zone_name": "subtitle", "content": "Hello world", "ttl_us": 10000000, "namespace": "exemplar-test"}

For streaming with word-by-word breakpoints:

{
  "zone_name": "subtitle",
  "content": "The quick brown fox jumps over the lazy dog",
  "breakpoints": [3, 9, 15, 19, 25, 30, 34, 38],
  "ttl_us": 10000000,
  "namespace": "exemplar-test"
}

Notification Stack Exemplar Scenario

CLI

python3 .claude/skills/user-test/scripts/notification_exemplar.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK \
  --ttl 8000

Required: --url. Optional: --psk-env (default TZE_HUD_PSK), --ttl (ms, default 8000).

Phases

Phase	What happens	Pause
1 — Initial burst	alpha (urgency 0), beta (urgency 1), gamma (urgency 2) published in order	2s
2 — Stack growth	alpha (urgency 3), beta (urgency 1) — stack reaches max_depth=5	2s
3 — TTL expiry	waits remaining phase-1 TTL plus ~650ms (150ms fade-out + 500ms margin) for phase-1 batch to auto-dismiss	1s
4 — Max depth eviction	6 rapid notifications; 1st is evicted instantly (no fade) when 6th arrives	3s

Visual Checklist (per phase)

Phase 1: Three notifications stacked newest-at-top. gamma (amber), beta (dark blue), alpha (dark gray) backdrops with 1px border and body-font text.

Phase 2: Five notifications. Top two are phase-2 publishes; bottom three are phase-1. All urgency-tinted correctly.

Phase 3: Phase-1 batch (urgency 0/1/2) has faded out; only 2 phase-2 notifications remain (urgency 3 and 1).

Phase 4: Exactly 5 notifications visible. "Burst A1" (oldest, urgency 0) is gone with no fade — evicted instantly. "Burst C6" is at top.

Notification payload shape

{
  "type": "notification",
  "text": "...",
  "icon": "...",
  "urgency": 0,
  "title": "Optional heading",
  "actions": [
    {"label": "Open", "callback_id": "open"},
    {"label": "Dismiss", "callback_id": "dismiss"}
  ]
}

Published via MCP publish_to_zone to notification-area zone with ttl_us derived from --ttl and namespace set to the simulated agent namespace (alpha, beta, or gamma).

Notification Full-Gamut Pass

python3 .claude/skills/user-test/scripts/publish_zone_batch.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK \
  --messages-file .claude/skills/user-test/scripts/notification-full-gamut.json \
  --delay-ms 250 \
  --list-zones

Coverage in notification-full-gamut.json:

urgency gamut: low (0), normal (1), urgent (2), critical (3)
two-line cards via title on all messages
long-body critical text to verify card-height containment
action rows: 2 actions on urgent card, 3 actions on critical card

Visual checks:

no body text should escape its card backdrop
urgency colors should progress low → normal → urgent → critical
action rows should appear inside the card near the bottom edge
stack ordering should remain newest-at-top under mixed payload shapes

Alert-Banner Exemplar Scenario

CLI

python3 .claude/skills/user-test/scripts/alert_banner_exemplar.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK \
  --ttl 15000

Required: --url. Optional: --psk-env (default TZE_HUD_PSK), --ttl (ms, default 15000).

Sequence

Step	Alert	Urgency	Text	Pause
1	Info	1	"Info: system nominal"	3s
2	Warning	2	"Warning: disk space low"	3s
3	Critical	3	"CRITICAL: security breach detected"	—

Visual Checklist

After all 3 publishes, the alert-banner zone should show all three alerts simultaneously:

Critical (red) at top — "CRITICAL: security breach detected"
Warning (amber) in middle — "Warning: disk space low"
Info (blue) at bottom — "Info: system nominal"

Alert payload shape

{"type": "notification", "text": "...", "icon": "", "urgency": 1}

Status-Bar Exemplar Scenario

CLI

python3 .claude/skills/user-test/scripts/status_bar_exemplar.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK \
  --battery-ttl 5000

10-Step Sequence

Step	Agent	Action	Pause
1	agent-weather	publish `weather` → `"72F Sunny"`	—
2	agent-power	publish `battery` → `"85%"` (short TTL)	—
3	agent-clock	publish `time` → `"3:42 PM"`	—
4	—	VISUAL CHECK: all 3 visible	3s
5	agent-weather	update `weather` → `"75F Cloudy"` (key replacement)	—
6	—	VISUAL CHECK: weather updated; battery/time unchanged	3s
7	agent-weather	publish empty value for `weather` (key removal)	—
8	—	VISUAL CHECK: weather gone; battery/time remain	3s
9	—	wait for battery TTL to expire (remaining TTL + 500ms sweep margin)	—
10	—	VISUAL CHECK: battery gone; time remains	3s

Visual Checklist

Step 6: Status bar still shows three entries. The weather value reads 75F Cloudy (replaced). battery: 85% and time: 3:42 PM are unchanged.

Step 8: Status bar shows two entries. The weather key is no longer visible (empty-value convention suppresses rendering). battery and time remain.

Step 10: Status bar shows one entry. The battery key has been swept by sweep_expired_zone_publications after its TTL elapsed. Only time: 3:42 PM remains visible.

Human Acceptance Criteria

#	Criterion	Step
AC1	Three distinct keys coexist — no key overwrites another	4
AC2	Key replacement updates only the target key's value	6
AC3	Empty-value publish removes that key from the visible display	8
AC4	TTL expiry removes the key without explicit publish	10
AC5	Chrome-layer bar is always visible above content tiles	all visual steps
AC6	Monospace font and dark opaque backdrop visible throughout	all visual steps

All six criteria must pass for the status-bar exemplar to be accepted.

Status-bar payload shape

{
  "zone_name": "status-bar",
  "content": {"type": "status_bar", "entries": {"weather": "72F Sunny"}},
  "merge_key": "weather",
  "ttl_us": 60000000,
  "namespace": "agent-weather"
}

Each agent uses a distinct namespace (agent-weather, agent-power, agent-clock). The merge_key matches the single entry key in entries. Published via MCP publish_to_zone.

Ambient Background Exemplar Scenario

CLI

python3 .claude/skills/user-test/scripts/ambient_background_exemplar.py \
  --url http://tzehouse-windows.parrot-hen.ts.net:9090 \
  --psk-env TZE_HUD_PSK

Required: --url. Optional: --psk-env (default TZE_HUD_PSK).

Phases

Phase	What happens	Pause
1 — Dark blue	Publish `solid_color` dark navy blue (r=0.05, g=0.05, b=0.2)	3s
2 — Warm amber	Replace with warm amber (r=0.9, g=0.6, b=0.2); latest-wins Replace policy evicts dark blue	3s
3 — Static image	Publish `static_image` content type (64-char hex resource_id); runtime renders warm-gray placeholder in v1	2s
4 — Rapid replace	10 different solid colors in sequence without delay; query `list_zones` to confirm `has_content=true` and visually confirm the final color is bright green	—

Visual Checklist

Phase 1: Entire HUD background should turn dark navy blue. No content tiles are affected — background is behind all content-layer zones.

Phase 2: Background shifts instantly to warm amber. The previous dark blue must be gone (Replace policy: latest-wins, exactly 1 active publication).

Phase 3: Background changes to a warm-gray placeholder quad (v1 behavior — GPU texture upload is deferred). The zone must accept the publication without error.

Background payload shapes

{"type": "solid_color", "r": 0.05, "g": 0.05, "b": 0.2, "a": 1.0}
{"type": "solid_color", "r": 0.9, "g": 0.6, "b": 0.2, "a": 1.0}
{"type": "static_image", "resource_id": "<64-char-hex-blake3-hash>"}

Text Stream Portals Exemplar Scenario

Existing automated coverage (do not duplicate)

Integration tests in tests/integration/:

text_stream_portal_surface.rs — raw-tile pilot composition, bounded viewport, local-first scroll, ambient attention
text_stream_portal_adapter.rs — transport-agnostic seam, tmux and non-tmux adapter conformance, external adapter isolation
text_stream_portal_coalescing.rs — retained-window coherence under backpressure
text_stream_portal_governance.rs — redaction, safe-mode, freeze, orphan path, chrome exclusion

Evidence artifact: docs/evidence/text-stream-portals/validation-2026-04-16.md.

Phase-0 pilot shape (recap)

Resident raw-tile pilot — composed from existing text, solid-color, image, and hit-region primitives. No new dedicated node type.
Resident gRPC session — portal traffic rides the existing primary bidirectional HudSession stream. No second long-lived portal stream.
Content-layer surface — portal tile renders below chrome like any other content-layer zone. No chrome-hosted portal affordances.
External adapter, authenticated — local adapter processes pass through existing capability grants; no implicit local trust.

CLI

python3 .claude/skills/user-test/scripts/text_stream_portal_exemplar.py \
  --target tzehouse-windows.parrot-hen.ts.net:50051 \
  --psk-env TZE_HUD_PSK \
  --agent-id agent-alpha \
  --doc docs/exemplar-manual-review-checklist.md \
  --tab-width 1920 \
  --phases baseline,scroll \
  --baseline-hold-s 30 \
  --max-lines 80

Optional --phases values:

Phase	What it exercises	Operator-visible proof
`baseline`	Two-pane INPUT/OUTPUT portal composition	Portal appears at right edge with header, composer, divider, transcript body, and footer
`scroll`	Transcript Interaction Contract	OUTPUT pane registers scroll, steps through transcript data, preserves mid-scroll window while tail lines append, then returns to latest output
`streaming`	Low-latency text interaction	OUTPUT body grows in ordered chunks
`rapid`	Coalescing coherence smoke	Rapid publish pressure does not collapse the retained window to one latest line
`diagnostic-input`	Live compositor/input path	Uses Windows OS input injection over SSH to click-focus the composer, drag the portal header, and wheel-scroll the OUTPUT pane

For diagnostic-input, --diagnostic-input-connect-timeout-s controls the SSH connect timeout separately from the overall injector timeout so unreachable Windows hosts fail fast.

Live Validation Axes

text_stream_portal_exemplar.py drives the resident raw-tile pilot against a live HUD and produces operator-visible proof for axes that integration tests cannot validate alone:

Axis	Spec requirement	What the operator should see
Streaming reveal	Low-Latency Text Interaction	Output arrives as ordered incremental updates, not snapshot replace
Local-first scroll	Transcript Interaction Contract	Scroll offset visibly updates before any adapter ack
Bounded viewport	Bounded Transcript Viewport	Retained window stays within on-screen bounds as transcript grows
Coalescing coherence	Coherent Transcript Coalescing	Under rapid-publish pressure, retained window never collapses to only latest line
Redaction	Governance, Privacy, and Override Compliance	Portal geometry preserved; transcript content suppressed under viewer policy
Safe mode	Governance, Privacy, and Override Compliance	Portal updates suspend under safe mode like other content surfaces
Orphan path	Governance, Privacy, and Override Compliance	Disconnected portal freezes at last coherent state; grace expiry removes it
Ambient attention	Ambient Portal Attention Defaults	Unread backlog does not auto-escalate interruption class

Out of scope for the live exemplar

Terminal-emulator rendering (ANSI, cursor positioning, PTY control)
Full transcript history storage in the scene graph
Chrome-hosted portal affordances or shell-owned portal controls
Portal-specific transport RPCs outside the primary session stream
Runtime ownership of external process or tmux lifecycle

Human Acceptance Criteria

The portal stays in the content layer and remains below chrome.
The OUTPUT pane text is readable and bounded within the portal.
During scroll, the visible output window advances in steps, appended tail lines do not force an unsolicited jump, and return-to-tail shows the newest output.
During streaming, output arrives incrementally rather than as a single snapshot replace.
During rapid, the pane remains coherent under fast updates.
During diagnostic-input, the JSON transcript includes input:focus-gained, drag:start/drag:end, and scroll:output checkpoints. The injector uses SetCursorPos, mouse events, wheel events, and SendInput Unicode text against the live overlay, so failures are runtime/input-path evidence rather than synthetic transcript success.
Manual review notes and any UX tweaks are recorded in docs/exemplar-manual-review-checklist.md row 11.

Behavior Rules

Use automation-first deploy/launch by default.
Default to full app deployment via SCP + scheduled task.
Use --package / cargo build only when explicitly requested.
Always pass non-interactive SSH/SCP flags (BatchMode, IdentitiesOnly) for automation safety.
Use hudbot for file operations (SCP, mkdir). Use tzeus for process control (taskkill, schtasks).
Always launch via TzeHudOverlay scheduled task with --window-mode overlay. Never launch via run_hud.ps1 or direct SSH exec — both produce grey opaque windows.
Never use Start-Process -RedirectStandardOutput — it sets CREATE_NO_WINDOW which breaks WS_EX_NOREDIRECTIONBITMAP transparency.
Require real MCP HTTP reachability before claiming publish-path success.
Never invent zone names or endpoint values; require user-provided values.
Treat any publish error as actionable; include exact response payload.
Keep messages configurable from the user prompt; do not hardcode content.
Assume Windows host defaults to tzehouse-windows.parrot-hen.ts.net unless user overrides.

user-test

More from this repository

User Test

Required Inputs

Windows Media Ingress Exemplar

Workflow

Step 0: SSH Connectivity Gate

Step 1: Deploy (SCP via hudbot)

Step 2: Register + Launch (via tzeus)

Step 2: MCP Reachability Gate

Step 3: Publish Configurable Zone Messages

Step 4: Publish Configurable Widget Messages

Step 5: Widget Reactivity Test (Gauge Cycling)

Step 6: Widget Reactivity Test (Status Indicator)

Step 7: Widget Reactivity Test (Progress Bar)

7a: 7-Step Sequence

7b: Color Sweep (Optional)

7c: Rapid-Fire Stream Test (100 publishes / 5 seconds)

Subtitle Exemplar Scenario

Presence Card Exemplar Scenario

CLI

Output

Human Acceptance Criteria

CLI

Phases

Human Acceptance Criteria

Named Test Group: subtitle-full-sequence

Subtitle payload shape

Notification Stack Exemplar Scenario

CLI

Phases

Visual Checklist (per phase)

Notification payload shape

Notification Full-Gamut Pass

Alert-Banner Exemplar Scenario

CLI

Sequence

Visual Checklist

Alert payload shape

Status-Bar Exemplar Scenario

CLI

10-Step Sequence

Visual Checklist

Human Acceptance Criteria

Status-bar payload shape

Ambient Background Exemplar Scenario

CLI

Phases

Visual Checklist

Background payload shapes

Text Stream Portals Exemplar Scenario

Existing automated coverage (do not duplicate)

Phase-0 pilot shape (recap)

CLI

Live Validation Axes

Out of scope for the live exemplar

Human Acceptance Criteria

Behavior Rules

User Test

Required Inputs

Windows Media Ingress Exemplar

Workflow

Step 0: SSH Connectivity Gate

Step 1: Deploy (SCP via hudbot)

Step 2: Register + Launch (via tzeus)

Step 2: MCP Reachability Gate

Step 3: Publish Configurable Zone Messages

Step 4: Publish Configurable Widget Messages

Step 5: Widget Reactivity Test (Gauge Cycling)

Step 6: Widget Reactivity Test (Status Indicator)

Step 7: Widget Reactivity Test (Progress Bar)

7a: 7-Step Sequence

7b: Color Sweep (Optional)

7c: Rapid-Fire Stream Test (100 publishes / 5 seconds)

Subtitle Exemplar Scenario

Presence Card Exemplar Scenario

CLI

Output

Human Acceptance Criteria

CLI