Run any Skill in Manus with one click

$pwd:

kane-cli

Name: Kane Cli
Author: LambdaTest

// Browser automation via kane-cli — run objectives, parse NDJSON output, inspect logs, report bugs. Use for any task requiring a real browser (navigate, click, fill forms, test web UI, take screenshots).

Run Skill in Manus

$ git log --oneline --stat

stars:212

forks:17

updated:May 14, 2026 at 10:33

SKILL.md

readonly

package.json

"author": "LambdaTest"

"repository": "LambdaTest/kane-cli"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Network and Computer Systems AdministratorsComputer and Mathematical Occupations15-1244L4

Run any Skill with one click

name	kane-cli
description	Browser automation via kane-cli — run objectives, parse NDJSON output, inspect logs, report bugs. Use for any task requiring a real browser (navigate, click, fill forms, test web UI, take screenshots).

Kane CLI — Browser Automation Skill

Use kane-cli for any task that requires a real browser: navigating websites, clicking elements, filling forms, searching, testing web UI, taking screenshots, or verifying deployments.

Do NOT use Playwright, Puppeteer, or Selenium directly. kane-cli manages Chrome, auth, and the AI automation agent.

Always run with --agent flag. This gives structured NDJSON output that you parse and present to the user with rich formatting.

1. Decision Tree

When the user's request involves a browser, follow this flow:

Is kane-cli installed? ├─ Unknown → Check with kane-cli --version ├─ No → npm install -g @testmuai/kane-cli then §2 └─ Yes ↓

Is kane-cli set up? ├─ Unknown → Run kane-cli whoami to check auth status ├─ No → Go to §2 (Pre-flight Setup) └─ Yes ↓

What does the user want? ├─ Single browser task → Build one kane-cli run --agent command (§3, §4) ├─ Test/verify something → Same, but use assertion objectives (§4) ├─ Extract data from a page → Same, but use "store as" extraction pattern (§4) ├─ Save / re-run / commit the test → Use kane-cli testmd (§7) ├─ Multiple independent tasks → Decompose into sub-objectives, run in parallel via Agent tool (§9) ├─ Debug a failed run → Inspect logs (§8) └─ Configure kane-cli → Run config commands (§10)

After every run:

Parse the NDJSON output (§5)
Present rich results with emojis (§6)
If failed, inspect logs and diagnose (§8)

2. Pre-flight Setup

Before first use, verify installation and auth.

Install

npm install -g @testmuai/kane-cli

Check Auth Status

kane-cli whoami

If this shows "not configured" or errors, run login:

Login (Basic Auth)

kane-cli login --username <user> --access-key <key>

This creates the default profile with basic auth, auto-selects the KaneAI project, and marks setup complete. Credentials come from the user's TestmuAI dashboard (Settings → Keys).

Optional flag:

--profile <name> — profile name (default: last selected profile check using config show)

Login (OAuth)

kane-cli login --oauth

This opens the browser for OAuth consent and waits for the callback. Works in both TTY and non-TTY (agent) mode.

Login (Interactive — TTY only)

In a terminal, run kane-cli login with no flags for the interactive wizard (auth method → project picker → folder picker). If the user needs this, ask them to run it directly:

Please run ! kane-cli login and complete the sign-in.

Verify

kane-cli whoami          # Auth status
kane-cli config show     # Current configuration

3. Building the Command

Every run uses this pattern:

kane-cli run "<objective>" --agent [options]

--agent is mandatory — it outputs structured NDJSON that you parse and present to the user.

Flags

Flag	Purpose	Default
`--headless`	No visible browser window	Off (browser visible)
`--max-steps <n>`	Limit agent reasoning steps	30
`--timeout <s>`	Kill run after N seconds	No limit
`--variables <json>`	Inline variables JSON	None
`--variables-file <path>`	Load variables from a JSON file	None
`--global-context <file>`	Override global agent context markdown	`~/.testmuai/kaneai/global-memory.md`
`--local-context <file>`	Override local project context markdown	`.testmuai/context.md`
`--ws-endpoint <url>`	Remote browser via WebSocket (e.g. LambdaTest grid)	Local Chrome
`--cdp-endpoint <url>`	Connect to existing Chrome via CDP	Auto-launch Chrome
`--code-export`	Generate code export after upload	Off

Exit Codes

Code	Meaning
0	✅ Passed
1	❌ Failed
2	⚠️ Error (auth, setup, infra)
3	⏱️ Timeout or cancelled

Variables

Variables parameterize objectives with reusable values and secrets. Use {{key}} syntax in objectives.

Format:

{
  "username": { "value": "alice", "secret": false },
  "password": { "value": "s3cret!", "secret": true }
}

secret: true masks the value in logs and routes it to TestmuAI's secrets store instead of being synced as plain TMS variables.

Loading order (later wins):

~/.testmuai/kaneai/variables/*.json (global, alphabetical)
{cwd}/.testmuai/variables/*.json (local project overrides)
--variables-file <path>
--variables '{...}' (inline JSON)

Always parameterize: credentials, API keys, tokens, environment-specific URLs. OK to hardcode: one-off URLs, static UI text, navigation paths.

Context Files

Context files provide additional instructions to the agent:

Global: ~/.testmuai/kaneai/global-memory.md — shared across all runs
Local: .testmuai/context.md in cwd — project-specific

Override per-run with --global-context / --local-context flags.

Examples

# Simple browser task
kane-cli run "Go to https://www.amazon.in and search for 'laptop'" --agent

# Headless with timeout
kane-cli run "Go to https://app.example.com and verify login page loads" --agent --headless --timeout 60

# With variables
kane-cli run "Go to https://app.example.com and login with {{username}} and {{password}}" --agent \
  --variables '{"username": {"value": "alice"}, "password": {"value": "secret123", "secret": true}}'

# Remote browser (LambdaTest grid)
kane-cli run "Go to https://shop.example.com and add item to cart" --agent \
  --ws-endpoint "wss://cdp.lambdatest.com/playwright?capabilities=..."

# With variables file
kane-cli run "Go to https://staging.myapp.com, login and verify dashboard" --agent \
  --variables-file ./test-creds.json --headless --timeout 120

4. Writing Objectives

The objective string is the most important input. How you phrase it determines what the agent does.

Three Patterns

Pattern	Trigger Phrases	Agent Behavior
🎯 Action	"go to", "click", "type", "search", "fill", "scroll"	Performs browser actions
✅ Assertion	"assert", "verify", "confirm", "check that"	Validates a condition (pass/fail)
📦 Extraction	"store X as 'name'"	Reads a value from the page and persists it in structured output

Extraction: The "store as" Pattern

Critical. Vague phrasing like "read", "report", or "tell me" does NOT reliably extract data. The agent may observe the value visually but won't persist it in structured output.

❌ Bad — agent looks but doesn't capture:

"go to example.com and read the page title"
"go to example.com and tell me the price"

✅ Good — agent extracts and persists in final_state:

"go to example.com, store the page title as 'page_title'"
"go to example.com, store the price of the first item as 'price'"

Stored values appear in the run_end event's final_state and context.memory fields.

Combining Patterns

Chain action → extraction → assertion in a single objective:

"go to {{app_url}}/dashboard,
 store the welcome message as 'welcome_text',
 store the user role in the sidebar as 'role',
 assert the role is 'Admin'"

Assertion Specificity

Type	Example
Exact match	`"assert the cart total shows '$29.99'"`
Flexible match	`"assert a price is displayed for each product"`
State	`"assert the Submit button is disabled until all fields are filled"`
Conditional	`"if a cookie banner appears, dismiss it, then assert the homepage loads"`
Negative	`"assert no error message or red banner is visible"`
Positional	`"assert 'Settings' appears in the left sidebar navigation"`

Dos and Don'ts

✅ Do	❌ Don't
Use imperative verbs: "go to", "click", "store as"	Use vague verbs: "check out", "look at", "explore"
Be specific: "click the 'Add to Cart' button"	Be vague: "add the item"
Name extractions: "store X as 'price'"	Hope for values: "tell me the price"
Use `{{variables}}` for credentials/URLs	Hardcode secrets in the objective
Include starting URL in the objective: "Go to https://..."	Assume the agent knows where to start
Split mega-objectives (>15 steps) into multiple runs	Cram everything into one massive objective

5. Parsing Output (--agent mode)

Internal reference only. Everything in this section (field names, event types, JSON structure) is for you to parse programmatically. Never expose these internal terms to the user. The user should see plain-language summaries, not run_end, final_state, bifurcation, NDJSON, session_dir, or any raw JSON fields.

With --agent, kane-cli outputs one JSON object per line to stdout. Progress UI renders to stderr.

Event Types

Progress events (bulk of the output — one per step):

{"step": 1, "status": "passed", "remark": "Navigated to amazon.in"}
{"step": 2, "status": "passed", "remark": "Typed 'laptop' in search box"}
{"step": 3, "status": "failed", "remark": "Could not find Add to Cart button"}

Field	Type	Description
`step`	number	Step index (1-based)
`status`	string	`"passed"` or `"failed"`
`remark`	string	What the agent did or why it failed

These are untyped — they have no type field. Do not key on event.type === 'step_start' or 'step_end'; those event types are not emitted.

Flow events:

Event (`type` field)	Key Fields	Purpose
`bifurcation`	`flows[]`, `count`	Agent split objective into sub-flows
`child_agent_start`	`child_id`, `objective`, `parent_step`	Child agent spawned
`child_agent_end`	`child_id`, `success`, `steps_taken`, `summary`	Child agent finished
`ask_user`	`question`, `step_index`, `options?`	Agent needs user input
`error`	`message`	Error occurred

Note: There is no run_start event — the first line is either a bifurcation or a progress object.

Note: ask_user is auto-disabled when stdin is not a TTY. Since agents typically run kane-cli as a subprocess, ask_user events will not be emitted. Write objectives that don't require interactive input.

Parsing Strategy

Since progress events lack a type field, distinguish them from typed events like this:

for each line of NDJSON:
  if obj.type === "run_end"    → terminal event, stop parsing
  if obj.type === "bifurcation" → flow split
  if obj.type exists           → other typed event
  if obj.step exists           → progress event (step/status/remark)

Build automation on run_end — it is the only event guaranteed to have a stable schema across versions. Use progress events for live status display only.

Terminal event (always the last line):

{
  "type": "run_end",
  "status": "passed",
  "summary": "Searched for laptop and added first result to cart",
  "one_liner": "Searched for laptop on Amazon and added to cart",
  "reason": "Objective completed",
  "duration": 45.2,
  "credits": 12,
  "final_state": {
    "price": "$29.99",
    "product_name": "Wireless Headphones"
  },
  "context": {
    "memory": {},
    "variables": {},
    "pointer": "(passed) Searched for laptop and added first result to cart"
  },
  "session_dir": "~/.testmuai/kaneai/sessions/a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "run_dir": "~/.testmuai/kaneai/sessions/a1b2c3d4-e5f6-7890-abcd-ef1234567890/runs/0",
  "test_url": "https://test-manager.lambdatest.com/projects/123/test-cases/456"
}

Key run_end fields:

status — "passed" or "failed"
summary — what the agent did
one_liner — short summary for display
reason — why it stopped
credits — credits consumed by the run (when reported)
final_state — extracted values from "store as" objectives
test_url — link to KaneAI dashboard (if upload succeeded)
session_dir / run_dir — paths to log files

Responding to `ask_user` (if stdin is a TTY)

{"type": "user_response", "answer": "Medium size"}

To cancel a run:

{"type": "cancel"}

6. Presenting Results to the User

Golden rule: The user should feel like they're watching a browser task happen, not reading a log file. Use plain language, never expose internal field names, JSON keys, file paths, or technical jargon. Translate everything into what the user cares about.

📢 Live Progress (During the Run)

Do not stay silent while kane-cli runs. As the command executes, keep the user informed:

Before starting — Tell the user what you're about to do:

Starting browser task: searching for 'laptop' on Amazon...
As steps complete — Relay each step's outcome in plain language as it happens. Parse the progress events from stdout and narrate them:

Step 1: Opened Amazon homepage Step 2: Typed 'laptop' in the search bar Step 3: Clicked the search button Step 4: Search results loaded — found product listings
If something goes wrong mid-run — Flag it immediately, don't wait for the final result:

Step 5: Could not find the 'Add to Cart' button — the agent is retrying...

This keeps the user engaged and lets them intervene early if the task is going in the wrong direction.

📋 Results Summary (After the Run)

After every run, present a clear summary. Never just say "it passed" — show the full picture in a user-friendly format.

Successful run:


🟢 Result	Passed
🎯 Task	Search for 'laptop' on Amazon
⏱️ Duration	45.2s
👣 Steps taken	7
📝 What happened	Opened Amazon, typed 'laptop' in search, clicked search, results loaded with 48 products
🔗 View details	Open in KaneAI Dashboard

If data was extracted (from "store as" objectives), show it as a clean results table:

📦 What was found	Value
Top repository	freeCodeCamp/freeCodeCamp
Star count	413k
Price	$29.99

If assertions were checked, show pass/fail for each:

✅ Check	Result
Dashboard shows welcome message	🟢 Passed
User role is Admin	🔴 Failed

❌ When Things Go Wrong

For failed runs, explain what went wrong in plain language:

🔍 What failed — describe the step that failed and why, in the user's terms (not "step_003.json shows dom_action error")
📸 Screenshot — if a screenshot exists, read and show it so the user can see what the browser looked like at the point of failure
💡 Why it likely failed — your diagnosis: was the element missing? Did the page not load? Was the objective ambiguous?
🔧 Suggested fix — a concrete next step: rephrase the objective, increase timeout, check auth, etc.

Example of a good failure report:

🔴 Failed at step 5 of 9 (after 25s)

What happened: The agent clicked "Proceed to Checkout" but the payment form never appeared. The page showed a loading spinner for 15 seconds before the agent timed out.

Likely cause: The checkout page may require authentication, or the site's payment service was slow/down.

Suggested fix: Try adding an explicit login step before checkout, or increase the timeout to 120s.

🐛 Suggesting a Bug Report

If the failure looks like a kane-cli bug (not auth, timeout, or a vague objective), offer to file a report:

This looks like it might be a bug in kane-cli. Want me to file a report?

File at: https://github.com/LambdaTest/kane-cli/issues. Gather the details automatically — don't ask the user to dig through log files.

Do NOT suggest bug reports for: auth issues, low timeouts, vague objectives, or website errors (500s, CAPTCHAs).

7. Saving & Replaying Tests (`testmd`)

The §3 run command is the primary mode — one-shot, ephemeral. testmd is the secondary mode: tests live as _test.md files on disk, each step is cached on the first run, and every later run replays from cache with no LLM cost.

Use testmd whenever the user wants the test to persist. The decision is binary — once a test exists as a file, every later invocation is testmd run, never run.

When to switch from `run` to `testmd`

User says	Use
"save this test", "commit this", "keep this", "add this to the suite"	`testmd`
"regression test", "smoke test", "make this replayable"	`testmd`
"this is a test", "test the X flow end-to-end" (suite-shaped)	`testmd`
"run this once", "check if X works right now", "try X"	`run` (§3)
"search for", "click", "fill", "verify" (one-shot)	`run` (§3)

If unclear, ask: "Do you want me to save this test so you can re-run it later?"

Quick start

Write the file (any path; filename must end in _test.md):

---
mode: testing
max_steps: 30
---

# Amazon search

## Open Amazon
Open https://www.amazon.com.

## Search for headphones
Type "wireless headphones" into the search box and submit.
Verify at least one product result is visible.

Run it:

kane-cli testmd run amazon_test.md --agent

File format

Four parts in order:

YAML frontmatter — between --- ... --- at the very top.
# Title — decorative; everything before the first ## is ignored.
## H2 step headings — one per step. The agent reads the step body, not the heading.
Step body — either prose or a single @import <path> line. Never both.

Per-step yaml overrides go immediately under the heading, in a fenced block:

## Submit the form
```yaml
timeout: 90
optional: true
```
Click submit and verify the confirmation banner.

Frontmatter keys to use:

Key	Scope	Description
`mode`	root	`action` (halts on auth walls) or `testing` (default — pushes through so negative-test assertions can fire)
`max_steps`	root + step	Max agent reasoning steps. Default `30`.
`timeout`	root + step	Hard kill per step in seconds.
`headless`	root	No browser window.
`variables`	root + step	`{{name}}` params, same shape as §3, with `secret: true` for credentials
`global_context` / `local_context`	root + step	Inline Markdown or path
`code_export` / `code_language`	root + step	Generate Playwright after the run; language `python` or `javascript`

Files ending in _test.md are tests (valid entry points). Any other .md is a helper — reachable only via @import.

The replay & cascade rule (CRITICAL)

On the first run of a test, the agent authors each step and saves a recording. On every later run, each step replays from its recording — no agent, no LLM cost, much faster.

A step replays only if all of these hold:

A recording for that step exists,
Its prose is unchanged since the recording,
Its yaml block is unchanged,
No earlier step in the file invalidated it.

Editing step N re-authors step N AND every step after it in the same file. Each step starts where the previous step left off (URL, login, tabs). When step 3 changes, step 4 cannot safely replay against state that no longer exists.

Consequences when editing tests:

A one-line tweak at the top of a 20-step test re-authors all 20 steps on the next run.
To re-record only one step, edit only that step (or steps after it).
--author forces full authoring for one run (debugging only).
rm -rf output-<stem>/ wipes the cache entirely.

`@import` for reusing flows

Extract a repeating flow (login, setup, cookie banner dismissal) into a helper file:

## Sign in
@import ./helpers/login.md

Rules:

Helper filename must not end in _test.md.
Path resolves relative to the importing file, not the shell's cwd.
The step body must be exactly @import <path> — no mixed prose, no extra lines.
The step's yaml block may contain only optional. Other keys are rejected.
optional: true on @import is allowed only at the root file, not on a nested import.

Variables and context propagate into helpers. Chrome / mode / auth do not (root-only).

Editing a helper re-authors that step in every test that imports it, plus everything after the import in those tests. Same cascade rule.

Commands

Command	Use
`kane-cli testmd run <path> --agent [flags]`	Run a test
`kane-cli testmd list`	List `*_test.md` files under cwd (NDJSON when non-TTY)
`kane-cli testmd status <path>`	Test Manager identity + local-sync state
`kane-cli testmd export <path> [--code-language python\|javascript]`	Regenerate code export from existing recordings (no browser launch)
`kane-cli testmd delete <path>`	Local-only delete: removes source + `output-<stem>/`. Does NOT delete from Test Manager.

Flags on testmd run that don't exist on §3 run:

Flag	Default	Description
`--name <name>`	none	Persist the run under this name. Regex `[a-zA-Z0-9_-]+`.
`--on-lock-conflict <readonly\|fail\|wait>`	none	Behavior when another user holds the test's edit lock. `readonly` = replay-only / no upload, `fail` = exit 2, `wait` = block until released
`--retry`	off	On replay failure, restart with a shrinking replay window
`--retry-count <n>`	`3`	Max retry restarts before falling back to full re-author
`--author`	off	Force authoring every step (skip replay decision)

All §3 run flags also apply (--agent, --headless, --max-steps, --timeout, --variables, etc.).

Flag wins over frontmatter for everything except variables — the file owns variables; you can add new keys via flags but cannot override file-defined ones.

Output: `output-<stem>/` and `Result.md`

After a run:

amazon_test.md
output-amazon/
  Result.md                      # human-readable run report
  .internal/                     # cached recordings — do not edit
  playwright-python-code/        # only if code_export enabled

output-<stem>/ is commit-safe and should be committed to git. That's how teammates and CI replay the same recordings.

For tests using @import, helper recordings land next to the helper file in helper-output-<helper>-<root>-<step>/ directories. Also commit-safe.

Result.md opens in any Markdown viewer. It contains:

Frontmatter — status, started, duration_s, session_id
One entry per root step with one of ✓ passed, ✗ failed, ⏭ skipped, optionally suffixed (optional) when a soft-failing step failed but the run continued
For @import steps that failed, a path to the failing sub-step inside the helper

When the user asks "did the test pass?" or "where did it fail?" for a previously-run test, read Result.md rather than re-running the test.

Recording a `_test.md` from a live session

If the user runs an ad-hoc objective with §3 run and decides to keep it:

kane-cli run "Search for noise-cancelling headphones on amazon.com" --name amazon-search

On exit, kane-cli writes <cwd>/.testmuai/tests/amazon-search_test.md. Move that file into the user's repo and re-run it with testmd run. Without --name, an ad-hoc run is ephemeral and nothing is written.

CI invocation

kane-cli testmd run ./tests/checkout_test.md \
  --agent \
  --headless \
  --on-lock-conflict wait \
  --retry

--agent — NDJSON to stdout (auto-enabled when stdin is not a TTY; pass explicitly anyway).
--headless — no window.
--on-lock-conflict wait — block instead of failing if a teammate is editing the same test.
--retry — automatically recover transient replay failures.

Exit codes follow §3 with new semantics:

2 now includes parse errors and --on-lock-conflict fail
3 now includes --on-lock-conflict wait timeout

Parse errors (when writing a `_test.md`)

Parse errors abort before any browser launch with exit 2. Common ones and the fix:

Message	Fix
`frontmatter is missing closing '---'`	Add the trailing `---`
`invalid YAML in frontmatter`	Re-validate the YAML block
`step body must be exactly one of prose / @import`	Split into two steps
`step config on @import may only contain 'optional'`	Remove other keys from the yaml block
`cannot @import a test file`	Imports may only reference helpers (not ending in `_test.md`)
`cyclic reference`	Restructure helpers to break the loop
`chrome config is global-only`	Move Chrome key to root frontmatter
`'<key>' is run-level and cannot be set per-step`	Move `mode` / `on_lock_conflict` to root frontmatter
`unknown config key`	Remove or fix the key
`auth/identity keys are CLI-only`	Pass `username` / `access_key` as CLI flags, not in frontmatter

When the user reports a parse error, fix the file before retrying — don't loop on the same error.

8. Failure Handling & Log Inspection

When a run fails, diagnose before suggesting fixes.

Log Locations

The run_end event provides session_dir and run_dir paths. Use those directly.

{session_dir}/
├── session.json               # Session metadata, run list, upload status
├── tui.log                    # Timeline: session start, run start/end, errors
└── runs/{n}/
    └── run-test/
        └── actions.ndjson     # Step-by-step record of agent actions

Debugging Flow

Parse the run_end event from stdout — it has status, reason, and summary plus the session_dir / run_dir paths.
Read actions.ndjson in {run_dir}/run-test/ — each line is one agent action with its intent and outcome.
Check tui.log in {session_dir}/ — for session-level issues (Chrome launch, auth, upload).

Common Failure Patterns

Symptom	Likely Cause	Fix
🔄 Agent repeats same action	Stuck in a loop / page didn't change	Rephrase objective, add explicit wait or assertion
🎯 Agent clicks wrong element	Ambiguous UI, multiple similar elements	Be more specific: "click the blue 'Submit' button in the checkout form"
👁️ Agent says done but didn't finish	Objective too vague	Add explicit assertions: "assert the confirmation page shows order number"
💀 Exit code 2, no steps	Auth or Chrome failure	Check `kane-cli whoami`, verify Chrome is available
⏱️ Exit code 3	Timeout or cancelled	Increase `--timeout` or `--max-steps`, or split into smaller objectives
🚫 "CDP endpoint not reachable"	Chrome not running	Let kane-cli manage Chrome (remove `--cdp-endpoint`)

9. Parallel Execution

For multiple independent browser tasks, decompose and run in parallel using the Agent tool.

When to Split

>15 steps — long runs drift and get stuck
Independent flows — login test and search test don't depend on each other
Different pages/features — settings vs checkout vs admin
Different user roles — admin flow vs regular user flow

How to Split

Each sub-objective must be self-contained: navigates to its own URL, authenticates independently, asserts its own outcomes. No sub-objective depends on another having run first.

Execution Pattern

Decompose the user's request into N independent sub-objectives

Spawn N Agent tool calls in a single message — each runs:

kane-cli run "Go to <url> and <sub-objective>" --agent --headless --timeout 120

Each agent parses the NDJSON output, waits for run_end, returns: status, steps, duration, summary, session path
After ALL agents complete, format the batch summary

Agent Prompt Template

Run this kane-cli browser test and report results:

    kane-cli run "Go to <url> and <objective>" --agent --headless --timeout 120

After the command completes:
1. Capture the exit code
2. Parse the run_end NDJSON event from stdout
3. If failed, read the failing step's screenshot from run_dir
4. Return: {status, steps, duration, summary, session_dir, failure_step, screenshot_path}

Batch Summary Format

## 🧪 Test Suite: <suite name>

| # | Test | Status | Steps | Time | What happened |
|---|------|--------|-------|------|---------|
| 1 | Login + dashboard | ✅ | 5 | 12s | Welcome banner visible |
| 2 | Product search | ✅ | 7 | 18s | 3 results for 'shoes' |
| 3 | Checkout flow | ❌ | 9 | 25s | Payment form did not load |
| 4 | Admin CSV export | ✅ | 6 | 15s | CSV downloaded (42 rows) |

### 📊 Overall
- **Pass rate:** 3/4 (75%)
- **Total steps:** 27 · **Total time:** 1m10s

### ❌ Failures
**#3 Checkout flow** — Payment form did not load after clicking "Credit Card".
📸 [screenshot of the failure shown inline]

Status icons: ✅ passed · ❌ failed · ⚠️ stuck/timeout

Do not show raw file paths (like ~/.testmuai/kaneai/sessions/...) in the summary. Instead, read the screenshot and show it inline, or offer to inspect logs only if the user asks.

10. Configuration & Reference

Config Commands

kane-cli config show                          # Show all current settings
kane-cli config set-window <W>x<H>           # Browser window size (e.g. 1920x1080)
kane-cli config chrome-profile <path>         # Chrome profile path (or interactive picker in TTY)
kane-cli config project <project-id>          # TMS project ID (or interactive picker in TTY)
kane-cli config folder <folder-id>            # TMS folder ID (or interactive picker in TTY)

Feedback

Submit feedback on a completed test run:

kane-cli feedback --test-id <id> --feedback-type <positive|negative> --details "..."

Directory Structure

~/.testmuai/kaneai/
├── tui-config.json              # Persistent CLI settings
├── config.json                  # Shared auth configuration
├── global-memory.md             # Global agent context
├── chrome-profile/              # Default Chrome user profile
├── profiles/                    # Stored credentials
│   └── {profile}/{env}/
│       └── credentials
├── sessions/                    # Session history
│   └── {session-id}/
│       ├── session.json         # Metadata, run list, upload status
│       ├── tui.log              # Session event log
│       ├── runs/{n}/
│       │   └── run-test/
│       │       └── actions.ndjson   # Step-by-step record of agent actions
│       └── code-export/         # (when --code-export) generated code files
└── variables/                   # Global variable files
    └── *.json

# Project-local overrides (in cwd):
.testmuai/
├── context.md                   # Project-specific agent context
└── variables/
    └── *.json                   # Project-specific variables

Chrome Management

kane-cli auto-launches Chrome with CDP (DevTools Protocol) on ports 9222–9230. Chrome runs as a detached process and outlives the CLI.

--headless — runs Chrome in headless mode (no visible window)
--cdp-endpoint <url> — connect to an already-running Chrome instance
--ws-endpoint <url> — connect to a remote browser (LambdaTest grid)

If Chrome fails to launch, ensure Google Chrome is installed and no other process is using CDP ports 9222–9230.

name	kane-cli
description	Browser automation via kane-cli — run objectives, parse NDJSON output, inspect logs, report bugs. Use for any task requiring a real browser (navigate, click, fill forms, test web UI, take screenshots).

Kane CLI — Browser Automation Skill

Use kane-cli for any task that requires a real browser: navigating websites, clicking elements, filling forms, searching, testing web UI, taking screenshots, or verifying deployments.

Do NOT use Playwright, Puppeteer, or Selenium directly. kane-cli manages Chrome, auth, and the AI automation agent.

Always run with --agent flag. This gives structured NDJSON output that you parse and present to the user with rich formatting.

1. Decision Tree

When the user's request involves a browser, follow this flow:

Is kane-cli installed? ├─ Unknown → Check with kane-cli --version ├─ No → npm install -g @testmuai/kane-cli then §2 └─ Yes ↓

Is kane-cli set up? ├─ Unknown → Run kane-cli whoami to check auth status ├─ No → Go to §2 (Pre-flight Setup) └─ Yes ↓

After every run:

Parse the NDJSON output (§5)
Present rich results with emojis (§6)
If failed, inspect logs and diagnose (§8)

2. Pre-flight Setup

Before first use, verify installation and auth.

Install

npm install -g @testmuai/kane-cli

Check Auth Status

kane-cli whoami

If this shows "not configured" or errors, run login:

Login (Basic Auth)

kane-cli login --username <user> --access-key <key>

This creates the default profile with basic auth, auto-selects the KaneAI project, and marks setup complete. Credentials come from the user's TestmuAI dashboard (Settings → Keys).

Optional flag:

--profile <name> — profile name (default: last selected profile check using config show)

Login (OAuth)

kane-cli login --oauth

This opens the browser for OAuth consent and waits for the callback. Works in both TTY and non-TTY (agent) mode.

Login (Interactive — TTY only)

In a terminal, run kane-cli login with no flags for the interactive wizard (auth method → project picker → folder picker). If the user needs this, ask them to run it directly:

Please run ! kane-cli login and complete the sign-in.

Verify

kane-cli whoami          # Auth status
kane-cli config show     # Current configuration

3. Building the Command

Every run uses this pattern:

kane-cli run "<objective>" --agent [options]

--agent is mandatory — it outputs structured NDJSON that you parse and present to the user.

Flags

Flag	Purpose	Default
`--headless`	No visible browser window	Off (browser visible)
`--max-steps <n>`	Limit agent reasoning steps	30
`--timeout <s>`	Kill run after N seconds	No limit
`--variables <json>`	Inline variables JSON	None
`--variables-file <path>`	Load variables from a JSON file	None
`--global-context <file>`	Override global agent context markdown	`~/.testmuai/kaneai/global-memory.md`
`--local-context <file>`	Override local project context markdown	`.testmuai/context.md`
`--ws-endpoint <url>`	Remote browser via WebSocket (e.g. LambdaTest grid)	Local Chrome
`--cdp-endpoint <url>`	Connect to existing Chrome via CDP	Auto-launch Chrome
`--code-export`	Generate code export after upload	Off

Exit Codes

Code	Meaning
0	✅ Passed
1	❌ Failed
2	⚠️ Error (auth, setup, infra)
3	⏱️ Timeout or cancelled

Variables

Variables parameterize objectives with reusable values and secrets. Use {{key}} syntax in objectives.

Format:

{
  "username": { "value": "alice", "secret": false },
  "password": { "value": "s3cret!", "secret": true }
}

secret: true masks the value in logs and routes it to TestmuAI's secrets store instead of being synced as plain TMS variables.

Loading order (later wins):

~/.testmuai/kaneai/variables/*.json (global, alphabetical)
{cwd}/.testmuai/variables/*.json (local project overrides)
--variables-file <path>
--variables '{...}' (inline JSON)

Always parameterize: credentials, API keys, tokens, environment-specific URLs. OK to hardcode: one-off URLs, static UI text, navigation paths.

Context Files

Context files provide additional instructions to the agent:

Global: ~/.testmuai/kaneai/global-memory.md — shared across all runs
Local: .testmuai/context.md in cwd — project-specific

Override per-run with --global-context / --local-context flags.

Examples

# Simple browser task
kane-cli run "Go to https://www.amazon.in and search for 'laptop'" --agent

# Headless with timeout
kane-cli run "Go to https://app.example.com and verify login page loads" --agent --headless --timeout 60

# With variables
kane-cli run "Go to https://app.example.com and login with {{username}} and {{password}}" --agent \
  --variables '{"username": {"value": "alice"}, "password": {"value": "secret123", "secret": true}}'

# Remote browser (LambdaTest grid)
kane-cli run "Go to https://shop.example.com and add item to cart" --agent \
  --ws-endpoint "wss://cdp.lambdatest.com/playwright?capabilities=..."

# With variables file
kane-cli run "Go to https://staging.myapp.com, login and verify dashboard" --agent \
  --variables-file ./test-creds.json --headless --timeout 120

4. Writing Objectives

The objective string is the most important input. How you phrase it determines what the agent does.

Three Patterns

Pattern	Trigger Phrases	Agent Behavior
🎯 Action	"go to", "click", "type", "search", "fill", "scroll"	Performs browser actions
✅ Assertion	"assert", "verify", "confirm", "check that"	Validates a condition (pass/fail)
📦 Extraction	"store X as 'name'"	Reads a value from the page and persists it in structured output

Extraction: The "store as" Pattern

Critical. Vague phrasing like "read", "report", or "tell me" does NOT reliably extract data. The agent may observe the value visually but won't persist it in structured output.

❌ Bad — agent looks but doesn't capture:

"go to example.com and read the page title"
"go to example.com and tell me the price"

✅ Good — agent extracts and persists in final_state:

"go to example.com, store the page title as 'page_title'"
"go to example.com, store the price of the first item as 'price'"

Stored values appear in the run_end event's final_state and context.memory fields.

Combining Patterns

Chain action → extraction → assertion in a single objective:

"go to {{app_url}}/dashboard,
 store the welcome message as 'welcome_text',
 store the user role in the sidebar as 'role',
 assert the role is 'Admin'"

Assertion Specificity

Type	Example
Exact match	`"assert the cart total shows '$29.99'"`
Flexible match	`"assert a price is displayed for each product"`
State	`"assert the Submit button is disabled until all fields are filled"`
Conditional	`"if a cookie banner appears, dismiss it, then assert the homepage loads"`
Negative	`"assert no error message or red banner is visible"`
Positional	`"assert 'Settings' appears in the left sidebar navigation"`

Dos and Don'ts

✅ Do	❌ Don't
Use imperative verbs: "go to", "click", "store as"	Use vague verbs: "check out", "look at", "explore"
Be specific: "click the 'Add to Cart' button"	Be vague: "add the item"
Name extractions: "store X as 'price'"	Hope for values: "tell me the price"
Use `{{variables}}` for credentials/URLs	Hardcode secrets in the objective
Include starting URL in the objective: "Go to https://..."	Assume the agent knows where to start
Split mega-objectives (>15 steps) into multiple runs	Cram everything into one massive objective

5. Parsing Output (--agent mode)

Internal reference only. Everything in this section (field names, event types, JSON structure) is for you to parse programmatically. Never expose these internal terms to the user. The user should see plain-language summaries, not run_end, final_state, bifurcation, NDJSON, session_dir, or any raw JSON fields.

With --agent, kane-cli outputs one JSON object per line to stdout. Progress UI renders to stderr.

Event Types

Progress events (bulk of the output — one per step):

{"step": 1, "status": "passed", "remark": "Navigated to amazon.in"}
{"step": 2, "status": "passed", "remark": "Typed 'laptop' in search box"}
{"step": 3, "status": "failed", "remark": "Could not find Add to Cart button"}

Field	Type	Description
`step`	number	Step index (1-based)
`status`	string	`"passed"` or `"failed"`
`remark`	string	What the agent did or why it failed

These are untyped — they have no type field. Do not key on event.type === 'step_start' or 'step_end'; those event types are not emitted.

Flow events:

Event (`type` field)	Key Fields	Purpose
`bifurcation`	`flows[]`, `count`	Agent split objective into sub-flows
`child_agent_start`	`child_id`, `objective`, `parent_step`	Child agent spawned
`child_agent_end`	`child_id`, `success`, `steps_taken`, `summary`	Child agent finished
`ask_user`	`question`, `step_index`, `options?`	Agent needs user input
`error`	`message`	Error occurred

Note: There is no run_start event — the first line is either a bifurcation or a progress object.

Parsing Strategy

Since progress events lack a type field, distinguish them from typed events like this:

for each line of NDJSON:
  if obj.type === "run_end"    → terminal event, stop parsing
  if obj.type === "bifurcation" → flow split
  if obj.type exists           → other typed event
  if obj.step exists           → progress event (step/status/remark)

Build automation on run_end — it is the only event guaranteed to have a stable schema across versions. Use progress events for live status display only.

Terminal event (always the last line):

{
  "type": "run_end",
  "status": "passed",
  "summary": "Searched for laptop and added first result to cart",
  "one_liner": "Searched for laptop on Amazon and added to cart",
  "reason": "Objective completed",
  "duration": 45.2,
  "credits": 12,
  "final_state": {
    "price": "$29.99",
    "product_name": "Wireless Headphones"
  },
  "context": {
    "memory": {},
    "variables": {},
    "pointer": "(passed) Searched for laptop and added first result to cart"
  },
  "session_dir": "~/.testmuai/kaneai/sessions/a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "run_dir": "~/.testmuai/kaneai/sessions/a1b2c3d4-e5f6-7890-abcd-ef1234567890/runs/0",
  "test_url": "https://test-manager.lambdatest.com/projects/123/test-cases/456"
}

Key run_end fields:

status — "passed" or "failed"
summary — what the agent did
one_liner — short summary for display
reason — why it stopped
credits — credits consumed by the run (when reported)
final_state — extracted values from "store as" objectives
test_url — link to KaneAI dashboard (if upload succeeded)
session_dir / run_dir — paths to log files

Responding to `ask_user` (if stdin is a TTY)

{"type": "user_response", "answer": "Medium size"}

To cancel a run:

{"type": "cancel"}

6. Presenting Results to the User

Golden rule: The user should feel like they're watching a browser task happen, not reading a log file. Use plain language, never expose internal field names, JSON keys, file paths, or technical jargon. Translate everything into what the user cares about.

📢 Live Progress (During the Run)

Do not stay silent while kane-cli runs. As the command executes, keep the user informed:

Before starting — Tell the user what you're about to do:

Starting browser task: searching for 'laptop' on Amazon...
As steps complete — Relay each step's outcome in plain language as it happens. Parse the progress events from stdout and narrate them:

Step 1: Opened Amazon homepage Step 2: Typed 'laptop' in the search bar Step 3: Clicked the search button Step 4: Search results loaded — found product listings
If something goes wrong mid-run — Flag it immediately, don't wait for the final result:

Step 5: Could not find the 'Add to Cart' button — the agent is retrying...

This keeps the user engaged and lets them intervene early if the task is going in the wrong direction.

📋 Results Summary (After the Run)

After every run, present a clear summary. Never just say "it passed" — show the full picture in a user-friendly format.

Successful run:


🟢 Result	Passed
🎯 Task	Search for 'laptop' on Amazon
⏱️ Duration	45.2s
👣 Steps taken	7
📝 What happened	Opened Amazon, typed 'laptop' in search, clicked search, results loaded with 48 products
🔗 View details	Open in KaneAI Dashboard

If data was extracted (from "store as" objectives), show it as a clean results table:

📦 What was found	Value
Top repository	freeCodeCamp/freeCodeCamp
Star count	413k
Price	$29.99

If assertions were checked, show pass/fail for each:

✅ Check	Result
Dashboard shows welcome message	🟢 Passed
User role is Admin	🔴 Failed

❌ When Things Go Wrong

For failed runs, explain what went wrong in plain language:

🔍 What failed — describe the step that failed and why, in the user's terms (not "step_003.json shows dom_action error")
📸 Screenshot — if a screenshot exists, read and show it so the user can see what the browser looked like at the point of failure
💡 Why it likely failed — your diagnosis: was the element missing? Did the page not load? Was the objective ambiguous?
🔧 Suggested fix — a concrete next step: rephrase the objective, increase timeout, check auth, etc.

Example of a good failure report:

🔴 Failed at step 5 of 9 (after 25s)

What happened: The agent clicked "Proceed to Checkout" but the payment form never appeared. The page showed a loading spinner for 15 seconds before the agent timed out.

Likely cause: The checkout page may require authentication, or the site's payment service was slow/down.

Suggested fix: Try adding an explicit login step before checkout, or increase the timeout to 120s.

🐛 Suggesting a Bug Report

If the failure looks like a kane-cli bug (not auth, timeout, or a vague objective), offer to file a report:

This looks like it might be a bug in kane-cli. Want me to file a report?

File at: https://github.com/LambdaTest/kane-cli/issues. Gather the details automatically — don't ask the user to dig through log files.

Do NOT suggest bug reports for: auth issues, low timeouts, vague objectives, or website errors (500s, CAPTCHAs).

7. Saving & Replaying Tests (`testmd`)

Use testmd whenever the user wants the test to persist. The decision is binary — once a test exists as a file, every later invocation is testmd run, never run.

When to switch from `run` to `testmd`

User says	Use
"save this test", "commit this", "keep this", "add this to the suite"	`testmd`
"regression test", "smoke test", "make this replayable"	`testmd`
"this is a test", "test the X flow end-to-end" (suite-shaped)	`testmd`
"run this once", "check if X works right now", "try X"	`run` (§3)
"search for", "click", "fill", "verify" (one-shot)	`run` (§3)

If unclear, ask: "Do you want me to save this test so you can re-run it later?"

Quick start

Write the file (any path; filename must end in _test.md):

---
mode: testing
max_steps: 30
---

# Amazon search

## Open Amazon
Open https://www.amazon.com.

## Search for headphones
Type "wireless headphones" into the search box and submit.
Verify at least one product result is visible.

Run it:

kane-cli testmd run amazon_test.md --agent

File format

Four parts in order:

YAML frontmatter — between --- ... --- at the very top.
# Title — decorative; everything before the first ## is ignored.
## H2 step headings — one per step. The agent reads the step body, not the heading.
Step body — either prose or a single @import <path> line. Never both.

Per-step yaml overrides go immediately under the heading, in a fenced block:

## Submit the form
```yaml
timeout: 90
optional: true
```
Click submit and verify the confirmation banner.

Frontmatter keys to use:

Key	Scope	Description
`mode`	root	`action` (halts on auth walls) or `testing` (default — pushes through so negative-test assertions can fire)
`max_steps`	root + step	Max agent reasoning steps. Default `30`.
`timeout`	root + step	Hard kill per step in seconds.
`headless`	root	No browser window.
`variables`	root + step	`{{name}}` params, same shape as §3, with `secret: true` for credentials
`global_context` / `local_context`	root + step	Inline Markdown or path
`code_export` / `code_language`	root + step	Generate Playwright after the run; language `python` or `javascript`

Files ending in _test.md are tests (valid entry points). Any other .md is a helper — reachable only via @import.

The replay & cascade rule (CRITICAL)

On the first run of a test, the agent authors each step and saves a recording. On every later run, each step replays from its recording — no agent, no LLM cost, much faster.

A step replays only if all of these hold:

A recording for that step exists,
Its prose is unchanged since the recording,
Its yaml block is unchanged,
No earlier step in the file invalidated it.

Consequences when editing tests:

A one-line tweak at the top of a 20-step test re-authors all 20 steps on the next run.
To re-record only one step, edit only that step (or steps after it).
--author forces full authoring for one run (debugging only).
rm -rf output-<stem>/ wipes the cache entirely.

`@import` for reusing flows

Extract a repeating flow (login, setup, cookie banner dismissal) into a helper file:

## Sign in
@import ./helpers/login.md

Rules:

Helper filename must not end in _test.md.
Path resolves relative to the importing file, not the shell's cwd.
The step body must be exactly @import <path> — no mixed prose, no extra lines.
The step's yaml block may contain only optional. Other keys are rejected.
optional: true on @import is allowed only at the root file, not on a nested import.

Variables and context propagate into helpers. Chrome / mode / auth do not (root-only).

Editing a helper re-authors that step in every test that imports it, plus everything after the import in those tests. Same cascade rule.

Commands

Command	Use
`kane-cli testmd run <path> --agent [flags]`	Run a test
`kane-cli testmd list`	List `*_test.md` files under cwd (NDJSON when non-TTY)
`kane-cli testmd status <path>`	Test Manager identity + local-sync state
`kane-cli testmd export <path> [--code-language python\|javascript]`	Regenerate code export from existing recordings (no browser launch)
`kane-cli testmd delete <path>`	Local-only delete: removes source + `output-<stem>/`. Does NOT delete from Test Manager.

Flags on testmd run that don't exist on §3 run:

Flag	Default	Description
`--name <name>`	none	Persist the run under this name. Regex `[a-zA-Z0-9_-]+`.
`--on-lock-conflict <readonly\|fail\|wait>`	none	Behavior when another user holds the test's edit lock. `readonly` = replay-only / no upload, `fail` = exit 2, `wait` = block until released
`--retry`	off	On replay failure, restart with a shrinking replay window
`--retry-count <n>`	`3`	Max retry restarts before falling back to full re-author
`--author`	off	Force authoring every step (skip replay decision)

All §3 run flags also apply (--agent, --headless, --max-steps, --timeout, --variables, etc.).

Flag wins over frontmatter for everything except variables — the file owns variables; you can add new keys via flags but cannot override file-defined ones.

Output: `output-<stem>/` and `Result.md`

After a run:

amazon_test.md
output-amazon/
  Result.md                      # human-readable run report
  .internal/                     # cached recordings — do not edit
  playwright-python-code/        # only if code_export enabled

output-<stem>/ is commit-safe and should be committed to git. That's how teammates and CI replay the same recordings.

For tests using @import, helper recordings land next to the helper file in helper-output-<helper>-<root>-<step>/ directories. Also commit-safe.

Result.md opens in any Markdown viewer. It contains:

Frontmatter — status, started, duration_s, session_id
One entry per root step with one of ✓ passed, ✗ failed, ⏭ skipped, optionally suffixed (optional) when a soft-failing step failed but the run continued
For @import steps that failed, a path to the failing sub-step inside the helper

When the user asks "did the test pass?" or "where did it fail?" for a previously-run test, read Result.md rather than re-running the test.

Recording a `_test.md` from a live session

If the user runs an ad-hoc objective with §3 run and decides to keep it:

kane-cli run "Search for noise-cancelling headphones on amazon.com" --name amazon-search

CI invocation

kane-cli testmd run ./tests/checkout_test.md \
  --agent \
  --headless \
  --on-lock-conflict wait \
  --retry

--agent — NDJSON to stdout (auto-enabled when stdin is not a TTY; pass explicitly anyway).
--headless — no window.
--on-lock-conflict wait — block instead of failing if a teammate is editing the same test.
--retry — automatically recover transient replay failures.

Exit codes follow §3 with new semantics:

2 now includes parse errors and --on-lock-conflict fail
3 now includes --on-lock-conflict wait timeout

Parse errors (when writing a `_test.md`)

Parse errors abort before any browser launch with exit 2. Common ones and the fix:

Message	Fix
`frontmatter is missing closing '---'`	Add the trailing `---`
`invalid YAML in frontmatter`	Re-validate the YAML block
`step body must be exactly one of prose / @import`	Split into two steps
`step config on @import may only contain 'optional'`	Remove other keys from the yaml block
`cannot @import a test file`	Imports may only reference helpers (not ending in `_test.md`)
`cyclic reference`	Restructure helpers to break the loop
`chrome config is global-only`	Move Chrome key to root frontmatter
`'<key>' is run-level and cannot be set per-step`	Move `mode` / `on_lock_conflict` to root frontmatter
`unknown config key`	Remove or fix the key
`auth/identity keys are CLI-only`	Pass `username` / `access_key` as CLI flags, not in frontmatter

When the user reports a parse error, fix the file before retrying — don't loop on the same error.

8. Failure Handling & Log Inspection

When a run fails, diagnose before suggesting fixes.

Log Locations

The run_end event provides session_dir and run_dir paths. Use those directly.

{session_dir}/
├── session.json               # Session metadata, run list, upload status
├── tui.log                    # Timeline: session start, run start/end, errors
└── runs/{n}/
    └── run-test/
        └── actions.ndjson     # Step-by-step record of agent actions

Debugging Flow

Parse the run_end event from stdout — it has status, reason, and summary plus the session_dir / run_dir paths.
Read actions.ndjson in {run_dir}/run-test/ — each line is one agent action with its intent and outcome.
Check tui.log in {session_dir}/ — for session-level issues (Chrome launch, auth, upload).

Common Failure Patterns

Symptom	Likely Cause	Fix
🔄 Agent repeats same action	Stuck in a loop / page didn't change	Rephrase objective, add explicit wait or assertion
🎯 Agent clicks wrong element	Ambiguous UI, multiple similar elements	Be more specific: "click the blue 'Submit' button in the checkout form"
👁️ Agent says done but didn't finish	Objective too vague	Add explicit assertions: "assert the confirmation page shows order number"
💀 Exit code 2, no steps	Auth or Chrome failure	Check `kane-cli whoami`, verify Chrome is available
⏱️ Exit code 3	Timeout or cancelled	Increase `--timeout` or `--max-steps`, or split into smaller objectives
🚫 "CDP endpoint not reachable"	Chrome not running	Let kane-cli manage Chrome (remove `--cdp-endpoint`)

9. Parallel Execution

For multiple independent browser tasks, decompose and run in parallel using the Agent tool.

When to Split

>15 steps — long runs drift and get stuck
Independent flows — login test and search test don't depend on each other
Different pages/features — settings vs checkout vs admin
Different user roles — admin flow vs regular user flow

How to Split

Each sub-objective must be self-contained: navigates to its own URL, authenticates independently, asserts its own outcomes. No sub-objective depends on another having run first.

Execution Pattern

Decompose the user's request into N independent sub-objectives

Spawn N Agent tool calls in a single message — each runs:

kane-cli run "Go to <url> and <sub-objective>" --agent --headless --timeout 120

Each agent parses the NDJSON output, waits for run_end, returns: status, steps, duration, summary, session path
After ALL agents complete, format the batch summary

Agent Prompt Template

Run this kane-cli browser test and report results:

    kane-cli run "Go to <url> and <objective>" --agent --headless --timeout 120

After the command completes:
1. Capture the exit code
2. Parse the run_end NDJSON event from stdout
3. If failed, read the failing step's screenshot from run_dir
4. Return: {status, steps, duration, summary, session_dir, failure_step, screenshot_path}

Batch Summary Format

## 🧪 Test Suite: <suite name>

| # | Test | Status | Steps | Time | What happened |
|---|------|--------|-------|------|---------|
| 1 | Login + dashboard | ✅ | 5 | 12s | Welcome banner visible |
| 2 | Product search | ✅ | 7 | 18s | 3 results for 'shoes' |
| 3 | Checkout flow | ❌ | 9 | 25s | Payment form did not load |
| 4 | Admin CSV export | ✅ | 6 | 15s | CSV downloaded (42 rows) |

### 📊 Overall
- **Pass rate:** 3/4 (75%)
- **Total steps:** 27 · **Total time:** 1m10s

### ❌ Failures
**#3 Checkout flow** — Payment form did not load after clicking "Credit Card".
📸 [screenshot of the failure shown inline]

Status icons: ✅ passed · ❌ failed · ⚠️ stuck/timeout

Do not show raw file paths (like ~/.testmuai/kaneai/sessions/...) in the summary. Instead, read the screenshot and show it inline, or offer to inspect logs only if the user asks.

10. Configuration & Reference

Config Commands

kane-cli config show                          # Show all current settings
kane-cli config set-window <W>x<H>           # Browser window size (e.g. 1920x1080)
kane-cli config chrome-profile <path>         # Chrome profile path (or interactive picker in TTY)
kane-cli config project <project-id>          # TMS project ID (or interactive picker in TTY)
kane-cli config folder <folder-id>            # TMS folder ID (or interactive picker in TTY)

Feedback

Submit feedback on a completed test run:

kane-cli feedback --test-id <id> --feedback-type <positive|negative> --details "..."

Directory Structure

~/.testmuai/kaneai/
├── tui-config.json              # Persistent CLI settings
├── config.json                  # Shared auth configuration
├── global-memory.md             # Global agent context
├── chrome-profile/              # Default Chrome user profile
├── profiles/                    # Stored credentials
│   └── {profile}/{env}/
│       └── credentials
├── sessions/                    # Session history
│   └── {session-id}/
│       ├── session.json         # Metadata, run list, upload status
│       ├── tui.log              # Session event log
│       ├── runs/{n}/
│       │   └── run-test/
│       │       └── actions.ndjson   # Step-by-step record of agent actions
│       └── code-export/         # (when --code-export) generated code files
└── variables/                   # Global variable files
    └── *.json

# Project-local overrides (in cwd):
.testmuai/
├── context.md                   # Project-specific agent context
└── variables/
    └── *.json                   # Project-specific variables

Chrome Management

kane-cli auto-launches Chrome with CDP (DevTools Protocol) on ports 9222–9230. Chrome runs as a detached process and outlives the CLI.

--headless — runs Chrome in headless mode (no visible window)
--cdp-endpoint <url> — connect to an already-running Chrome instance
--ws-endpoint <url> — connect to a remote browser (LambdaTest grid)

If Chrome fails to launch, ensure Google Chrome is installed and no other process is using CDP ports 9222–9230.

kane-cli

Kane CLI — Browser Automation Skill

1. Decision Tree

2. Pre-flight Setup

Install

Check Auth Status

Login (Basic Auth)

Login (OAuth)

Login (Interactive — TTY only)

Verify

3. Building the Command

Flags

Exit Codes

Variables

Context Files

Examples

4. Writing Objectives

Three Patterns

Extraction: The "store as" Pattern

Combining Patterns

Assertion Specificity

Dos and Don'ts

5. Parsing Output (--agent mode)

Event Types

Parsing Strategy

Responding to ask_user (if stdin is a TTY)

6. Presenting Results to the User

📢 Live Progress (During the Run)

📋 Results Summary (After the Run)

❌ When Things Go Wrong

🐛 Suggesting a Bug Report

7. Saving & Replaying Tests (testmd)

When to switch from run to testmd

Quick start

File format

The replay & cascade rule (CRITICAL)

@import for reusing flows

Commands

Output: output-<stem>/ and Result.md

Recording a _test.md from a live session

CI invocation

Parse errors (when writing a _test.md)

8. Failure Handling & Log Inspection

Log Locations

Debugging Flow

Common Failure Patterns

9. Parallel Execution

When to Split

How to Split

Execution Pattern

Agent Prompt Template

Batch Summary Format

10. Configuration & Reference

Config Commands

Feedback

Directory Structure

Chrome Management

Kane CLI — Browser Automation Skill

1. Decision Tree

2. Pre-flight Setup

Install

Check Auth Status

Login (Basic Auth)

Login (OAuth)

Login (Interactive — TTY only)

Verify

3. Building the Command

Flags

Exit Codes

Variables

Context Files

Examples

4. Writing Objectives

Three Patterns

Extraction: The "store as" Pattern

Combining Patterns

Assertion Specificity

Dos and Don'ts

5. Parsing Output (--agent mode)

Event Types

Responding to `ask_user` (if stdin is a TTY)

7. Saving & Replaying Tests (`testmd`)

When to switch from `run` to `testmd`

`@import` for reusing flows

Output: `output-<stem>/` and `Result.md`

Recording a `_test.md` from a live session

Parse errors (when writing a `_test.md`)

Responding to `ask_user` (if stdin is a TTY)

7. Saving & Replaying Tests (`testmd`)

When to switch from `run` to `testmd`

`@import` for reusing flows

Output: `output-<stem>/` and `Result.md`

Recording a `_test.md` from a live session

Parse errors (when writing a `_test.md`)