| name | agent-browser |
| description | Advanced browser automation tool for web testing, form filling, screenshots, data extraction, and structured information retrieval. Supports complex workflows including authentication, dynamic content handling, and formatted table output. Essential for web scraping, testing, and data collection tasks. |
Browser Automation with agent-browser
Quick start
agent-browser open <url>
agent-browser snapshot -i
agent-browser click @e1
agent-browser fill @e2 "text"
agent-browser close
Core workflow
- Navigate:
agent-browser open <url>
- Snapshot:
agent-browser snapshot -i (returns elements with refs like @e1, @e2)
- Interact using refs from the snapshot
- Re-snapshot after navigation or significant DOM changes
Data Extraction Workflow
For extracting structured data from web pages:
- Navigate to target page
- Take snapshot to identify data containers
- Extract data using
get text or other commands
- Format extracted data into tables for presentation
Commands
Navigation
agent-browser open <url>
agent-browser back
agent-browser forward
agent-browser reload
agent-browser close
Snapshot (page analysis)
agent-browser snapshot
agent-browser snapshot -i
agent-browser snapshot -c
agent-browser snapshot -d 3
Interactions (use @refs from snapshot)
agent-browser click @e1
agent-browser dblclick @e1
agent-browser fill @e2 "text"
agent-browser type @e2 "text"
agent-browser press Enter
agent-browser press Control+a
agent-browser hover @e1
agent-browser check @e1
agent-browser uncheck @e1
agent-browser select @e1 "value"
agent-browser scroll down 500
agent-browser scrollintoview @e1
Get information
agent-browser get text @e1
agent-browser get value @e1
agent-browser get title
agent-browser get url
Screenshots
agent-browser screenshot
agent-browser screenshot path.png
agent-browser screenshot --full
Wait
agent-browser wait @e1
agent-browser wait 2000
agent-browser wait --text "Success"
agent-browser wait --load networkidle
Semantic locators (alternative to refs)
agent-browser find role button click --name "Submit"
agent-browser find text "Sign In" click
agent-browser find label "Email" fill "user@test.com"
Example: Form submission
agent-browser open https://example.com/form
agent-browser snapshot -i
agent-browser fill @e1 "user@example.com"
agent-browser fill @e2 "password123"
agent-browser click @e3
agent-browser wait --load networkidle
agent-browser snapshot -i
Example: Authentication with saved state
agent-browser open https://app.example.com/login
agent-browser snapshot -i
agent-browser fill @e1 "username"
agent-browser fill @e2 "password"
agent-browser click @e3
agent-browser wait --url "**/dashboard"
agent-browser state save auth.json
agent-browser state load auth.json
agent-browser open https://app.example.com/dashboard
Example: Data extraction and table formatting
agent-browser open https://example.com/products
agent-browser snapshot -i
agent-browser get text @product1
agent-browser get text @price1
Sessions (parallel browsers)
agent-browser --session test1 open site-a.com
agent-browser --session test2 open site-b.com
agent-browser session list
JSON output (for parsing)
Add --json for machine-readable output:
agent-browser snapshot -i --json
agent-browser get text @e1 --json
Output Formatting
Table Rendering (REQUIRED)
When presenting structured data such as lists, rankings, or tabular information extracted from web pages, output MUST be rendered in markdown table format for better readability.
Requirements:
- Use markdown tables for any tabular data
- Include appropriate headers for columns
- Ensure data is properly aligned and formatted
- Use consistent formatting across similar data types
- Include relevant metadata (URLs, dates, prices, etc.) as table columns
Example Table Output:
| 主播名称 | 用户名 | 平台 | 收费 | 到期日期 | 直播链接 |
|---|
| 谪止 | fanxiaofeng2020 | 抖音 | ¥40.00 | 2026-01-08 | 链接 |
| 小烦o3o | yml01515 | 抖音 | ¥30.00 | 2026-01-08 | 链接 |
Data Extraction Best Practices
- When extracting multiple similar items, organize them into tables
- Use descriptive column headers in Chinese when appropriate
- Sort data logically when possible (e.g., by date, relevance, or priority)
- Include clickable links using markdown format:
[text](url)
- Ensure all monetary values include currency symbols
- Format dates consistently (YYYY-MM-DD)
Debugging
agent-browser open example.com --headed
agent-browser console
agent-browser errors