ワンクリックで
agent-browser
Advanced browser automation for AI agents with snapshot-ref interaction pattern - navigate, snapshot interactive elements with refs, click/fill/select by refs, manage sessions, and extract structured data
メニュー
Advanced browser automation for AI agents with snapshot-ref interaction pattern - navigate, snapshot interactive elements with refs, click/fill/select by refs, manage sessions, and extract structured data
Generate and compare multiple implementation variants with benchmarking and scoring
Design RESTful APIs with best practices, conventions, and OpenAPI specs
Generate API documentation from code - produce OpenAPI/Swagger specs, Markdown API references, request/response examples, and interactive documentation from source code analysis
Universal API gateway - connect to 100+ APIs with a unified interface, manage API keys, and chain API calls
API测试技能 - IPC处理器发现、测试生成、健康检查、回归测试
双阶段架构模式 - 先规划后编辑,分离推理与代码修改,提升多文件编辑准确率
| name | agent-browser |
| display-name | Agent Browser |
| description | Advanced browser automation for AI agents with snapshot-ref interaction pattern - navigate, snapshot interactive elements with refs, click/fill/select by refs, manage sessions, and extract structured data |
| version | 1.0.0 |
| category | automation |
| user-invocable | true |
| tags | ["browser","agent","automation","snapshot","form","screenshot","session","ref"] |
| capabilities | ["snapshot-navigation","ref-interaction","session-management","form-automation","data-extraction","screenshot-capture","state-persistence"] |
| handler | ./handler.js |
| os | ["win32","darwin","linux"] |
| tools | ["agent-browse","agent-snapshot","agent-click","agent-fill","agent-screenshot","agent-extract","agent-wait","agent-session"] |
| dependencies | ["browser-automation","computer-use"] |
| instructions | Use this skill for advanced browser automation with the snapshot-ref pattern: 1. Navigate to URL 2. Snapshot to get element refs (@e1, @e2...) 3. Interact using refs 4. Re-snapshot after DOM changes. Supports session persistence, form filling, data extraction, and screenshots. Integrates with the built-in browser engine and Computer Use agent. |
| examples | [{"input":"open https://example.com and snapshot","action":"open"},{"input":"fill @e1 with user@example.com","action":"fill"},{"input":"click @e3 and wait for navigation","action":"click"},{"input":"screenshot the current page","action":"screenshot"}] |
| author | ChainlessChain |
| license | MIT |
Advanced browser automation using the snapshot-ref interaction pattern.
Every browser automation follows this pattern:
open <url>@e1, @e2, ...)/agent-browser open https://example.com
/agent-browser snapshot
# Output: @e1 [input email], @e2 [input password], @e3 [button] "Login"
/agent-browser fill @e1 "user@example.com"
/agent-browser fill @e2 "secret"
/agent-browser click @e3
/agent-browser snapshot # Check result
| Command | Description | Example |
|---|---|---|
open | Navigate to URL | open https://example.com |
snapshot | Get interactive elements with refs | snapshot |
click | Click element by ref | click @e1 |
fill | Clear and type into element | fill @e2 "text" |
type | Type without clearing | type @e2 "text" |
select | Select dropdown option | select @e3 "option" |
screenshot | Capture page screenshot | screenshot |
extract | Extract text/data from element | extract @e5 |
wait | Wait for element or condition | wait @e1 or wait 2000 |
/agent-browser session-save auth.json # Save cookies/state
/agent-browser session-load auth.json # Restore state
open https://example.com/signup → snapshot → fill fields → click submit → snapshot
open <url> → snapshot → extract @e5 → extract body
open <url> → snapshot → click @link → wait → snapshot → repeat
@e1, @e2 are assigned by snapshot to interactive elementsclick "Login button"