원클릭으로
browserwing-executor
// Control browser automation through HTTP API. Supports page navigation, element interaction (click, type, select), data extraction, accessibility snapshot analysis, screenshot, JavaScript execution, and batch operations.
// Control browser automation through HTTP API. Supports page navigation, element interaction (click, type, select), data extraction, accessibility snapshot analysis, screenshot, JavaScript execution, and batch operations.
Persistent local memory for OpenClaw agents. Use when users say: - "install memos" - "install MemOS" - "setup memory" - "add memory plugin" - "openclaw memory" - "memos onboarding" - "memory not working" - "configure memory" - "enable memory" - "upgrade MemOS" - "update memory plugin"
Use the MemOS Local memory system to search and use the user's past conversations. Use this skill whenever the user refers to past chats, their own preferences or history, or when you need to answer from prior context. When auto-recall returns nothing (long or unclear user query), generate your own short search query and call memory_search. Available tools: memory_search, memory_get, memory_write_public, memory_share, memory_unshare, task_summary, skill_get, skill_search, skill_install, skill_publish, skill_unpublish, network_memory_detail, network_skill_pull, network_team_info, memory_timeline, memory_viewer.
Ask users questions via the UI. Use when you need clarification, user preferences, or confirmation before proceeding. The user CANNOT see CLI output - this tool is the ONLY way to communicate with them.
Browser automation with persistent page state. Use when users ask to navigate websites, fill forms, take screenshots, extract web data, test web apps, or automate browser workflows. Trigger phrases include "go to [url]", "click on", "fill out the form", "take a screenshot", "scrape", "automate", "test the website", "log into", or any browser interaction request.
Manage and operate BrowserWing — an intelligent browser automation platform. Install dependencies, configure LLM, create/manage/execute automation scripts, use AI-driven exploration to generate scripts, browse the script marketplace, and troubleshoot issues.
| name | browserwing-executor |
| description | Control browser automation through HTTP API. Supports page navigation, element interaction (click, type, select), data extraction, accessibility snapshot analysis, screenshot, JavaScript execution, and batch operations. |
BrowserWing Executor provides comprehensive browser automation capabilities through HTTP APIs. You can control browser navigation, interact with page elements, extract data, and analyze page structure.
API Base URL: http://localhost:8080/api/v1/executor
Authentication: Use X-BrowserWing-Key: <api-key> header or Authorization: Bearer <token>
IMPORTANT: Always call this endpoint first to see all available commands and their parameters.
curl -X GET 'http://localhost:8080/api/v1/executor/help'
Response: Returns complete list of all commands with parameters, examples, and usage guidelines.
Query specific command:
curl -X GET 'http://localhost:8080/api/v1/executor/help?command=extract'
CRITICAL: Always call this after navigation to understand page structure and get element RefIDs.
curl -X GET 'http://localhost:8080/api/v1/executor/snapshot'
Response Example:
{
"success": true,
"snapshot_text": "Clickable Elements:\n @e1 Login (role: button)\n @e2 Sign Up (role: link)\n\nInput Elements:\n @e3 Email (role: textbox) [placeholder: your@email.com]\n @e4 Password (role: textbox)"
}
Use Cases:
curl -X POST 'http://localhost:8080/api/v1/executor/navigate' \
-H 'Content-Type: application/json' \
-d '{"url": "https://example.com"}'
curl -X POST 'http://localhost:8080/api/v1/executor/click' \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e1"}'
Identifier formats:
@e1, @e2 (from snapshot)#button-id, .class-name//button[@type='submit']Login (text content)curl -X POST 'http://localhost:8080/api/v1/executor/type' \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e3", "text": "user@example.com"}'
curl -X POST 'http://localhost:8080/api/v1/executor/extract' \
-H 'Content-Type: application/json' \
-d '{
"selector": ".product-item",
"fields": ["text", "href"],
"multiple": true
}'
curl -X POST 'http://localhost:8080/api/v1/executor/wait' \
-H 'Content-Type: application/json' \
-d '{"identifier": ".loading", "state": "hidden", "timeout": 10}'
curl -X POST 'http://localhost:8080/api/v1/executor/batch' \
-H 'Content-Type: application/json' \
-d '{
"operations": [
{"type": "navigate", "params": {"url": "https://example.com"}, "stop_on_error": true},
{"type": "click", "params": {"identifier": "@e1"}, "stop_on_error": true},
{"type": "type", "params": {"identifier": "@e3", "text": "query"}, "stop_on_error": true}
]
}'
Step-by-step workflow:
Discover commands: Call GET /help to see all available operations and their parameters (do this first if unsure).
Navigate: Use POST /navigate to open the target webpage.
Analyze page: Call GET /snapshot to understand page structure and get element RefIDs.
Interact: Use element RefIDs (like @e1, @e2) or CSS selectors to:
POST /clickPOST /typePOST /selectPOST /waitExtract data: Use POST /extract to get information from the page.
Present results: Format and show extracted data to the user.
User Request: "Search for 'laptop' on example.com and get the first 5 results"
Your Actions:
curl -X POST 'http://localhost:8080/api/v1/executor/navigate' \
-H 'Content-Type: application/json' \
-d '{"url": "https://example.com/search"}'
curl -X GET 'http://localhost:8080/api/v1/executor/snapshot'
Response shows: @e3 Search (role: textbox) [placeholder: Search...]
curl -X POST 'http://localhost:8080/api/v1/executor/type' \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e3", "text": "laptop"}'
curl -X POST 'http://localhost:8080/api/v1/executor/press-key' \
-H 'Content-Type: application/json' \
-d '{"key": "Enter"}'
curl -X POST 'http://localhost:8080/api/v1/executor/wait' \
-H 'Content-Type: application/json' \
-d '{"identifier": ".search-results", "state": "visible", "timeout": 10}'
curl -X POST 'http://localhost:8080/api/v1/executor/extract' \
-H 'Content-Type: application/json' \
-d '{
"selector": ".result-item",
"fields": ["text", "href"],
"multiple": true
}'
Found 15 results for 'laptop':
1. Gaming Laptop - $1299 (https://...)
2. Business Laptop - $899 (https://...)
...
POST /navigate - Navigate to URLPOST /go-back - Go back in historyPOST /go-forward - Go forward in historyPOST /reload - Reload current pagePOST /click - Click element (supports: RefID @e1, CSS selector, XPath, text content)POST /type - Type text into input (supports: RefID @e3, CSS selector, XPath)POST /select - Select dropdown optionPOST /hover - Hover over elementPOST /wait - Wait for element state (visible, hidden, enabled)POST /press-key - Press keyboard key (Enter, Tab, Ctrl+S, etc.)POST /extract - Extract data from elements (supports multiple elements, custom fields)POST /get-text - Get element text contentPOST /get-value - Get input element valueGET /page-info - Get page URL and titleGET /page-text - Get all page textGET /page-content - Get full HTMLGET /snapshot - Get accessibility snapshot (⭐ ALWAYS call after navigation)GET /clickable-elements - Get all clickable elementsGET /input-elements - Get all input elementsPOST /screenshot - Take page screenshot (base64 encoded)POST /evaluate - Execute JavaScript codePOST /batch - Execute multiple operations in sequencePOST /scroll-to-bottom - Scroll to page bottomPOST /resize - Resize browser windowPOST /tabs - Manage browser tabs (list, new, switch, close)POST /fill-form - Intelligently fill multiple form fields at onceGET /console-messages - Get browser console messages (logs, warnings, errors)GET /network-requests - Get network requests made by the pagePOST /handle-dialog - Configure JavaScript dialog (alert, confirm, prompt) handlingPOST /file-upload - Upload files to input elementsPOST /drag - Drag and drop elementsPOST /close-page - Close the current page/tabYou can identify elements using:
RefID (Recommended): @e1, @e2, @e3
/snapshot endpoint"identifier": "@e1"CSS Selector: #id, .class, button[type="submit"]
"identifier": "#login-button"XPath: //button[@id='login'], //a[contains(text(), 'Submit')]
"identifier": "//button[@id='login']"Text Content: Login, Sign Up, Submit
"identifier": "Login"ARIA Label: Elements with aria-label attribute
Before starting:
GET /help if you're unsure about available commands or their parametersDuring automation:
/snapshot after navigation to get page structure and RefIDs@e1) over CSS selectors for reliability and stability/wait for dynamic content that loads asynchronously/batch for multiple sequential operations to improve efficiencyError handling:
/snapshot again to refresh page structureData extraction:
fields parameter to specify what to extract: ["text", "href", "src"]multiple: true to extract from multiple elementsScenario: User wants to login to a website
User: "Please log in to example.com with username 'john' and password 'secret123'"
Your Actions:
Step 1: Navigate to login page
POST http://localhost:8080/api/v1/executor/navigate
{"url": "https://example.com/login"}
Step 2: Get page structure
GET http://localhost:8080/api/v1/executor/snapshot
Response:
Clickable Elements:
@e1 Login (role: button)
Input Elements:
@e2 Username (role: textbox)
@e3 Password (role: textbox)
Step 3: Enter username
POST http://localhost:8080/api/v1/executor/type
{"identifier": "@e2", "text": "john"}
Step 4: Enter password
POST http://localhost:8080/api/v1/executor/type
{"identifier": "@e3", "text": "secret123"}
Step 5: Click login button
POST http://localhost:8080/api/v1/executor/click
{"identifier": "@e1"}
Step 6: Wait for login success (optional)
POST http://localhost:8080/api/v1/executor/wait
{"identifier": ".welcome-message", "state": "visible", "timeout": 10}
Step 7: Inform user
"Successfully logged in to example.com!"
Scenario: Fill out a form with multiple fields
Instead of making 5 separate API calls, use one batch operation:
curl -X POST 'http://localhost:8080/api/v1/executor/batch' \
-H 'Content-Type: application/json' \
-d '{
"operations": [
{
"type": "navigate",
"params": {"url": "https://example.com/form"},
"stop_on_error": true
},
{
"type": "type",
"params": {"identifier": "#name", "text": "John Doe"},
"stop_on_error": true
},
{
"type": "type",
"params": {"identifier": "#email", "text": "john@example.com"},
"stop_on_error": true
},
{
"type": "select",
"params": {"identifier": "#country", "value": "United States"},
"stop_on_error": true
},
{
"type": "click",
"params": {"identifier": "#submit"},
"stop_on_error": true
}
]
}'
/help or /help?command=<name> to learn about commands/snapshot after navigation to understand the page/wait before interacting with elements that load asynchronously/batch for multiple sequential operations/type for each field: @e1, @e2, etc./select for dropdowns/wait/extract with CSS selector and multiple: true["text", "href", "src"]@e2@e3@e1wait_visible: true (default) for reliable element interactionlocalhost:8080 with actual API host addressX-BrowserWing-Key header or JWT tokenElement not found:
/snapshot to see available elementsTimeout errors:
/wait with appropriate state before interactionExtraction returns empty:
/wait first)# Discover commands
GET localhost:8080/api/v1/executor/help
# Navigate
POST localhost:8080/api/v1/executor/navigate {"url": "..."}
# Get page structure
GET localhost:8080/api/v1/executor/snapshot
# Click element
POST localhost:8080/api/v1/executor/click {"identifier": "@e1"}
# Type text
POST localhost:8080/api/v1/executor/type {"identifier": "@e3", "text": "..."}
# Extract data
POST localhost:8080/api/v1/executor/extract {"selector": "...", "fields": [...], "multiple": true}
All operations return:
{
"success": true,
"message": "Operation description",
"timestamp": "2026-01-15T10:30:00Z",
"data": {
// Operation-specific data
}
}
Error response:
{
"error": "error.operationFailed",
"detail": "Detailed error message"
}