browser-execute

Name: Browser Execute
Author: browser-use

// Use ONLY when calling the `browser_execute` tool or driving a real browser via the Chrome DevTools Protocol. Required reading before the first `browser_execute` call in a session. Covers the three connection methods (local Chrome with remote debugging, isolated debug-port profile, Browser Use cloud), the in-process `session` / `console` snippet model, attaching to a page target, common CDP commands, the per-project `.bcode/agent-workspace/` for reusable scripts, and screenshot auto-attachment.

تشغيل في Manus

$ git log --oneline --stat

stars:١١٠

forks:١٧

updated:١٦ مايو ٢٠٢٦ في ٠٤:٤٦

SKILL.md

readonly

package.json

"author": "browser-use"

"repository": "browser-use/browsercode"

فتح مستودع GitHub عرض مستودعات المنشئ

$ install --global

$ download --local

تشغيل في Manus

$ useful --forSOC

مديرو الشبكات وأنظمة الحاسوبمهن الحاسوب والرياضيات15-1244L4

# Linux google-chrome --remote-debugging-port=9222 --user-data-dir=./.bcode/chrome-data-dir # macOS "/Applications/Google Chrome.app/Contents/MacOS/Google Chrome" \ --remote-debugging-port=9222 --user-data-dir=./.bcode/chrome-data-dir # Windows (cmd.exe) "C:\Program Files\Google\Chrome\Application\chrome.exe" ^ --remote-debugging-port=9222 --user-data-dir=.\.bcode\chrome-data-dir # Windows (PowerShell) & "C:\Program Files\Google\Chrome\Application\chrome.exe" ` --remote-debugging-port=9222 --user-data-dir=.\.bcode\chrome-data-dir

// Resolve the live WebSocket URL via `/json/version` and connect: const ver = await fetch("http://127.0.0.1:9222/json/version").then(r => r.json()) await session.connect({ wsUrl: ver.webSocketDebuggerUrl })

// Provision and connect to a cloud browser const r = await fetch("https://api.browser-use.com/api/v3/browsers", { method: "POST", headers: { "X-Browser-Use-API-Key": process.env.BROWSER_USE_API_KEY, "Content-Type": "application/json" }, body: "{}", }) // Additional options: fetch https://docs.browser-use.com/cloud/api-v3/browsers/create-browser-session const { id, cdpUrl, liveUrl } = await r.json() const ver = await fetch(`${cdpUrl}/json/version`).then(r => r.json()) await session.connect({ wsUrl: ver.webSocketDebuggerUrl }) console.log("liveUrl for the user to watch:", liveUrl)

// Browser Use cloud will eventually close idle browsers. An explicit stop frees the slot: await fetch(`https://api.browser-use.com/api/v3/browsers/${id}`, { method: "PATCH", headers: { "X-Browser-Use-API-Key": process.env.BROWSER_USE_API_KEY, "Content-Type": "application/json" }, body: JSON.stringify({ action: "stop" }), })

const targets = (await session.Target.getTargets({})).targetInfos // Pick the first non-internal tab if none was specified. const page = targets.find(t => t.type === "page" && !t.url.startsWith("chrome://")) await session.use(page.targetId)

// Navigate. await session.Page.enable() await session.Page.navigate({ url: "https://example.com" }) await session.waitFor("Page.loadEventFired") // Evaluate JS in the page. const r = await session.Runtime.evaluate({ expression: "document.title", returnByValue: true, }) console.log(r.result.value) // Click by coordinates. const x = 200, y = 300 await session.Input.dispatchMouseEvent({ type: "mouseMoved", x, y }) await session.Input.dispatchMouseEvent({ type: "mousePressed", x, y, button: "left", clickCount: 1 }) await session.Input.dispatchMouseEvent({ type: "mouseReleased", x, y, button: "left", clickCount: 1 }) // Type text. await session.Input.insertText({ text: "hello" }) // Screenshot. await session.Page.captureScreenshot({ format: "png" }) // You see the image inline on the next turn — `browser_execute` automatically // attaches every `Page.captureScreenshot` result. No need to decode, save, or // `read` the bytes back. The base64 is still in `data` (via the return value) // for the rare case you want to process it programmatically.

// ./.bcode/agent-workspace/scrape_titles.ts (you write this with the `write` tool) export async function scrapeTitles(session: any, urls: string[]) { const titles: string[] = [] await session.Page.enable() for (const url of urls) { await session.Page.navigate({ url }) await session.waitFor("Page.loadEventFired") const r = await session.Runtime.evaluate({ expression: "document.title", returnByValue: true }) titles.push(r.result.value) } return titles }

// later snippet const path = process.cwd() + "/.bcode/agent-workspace/scrape_titles.ts" // Cache-bust (`?t=${Date.now()}`) is your responsibility: without it, edits to the file won't be picked up. const m = await import(`${path}?t=${Date.now()}`) const titles = await m.scrapeTitles(session, ["https://example.com", "https://example.org"]) console.log(JSON.stringify(titles))

browser-execute

Connecting

Way 1: connect to the user's running Chrome or Chromium-based browser (real profile, popup-gated).

Way 2: connect to a Chrome or Chromium-based browser launched with a debug port (isolated profile, no popups).

Way 3: provision and connect to a Browser Use cloud browser.

Way 4: user-preconfigured endpoint

Attaching to a target

Driving a page

Reusing code

Guardrails

Console