Run any Skill in Manus with one click

$pwd:

browser-to-api

Name: Browser To Api
Author: browserbase

// Turn a website's observable HTTP traffic into a best-effort OpenAPI 3.1 spec by analyzing a `browser-trace` capture. Use when the user wants to discover/extract API endpoints from a browser session, build an OpenAPI doc from network traffic, or document a third-party site's XHR/fetch surface for client integration.

Run Skill in Manus

$ git log --oneline --stat

stars:3,409

forks:225

updated:May 17, 2026 at 08:05

File Explorer

14 files

SKILL.md

readonly

related-skills.json

same repository

autobrowse.md

from "browserbase/skills"

Self-improving browser automation via the auto-research loop. Iteratively runs a browsing task, reads the trace, and improves the navigation skill (strategy.md) until it reliably passes. Supports parallel runs across multiple tasks using sub-agents. Use when you want to build or improve browser automation skills for specific website tasks.

2026-05-183.4k

browser.md

from "browserbase/skills"

Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications. Supports remote Browserbase sessions with Browserbase Identity, Verified browsers, automatic CAPTCHA solving, and residential proxies — ideal for protected websites and JavaScript-heavy pages.

2026-05-183.4k

browser-trace.md

from "browserbase/skills"

Capture a full DevTools-protocol trace of any browser automation — CDP firehose, screenshots, and DOM dumps — then bisect the stream into per-page searchable buckets. Use when the user wants to debug a failed run, audit network/console/DOM activity, attach a trace to an in-progress session, or feed structured per-page summaries back into an agent loop so its next iteration learns from the last one.

2026-05-183.4k

cookie-sync.md

from "browserbase/skills"

Sync cookies from local Chrome to a Browserbase persistent context so the browse CLI can access authenticated sites. Use when the user wants to browse as themselves, sync cookies, or log into sites via Browserbase.

2026-05-183.4k

fetch.md

from "browserbase/skills"

Use this skill when the user wants to retrieve a URL without a full browser session: fetch HTML or JSON from static pages, inspect status codes or headers, follow redirects, or get page source for simple scraping. Prefer it over a browser when JavaScript rendering and page interaction are not needed. Supports proxies and redirect control.

2026-05-183.4k

ui-test.md

from "browserbase/skills"

AI-powered adversarial UI testing via the browse CLI. Analyzes git diffs to test only what changed, or explores the full app to find bugs. Tests functional correctness, accessibility, responsive layout, and UX heuristics. Use when the user asks to test UI changes, QA a pull request, audit accessibility, or run exploratory testing. Supports local browser (localhost) and remote Browserbase (deployed sites).

2026-05-183.4k

package.json

"author": "browserbase"

"repository": "browserbase/skills"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	browser-to-api
description	Turn a website's observable HTTP traffic into a best-effort OpenAPI 3.1 spec by analyzing a `browser-trace` capture. Use when the user wants to discover/extract API endpoints from a browser session, build an OpenAPI doc from network traffic, or document a third-party site's XHR/fetch surface for client integration.
compatibility	Requires Node 18+ and a `browser-trace` run directory (`.o11y/<run>/`) produced by the sibling `browser-trace` skill. The scripts use only the Node standard library — no `npm install` step. `jq` is referenced in docs for ad-hoc querying but is not required by the scripts.
license	MIT
allowed-tools	Bash, Read, Grep

Browser to API

Replay-driven API discovery. Consume a browser-trace capture, pair its CDP request / response events, templatize observed URLs, infer JSON schemas from samples, and emit an OpenAPI 3.1 document plus a human-readable coverage report.

This skill does not capture traffic. It is purely offline post-processing on top of browser-trace's cdp/network/*.jsonl buckets. The two skills compose:

browser-trace    →  .o11y/<run>/cdp/network/{requests,responses}.jsonl
browser-to-api   →  .o11y/<run>/api-spec/index.html + openapi.yaml + client.mjs

When to use

The user wants an OpenAPI document for a third-party or undocumented website API.
The user has a browser-trace run and wants endpoints + schemas extracted from it.
The user is building a client/SDK against a site that doesn't publish a spec.
The user wants a coverage report showing which flows would broaden the spec.

If the user wants to capture traffic, send them to browser-trace first.

Two-step workflow

1. Capture with `browser-trace` (and optionally bodies via `browse network on`)

# Local example against an existing debuggable Chrome target
TARGET=9222

node ../browser-trace/scripts/start-capture.mjs "$TARGET" my-site
browse open about:blank --cdp "$TARGET"
browse network on                                    # capture request/response bodies
browse open https://example.com
# ...drive whatever flows you want covered...

# Snapshot the bodies dir BEFORE turning capture off (the temp dir is shared
# per-session, so subsequent `browse network on` runs would mix your bodies
# with whatever a future capture writes if you skip this step).
cp -r "$(browse network path | jq -r .path)" .o11y/my-site/cdp/network/bodies/
browse network off

node ../browser-trace/scripts/stop-capture.mjs my-site
node ../browser-trace/scripts/bisect-cdp.mjs my-site

browse network on is optional but strongly recommended — without it, the spec has no response-body schemas (the CDP firehose used by browse cdp does not embed bodies). With it, both request bodies (already captured by CDP) and response bodies are joined into the trace by CDP requestId.

2. Generate the spec

node scripts/discover.mjs --run .o11y/my-site
# → .o11y/my-site/api-spec/index.html          ← open this
#   .o11y/my-site/api-spec/client.mjs
#   .o11y/my-site/api-spec/openapi.yaml
#   .o11y/my-site/api-spec/openapi.json
#   .o11y/my-site/api-spec/report.md
#   .o11y/my-site/api-spec/confidence.json
#   .o11y/my-site/api-spec/samples/*.json
#   .o11y/my-site/api-spec/intermediate/*.jsonl

discover.mjs auto-detects <run>/cdp/network/bodies/. To use a body capture from elsewhere (e.g. didn't snapshot, want the live browse network dir), pass --bodies <path> explicitly.

3. Open the HTML report

After discover.mjs finishes, always open the generated HTML report:

open .o11y/my-site/api-spec/index.html

The report is a self-contained HTML file (no server needed) that shows each discovered operation as an expandable card with variables, client usage, request/response examples, and a generated client.mjs snippet at the bottom. This is the primary deliverable — always open it for the user.

CLI flags

Flag	Required	Meaning
`--run <path>`	yes	Path to a `browser-trace` run directory
`--out <path>`	no	Output dir; default `<run>/api-spec/`
`--bodies <path>`	no	`browse network` capture dir to join into the trace (auto-detected from `<run>/cdp/network/bodies/` when present)
`--include <regex>`	no	Only include URLs matching regex (repeatable)
`--exclude <regex>`	no	Exclude URLs matching regex (repeatable; in addition to defaults)
`--origins <list>`	no	Comma-separated origin allow-list (e.g. `api.example.com,example.com`)
`--format <yaml\|json\|both>`	no	Output format. Default `both`
`--title <string>`	no	OpenAPI `info.title`. Default derived from primary origin
`--redact <list>`	no	Extra header names / JSON keys to redact (comma-separated)
`--min-samples <n>`	no	Minimum samples per endpoint to include. Default `1`
`--stage <name>`	no	Run only one stage: `load`, `filter`, `normalize`, `infer`, `emit`

Output layout

<run>/api-spec/
├── index.html                visual report — open this (self-contained, no server)
├── client.mjs                zero-dep fetch client with typed functions per operation
├── openapi.yaml              machine-readable spec
├── openapi.json              mirror
├── report.md                 markdown summary + curl examples
├── confidence.json           per-endpoint confidence + normalization flags
├── samples/                  redacted request/response examples
│   └── <method>__<path-hash>.json
└── intermediate/             pipeline byproducts (paired/filtered/endpoints jsonl)

What you get from `browse cdp` and `browse network`

Two complementary capture sources:

Source	Provides	Limitation
`browse cdp` (used by `browser-trace`)	request method/URL/headers/`postData`, response status/headers/mimeType, full event timing	Does not embed response bodies. Bodies must be pulled with `Network.getResponseBody`, which the firehose doesn't do.
`browse network on` (separate command)	request bodies AND response bodies on disk, keyed by CDP `requestId`	Capture dir is shared per `browse` session; snapshot before another `browse network on` overwrites it.

discover.mjs will pull bodies from a browse network dir if you pass --bodies <path> (or stash them under <run>/cdp/network/bodies/, which is auto-detected). The matching is by requestId — browse network writes that into each request.json as id, and we join directly.

What changes when bodies are present:

✅ Path templating, query-param schemas, status codes, content-types — same either way.
✅ Request-body schemas — postData from CDP is enough; bodies dir is a nice-to-have for non-postData cases.
✅ Response-body schemas — fully inferred from real samples. Without bodies you get { description, content: <mimeType> } skeletons.

The report flags every endpoint that has no response-body sample.

Automatic noise filtering

The normalize stage automatically classifies and drops infrastructure noise:

Tracking / analytics — paths containing /track, /pixel, /beacon, /impression, /pageview, /dag/v*
Bot defense — Akamai (/akam/), fingerprint payloads (sensor_data), obfuscated multi-segment paths
Session plumbing — /session, /authenticate/start, cookie consent, A/B experiment endpoints
HTML page renders — GET requests returning text/html (the rendered page, not the API)

This typically drops 60-80% of captured traffic. The --include flag can rescue a false positive.

GraphQL / multiplexed endpoint decomposition

When a single endpoint (like /dapi/fe/gql) is called with different operationName values, the skill automatically splits it into separate logical operations. Each gets its own:

OpenAPI path entry (e.g. /dapi/fe/gql [Autocomplete])
Request/response schema inferred from only that operation's samples
Curl example and variables table in the report

Detection works on body fields (operationName, method, action) and query params (opname, op). This covers GraphQL (APQ and inline), JSON-RPC, and similar dispatch patterns.

Limitations

Coverage is bounded by the captured flow. Endpoints not exercised in the trace will not appear. The skill cannot prove completeness.
Schemas are inductive, not contractual. A field might be optional on the server even if every sample contained it.
Auth is observed, not specified. The skill records auth-shaped headers in an x-observed-auth extension but won't claim a security scheme.
Path templating is heuristic. Numeric / UUID / hex / slug patterns are detected per segment. Ambiguous URLs are flagged in confidence.json.
Redaction is best-effort. Default redactions cover common credentials, but app-specific secrets may slip through; use --redact for known custom headers/keys.

Best practices

Drive the flows you want documented. The richer the browser-trace, the richer the spec.
Use --origins for noisy sites. A marketing page hits dozens of analytics hosts; restrict to the API origin you care about.
Inspect report.md first. It has curl-ready examples and response samples for every discovered operation.
Bump --min-samples to 2+ when you want only confidently-shaped endpoints in the final doc — drop the long tail.
Pair with browse network on when response-body schemas matter. The CDP firehose alone has request bodies but not response bodies.

For pipeline internals and the file format reference, see REFERENCE.md.

browser-to-api

More from this repository

More from this repository

Browser to API

When to use

Two-step workflow

1. Capture with browser-trace (and optionally bodies via browse network on)

2. Generate the spec

3. Open the HTML report

CLI flags

Output layout

What you get from browse cdp and browse network

Automatic noise filtering

GraphQL / multiplexed endpoint decomposition

Limitations

Best practices

Browser to API

When to use

Two-step workflow

1. Capture with browser-trace (and optionally bodies via browse network on)

2. Generate the spec

3. Open the HTML report

CLI flags

Output layout

What you get from browse cdp and browse network

Automatic noise filtering

GraphQL / multiplexed endpoint decomposition

Limitations

Best practices

1. Capture with `browser-trace` (and optionally bodies via `browse network on`)

What you get from `browse cdp` and `browse network`

1. Capture with `browser-trace` (and optionally bodies via `browse network on`)

What you get from `browse cdp` and `browse network`