원클릭으로
web-scraping
scrape websites, extract data, crawl pages, and browse the web at scale using kernel cloud browsers with stealth mode and proxies.
Codex 또는 Claude로 설치 이 Prompt를 복사해 Codex, Claude 또는 다른 어시스턴트에 붙여 넣으면 Skill 페이지를 검토하고 설치를 진행할 수 있습니다.
메뉴
scrape websites, extract data, crawl pages, and browse the web at scale using kernel cloud browsers with stealth mode and proxies.
Codex 또는 Claude로 설치 이 Prompt를 복사해 Codex, Claude 또는 다른 어시스턴트에 붙여 넣으면 Skill 페이지를 검토하고 설치를 진행할 수 있습니다.
SOC 직업 분류 기준
| name | web-scraping |
| description | scrape websites, extract data, crawl pages, and browse the web at scale using kernel cloud browsers with stealth mode and proxies. |
kernel cloud browsers let you scrape at scale with stealth mode, residential proxies, and up to 72-hour sessions. no charges for idle time.
stealth mode automatically adds a recaptcha solver and residential proxy to your browsers. use it for any site with bot detection.
const browser = await kernel.browsers.create({ stealth: true });
proxy quality for bot detection avoidance, best to worst:
await page.goto("https://example.com/products");
await page.waitForSelector(".product-card");
const products = await page.$$eval(".product-card", cards =>
cards.map(card => ({
name: card.querySelector("h2")?.textContent?.trim(),
price: card.querySelector(".price")?.textContent?.trim(),
url: card.querySelector("a")?.href,
}))
);
return products;
const allItems = [];
let hasNext = true;
while (hasNext) {
const items = await page.$$eval(".item", els =>
els.map(e => e.textContent?.trim())
);
allItems.push(...items);
const nextBtn = await page.$("a.next-page");
if (nextBtn) {
await nextBtn.click();
await page.waitForLoadState("networkidle");
} else {
hasNext = false;
}
}
return allItems;
let previousHeight = 0;
while (true) {
const currentHeight = await page.evaluate(() => document.body.scrollHeight);
if (currentHeight === previousHeight) break;
previousHeight = currentHeight;
await page.evaluate(() => window.scrollTo(0, document.body.scrollHeight));
await page.waitForTimeout(2000);
}
return await page.$$eval(".item", els => els.map(e => e.textContent));
launch multiple browsers for parallel scraping:
const urls = ["https://site.com/1", "https://site.com/2", "https://site.com/3"];
const results = await Promise.all(
urls.map(async (url) => {
const browser = await kernel.browsers.create({ stealth: true });
const pw = await chromium.connectOverCDP(browser.cdp_ws_url);
const page = pw.contexts()[0].pages()[0];
try {
await page.goto(url);
return await page.title();
} finally {
await pw.close();
await kernel.browsers.deleteByID(browser.session_id);
}
})
);
pre-warm a pool of browsers for instant acquisition. no cold-start latency.
const pool = await kernel.pools.create({
name: "scraper-pool",
size: 10,
stealth: true,
proxy_id: "residential-proxy-id",
});
// acquire, scrape, release — browsers recycle instantly
for (const url of urls) {
const browser = await kernel.pools.acquire(pool.id);
// scrape...
await kernel.pools.release(pool.id, browser.session_id);
}
kernel supports sessions up to 72 hours. no charges for idle time — you can pause overnight and resume the next day from the same point.
const browser = await kernel.browsers.create({
stealth: true,
timeout_seconds: 259200, // 72 hours
});
browse websites, open a browser, automate a website, navigate to a URL, click buttons, fill forms, and scrape pages using kernel cloud browsers with playwright and CDP.
manage cloud browsers, take screenshots, run playwright scripts, and manage browser profiles using kernel's MCP tools. use when the kernel MCP server is connected.
build browser automation scripts using the kernel python sdk with playwright and remote browser management. use when writing python code that creates browsers, runs playwright, or automates websites with kernel.
build browser automation scripts using the kernel typescript sdk with playwright, CDP, and remote browser management. use when writing typescript code that creates browsers, runs playwright, or automates websites with kernel.
manage authentication for AI agents — log in and stay logged in across websites. use when setting up login credentials, authentication flows, or persistent authenticated sessions with kernel.