Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

Loslegen

kernel-python-sdk

Sterne7

Forks1

Aktualisiert11. Februar 2026 um 17:48

Build browser automation scripts using the Kernel Python SDK with Playwright and remote browser management.

Installation

Mit Codex oder Claude installieren Kopieren Sie diesen Prompt, fügen Sie ihn in Codex, Claude oder einen anderen Assistant ein und lassen Sie die Skill-Seite prüfen und installieren.

In Manus ausführen

Quelle

kernel

kernel/skills

GitHub-Repository öffnen Creator-Repositorys ansehen

Download

In Manus ausführen

Verwandte BerufeSOC

Basierend auf der SOC-Berufsklassifikation

SoftwareentwicklerInformatik- und Mathematikberufe·SOC 15-1252

Datei-Explorer

2 Dateien

SKILL.md

readonly

name	kernel-python-sdk
description	Build browser automation scripts using the Kernel Python SDK with Playwright and remote browser management.
context	fork

When to Use This Skill

Use the Kernel Python SDK when you need to:

Build browser automation scripts - Create Python programs that control remote browsers
Execute server-side automation - Run Playwright code directly in the browser VM without local dependencies
Manage browser sessions programmatically - Create, configure, and control browsers from code
Build scalable scraping/testing tools - Use browser pools and profiles for high-volume automation
Deploy automation as actions - Package scripts as Kernel actions for invocation via API

When NOT to use:

For CLI commands (e.g., kernel browsers create), use the kernel-cli skill instead
For quick one-off tasks, the CLI may be simpler than writing code

Core Concepts

SDK Architecture

The SDK is organized into resource-based modules:

kernel.browsers - Browser session management (create, list, delete)
kernel.browsers.playwright - Server-side Playwright execution
kernel.browsers.computer - OS-level controls (mouse, keyboard, screenshots)
kernel.browser_pools - Pre-warmed browser pool management
kernel.profiles - Persistent browser profiles (auth state)
kernel.auth.connections - Managed auth (create, login, submit, follow, retrieve, delete)
kernel.credential_providers - External credential providers (1Password)
kernel.proxies - Proxy configuration
kernel.extensions - Chrome extension management
kernel.deployments - App deployment
kernel.invocations - Action invocation

Two Automation Approaches

1. Server-side Execution (RECOMMENDED)

Execute Playwright code directly in browser VM using kernel.browsers.playwright.execute(session_id, code="...")
session_id must be passed as a positional argument (first parameter), not as id= keyword
Response accessed via response.result - MUST use return in code to get data back
Best for: Most use cases, production automation, parallel execution, actions

2. CDP Connection (Client-side)

Connect Playwright to browser via CDP WebSocket URL
Code runs locally, browser runs remotely; requires local Playwright installation
Best for: Complex debugging, specific local development needs

Patterns Reference

Import Patterns

Standard: from kernel import Kernel
For actions: import kernel and from kernel import Kernel
For typed payloads: from typing import TypedDict
For CDP: from playwright.async_api import async_playwright

SDK Initialization

client = Kernel() reads KERNEL_API_KEY from environment automatically

Action Handler Pattern

from typing import TypedDict
from kernel import Kernel

app = kernel.App("app-name")

class TaskInput(TypedDict):
    task: str

@app.action("action-name")
async def my_action(ctx: kernel.KernelContext, input_data: TaskInput):
    # Access input: input_data["task"] or input_data.get("task")
    ...

CDP Connection Pattern (Client-side)

async with async_playwright() as playwright:
    browser = await playwright.chromium.connect_over_cdp(kernel_browser.cdp_ws_url)
    context = browser.contexts[0] if browser.contexts else await browser.new_context()
    page = context.pages[0] if context.pages else await context.new_page()

Binary Data Handling

Binary data (screenshots, PDFs) returns as Node.js Buffer: {'data': [byte_array], 'type': 'Buffer'}

# Follow canonical pattern above, then:
if response.success and response.result:
    data = bytes(response.result['data'])
    with open("output.png", "wb") as f:
        f.write(data)

Installation

uv pip install kernel or pip install kernel
For CDP: uv pip install playwright

References

Kernel Documentation: https://www.kernel.sh/docs
API Reference: https://www.kernel.sh/docs/api-reference/
Templates: https://www.kernel.sh/docs/reference/cli/create#available-templates
Quickstart Guide: https://www.kernel.sh/docs/quickstart
Examples: examples

Mehr aus diesem Repository

gleiches Repository

debug-browser-session

kernel/skills

Systematically debug a Kernel cloud browser session — VM issues, network errors, Chrome crashes, page-load failures, and live-view problems. Use when a browser session misbehaves (e.g. ERR_HTTP2_PROTOCOL_ERROR, "browser not responding", blank/error pages, captcha or "checking your browser" blocks, live view not loading) and you have the session ID. Drives the Kernel CLI to inspect session status, screenshots, page state, VM logs, and network connectivity.

2026-06-267

generate-video

kernel/skills

Generate crisp, perfectly smooth MP4 videos from a web page or animated visualization by driving headless Chromium over the Chrome DevTools Protocol (CDP) with deterministic frame-stepping, then encoding with ffmpeg and (if remote) sharing via a cloudflared tunnel. Use when asked to make/generate a demo video, explainer clip, launch/marketing animation, social teaser, or any short rendered video from a web scene — especially animated stat/timeline/diagram visualizations — and when you need to iterate on it (tweak, re-render, compare). Solves the common "the recording has hitches / judder / dropped frames" problem.

2026-06-137

kernel-browser-harness

kernel/skills

Best practices for using browser-use's open-source browser-harness with Kernel cloud browsers over CDP. Use when driving a Kernel browser from browser-harness, extracting a CDP URL from `kernel browsers create`, or running multi-step or parallel harness sessions against Kernel.

2026-06-087

kernel-cli

kernel/skills

Complete guide to Kernel CLI - cloud browser platform with automation, deployment, and management

2026-05-067

diff-profile-archives

kernel/skills

Compare two Kernel profile archives to investigate behavioral differences. Use when an issue (e.g. login failure, captcha, broken automation, vendor mismatch) reproduces on one profile but not another, or when a "good" vs "bad" profile needs to be diffed. Takes two profile IDs/names plus an issue description, downloads both archives via the Kernel CLI, and surfaces differences in cookies, storage, preferences, extensions, and login state that could explain the issue.

2026-04-287

profile-website-bot-detection

kernel/skills

Profile a website for bot detection vendors using stealth vs non-stealth Kernel browsers. Use when analyzing bot detection on a website, comparing stealth effectiveness, identifying anti-bot vendors and products, or detecting challenge types.

2026-03-037

from typing import TypedDict from kernel import Kernel app = kernel.App("app-name") class TaskInput(TypedDict): task: str @app.action("action-name") async def my_action(ctx: kernel.KernelContext, input_data: TaskInput): # Access input: input_data["task"] or input_data.get("task") ...

async with async_playwright() as playwright: browser = await playwright.chromium.connect_over_cdp(kernel_browser.cdp_ws_url) context = browser.contexts[0] if browser.contexts else await browser.new_context() page = context.pages[0] if context.pages else await context.new_page()