Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

$pwd:

data-connector

Name: Data Connector
Author: vana-com

// Build new Vana DataConnect data connectors that export user data from web platforms. Use when asked to "create a connector", "add a connector", "build a connector", "new data connector", "write a connector for X", or when working on the data-connectors repository and the task involves adding or modifying a Playwright-based connector.

In Manus ausführen

$ git log --oneline --stat

stars:15

forks:9

updated:31. März 2026 um 13:00

Datei-Explorer

6 Dateien

SKILL.md

readonly

name	data-connector
description	Build new Vana DataConnect data connectors that export user data from web platforms. Use when asked to "create a connector", "add a connector", "build a connector", "new data connector", "write a connector for X", or when working on the data-connectors repository and the task involves adding or modifying a Playwright-based connector.

Data Connector Builder

Build Playwright-based data connectors for the Vana DataConnect ecosystem. Connectors export a user's personal data from web platforms (LinkedIn, ChatGPT, Spotify, etc.) using browser automation. Credentials never leave the device.

Repository Layout

data-connectors/
├── registry.json                    # Central manifest with checksums
├── test-connector.cjs               # Standalone test runner
├── types/connector.d.ts             # TypeScript type definitions
├── schemas/                         # JSON Schema files (one per scope)
│   └── <platform>.<scope>.json
├── icons/                           # SVG icons for UI
├── scripts/                         # Validation & helper scripts
│   └── validate-connector.cjs
└── connectors/                      # All connector folders live here
    └── <company>/                   # One folder per company
        ├── <name>-playwright.js     # Connector script
        └── <name>-playwright.json   # Metadata

Workflow

Step 1 — Research the target platform

Before writing code, investigate the platform:

Check for REST/JSON APIs accessible from a logged-in browser session. Open DevTools Network tab, browse the platform, look for XHR/fetch calls returning JSON. This is the preferred extraction method.
Check for GraphQL endpoints — many modern platforms use these.
If no API, plan DOM scraping as a fallback. Identify stable selectors (ARIA roles, data attributes, semantic HTML). Never rely on obfuscated CSS class names.
Identify the login flow — what URL, what selectors prove the user is logged in, are there challenges/2FA/captchas to handle.
Define scopes — what data categories to export (e.g., platform.profile, platform.posts).

Step 2 — Create the metadata file

Create connectors/<company>/<name>-playwright.json. See templates/connector-metadata.json.

Required fields: id, version, name, company, description, connectURL, connectSelector, runtime (always "playwright").

The connectSelector is critical — it's how DataConnect detects the user is logged in. Pick a CSS selector only visible post-login (e.g., a feed element, profile avatar, nav item).

Step 3 — Write the connector script

Create connectors/<company>/<name>-playwright.js. See templates/connector-script.js.

All connectors follow the two-phase pattern:

Phase 1 — Login (visible browser)

Check if already logged in via persistent session
If not, call page.showBrowser(loginUrl) so the user can log in manually
Call page.promptUser() to wait until login is complete

Phase 2 — Data collection (headless)

Call page.goHeadless() — browser disappears
Fetch data via API calls, network capture, or DOM scraping
Report progress via page.setProgress()
Build scoped result object and call page.setData('result', result)

For the full page API reference, see PAGE-API.md. For extraction pattern examples, see PATTERNS.md.

Step 4 — Create JSON schemas

Create schemas/<platform>.<scope>.json for each scope. See templates/schema.json.

Step 5 — Update the registry

Generate checksums and add entry to registry.json:

shasum -a 256 connectors/<company>/<name>-playwright.js | awk '{print "sha256:" $1}'
shasum -a 256 connectors/<company>/<name>-playwright.json | awk '{print "sha256:" $1}'

Add to the connectors array in registry.json and update lastUpdated.

Step 6 — Test

node test-connector.cjs ./connectors/<company>/<name>-playwright.js           # headed (visible)
node test-connector.cjs ./connectors/<company>/<name>-playwright.js --headless # headless

Scoped Result Format

The result object uses platform.scope keys. The frontend auto-detects scoped keys (any key containing . that isn't metadata) and POSTs each to POST /v1/data/{scope}.

const result = {
  'platform.profile': { /* profile data */ },
  'platform.posts':   { /* posts data */ },
  exportSummary: { count: 42, label: 'items', details: '1 profile, 41 posts' },
  timestamp: new Date().toISOString(),
  version: '1.0.0-playwright',
  platform: 'platform-name',
};
await page.setData('result', result);

exportSummary is required — the UI displays it. Metadata keys (exportSummary, timestamp, version, platform) are not treated as scopes.

Guidelines

Credentials stay on-device. Never send tokens or passwords to external servers.
Prefer API fetch over DOM scraping. APIs are more stable.
Avoid CSS class names for selectors — platforms obfuscate them. Use structural selectors, ARIA roles, data attributes, semantic HTML.
Use page.setProgress() for long exports so users see what's happening.
Handle errors gracefully. Use page.setData('error', message) with clear messages.
Rate-limit API calls. Add page.sleep() between requests to avoid 429s.
Test pagination edge cases — empty results, single page, large datasets.
All page.evaluate() calls take a JS string, not a function. Variables from the connector scope must be interpolated via JSON.stringify().

Reference Connectors

Connector	Pattern	Best example of
`connectors/linkedin/linkedin-playwright.js`	REST API	API fetch with CSRF, parallel calls, clean error handling
`connectors/openai/chatgpt-playwright.js`	REST API + Network capture	Auth token extraction, parallel pagination, hybrid approach
`connectors/github/github-playwright.js`	DOM scraping	Structural selectors, pagination, text parsing
`connectors/meta/instagram-playwright.js`	Network capture	`captureNetwork()` usage, GraphQL interception
`connectors/spotify/spotify-playwright.js`	GraphQL + custom auth	Complex auth (TOTP), dynamic query hashes

related-skills.json

gleiches Repository

vana-connect.md

from "vana-com/data-connectors"

Connect personal data from any web platform using browser automation. Use when: (1) user wants to connect a data source like ChatGPT, Instagram, Spotify, or any platform, (2) user says "connect my [platform]", (3) user wants to generate or update their profile from connected data. Also triggers on: "create a connector for [platform]".

2026-04-1415

auto-create-connector.md

from "vana-com/data-connectors"

Autonomously create, test, and validate a data connector for any web platform — end to end. Use when asked to "auto-create a connector", "automatically build a connector", or when a connector needs to be created from scratch and tested without manual guidance. Triggers on: "auto-create", "auto connector", "create and test connector", "build connector end to end", "generate connector".

2026-03-3115

auto-create-connector.md

from "vana-com/data-connectors"

2026-03-3115

package.json

"author": "vana-com"

"repository": "vana-com/data-connectors"

GitHub-Repository öffnen Creator-Repositorys ansehen

$ install --global

$ download --local

In Manus ausführen

$ useful --forSOC

SoftwareentwicklerInformatik- und Mathematikberufe15-1252L4

name	data-connector
description	Build new Vana DataConnect data connectors that export user data from web platforms. Use when asked to "create a connector", "add a connector", "build a connector", "new data connector", "write a connector for X", or when working on the data-connectors repository and the task involves adding or modifying a Playwright-based connector.

Data Connector Builder

Repository Layout

data-connectors/
├── registry.json                    # Central manifest with checksums
├── test-connector.cjs               # Standalone test runner
├── types/connector.d.ts             # TypeScript type definitions
├── schemas/                         # JSON Schema files (one per scope)
│   └── <platform>.<scope>.json
├── icons/                           # SVG icons for UI
├── scripts/                         # Validation & helper scripts
│   └── validate-connector.cjs
└── connectors/                      # All connector folders live here
    └── <company>/                   # One folder per company
        ├── <name>-playwright.js     # Connector script
        └── <name>-playwright.json   # Metadata

Workflow

Step 1 — Research the target platform

Before writing code, investigate the platform:

Check for REST/JSON APIs accessible from a logged-in browser session. Open DevTools Network tab, browse the platform, look for XHR/fetch calls returning JSON. This is the preferred extraction method.
Check for GraphQL endpoints — many modern platforms use these.
If no API, plan DOM scraping as a fallback. Identify stable selectors (ARIA roles, data attributes, semantic HTML). Never rely on obfuscated CSS class names.
Identify the login flow — what URL, what selectors prove the user is logged in, are there challenges/2FA/captchas to handle.
Define scopes — what data categories to export (e.g., platform.profile, platform.posts).

Step 2 — Create the metadata file

Create connectors/<company>/<name>-playwright.json. See templates/connector-metadata.json.

Required fields: id, version, name, company, description, connectURL, connectSelector, runtime (always "playwright").

The connectSelector is critical — it's how DataConnect detects the user is logged in. Pick a CSS selector only visible post-login (e.g., a feed element, profile avatar, nav item).

Step 3 — Write the connector script

Create connectors/<company>/<name>-playwright.js. See templates/connector-script.js.

All connectors follow the two-phase pattern:

Phase 1 — Login (visible browser)

Check if already logged in via persistent session
If not, call page.showBrowser(loginUrl) so the user can log in manually
Call page.promptUser() to wait until login is complete

Phase 2 — Data collection (headless)

Call page.goHeadless() — browser disappears
Fetch data via API calls, network capture, or DOM scraping
Report progress via page.setProgress()
Build scoped result object and call page.setData('result', result)

For the full page API reference, see PAGE-API.md. For extraction pattern examples, see PATTERNS.md.

Step 4 — Create JSON schemas

Create schemas/<platform>.<scope>.json for each scope. See templates/schema.json.

Step 5 — Update the registry

Generate checksums and add entry to registry.json:

shasum -a 256 connectors/<company>/<name>-playwright.js | awk '{print "sha256:" $1}'
shasum -a 256 connectors/<company>/<name>-playwright.json | awk '{print "sha256:" $1}'

Add to the connectors array in registry.json and update lastUpdated.

Step 6 — Test

node test-connector.cjs ./connectors/<company>/<name>-playwright.js           # headed (visible)
node test-connector.cjs ./connectors/<company>/<name>-playwright.js --headless # headless

Scoped Result Format

The result object uses platform.scope keys. The frontend auto-detects scoped keys (any key containing . that isn't metadata) and POSTs each to POST /v1/data/{scope}.

const result = {
  'platform.profile': { /* profile data */ },
  'platform.posts':   { /* posts data */ },
  exportSummary: { count: 42, label: 'items', details: '1 profile, 41 posts' },
  timestamp: new Date().toISOString(),
  version: '1.0.0-playwright',
  platform: 'platform-name',
};
await page.setData('result', result);

exportSummary is required — the UI displays it. Metadata keys (exportSummary, timestamp, version, platform) are not treated as scopes.

Guidelines

Credentials stay on-device. Never send tokens or passwords to external servers.
Prefer API fetch over DOM scraping. APIs are more stable.
Avoid CSS class names for selectors — platforms obfuscate them. Use structural selectors, ARIA roles, data attributes, semantic HTML.
Use page.setProgress() for long exports so users see what's happening.
Handle errors gracefully. Use page.setData('error', message) with clear messages.
Rate-limit API calls. Add page.sleep() between requests to avoid 429s.
Test pagination edge cases — empty results, single page, large datasets.
All page.evaluate() calls take a JS string, not a function. Variables from the connector scope must be interpolated via JSON.stringify().

Reference Connectors

Connector	Pattern	Best example of
`connectors/linkedin/linkedin-playwright.js`	REST API	API fetch with CSRF, parallel calls, clean error handling
`connectors/openai/chatgpt-playwright.js`	REST API + Network capture	Auth token extraction, parallel pagination, hybrid approach
`connectors/github/github-playwright.js`	DOM scraping	Structural selectors, pagination, text parsing
`connectors/meta/instagram-playwright.js`	Network capture	`captureNetwork()` usage, GraphQL interception
`connectors/spotify/spotify-playwright.js`	GraphQL + custom auth	Complex auth (TOTP), dynamic query hashes

data-connector

Data Connector Builder

Repository Layout

Workflow

Step 1 — Research the target platform

Step 2 — Create the metadata file

Step 3 — Write the connector script

Step 4 — Create JSON schemas

Step 5 — Update the registry

Step 6 — Test

Scoped Result Format

Guidelines

Reference Connectors

Mehr aus diesem Repository

Mehr aus diesem Repository

Data Connector Builder

Repository Layout

Workflow

Step 1 — Research the target platform

Step 2 — Create the metadata file

Step 3 — Write the connector script

Step 4 — Create JSON schemas

Step 5 — Update the registry

Step 6 — Test

Scoped Result Format

Guidelines

Reference Connectors