一键导入
readability-extractor
// Strip ads, navigation, and sidebars from any article URL or HTML. Use when agents need clean reading content for LLM context, article summarization, or RAG pipelines. Returns title, clean text, excerpt, and word count.
// Strip ads, navigation, and sidebars from any article URL or HTML. Use when agents need clean reading content for LLM context, article summarization, or RAG pipelines. Returns title, clean text, excerpt, and word count.
Analyze YouTube channels, track competitor activity, and discover content opportunities. Use when users need to monitor YouTube channels, analyze a niche or category for content gaps, identify viral patterns, or plan content strategy. Handles competitive intelligence, market discovery, and trend analysis for YouTube creators.
Analyze SERP competition for any keyword. Use when you need to check if a keyword is worth targeting. Scores competition across 5 dimensions, identifies authority sites, assesses content depth, and provides go/differentiate/avoid recommendation.
Full-site keyword research with three-phase funnel. Use when you need to find SEO keyword opportunities for a website. Analyzes site topics, expands 30+ candidate keywords, runs SERP competition analysis on top 10, and recommends top 3 keywords with actionable landing page suggestions.
Generate SEO-optimized page structures from keywords. Use when you need to create landing pages, guide pages, comparison pages, or hub pages from trending keywords. Handles search intent classification, template selection, meta tag generation, schema markup, and SERP competitor analysis.
Audit web pages for P0 SEO compliance. Use when developers need to validate server-side metadata, image alt tags, keyword density, FAQ sections, and breadcrumb navigation before publishing. Handles Next.js App Router pages and returns actionable fix recommendations.
Translate text between 50+ languages with auto-detect and batch mode. Use when agents need multilingual content pipelines, global communication, or document translation. Handles Chinese, Japanese, Korean, Arabic, Hindi, and more.
| name | Readability Extractor |
| slug | readability-extractor |
| description | Strip ads, navigation, and sidebars from any article URL or HTML. Use when agents need clean reading content for LLM context, article summarization, or RAG pipelines. Returns title, clean text, excerpt, and word count. |
| category | Content |
| tags | ["readability","article","content-extraction","rag","clean-text"] |
| price_per_call | 0 |
| input_schema | {"type":"object","properties":{"url":{"type":"string","description":"URL to fetch and extract content from"},"html":{"type":"string","description":"Raw HTML to extract content from"}},"required":[]} |
| output_schema | {"type":"object","properties":{"title":{"type":"string"},"content":{"type":"string"},"excerpt":{"type":"string"},"wordCount":{"type":"number"}}} |
| metadata | {"requires":{"env":["CLAW0X_API_KEY"]}} |
Extract clean, readable content from any webpage or HTML. Strips ads, navigation, sidebars, scripts, and noise. Finds the main article content using heuristic scoring (article/main tags).
FREE. No charge per call.
Input:
{ "url": "https://example.com/blog/article" }
Output:
{
"title": "Article Title",
"content": "Clean article text without ads or navigation...",
"excerpt": "Clean article text without ads...",
"wordCount": 450,
"source": "https://example.com/blog/article"
}
| Code | Meaning |
|---|---|
| 400 | Missing url/html, or URL fetch failed |
| 401 | Missing or invalid API key |
| 500 | Extraction failed (not billed) |
Claw0x is the native skills layer for AI agents.