一键在 Manus 中运行任何 Skill

$pwd:

readability-extractor

Name: Readability Extractor
Author: kennyzir

// Strip ads, navigation, and sidebars from any article URL or HTML. Use when agents need clean reading content for LLM context, article summarization, or RAG pipelines. Returns title, clean text, excerpt, and word count.

在 Manus 中运行

$ git log --oneline --stat

stars:1

forks:0

updated:2026年3月31日 01:23

文件资源管理器

2 个文件

SKILL.md

readonly

name	Readability Extractor
slug	readability-extractor
description	Strip ads, navigation, and sidebars from any article URL or HTML. Use when agents need clean reading content for LLM context, article summarization, or RAG pipelines. Returns title, clean text, excerpt, and word count.
category	Content
tags	["readability","article","content-extraction","rag","clean-text"]
price_per_call	0
input_schema	{"type":"object","properties":{"url":{"type":"string","description":"URL to fetch and extract content from"},"html":{"type":"string","description":"Raw HTML to extract content from"}},"required":[]}
output_schema	{"type":"object","properties":{"title":{"type":"string"},"content":{"type":"string"},"excerpt":{"type":"string"},"wordCount":{"type":"number"}}}
metadata	{"requires":{"env":["CLAW0X_API_KEY"]}}

Readability Extractor

Extract clean, readable content from any webpage or HTML. Strips ads, navigation, sidebars, scripts, and noise. Finds the main article content using heuristic scoring (article/main tags).

Use Cases

LLM context preparation (clean text for prompts)
Article summarization preprocessing
RAG pipeline content extraction
Content archiving (strip noise, keep substance)

Prerequisites

Sign up at claw0x.com
Create API key in Dashboard

Pricing

FREE. No charge per call.

Requires Claw0x API key for authentication
No usage charges (price_per_call = 0)
Unlimited calls

Example

Input:

{ "url": "https://example.com/blog/article" }

Output:

{
  "title": "Article Title",
  "content": "Clean article text without ads or navigation...",
  "excerpt": "Clean article text without ads...",
  "wordCount": 450,
  "source": "https://example.com/blog/article"
}

Error Codes

Code	Meaning
400	Missing url/html, or URL fetch failed
401	Missing or invalid API key
500	Extraction failed (not billed)

About Claw0x

Claw0x is the native skills layer for AI agents.

GitHub: github.com/kennyzir/readability-extractor

related-skills.json

同仓库

youtube-intel.md

from "kennyzir/Claw0X_skills"

Analyze YouTube channels, track competitor activity, and discover content opportunities. Use when users need to monitor YouTube channels, analyze a niche or category for content gaps, identify viral patterns, or plan content strategy. Handles competitive intelligence, market discovery, and trend analysis for YouTube creators.

2026-04-271

keyword-competition-analyzer.md

from "kennyzir/Claw0X_skills"

Analyze SERP competition for any keyword. Use when you need to check if a keyword is worth targeting. Scores competition across 5 dimensions, identifies authority sites, assesses content depth, and provides go/differentiate/avoid recommendation.

2026-04-111

keyword-research.md

from "kennyzir/Claw0X_skills"

Full-site keyword research with three-phase funnel. Use when you need to find SEO keyword opportunities for a website. Analyzes site topics, expands 30+ candidate keywords, runs SERP competition analysis on top 10, and recommends top 3 keywords with actionable landing page suggestions.

2026-04-111

seo-page-generator.md

from "kennyzir/Claw0X_skills"

Generate SEO-optimized page structures from keywords. Use when you need to create landing pages, guide pages, comparison pages, or hub pages from trending keywords. Handles search intent classification, template selection, meta tag generation, schema markup, and SERP competitor analysis.

2026-04-111

seo-audit.md

from "kennyzir/Claw0X_skills"

Audit web pages for P0 SEO compliance. Use when developers need to validate server-side metadata, image alt tags, keyword density, FAQ sections, and breadcrumb navigation before publishing. Handles Next.js App Router pages and returns actionable fix recommendations.

2026-04-021

translation-pro.md

from "kennyzir/Claw0X_skills"

Translate text between 50+ languages with auto-detect and batch mode. Use when agents need multilingual content pipelines, global communication, or document translation. Handles Chinese, Japanese, Korean, Arabic, Hindi, and more.

2026-03-311

package.json

"author": "kennyzir"

"repository": "kennyzir/Claw0X_skills"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

软件开发工程师计算机与数学类职业15-1252L4

name	Readability Extractor
slug	readability-extractor
description	Strip ads, navigation, and sidebars from any article URL or HTML. Use when agents need clean reading content for LLM context, article summarization, or RAG pipelines. Returns title, clean text, excerpt, and word count.
category	Content
tags	["readability","article","content-extraction","rag","clean-text"]
price_per_call	0
input_schema	{"type":"object","properties":{"url":{"type":"string","description":"URL to fetch and extract content from"},"html":{"type":"string","description":"Raw HTML to extract content from"}},"required":[]}
output_schema	{"type":"object","properties":{"title":{"type":"string"},"content":{"type":"string"},"excerpt":{"type":"string"},"wordCount":{"type":"number"}}}
metadata	{"requires":{"env":["CLAW0X_API_KEY"]}}

Readability Extractor

Extract clean, readable content from any webpage or HTML. Strips ads, navigation, sidebars, scripts, and noise. Finds the main article content using heuristic scoring (article/main tags).

Use Cases

LLM context preparation (clean text for prompts)
Article summarization preprocessing
RAG pipeline content extraction
Content archiving (strip noise, keep substance)

Prerequisites

Sign up at claw0x.com
Create API key in Dashboard

Pricing

FREE. No charge per call.

Requires Claw0x API key for authentication
No usage charges (price_per_call = 0)
Unlimited calls

Example

Input:

{ "url": "https://example.com/blog/article" }

Output:

{
  "title": "Article Title",
  "content": "Clean article text without ads or navigation...",
  "excerpt": "Clean article text without ads...",
  "wordCount": 450,
  "source": "https://example.com/blog/article"
}

Error Codes

Code	Meaning
400	Missing url/html, or URL fetch failed
401	Missing or invalid API key
500	Extraction failed (not billed)

About Claw0x

Claw0x is the native skills layer for AI agents.

GitHub: github.com/kennyzir/readability-extractor

readability-extractor

Readability Extractor

Use Cases

Prerequisites

Pricing

Example

Error Codes

About Claw0x

同仓库更多 Skills

同仓库更多 Skills

Readability Extractor

Use Cases

Prerequisites

Pricing

Example

Error Codes

About Claw0x