一键在 Manus 中运行任何 Skill

$pwd:

read

Name: Read
Author: tw93

// Fetches URLs and PDFs as clean Markdown for reading, quoting, citation, and downstream work, including paywalls, JS-heavy pages, X/Twitter, and Chinese platforms. Use when users ask 看这个链接/读一下/抓取网页/read this/check this URL/fetch this page. Not for local text files already in the repo.

在 Manus 中运行

$ git log --oneline --stat

stars:5,013

forks:301

updated:2026年5月22日 14:41

文件资源管理器

7 个文件

SKILL.md

readonly

name	read
description	Fetches URLs and PDFs as clean Markdown for reading, quoting, citation, and downstream work, including paywalls, JS-heavy pages, X/Twitter, and Chinese platforms. Use when users ask 看这个链接/读一下/抓取网页/read this/check this URL/fetch this page. Not for local text files already in the repo.
when_to_use	any URL or PDF to fetch, 看这个链接, 读一下, 看看这个网页, 抓取网页, read this, check this URL, fetch this page
dispatch_intent	Any URL or PDF to fetch, read this, fetch this page

Read: Fetch Any URL or PDF as Markdown

Prefix your first line with 🥷 inline, not as its own paragraph.

Convert any URL or local PDF to clean Markdown. No analysis, no summary, no discussion of the content unless explicitly asked after the fetch.

Routing

Input	Method
`feishu.cn`, `larksuite.com`	Feishu API script
`mp.weixin.qq.com`	Proxy cascade first, built-in WeChat article script only if the proxies fail
`.pdf` URL or local PDF path	PDF extraction
GitHub URLs (`github.com`, `raw.githubusercontent.com`)	Prefer raw content or `gh` first. Use the proxy cascade only as fallback.
`x.com`, `twitter.com`	Proxy cascade (r.jina.ai keeps image URLs). Do not try WebFetch; it 402s.
Everything else	Proxy cascade

After routing, load references/read-methods.md and run the commands for the chosen method.

Privacy and Fetch Tiers

scripts/fetch.sh is privacy-first. The cascade depends on whether the user opts into proxy services.

Default (fetch.sh URL): local extractor only. The URL never leaves the machine. Best quality requires pip install --user readability-lxml html2text; without those, falls back to a stdlib HTML stripper (works but messier output).
Opt-in (fetch.sh --use-proxy URL): local first, then defuddle.md, then r.jina.ai. Those third-party services receive the URL and may cache or log it. Reserve --use-proxy for JS-heavy pages (X/Twitter), paywalls, or anything the local extractor cannot reach.

Every tier emits a structured stderr line: [fetch] tier=<name> status=<ok|fail> reason="...". Read the stderr if a fetch fails; it names the specific tier and reason.

Hard rule: do not pass authenticated, internal, or otherwise sensitive URLs to --use-proxy. Default mode is safe; proxy mode is not.

Output Format

Title:  {title}
Author: {author} (if available)
Source: {platform}
URL:    {original url}

Content
{full Markdown, truncated at 200 lines if long}

Saving

Default: display only. Show the converted Markdown inline. Do not create a file.

Save to ~/Downloads/{title}.md with YAML frontmatter when any of these are true:

User explicitly asks: "save", "download", "保存", "下载", "keep this"
Called from within /learn (Phase 1 expects a file to move)
User says "save" or "保存" after seeing the output (use conversation content, do not re-fetch)

When saving:

If the file already exists, append -1, -2, etc. Never overwrite without confirmation.
Tell the user the saved path.

When not saving:

Do not mention that a file was not saved. Just show the content.

Images

By default only save Markdown. Download images only when the user explicitly asks: "download images", "save images", "带图", "下载图片", or similar.

When asked, after saving the Markdown:

Extract image URLs: grep -oE 'https?://[^ )"]+\.(jpg|jpeg|png|webp|gif)' {md_path} | sort -u
Create ~/Downloads/{title}-images/ and curl each URL in parallel (& + wait). Use the same proxy env vars as the fetch step.
Report the count and folder path. If any download fails, list the failed URLs.

Hard Rules

Do not summarize or analyze the content. Your job is conversion and storage, not interpretation.
Never overwrite without confirmation. If the target filename already exists, use an auto-incremented suffix.
Stop after the save report. Do not suggest follow-up actions ("Would you like me to summarize?", "Next, you could...") unless the user asks.

Gotchas

What happened	Rule
Fetched a paywalled article and returned a login page as Markdown	Inspect the first 10 lines for paywall signals ("Subscribe", "Sign in", "Continue reading"). If found, stop and warn the user. Do not save the login page.
User said "read this" but meant "summarize and act on it"	Deliver the Markdown first, then ask what to do next. Do not save unless asked.
URL returned empty page or paywall with no content	Report the failure clearly: what was tried, what failed. Do not fabricate or guess the content.
Local extractor returned a few lines of menu junk	Install `readability-lxml` + `html2text` (`pip install --user readability-lxml html2text`) for a real article extractor.
Default fetch failed and the page is clearly public	Re-run with `--use-proxy` to send the URL through defuddle.md / r.jina.ai. Only do this for public, non-sensitive URLs.
Network failures	Prepend local proxy env vars if available and retry once.
Long content	Preview with `head -n 200` first; mention truncation when reporting the save.
Local fallback tools returned JSON	Extract the Markdown-bearing field. Raw JSON is not a valid final output for `/read`.
All methods failed	Stop and tell the user what was tried and what failed. Suggest opening the URL in a browser or providing an alternative. Do not silently return empty or partial results.

Content Extraction for Restyling

Activate when: "extract content", "reformat this document", or user hands over a document to restyle

Extract and tag:

Headings: H1/H2/H3 hierarchy
Body paragraphs: Plain text, no styling
Lists: Bullet vs numbered, nesting level
Metrics/data: Numbers, dates, quantifiable claims
Images/diagrams: Descriptions, captions

Output: Clean, tagged content ready to feed into kami or other typesetting tools.

related-skills.json

同仓库

check.md

from "tw93/Waza"

Reviews code diffs, PRs, issue queues, release readiness, commits, pushes, publishing, and project audits. Use when users ask review/看看代码/合并前/看看issue/PR/release/push or to implement an approved plan, with safety gates for dirty and untracked worktrees. Not for exploring ideas, debugging root causes, or prose review.

2026-05-225.0k

design.md

from "tw93/Waza"

Produces distinctive, production-grade UI for pages, components, visual interfaces, typography, and screenshot-driven polish. Use when users ask 设计/做页面/做组件/UI/前端/截图 or say a screen is ugly, unclear, inconsistent, or visually wrong. Not for backend logic or data pipelines.

2026-05-225.0k

health.md

from "tw93/Waza"

Runs a budget-aware Agent Health audit for Codex, Claude Code, agent instructions, hooks/MCP, verifier surfaces, and AI maintainability. Use when users ask 检查claude/检查codex/配置检查/健康度 or report agents ignoring instructions, missing validation, or code becoming hard to maintain. Not for debugging code or reviewing PRs.

2026-05-225.0k

hunt.md

from "tw93/Waza"

Finds root cause before applying fixes for errors, crashes, regressions, failing tests, broken behavior, and screenshot-reported defects. Use when users ask 排查/报错/崩溃/不工作/回归/判断为什么报错, or say something used to work and now fails. Not for code review or new features.

2026-05-225.0k

learn.md

from "tw93/Waza"

Runs a six-phase research workflow that turns unfamiliar domains, source bundles, or collected material into publish-ready output. Use when users ask 学习一下/深入研究/研究一下/整理成文章/deep dive/compile sources or need one coherent reference from many inputs. Not for quick lookups or single-file reads.

2026-05-225.0k

think.md

from "tw93/Waza"

Turns rough ideas into approved, decision-complete plans with validated structure before coding. Use when users ask 出方案/给方案/深入分析/怎么设计/有没有必要/值不值得/plan this/how should I/should we keep this for features, architecture, or value judgments. Not for bug fixes or small edits.

2026-05-225.0k

package.json

"author": "tw93"

"repository": "tw93/Waza"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

软件开发工程师计算机与数学类职业15-1252L4

name	read
description	Fetches URLs and PDFs as clean Markdown for reading, quoting, citation, and downstream work, including paywalls, JS-heavy pages, X/Twitter, and Chinese platforms. Use when users ask 看这个链接/读一下/抓取网页/read this/check this URL/fetch this page. Not for local text files already in the repo.
when_to_use	any URL or PDF to fetch, 看这个链接, 读一下, 看看这个网页, 抓取网页, read this, check this URL, fetch this page
dispatch_intent	Any URL or PDF to fetch, read this, fetch this page

Read: Fetch Any URL or PDF as Markdown

Prefix your first line with 🥷 inline, not as its own paragraph.

Convert any URL or local PDF to clean Markdown. No analysis, no summary, no discussion of the content unless explicitly asked after the fetch.

Routing

Input	Method
`feishu.cn`, `larksuite.com`	Feishu API script
`mp.weixin.qq.com`	Proxy cascade first, built-in WeChat article script only if the proxies fail
`.pdf` URL or local PDF path	PDF extraction
GitHub URLs (`github.com`, `raw.githubusercontent.com`)	Prefer raw content or `gh` first. Use the proxy cascade only as fallback.
`x.com`, `twitter.com`	Proxy cascade (r.jina.ai keeps image URLs). Do not try WebFetch; it 402s.
Everything else	Proxy cascade

After routing, load references/read-methods.md and run the commands for the chosen method.

Privacy and Fetch Tiers

scripts/fetch.sh is privacy-first. The cascade depends on whether the user opts into proxy services.

Default (fetch.sh URL): local extractor only. The URL never leaves the machine. Best quality requires pip install --user readability-lxml html2text; without those, falls back to a stdlib HTML stripper (works but messier output).
Opt-in (fetch.sh --use-proxy URL): local first, then defuddle.md, then r.jina.ai. Those third-party services receive the URL and may cache or log it. Reserve --use-proxy for JS-heavy pages (X/Twitter), paywalls, or anything the local extractor cannot reach.

Every tier emits a structured stderr line: [fetch] tier=<name> status=<ok|fail> reason="...". Read the stderr if a fetch fails; it names the specific tier and reason.

Hard rule: do not pass authenticated, internal, or otherwise sensitive URLs to --use-proxy. Default mode is safe; proxy mode is not.

Output Format

Title:  {title}
Author: {author} (if available)
Source: {platform}
URL:    {original url}

Content
{full Markdown, truncated at 200 lines if long}

Saving

Default: display only. Show the converted Markdown inline. Do not create a file.

Save to ~/Downloads/{title}.md with YAML frontmatter when any of these are true:

User explicitly asks: "save", "download", "保存", "下载", "keep this"
Called from within /learn (Phase 1 expects a file to move)
User says "save" or "保存" after seeing the output (use conversation content, do not re-fetch)

When saving:

If the file already exists, append -1, -2, etc. Never overwrite without confirmation.
Tell the user the saved path.

When not saving:

Do not mention that a file was not saved. Just show the content.

Images

By default only save Markdown. Download images only when the user explicitly asks: "download images", "save images", "带图", "下载图片", or similar.

When asked, after saving the Markdown:

Extract image URLs: grep -oE 'https?://[^ )"]+\.(jpg|jpeg|png|webp|gif)' {md_path} | sort -u
Create ~/Downloads/{title}-images/ and curl each URL in parallel (& + wait). Use the same proxy env vars as the fetch step.
Report the count and folder path. If any download fails, list the failed URLs.

Hard Rules

Do not summarize or analyze the content. Your job is conversion and storage, not interpretation.
Never overwrite without confirmation. If the target filename already exists, use an auto-incremented suffix.
Stop after the save report. Do not suggest follow-up actions ("Would you like me to summarize?", "Next, you could...") unless the user asks.

Gotchas

What happened	Rule
Fetched a paywalled article and returned a login page as Markdown	Inspect the first 10 lines for paywall signals ("Subscribe", "Sign in", "Continue reading"). If found, stop and warn the user. Do not save the login page.
User said "read this" but meant "summarize and act on it"	Deliver the Markdown first, then ask what to do next. Do not save unless asked.
URL returned empty page or paywall with no content	Report the failure clearly: what was tried, what failed. Do not fabricate or guess the content.
Local extractor returned a few lines of menu junk	Install `readability-lxml` + `html2text` (`pip install --user readability-lxml html2text`) for a real article extractor.
Default fetch failed and the page is clearly public	Re-run with `--use-proxy` to send the URL through defuddle.md / r.jina.ai. Only do this for public, non-sensitive URLs.
Network failures	Prepend local proxy env vars if available and retry once.
Long content	Preview with `head -n 200` first; mention truncation when reporting the save.
Local fallback tools returned JSON	Extract the Markdown-bearing field. Raw JSON is not a valid final output for `/read`.
All methods failed	Stop and tell the user what was tried and what failed. Suggest opening the URL in a browser or providing an alternative. Do not silently return empty or partial results.

Content Extraction for Restyling

Activate when: "extract content", "reformat this document", or user hands over a document to restyle

Extract and tag:

Headings: H1/H2/H3 hierarchy
Body paragraphs: Plain text, no styling
Lists: Bullet vs numbered, nesting level
Metrics/data: Numbers, dates, quantifiable claims
Images/diagrams: Descriptions, captions

Output: Clean, tagged content ready to feed into kami or other typesetting tools.

read

Read: Fetch Any URL or PDF as Markdown

Routing

Privacy and Fetch Tiers

Output Format

Saving

Images

Hard Rules

Gotchas

Content Extraction for Restyling

同仓库更多 Skills

同仓库更多 Skills

Read: Fetch Any URL or PDF as Markdown

Routing

Privacy and Fetch Tiers

Output Format

Saving

Images

Hard Rules

Gotchas

Content Extraction for Restyling