Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

$pwd:

defuddle

Name: Defuddle
Author: AgriciDaniel

// Strip clutter from web pages before ingesting into the wiki. Removes ads, navigation, headers, footers, and boilerplate: leaving clean readable markdown that saves 40-60% tokens. Triggers on: defuddle, clean this page, strip this url, fetch and clean, clean web content before ingesting, strip ads, remove clutter, clean URL content, readable markdown from URL.

In Manus ausführen

$ git log --oneline --stat

stars:5.252

forks:599

updated:10. April 2026 um 13:28

SKILL.md

readonly

name	defuddle
description	Strip clutter from web pages before ingesting into the wiki. Removes ads, navigation, headers, footers, and boilerplate: leaving clean readable markdown that saves 40-60% tokens. Triggers on: defuddle, clean this page, strip this url, fetch and clean, clean web content before ingesting, strip ads, remove clutter, clean URL content, readable markdown from URL.
allowed-tools	Read Bash

defuddle: Web Page Cleaner

Defuddle extracts the meaningful content from a web page and drops everything else: ads, cookie banners, nav bars, related articles, footers, social sharing buttons. What remains is the article body as clean markdown.

Use this before any URL ingestion. It is optional but strongly recommended. It cuts token usage by 40-60% on typical web articles and produces cleaner wiki pages.

Substrate note (v1.7+): Unlike obsidian-markdown / obsidian-bases / json-canvas (where we defer to kepano/obsidian-skills as upstream), the defuddle skill is original to claude-obsidian — kepano's marketplace does not ship a defuddle skill. This is the canonical version. The underlying defuddle-cli is independent of either marketplace and lives at github.com/kepano/defuddle.

Install

npm install -g defuddle-cli

Verify: defuddle --version

Usage

Clean a URL directly

defuddle https://example.com/article

Outputs clean markdown to stdout.

Save to .raw/

defuddle https://example.com/article > .raw/articles/article-slug-$(date +%Y-%m-%d).md

Add frontmatter header after saving

After running defuddle, prepend the source URL and fetch date:

SLUG="article-slug-$(date +%Y-%m-%d)"
{ echo "---"; echo "source_url: https://example.com/article"; echo "fetched: $(date +%Y-%m-%d)"; echo "---"; echo ""; defuddle https://example.com/article; } > .raw/articles/$SLUG.md

Clean a local HTML file

defuddle page.html

When to Use

Use defuddle when:

Ingesting a news article, blog post, or documentation page from a URL
The page has a lot of surrounding content (most web pages do)
You want to stay within token budget on a long article

Skip defuddle when:

The source is already a clean markdown or PDF file
The page is a dashboard, app, or structured data (defuddle expects article-style content)
defuddle is not installed and the article is short enough to process raw

Fallback

If defuddle is not installed, check:

which defuddle 2>/dev/null || echo "not installed"

If not installed: use WebFetch directly. The content will be less clean but still workable.

Integration with /wiki-ingest

The /wiki-ingest skill checks for defuddle automatically when a URL is passed. You do not need to run defuddle manually before ingesting a URL. The ingest skill will call it if available.

To manually clean a page and save before ingesting:

Run the save command above
Then: ingest .raw/articles/[slug].md

How to think (10-principle mapping)

When working on this skill, apply the 10-principle loop. See skills/think/SKILL.md for the canonical framework.

#	Principle	Application here
1	OBSERVE (ext)	Which URL? What's actually on the page? Don't assume the title matches the content.
2	OBSERVE (int)	Am I assuming the page has the content the user expects? Verify before extracting.
3	LISTEN	Did the user say "the article" (main content only) or "the link" (everything visible)?
4	THINK	Strip boilerplate, preserve structure, capture metadata. Quote URLs in shell to avoid injection.
5	CONNECT (lat)	How does this domain typically render? Some sites mangle defuddle's heuristics; track those.
6	CONNECT (sys)	Shells out to defuddle-cli (kepano); output lands in `.raw/` for wiki-ingest pickup.
7	FEEL	Clean markdown that reads like the original, not boilerplate residue.
8	ACCEPT	Some pages don't extract well. Flag and move on; don't force when the heuristic loses.
9	CREATE	Markdown to stdout, redirected to `.raw/articles/<slug>-<date>.md`.
10	GROW	Extraction failures suggest defuddle-cli upgrade or alternative extractor — track them as backlog.

related-skills.json

gleiches Repository

autoresearch.md

from "AgriciDaniel/claude-obsidian"

Autonomous iterative research loop. Takes a topic, runs web searches, fetches sources, synthesizes findings, and files everything into the wiki as structured pages. Based on Karpathy's autoresearch pattern: program.md configures objectives and constraints, the loop runs until depth is reached, output goes directly into the knowledge base. Triggers on: "/autoresearch", "autoresearch", "research [topic]", "deep dive into [topic]", "investigate [topic]", "find everything about [topic]", "research and file", "go research", "build a wiki on".

2026-04-235.3k

wiki-ingest.md

from "AgriciDaniel/claude-obsidian"

Ingest sources into the Obsidian wiki vault. Reads a source, extracts entities and concepts, creates or updates wiki pages, cross-references, and logs the operation. Supports files, URLs, and batch mode. Triggers on: ingest, process this source, add this to the wiki, read and file this, batch ingest, ingest all of these, ingest this url.

2026-04-235.3k

wiki-lint.md

from "AgriciDaniel/claude-obsidian"

Health check the Obsidian wiki vault. Finds orphan pages, dead wikilinks, stale claims, missing cross-references, frontmatter gaps, and empty sections. Creates or updates Dataview dashboards. Generates canvas maps. Triggers on: "lint", "health check", "clean up wiki", "check the wiki", "wiki maintenance", "find orphans", "wiki audit".

2026-04-235.3k

wiki-fold.md

from "AgriciDaniel/claude-obsidian"

Rollup of wiki log entries into meta-pages. Reads the last 2^k entries from wiki/log.md, writes a structurally-idempotent fold page to wiki/folds/ that links back to children. Extractive summarization (no invention). Dry-run by default, stdout-only; commit mode writes and accepts that the PostToolUse hook auto-commits. Triggers on: fold the log, run a fold, run wiki-fold, log rollup, roll up log entries.

2026-04-235.3k

wiki.md

from "AgriciDaniel/claude-obsidian"

Claude + Obsidian knowledge companion. Sets up a persistent wiki vault, scaffolds structure from a one-sentence description, and routes to specialized sub-skills. Use for setup, scaffolding, cross-project referencing, and hot cache management. Triggers on: "set up wiki", "scaffold vault", "create knowledge base", "/wiki", "wiki setup", "obsidian vault", "knowledge base", "second brain setup", "running notetaker", "persistent memory", "llm wiki".

2026-04-135.3k

canvas.md

from "AgriciDaniel/claude-obsidian"

Visual layer of the wiki. Add images, text cards, PDFs, and wiki pages to Obsidian canvas files with auto-positioning inside zones. Integrates with /banana for image capture. Triggers on: /canvas, canvas new, canvas add image, canvas add text, canvas add pdf, canvas add note, canvas zone, canvas list, canvas from banana, add to canvas, put this on the canvas, open canvas, create canvas.

2026-04-105.3k

package.json

"author": "AgriciDaniel"

"repository": "AgriciDaniel/claude-obsidian"

GitHub-Repository öffnen Creator-Repositorys ansehen

$ install --global

$ download --local

In Manus ausführen

$ useful --forSOC

Korrektoren und TextmarkiererBüro- und Verwaltungsberufe43-9081L4

name	defuddle
description	Strip clutter from web pages before ingesting into the wiki. Removes ads, navigation, headers, footers, and boilerplate: leaving clean readable markdown that saves 40-60% tokens. Triggers on: defuddle, clean this page, strip this url, fetch and clean, clean web content before ingesting, strip ads, remove clutter, clean URL content, readable markdown from URL.
allowed-tools	Read Bash

defuddle: Web Page Cleaner

Use this before any URL ingestion. It is optional but strongly recommended. It cuts token usage by 40-60% on typical web articles and produces cleaner wiki pages.

Install

npm install -g defuddle-cli

Verify: defuddle --version

Usage

Clean a URL directly

defuddle https://example.com/article

Outputs clean markdown to stdout.

Save to .raw/

defuddle https://example.com/article > .raw/articles/article-slug-$(date +%Y-%m-%d).md

Add frontmatter header after saving

After running defuddle, prepend the source URL and fetch date:

SLUG="article-slug-$(date +%Y-%m-%d)"
{ echo "---"; echo "source_url: https://example.com/article"; echo "fetched: $(date +%Y-%m-%d)"; echo "---"; echo ""; defuddle https://example.com/article; } > .raw/articles/$SLUG.md

Clean a local HTML file

defuddle page.html

When to Use

Use defuddle when:

Ingesting a news article, blog post, or documentation page from a URL
The page has a lot of surrounding content (most web pages do)
You want to stay within token budget on a long article

Skip defuddle when:

The source is already a clean markdown or PDF file
The page is a dashboard, app, or structured data (defuddle expects article-style content)
defuddle is not installed and the article is short enough to process raw

Fallback

If defuddle is not installed, check:

which defuddle 2>/dev/null || echo "not installed"

If not installed: use WebFetch directly. The content will be less clean but still workable.

Integration with /wiki-ingest

The /wiki-ingest skill checks for defuddle automatically when a URL is passed. You do not need to run defuddle manually before ingesting a URL. The ingest skill will call it if available.

To manually clean a page and save before ingesting:

Run the save command above
Then: ingest .raw/articles/[slug].md

How to think (10-principle mapping)

When working on this skill, apply the 10-principle loop. See skills/think/SKILL.md for the canonical framework.

#	Principle	Application here
1	OBSERVE (ext)	Which URL? What's actually on the page? Don't assume the title matches the content.
2	OBSERVE (int)	Am I assuming the page has the content the user expects? Verify before extracting.
3	LISTEN	Did the user say "the article" (main content only) or "the link" (everything visible)?
4	THINK	Strip boilerplate, preserve structure, capture metadata. Quote URLs in shell to avoid injection.
5	CONNECT (lat)	How does this domain typically render? Some sites mangle defuddle's heuristics; track those.
6	CONNECT (sys)	Shells out to defuddle-cli (kepano); output lands in `.raw/` for wiki-ingest pickup.
7	FEEL	Clean markdown that reads like the original, not boilerplate residue.
8	ACCEPT	Some pages don't extract well. Flag and move on; don't force when the heuristic loses.
9	CREATE	Markdown to stdout, redirected to `.raw/articles/<slug>-<date>.md`.
10	GROW	Extraction failures suggest defuddle-cli upgrade or alternative extractor — track them as backlog.

defuddle

defuddle: Web Page Cleaner

Install

Usage

Clean a URL directly

Save to .raw/

Add frontmatter header after saving

Clean a local HTML file

When to Use

Fallback

Integration with /wiki-ingest

How to think (10-principle mapping)

Mehr aus diesem Repository

Mehr aus diesem Repository

defuddle: Web Page Cleaner

Install

Usage

Clean a URL directly

Save to .raw/

Add frontmatter header after saving

Clean a local HTML file

When to Use

Fallback

Integration with /wiki-ingest

How to think (10-principle mapping)