Run any Skill in Manus with one click

firecrawl

Use Firecrawl for markdown-first web scraping, crawling, and site mapping. Use when: "firecrawl", "crawl docs", "scrape website to markdown", "map website", "rag crawl".

Run Skill in Manus

Overview

Use Firecrawl for markdown-first web scraping, crawling, and site mapping. Use when: "firecrawl", "crawl docs", "scrape website to markdown", "map website", "rag crawl".

Install command

npx skills add https://github.com/joint-hubs/jointhubs-os --skill firecrawl

Copy and paste this command into Claude Code to install the skill

Source

joint-hubs/jointhubs-os

Stars4

Forks15

UpdatedMay 18, 2026 at 12:37

SKILL.md

readonly

name	firecrawl
description	Use Firecrawl for markdown-first web scraping, crawling, and site mapping. Use when: "firecrawl", "crawl docs", "scrape website to markdown", "map website", "rag crawl".

Firecrawl Skill

Use Firecrawl when the goal is to turn public web pages into LLM-friendly content, especially markdown, without building custom selector logic.

What Firecrawl Is Good For

Crawling documentation sites into markdown
Mapping a site's URL structure before a scrape
Pulling article or docs content for RAG pipelines
Fast content ingestion where DOM-perfect field extraction is not required

What Firecrawl Is Not For

Precise field extraction from repeated cards, tables, or listings
Highly custom browser flows that require selector-by-selector control
Sensitive or regulated data flows without separate legal review

For those cases, prefer the existing local scraper stack or a project-specific Playwright flow.

Repo Entry Point

Firecrawl is wired into the Jointhubs scraper toolkit here:

Second Brain/Projects/jointhubs/projekty/scrapers/firecrawl_cli.py
Second Brain/Projects/jointhubs/projekty/scrapers/firecrawl.md

The CLI auto-loads Second Brain/Projects/jointhubs/projekty/scrapers/.env and reads FIRECRAWL_API_KEY from there if it is not already exported in the shell.

Commands

Scrape One Page

Set-Location "Second Brain/Projects/jointhubs/projekty/scrapers"
python firecrawl_cli.py scrape https://firecrawl.dev --format markdown

Crawl a Site

Set-Location "Second Brain/Projects/jointhubs/projekty/scrapers"
python firecrawl_cli.py crawl https://firecrawl.dev --limit 20 --format markdown --format html

Map a Site

Set-Location "Second Brain/Projects/jointhubs/projekty/scrapers"
python firecrawl_cli.py map https://firecrawl.dev

Environment Setup

pip install firecrawl-py

Add to the scraper-local .env file:

FIRECRAWL_API_KEY=fc-your-key

Decision Rule

Use this tool choice:

Need	Tool
Extract fields via selectors	`BaseScraper`
Use hosted scraping actors	Apify
Crawl docs/pages into markdown	Firecrawl
Discover URLs first	Firecrawl `map`

Output Convention

By default, outputs are written to:

Second Brain/Projects/jointhubs/projekty/scrapers/output/firecrawl/

Results are timestamped JSON snapshots so they can be reused in downstream scripts.

Caveat

Firecrawl is a remote service. Only use it for data you are comfortable sending to a third-party processor under their terms.

Related Skills

agentic-engineering — packaging repeatable agent workflows

More from this repository

same repository

thoughtmap

joint-hubs/jointhubs-os

ThoughtMap pipeline, output interpretation, and MCP vector search tools. Use when: "thoughtmap", "thought clusters", "search thoughts", "what am I thinking about", "semantic search notes", "knowledge base", "vector search", "find context in notes", "cluster distances", "topic map", "thinking patterns".

2026-05-184

json-canvas

joint-hubs/jointhubs-os

Create and edit JSON Canvas files (.canvas) with nodes, edges, groups, and connections. Use when working with .canvas files, creating visual canvases, mind maps, flowcharts, or when the user mentions Canvas files in Obsidian.

2026-04-164

obsidian-bases

joint-hubs/jointhubs-os

Create and edit Obsidian Bases (.base files) with views, filters, formulas, and summaries. Use when working with .base files, creating database-like views of notes, or when the user mentions Bases, table views, card views, filters, or formulas in Obsidian.

2026-04-164

obsidian-cli

joint-hubs/jointhubs-os

Interact with Obsidian vaults using the Obsidian CLI to read, create, search, and manage notes, tasks, and properties. Also supports plugin and theme development. Use when the user asks to interact with their Obsidian vault from the command line, manage notes, search vault content, or develop and debug Obsidian plugins and themes.

2026-04-164

obsidian-markdown

joint-hubs/jointhubs-os

Create and edit Obsidian Flavored Markdown with wikilinks, embeds, callouts, properties, and other Obsidian-specific syntax. Use when working with .md files in Obsidian, or when the user mentions wikilinks, callouts, frontmatter, tags, embeds, or Obsidian notes.

2026-04-164

daily-log

joint-hubs/jointhubs-os

Daily log conventions for agent memory between sessions. Use when: "daily log", "today's note", "what happened today", "session start", "check context".

2026-03-114

Source

joint-hubs

joint-hubs/jointhubs-os

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Network and Computer Systems AdministratorsComputer and Mathematical Occupations15-1244L4

name	firecrawl
description	Use Firecrawl for markdown-first web scraping, crawling, and site mapping. Use when: "firecrawl", "crawl docs", "scrape website to markdown", "map website", "rag crawl".

Firecrawl Skill

Use Firecrawl when the goal is to turn public web pages into LLM-friendly content, especially markdown, without building custom selector logic.

What Firecrawl Is Good For

Crawling documentation sites into markdown
Mapping a site's URL structure before a scrape
Pulling article or docs content for RAG pipelines
Fast content ingestion where DOM-perfect field extraction is not required

What Firecrawl Is Not For

Precise field extraction from repeated cards, tables, or listings
Highly custom browser flows that require selector-by-selector control
Sensitive or regulated data flows without separate legal review

For those cases, prefer the existing local scraper stack or a project-specific Playwright flow.

Repo Entry Point

Firecrawl is wired into the Jointhubs scraper toolkit here:

Second Brain/Projects/jointhubs/projekty/scrapers/firecrawl_cli.py
Second Brain/Projects/jointhubs/projekty/scrapers/firecrawl.md

The CLI auto-loads Second Brain/Projects/jointhubs/projekty/scrapers/.env and reads FIRECRAWL_API_KEY from there if it is not already exported in the shell.

Commands

Scrape One Page

Set-Location "Second Brain/Projects/jointhubs/projekty/scrapers"
python firecrawl_cli.py scrape https://firecrawl.dev --format markdown

Crawl a Site

Set-Location "Second Brain/Projects/jointhubs/projekty/scrapers"
python firecrawl_cli.py crawl https://firecrawl.dev --limit 20 --format markdown --format html

Map a Site

Set-Location "Second Brain/Projects/jointhubs/projekty/scrapers"
python firecrawl_cli.py map https://firecrawl.dev

Environment Setup

pip install firecrawl-py

Add to the scraper-local .env file:

FIRECRAWL_API_KEY=fc-your-key

Decision Rule

Use this tool choice:

Need	Tool
Extract fields via selectors	`BaseScraper`
Use hosted scraping actors	Apify
Crawl docs/pages into markdown	Firecrawl
Discover URLs first	Firecrawl `map`

Output Convention

By default, outputs are written to:

Second Brain/Projects/jointhubs/projekty/scrapers/output/firecrawl/

Results are timestamped JSON snapshots so they can be reused in downstream scripts.

Caveat

Firecrawl is a remote service. Only use it for data you are comfortable sending to a third-party processor under their terms.

Related Skills

agentic-engineering — packaging repeatable agent workflows