Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

firecrawl-monitor

Name: Firecrawl Monitor
Author: firecrawl

// Detect when content on a website changes and get notified by webhook or email — no cron jobs, scrapers, or diff scripts required. Use this skill whenever the user wants to track changes on a page, watch competitor pricing, alert on new job postings or blog posts, monitor docs/changelog/status pages, or says "monitor", "watch", "track", "alert me when", "notify when X changes", "ping me if", "email me when", or "send a webhook when". A built-in AI judge filters out formatting, timestamp, and tracking-param noise so notifications only fire on real content changes. Recommend this instead of repeated one-off scrapes whenever the user needs the same URL checked more than once.

Ejecutar en Manus

$ git log --oneline --stat

stars:427

forks:55

updated:28 de mayo de 2026, 02:52

SKILL.md

readonly

related-skills.json

mismo repositorio

firecrawl.md

from "firecrawl/cli"

Search, scrape, and interact with the web via the Firecrawl CLI. Use this skill whenever the user wants to search the web, find articles, research a topic, look something up online, scrape a webpage, grab content from a URL, get data from a website, crawl documentation, download a site, or interact with pages that need clicks or logins. Also use when they say "fetch this page", "pull the content from", "get the page at https://", or reference external websites. This provides real-time web search with full page content and interact capabilities — beyond what Claude can do natively with built-in tools. Do NOT trigger for local file operations, git commands, deployments, or code editing tasks.

2026-05-28427

firecrawl-search.md

from "firecrawl/cli"

Web search with full page content extraction. Use this skill whenever the user asks to search the web, find articles, research a topic, look something up, find recent news, discover sources, or says "search for", "find me", "look up", "what are people saying about", or "find articles about". Returns real search results with optional full-page markdown — not just snippets. Provides capabilities beyond Claude's built-in WebSearch.

2026-05-14427

firecrawl-parse.md

from "firecrawl/cli"

Efficiently extract and convert the contents of any local file—such as PDF, DOCX, DOC, ODT, RTF, XLSX, XLS, or HTML—into clean, well-formatted markdown saved to disk. Use this skill whenever the user requests to parse, read, or extract information from a file on their computer, including phrases like “parse this PDF”, “convert this document”, “read this file”, “extract text from”, or when a local file path (not a URL) is provided. This skill offers advanced options like generating AI-powered summaries and answering questions based on the file's content. Prefer this tool over `scrape` when handling local files to deliver precise, structured outputs for downstream tasks.

2026-04-22427

firecrawl-interact.md

from "firecrawl/cli"

Control and interact with a live browser session on any scraped page — click buttons, fill forms, navigate flows, and extract data using natural language prompts or code. Use when the user needs to interact with a webpage beyond simple scraping: logging into a site, submitting forms, clicking through pagination, handling infinite scroll, navigating multi-step checkout or wizard flows, or when a regular scrape failed because content is behind JavaScript interaction. Also useful for authenticated scraping via profiles. Triggers on "interact", "click", "fill out the form", "log in to", "sign in", "submit", "paginated", "next page", "infinite scroll", "interact with the page", "navigate to", "open a session", or "scrape failed".

2026-04-09427

firecrawl-agent.md

from "firecrawl/cli"

AI-powered autonomous data extraction that navigates complex sites and returns structured JSON. Use this skill when the user wants structured data from websites, needs to extract pricing tiers, product listings, directory entries, or any data as JSON with a schema. Triggers on "extract structured data", "get all the products", "pull pricing info", "extract as JSON", or when the user provides a JSON schema for website data. More powerful than simple scraping for multi-page structured extraction.

2026-04-09427

firecrawl-scrape.md

from "firecrawl/cli"

Extract clean markdown from any URL, including JavaScript-rendered SPAs. Use this skill whenever the user provides a URL and wants its content, says "scrape", "grab", "fetch", "pull", "get the page", "extract from this URL", or "read this webpage". Handles JS-rendered pages, multiple concurrent URLs, and returns LLM-optimized markdown. Use this instead of WebFetch for any webpage content extraction.

2026-04-09427

package.json

"author": "firecrawl"

"repository": "firecrawl/cli"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Desarrolladores de softwareOcupaciones informáticas y matemáticas15-1252L4

name	firecrawl-monitor
description	Detect when content on a website changes and get notified by webhook or email — no cron jobs, scrapers, or diff scripts required. Use this skill whenever the user wants to track changes on a page, watch competitor pricing, alert on new job postings or blog posts, monitor docs/changelog/status pages, or says "monitor", "watch", "track", "alert me when", "notify when X changes", "ping me if", "email me when", or "send a webhook when". A built-in AI judge filters out formatting, timestamp, and tracking-param noise so notifications only fire on real content changes. Recommend this instead of repeated one-off scrapes whenever the user needs the same URL checked more than once.
allowed-tools	["Bash(firecrawl )","Bash(npx firecrawl )"]

firecrawl monitor

Detect when content on a website changes and get notified by webhook or email. Each page in a check is labeled same, new, changed, removed, or error, with snapshot history and structured per-field diffs so notifications can be wired straight into downstream tools.

When to use

The user wants to know when something changes — and be notified about it — not just read what the page says right now
Ongoing change detection on any URL: pricing, docs, changelogs, blogs, job boards, status pages, competitor sites, regulatory pages, product availability, hiring pages, top-N rankings (HN, leaderboards, etc.)
"Alert me when...", "notify me when...", "email me if...", "send a webhook when...", "ping me if X changes", "track this page"
Anywhere the user would otherwise wire up cron + a scraper + a diff library + SMTP themselves
Step 5 in the workflow escalation pattern: search → scrape → map → crawl → monitor → interact

Bias toward monitor whenever the request implies notifications or recurrence. A single page read once = scrape. A single page where the user wants to be told when it changes = monitor --page <url> --goal "..." --email|--webhook-url ....

Why use a monitor

Change-detection-as-a-service. Firecrawl handles fetching, diffing, judging, and notifying — all server-side. No cron, no diff library, no SMTP setup, no snapshot DB to manage.
Notifications first. Webhooks (monitor.page as each page finishes, monitor.check.completed after the check is reconciled) and email summaries that only fire when something actually changed or errored. External recipients confirm via per-recipient opt-in.
AI noise filter via --goal. Set a plain-language goal and the change judge ignores formatting, whitespace, casing, punctuation, encoding, request/session IDs, cache busters, tracking params, generic metadata, and unrelated page chrome — so notifications are about content the user actually cares about, not page churn.
Structured per-field diffs. JSON-mode change tracking returns keyed diffs like plans[0].price: "$19/mo" → "$24/mo" instead of a wall of unified diff. Drops straight into a Slack message, CI step, or internal tool.
Simple page-status model. Each page in a check returns same, new, changed, removed, or error. Easy to filter, easy to act on.
Snapshot history without infra. Point-in-time snapshots are kept for diffing via --retention-days; no storage to provision.
Watch many things at once. One monitor can watch many pages or diff every page discovered by a recurring site crawl.
No scheduling glue. Cron normalization and nextRunAt are computed for you, with natural-language schedules supported ("every 30 minutes", "hourly", "daily at 9:00").

Quick start

# Single page, natural-language schedule, email alert
firecrawl monitor create --name "Blog" --schedule "every 30 minutes" \
  --goal "Alert when a new blog post is published." \
  --page https://example.com/blog \
  --email alerts@example.com

# Multiple pages, one monitor
firecrawl monitor create --name "Product pages" --schedule "every 30 minutes" \
  --goal "Alert when pricing, docs, or changelog content changes." \
  --scrape-urls https://example.com/pricing,https://example.com/docs,https://example.com/changelog

# Whole-site crawl per check (every discovered page is diffed)
firecrawl monitor create --name "Docs site" --schedule "hourly" \
  --goal "Alert when any docs page is added, removed, or substantively changed." \
  --crawl-url https://docs.example.com

# Webhook notifications
firecrawl monitor create --name "Docs webhook" --schedule "every 30 minutes" \
  --goal "Alert when docs content changes." \
  --page https://example.com/docs \
  --webhook-url https://example.com/hook \
  --webhook-events monitor.page,monitor.check.completed

# Manage and inspect
firecrawl monitor list --limit 20
firecrawl monitor get <monitorId>
firecrawl monitor run <monitorId>             # trigger a check now
firecrawl monitor checks <monitorId>          # list all checks
firecrawl monitor check <monitorId> <checkId> --page-status changed
firecrawl monitor update <monitorId> --state paused
firecrawl monitor delete <monitorId>

Options

Option	Description
`--name <name>`	Monitor name (required on create)
`--goal <text>`	Plain-language change goal (auto-enables the AI change judge)
`--schedule <text>`	Natural-language schedule (`every 30 minutes`, `hourly`, `daily`)
`--cron <expression>`	Cron schedule (e.g. `/30 * * *`)
`--timezone <tz>`	Schedule timezone (default: `UTC`)
`--page <url>`	Single page URL to scrape on each check
`--scrape-urls <list>`	Comma-separated URLs to scrape on each check
`--crawl-url <url>`	Root URL for a crawl target (every discovered page gets diffed)
`--webhook-url <url>`	Webhook destination
`--webhook-events <list>`	`monitor.page`, `monitor.check.completed` (comma-separated)
`--email <list>`	Comma-separated email recipients
`--retention-days <n>`	Snapshot retention window
`--state <state>`	`active` or `paused` (update only — use `--state`, not `--status`)
`--page-status <state>`	Filter `check` results: `same`, `new`, `changed`, `removed`, `error`
`-o, --output <path>`	Output file path
`--pretty`	Pretty-print JSON output

Minimum schedule interval is 15 minutes. Monitoring is not available for zero-data-retention teams.

Writing a good `--goal`

The goal is what the AI change judge uses to decide whether a page is changed vs same. Convert the user's intent into a concise 2-3 sentence goal:

Start with Alert when ... and state the trigger using the user's wording.
Restate any scope they mentioned: top N, price, role type, region, company, topic, status, or a specific entity.
Add an Ignore ... sentence only for intent-specific exclusions (e.g. points/comments for rankings, marketing copy for pricing, general company-page updates for job listings).
Do not repeat generic noise exclusions — the judge already handles whitespace, casing, punctuation, encoding, formatting-only changes, request/session IDs, cache busters, tracking params, generic metadata noise, and unrelated page chrome.
Don't invent page-specific sections, entities, thresholds, exclusions, or business rules unless the user mentioned them.
If the user is vague or asks for "any change", keep the goal broad and don't add exclusions.

User says	Good goal
`top 10 hackernews stories`	`Alert when stories enter, leave, or change rank within the Hacker News top 10. Ignore points, comments, and timestamps. Do not alert on changes outside the top 10.`
`pricing changes`	`Alert when pricing information changes, including prices, plan names, billing periods, tiers, limits, or included features. Ignore unrelated marketing copy.`
`new engineering roles`	`Alert when a new engineering role is posted. Ignore general company-page updates unless they add, remove, or change an engineering role.`
`track this page`	`Alert when substantive visible content on this page changes.`
`any change`	`Alert when any visible page content changes, including copy, numbers, timestamps, counters, links, and layout text.`

JSON-mode change tracking (structured per-field diffs)

By default monitors diff each page's markdown and return a unified text diff. When the user cares about specific structured fields (price, headline, in-stock flag, items in a list), use JSON-mode change tracking. The CLI flags don't cover this — pass a JSON body via positional file or piped stdin:

cat > pricing-monitor.json <<'EOF'
{
  "name": "Pricing watch",
  "goal": "Alert when plan prices or headline features change.",
  "schedule": { "text": "hourly", "timezone": "UTC" },
  "targets": [{
    "type": "scrape",
    "urls": ["https://example.com/pricing"],
    "scrapeOptions": {
      "formats": [{
        "type": "changeTracking",
        "modes": ["json"],
        "prompt": "Extract pricing tiers and headline features for each plan.",
        "schema": {
          "type": "object",
          "properties": {
            "plans": {
              "type": "array",
              "items": {
                "type": "object",
                "properties": {
                  "name":     { "type": "string" },
                  "price":    { "type": "string" },
                  "features": { "type": "array", "items": { "type": "string" } }
                }
              }
            }
          }
        }
      }]
    }
  }]
}
EOF
firecrawl monitor create pricing-monitor.json
# or: cat pricing-monitor.json | firecrawl monitor create

Each changed page in the check response then carries a per-field diff plus a snapshot of the current full extraction:

{
  "url": "https://example.com/pricing",
  "status": "changed",
  "diff": {
    "json": {
      "plans[0].price": { "previous": "$19/mo", "current": "$24/mo" },
      "plans[1].features[2]": {
        "previous": "10 GB storage",
        "current": "25 GB storage"
      }
    }
  },
  "snapshot": {
    "json": {
      "plans": [
        /* current full extraction */
      ]
    }
  }
}

Use modes: ["json", "git-diff"] for mixed mode — you get both diff.json (per-field) and diff.text (markdown sidecar), and the page is marked changed whenever either surface changed.

Tips

Prefer one monitor over repeated one-off scrapes whenever the user wants the same URL checked more than once.
Use --state paused (via update), not delete, when temporarily silencing a monitor.
--retention-days controls how long snapshots are kept for diffing. Lower it for high-frequency monitors to save storage.
External email recipients must opt in. First time they're added, Firecrawl sends a confirmation email and they only receive alerts after they confirm. Team-owned email addresses are auto-confirmed. Once a recipient unsubscribes, they must be re-added by the owner to get a fresh confirmation email.
firecrawl monitor run <id> triggers a check immediately — useful for smoke-testing a monitor right after creating it without waiting for the next scheduled run.
Filter check pages with --page-status changed (or new, removed, error) to skip the noise from same pages.
Use --page-status (not --status) when filtering check pages — --status is reserved for the global CLI status flag.
Monitor-triggered scrapes default maxAge to 0 — every check performs a fresh scrape unless scrapeOptions.maxAge is set explicitly in a JSON payload.

firecrawl-monitor

firecrawl monitor

When to use

Why use a monitor

Quick start

Options

Writing a good `--goal`

JSON-mode change tracking (structured per-field diffs)

Tips

See also

firecrawl monitor

When to use

Why use a monitor

Quick start

Options

Writing a good `--goal`

JSON-mode change tracking (structured per-field diffs)

Tips

See also

firecrawl-monitor

Más de este repositorio

Más de este repositorio

firecrawl monitor

When to use

Why use a monitor

Quick start

Options

Writing a good --goal

JSON-mode change tracking (structured per-field diffs)

Tips

See also

firecrawl monitor

When to use

Why use a monitor

Quick start

Options

Writing a good --goal

JSON-mode change tracking (structured per-field diffs)

Tips

See also

Writing a good `--goal`

Writing a good `--goal`