Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

osint-research

OSINT recon skill — passive entity discovery for domain / IP / email / person / company / github user. Builds a hybrid findings/dossier/graph report from free public sources (Whois, DNS, crt.sh, Wayback, Shodan InternetDB, GitHub code search, Tavily/Firecrawl/Exa/Perplexity dorks, optional theHarvester/subfinder). No subscription dependencies. Disclaimer is non-removable; secrets are redacted; blocklisted dump sites are filtered at outbound and inbound levels.

Exécuter dans Manus

Aperçu

Commande d'installation

npx skills add https://github.com/hint-shu/deep-research --skill osint-research

Copiez et collez cette commande dans Claude Code pour installer le skill

Source

hint-shu/deep-research

Étoiles2

Forks0

Mis à jour29 avril 2026 à 10:48

Explorateur de fichiers

16 fichiers

SKILL.md

readonly

name	osint-research
description	OSINT recon skill — passive entity discovery for domain / IP / email / person / company / github user. Builds a hybrid findings/dossier/graph report from free public sources (Whois, DNS, crt.sh, Wayback, Shodan InternetDB, GitHub code search, Tavily/Firecrawl/Exa/Perplexity dorks, optional theHarvester/subfinder). No subscription dependencies. Disclaimer is non-removable; secrets are redacted; blocklisted dump sites are filtered at outbound and inbound levels.
user_invocable	true

OSINT Research — Entity Discovery (Specialized Skill)

Parallel to L0–L5 research ladder. Single entity per invocation.

When to Use

User passes a target as the argument:

example.com → domain recon
8.8.8.8 → IP recon
user@example.com → email recon
"John Doe" → person recon
--company "Anthropic" → company recon
github:anthropics → github org/user recon

Goal: hybrid OSINT report (findings → dossier → mermaid graph → raw artifacts → audit trail) in .firecrawl/osint/<slug>/.

Reference spec: docs/superpowers/specs/2026-04-26-osint-research-design.md

Budget

Time: ~10–15 min
Credits: ~30–50 across Tavily/Firecrawl/Exa/Perplexity (comparable to L2)
Money: $0/mo guaranteed (no subscriptions)

Hard Security Rules (do NOT bypass)

These are enforced at helper level too — but Claude must understand and respect them:

Never write a raw channel response to disk. Not to artifacts, not to /tmp/, not anywhere.
Always pipe channel outputs through inbound-filter.sh then secret-redactor.sh before any disk write or further processing.
Never query a blocklisted dump site directly. dorks.sh enforces this; you must not work around it.
Never copy detected secrets into the report or artifacts in plaintext. Type + location + truncated match only.
Never aggregate PII fields into a profile of a private individual. No joining of name + email + home address.

If any rule conflicts with what you think the user wants — stop and ask. Defaults are not overridable in a single invocation.

Pipeline

Classify entity:

ENTITY_TYPE=$(bash skills/osint-research/lib/entity-classifier.sh [--company] "$TARGET")

Build slug: <sanitized_target>-<YYYYMMDD>-<HHMM> (e.g. example-corp-com-20260426-1432). Create .firecrawl/osint/<slug>/.
Run channels in parallel (Phase 1). Each channel emits NDJSON on stdout. For each channel:
- Pipe its stdout through lib/inbound-filter.sh → lib/secret-redactor.sh.
- Append filtered/redacted lines to <slug>/raw/<channel>.ndjson.
- Record channel status (OK / SKIPPED / ERRORED) for the status block.
Bash-callable channels:
- channels/whois-dns.sh (domain/ip/company)
- channels/crtsh.sh (domain/company)
- channels/wayback.sh (domain/company)
- channels/shodan-idb.sh (after DNS resolves an IP)
- channels/github-leaks.sh (all entity types)
- channels/theharvester-wrap.sh (domain/company; skip if missing)
- channels/subfinder-wrap.sh (domain/company; skip if missing)
- channels/dorks.sh (all entity types — emits queries; you feed them to Tavily MCP)
MCP-callable channels (you call directly via tools):
- Tavily — for dork queries from dorks.sh. Pipe each result body through inbound-filter.sh and secret-redactor.sh before persisting.
- Firecrawl — for top-N URLs from Tavily. Before scraping, pass the URL through inbound-filter.sh (single-line NDJSON form). If filter drops it, do not scrape.
- Exa — for related entities (person/company only).
- Perplexity — for context enrichment (person/company only).
Phase 2 dependent channels:
- For each IP from whois-dns DNS resolves → shodan-idb.sh.
- For each top-URL from Tavily → Firecrawl scrape (after inbound filter check).

Findings extraction:

cat <slug>/raw/*.ndjson | bash skills/osint-research/lib/findings-extractor.sh > <slug>/findings.ndjson

Synthesize:
- Read findings.ndjson. Group by priority (CRITICAL/HIGH/MEDIUM/LOW).
- Read <slug>/raw/*.ndjson. Group by record_type → fill dossier sections.
- Build mermaid graph from entities + edges (cap at 30 nodes; rest go to CSVs only).
- Fill templates/osint-report.md.tpl placeholders.
- Write <slug>/osint-report.md, <slug>/graph.mmd, plus <slug>/subdomains.csv, <slug>/emails.csv, <slug>/ips.csv, <slug>/dorks-results.md, <slug>/tech-stack.md, <slug>/sources.md.
Audit trail (sources.md): for every claim in the report, record channel + timestamp. Also record filtered-out URLs (host only, no content) under a Filtered (security policy) heading.
Channel status block in the report header: which channels ran OK, which were SKIPPED (with install tip if optional CLI missing), which ERRORED.

Error Handling

Required channels (whois, dns) failure → abort with clear message.
Recommended channels (crt.sh, Wayback, Shodan-IDB, GitHub, Tavily dorks, Perplexity) failure → mark SKIPPED, continue.
Optional channels (theHarvester, subfinder) missing → silent skip with install tip.
Tavily/Firecrawl/Exa/Perplexity rate limit → exponential backoff x3, then PARTIAL.
Malformed channel response → log only structured metadata (channel/status/size/category/timestamp) to sources.md. Never write raw body anywhere.

Output Format

Hybrid (per spec §5):

1. Disclaimer (non-removable)
2. Findings Summary (CRITICAL → LOW)
3. Entity Dossier (Identity / Infrastructure / People / History / Leaks / Tech Stack / Related)
4. Relationship Graph (mermaid)
5. Raw Artifacts (links to CSVs/MD files)
6. Sources (audit trail with timestamps + Filtered list)

Done Criteria

The skill completes successfully when:

<slug>/osint-report.md exists, contains all required sections.
All channel outputs are stored in <slug>/raw/*.ndjson (filtered + redacted).
<slug>/sources.md has at least one entry per channel that ran.
No plaintext secret appears in any file under <slug>/.
No blocklisted-domain URL or content appears in any file under <slug>/.

Plus depuis ce dépôt

même dépôt

ultra-research

hint-shu/deep-research

Ultra research (L5) — maximum depth. Runs /academic-research (L4) then adds peer-review simulation, recursive exploration until knowledge saturation, full agent crew (7+), and builds a complete knowledge base with executive summary, glossary, timeline, playbooks, counter-arguments, and open questions. 1+ hour, 150+ sources, 10000+ word main report + full vault structure. Auto-syncs to auto-memory. Use when you want to become an expert on a topic.

2026-04-182

quick-research

hint-shu/deep-research

Quick web research for simple questions — fast answer with 3-5 cited sources in ~1 minute. Use for fact-checks, simple lookups, "what is X", "latest version of Y", "who made Z". For anything deeper, use /research (L1) or /deep-research (L2).

2026-04-172

deep-research

hint-shu/deep-research

Advanced deep research (L2) — runs /research (L1) first, then adds reflection loop, contradiction detection, and tree depth 2 for follow-up questions. ~12 min, 20-30 sources, ~2000 word report. Use for serious questions, technology choices, non-trivial investigations.

2026-04-172

expert-research

hint-shu/deep-research

Expert-level research (L3) — runs /deep-research (L2) then adds critic agent, fact-checking pass, multi-perspective search, and human-in-the-loop plan approval. ~20 min, 40-60 sources, 3000+ word report with executive summary. Use for strategic decisions, technology migrations, important investigations.

2026-04-172

academic-research

hint-shu/deep-research

Academic-grade research (L4) — runs /expert-research (L3) then adds academic sources (arXiv, Google Scholar), full multi-agent crew (Planner + 2x Researchers + Critic + Editor), timeline analysis, methodology section, and annotated bibliography with source quality ratings. ~40 min, 80-120 sources, 5000+ word report. Use for scientific overviews, research-grade investigations.

2026-04-172

research

hint-shu/deep-research

Standard deep research (L1) with planner decomposition and per-source summarization. ~5 min, 10-15 sources, structured ~1000 word report. Default choice for "расскажи про X", "как работает Y", "что нового в Z". Use this when /quick-research is too shallow and /deep-research is overkill.

2026-04-172

Source

hint-shu

hint-shu/deep-research

Ouvrir le dépôt GitHub Voir les dépôts du créateur

Commande d'installation

Téléchargement

Exécuter dans Manus

Utile pourSOC

Analystes en sécurité de l'informationProfessions informatiques et mathématiques15-1212L4

name	osint-research
description	OSINT recon skill — passive entity discovery for domain / IP / email / person / company / github user. Builds a hybrid findings/dossier/graph report from free public sources (Whois, DNS, crt.sh, Wayback, Shodan InternetDB, GitHub code search, Tavily/Firecrawl/Exa/Perplexity dorks, optional theHarvester/subfinder). No subscription dependencies. Disclaimer is non-removable; secrets are redacted; blocklisted dump sites are filtered at outbound and inbound levels.
user_invocable	true

OSINT Research — Entity Discovery (Specialized Skill)

Parallel to L0–L5 research ladder. Single entity per invocation.

When to Use

User passes a target as the argument:

example.com → domain recon
8.8.8.8 → IP recon
user@example.com → email recon
"John Doe" → person recon
--company "Anthropic" → company recon
github:anthropics → github org/user recon

Goal: hybrid OSINT report (findings → dossier → mermaid graph → raw artifacts → audit trail) in .firecrawl/osint/<slug>/.

Reference spec: docs/superpowers/specs/2026-04-26-osint-research-design.md

Budget

Time: ~10–15 min
Credits: ~30–50 across Tavily/Firecrawl/Exa/Perplexity (comparable to L2)
Money: $0/mo guaranteed (no subscriptions)

Hard Security Rules (do NOT bypass)

These are enforced at helper level too — but Claude must understand and respect them:

Never write a raw channel response to disk. Not to artifacts, not to /tmp/, not anywhere.
Always pipe channel outputs through inbound-filter.sh then secret-redactor.sh before any disk write or further processing.
Never query a blocklisted dump site directly. dorks.sh enforces this; you must not work around it.
Never copy detected secrets into the report or artifacts in plaintext. Type + location + truncated match only.
Never aggregate PII fields into a profile of a private individual. No joining of name + email + home address.

If any rule conflicts with what you think the user wants — stop and ask. Defaults are not overridable in a single invocation.

Pipeline

Classify entity:

ENTITY_TYPE=$(bash skills/osint-research/lib/entity-classifier.sh [--company] "$TARGET")

Build slug: <sanitized_target>-<YYYYMMDD>-<HHMM> (e.g. example-corp-com-20260426-1432). Create .firecrawl/osint/<slug>/.
Run channels in parallel (Phase 1). Each channel emits NDJSON on stdout. For each channel:
- Pipe its stdout through lib/inbound-filter.sh → lib/secret-redactor.sh.
- Append filtered/redacted lines to <slug>/raw/<channel>.ndjson.
- Record channel status (OK / SKIPPED / ERRORED) for the status block.
Bash-callable channels:
- channels/whois-dns.sh (domain/ip/company)
- channels/crtsh.sh (domain/company)
- channels/wayback.sh (domain/company)
- channels/shodan-idb.sh (after DNS resolves an IP)
- channels/github-leaks.sh (all entity types)
- channels/theharvester-wrap.sh (domain/company; skip if missing)
- channels/subfinder-wrap.sh (domain/company; skip if missing)
- channels/dorks.sh (all entity types — emits queries; you feed them to Tavily MCP)
MCP-callable channels (you call directly via tools):
- Tavily — for dork queries from dorks.sh. Pipe each result body through inbound-filter.sh and secret-redactor.sh before persisting.
- Firecrawl — for top-N URLs from Tavily. Before scraping, pass the URL through inbound-filter.sh (single-line NDJSON form). If filter drops it, do not scrape.
- Exa — for related entities (person/company only).
- Perplexity — for context enrichment (person/company only).
Phase 2 dependent channels:
- For each IP from whois-dns DNS resolves → shodan-idb.sh.
- For each top-URL from Tavily → Firecrawl scrape (after inbound filter check).

Findings extraction:

cat <slug>/raw/*.ndjson | bash skills/osint-research/lib/findings-extractor.sh > <slug>/findings.ndjson

Synthesize:
- Read findings.ndjson. Group by priority (CRITICAL/HIGH/MEDIUM/LOW).
- Read <slug>/raw/*.ndjson. Group by record_type → fill dossier sections.
- Build mermaid graph from entities + edges (cap at 30 nodes; rest go to CSVs only).
- Fill templates/osint-report.md.tpl placeholders.
- Write <slug>/osint-report.md, <slug>/graph.mmd, plus <slug>/subdomains.csv, <slug>/emails.csv, <slug>/ips.csv, <slug>/dorks-results.md, <slug>/tech-stack.md, <slug>/sources.md.
Audit trail (sources.md): for every claim in the report, record channel + timestamp. Also record filtered-out URLs (host only, no content) under a Filtered (security policy) heading.
Channel status block in the report header: which channels ran OK, which were SKIPPED (with install tip if optional CLI missing), which ERRORED.

Error Handling

Required channels (whois, dns) failure → abort with clear message.
Recommended channels (crt.sh, Wayback, Shodan-IDB, GitHub, Tavily dorks, Perplexity) failure → mark SKIPPED, continue.
Optional channels (theHarvester, subfinder) missing → silent skip with install tip.
Tavily/Firecrawl/Exa/Perplexity rate limit → exponential backoff x3, then PARTIAL.
Malformed channel response → log only structured metadata (channel/status/size/category/timestamp) to sources.md. Never write raw body anywhere.

Output Format

Hybrid (per spec §5):

1. Disclaimer (non-removable)
2. Findings Summary (CRITICAL → LOW)
3. Entity Dossier (Identity / Infrastructure / People / History / Leaks / Tech Stack / Related)
4. Relationship Graph (mermaid)
5. Raw Artifacts (links to CSVs/MD files)
6. Sources (audit trail with timestamps + Filtered list)

Done Criteria

The skill completes successfully when:

<slug>/osint-report.md exists, contains all required sections.
All channel outputs are stored in <slug>/raw/*.ndjson (filtered + redacted).
<slug>/sources.md has at least one entry per channel that ran.
No plaintext secret appears in any file under <slug>/.
No blocklisted-domain URL or content appears in any file under <slug>/.