Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

benchmark-due-diligence

Name: Benchmark Due Diligence
Author: daymade

// Adversarial due-diligence on a benchmark you envy — a founder, KOL, company, or product whose claimed success you suspect is inflated. Inline four-phase orchestration — fan-out collection, adversarial verification grading every claim L1-L4 to split marketing bubble from real signal, attribution weighting (product vs timing vs IP vs luck, what's replicable), then mapping the validated playbook onto the user's own resources. Use whenever the user wants to 尽调/对标/拆解 a competitor or role-model, 抄/偷师 someone's playbook, suspects 水分/泡沫 in their claims (Product Hunt

Ejecutar en Manus

$ git log --oneline --stat

stars:1130

forks:173

updated:30 de mayo de 2026, 11:16

Explorador de archivos

5 archivos

SKILL.md

readonly

related-skills.json

mismo repositorio

bigdata-skill.md

from "daymade/claude-code-skills"

Pull Bigdata.com (RavenPack) financial and news data through the official `bigdata-client` SDK and its public `/v1/*` REST endpoints when the Bigdata MCP server returns only pre-synthesized tearsheets but you need the machine-readable substrate underneath. MCP search returns prose chunks (text + relevance only — no per-chunk sentiment, no entity spans); its tearsheets give only aggregate values, not computable time series or per-field JSON. This skill bundles a verified, cost-guarded toolkit over the official REST API: annotated chunk search, entity/ISIN resolution, analyst estimates, calendar/surprise/ ratings/targets, financial statements, TTM metrics & ratios, prices, dividends, revenue segments, a daily entity-sentiment series, co-mention graph, screener, and batch search. Use it whenever the user mentions Bigdata.com, RavenPack, a `bd_v2_` key, the bigdata MCP, rp_entity_id, chunk/query_unit cost, or wants structured financials, fundamentals, prices, sentiment, or annotated news.

2026-05-301.1k

cloudflare-troubleshooting.md

from "daymade/claude-code-skills"

Investigate and resolve Cloudflare configuration issues using API-driven evidence gathering. Use when troubleshooting ERR_TOO_MANY_REDIRECTS, SSL errors, DNS issues, or any Cloudflare-related problems. Focus on systematic investigation using Cloudflare API to examine actual configuration rather than making assumptions.

2026-05-301.1k

debugging-network-issues.md

from "daymade/claude-code-skills"

Evidence-driven investigation for network, streaming, and protocol-layer bugs. Use when debugging connection resets (ECONNRESET, HTTP/2 RST_STREAM, INTERNAL_ERROR), SSE or long-polling stalls, fixed-time connection drops, CDN/proxy/CGNAT idle timeouts, or any incident where symptoms do not match the obvious cause. Applies falsification-first methodology — layered isolation experiments to pin down the responsible network layer, env-gated runtime instrumentation for non-invasive observation, and counter-review agent teams to challenge single-cause assumptions. Strongly trigger on "socket closed unexpectedly", "stream interrupted", "ECONNRESET", "HTTP/2 INTERNAL_ERROR", "fails after N seconds", "works sometimes but not always", "upstream silent for X seconds", or any scenario where the investigator might jump to conclusions before evidence. Generalizes to any multi-layer system investigation where assumption-first thinking is the failure mode.

2026-05-301.1k

windows-remote-desktop-connection-doctor.md

from "daymade/claude-code-skills"

Diagnose Windows App (Microsoft Remote Desktop / Azure Virtual Desktop / W365) connection quality issues on macOS. Analyze transport protocol selection (UDP Shortpath vs WebSocket), detect VPN/proxy interference with STUN/TURN negotiation, and parse Windows App logs for Shortpath failures. This skill should be used when VDI connections are slow, when transport shows WebSocket instead of UDP, when RDP Shortpath fails to establish, or when RTT is unexpectedly high.

2026-05-301.1k

pdf-creator.md

from "daymade/claude-code-skills"

Convert markdown files to professional PDF documents with proper Chinese font support, theme system, and visual self-check. Use whenever the user asks to create PDFs, convert markdown to PDF, generate printable documents, or needs documents formatted for print or mobile reading. This skill MUST be used instead of manual pandoc/Chrome invocations — it handles CJK typography, Chrome header/footer suppression, and mandatory visual verification that manual approaches miss. **Scope: markdown → PDF only.** For Word (.docx) output use `minimax-docx`; this skill does not produce docx and the two pipelines are intentionally orthogonal.

2026-05-301.1k

skill-creator.md

from "daymade/claude-code-skills"

Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, edit, or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.

2026-05-301.1k

package.json

"author": "daymade"

"repository": "daymade/claude-code-skills"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Analistas de investigación de mercados y especialistas en marketingOperaciones empresariales y financieras13-1161L4

name

benchmark-due-diligence

description

Adversarial due-diligence on a benchmark you envy — a founder, KOL, company, or product whose claimed success you suspect is inflated. Inline four-phase orchestration — fan-out collection, adversarial verification grading every claim L1-L4 to split marketing bubble from real signal, attribution weighting (product vs timing vs IP vs luck, what's replicable), then mapping the validated playbook onto the user's own resources. Use whenever the user wants to 尽调/对标/拆解 a competitor or role-model, 抄/偷师 someone's playbook, suspects 水分/泡沫 in their claims (Product Hunt

Benchmark Due Diligence

Take a benchmark the user envies — a founder, KOL, company, or product whose success looks suspiciously shiny — and produce a teardown that ends in "what this means for ME", not a neutral report. The deliverable answers three questions a balanced briefing never does: How much of this success is real vs marketing bubble? How much is replicable method vs luck/timing? And what, specifically, can the commissioner do with it?

This is the adversarial, decision-oriented cousin of deep-research. Where deep-research builds a trustworthy picture of the world, this skill assumes the picture is inflated until proven otherwise and converts the survivors into the commissioner's own moves.

CRITICAL: run inline, never `context: fork`

This skill is an orchestrator — it spawns parallel collection + verification agents (via the Workflow tool, or Task agents) and may invoke other skills (deep-research, osint-investigate, qcc). Subagents cannot spawn subagents or call skills. Setting context: fork would silently break the entire fan-out. Do not add a context field. (Same constraint osint-investigate documents — it's a hard runtime rule, not a preference.)

The one rule that protects the commissioner: two injection channels

Everything the agents see flows through exactly two channels. Keeping them separate is the single most important discipline in this skill:

Channel	Content	Injected into
FACTS	Already-verified public facts about the benchmark (relationships, who-owns-what, the headline claim flagged `⚠️ to-verify`)	Every agent — collection, verification, synthesis
COMMISSIONER_CONTEXT	The commissioner's private reality — real resources, client names, strategic intent, what they can actually leverage	Only the final mapping agent (Phase 4)

Why this split is non-negotiable: collection and verification agents take their input and run external WebSearch on it. If the commissioner's client names or strategy leak into those prompts, they get searched on the open web — a privacy breach. The mapping phase genuinely needs "who is the commissioner"; the collection phase must never see it. Encode this in the orchestration (see references/workflow_orchestration_template.md), don't rely on remembering it mid-run.

Phase 0 — nail the foundation by evidence, not appearance (do this BEFORE any agent)

The fastest way to waste a 12-agent fan-out is to build it on a foundation you inferred from appearances. Two failure modes recur and both have burned real runs:

Inferring relationships between entities from names/domains. "Their content lives at academy.example.com, and they're the founder, so they must own that community" — when in reality they were just an invited guest. A shared domain, a similar name, or co-occurrence is an observation, not ownership. Verify with an authoritative source before treating any A↔B relationship as fact.
Treating the commissioner's client as the commissioner's asset. If the commissioner does service work for an accelerator/brand, that accelerator is the client's asset — the commissioner can't leverage its audience or capital. Mapping the benchmark's playbook onto resources the commissioner doesn't actually control produces castles in the air.

So before fanning out, establish by evidence (not vibes):

The benchmark's real entity graph — who owns whom, who merely partners/guests. Don't reason from names.
The headline-claim attribution — the benchmark's whole narrative usually rests on one trophy stat ("took product X from 0 → 1M users"). Are they the founder, or the departed growth lead? This is the #1 to-verify target; write it into FACTS with a ⚠️.
What the commissioner truly controls — separate owned assets from client/partner assets.

Write the results into FACTS (public half) and COMMISSIONER_CONTEXT (private half). A shaky foundation makes every downstream agent confidently wrong.

The four-phase orchestration

Use the Workflow tool (preferred — deterministic fan-out, see the ready-to-fill template in references/workflow_orchestration_template.md) or Task agents. Scale agent count to how thorough the user wants (a few dimensions for a quick read, 6+ with multi-vote verification for a deep audit).

Phase 1 + 2 — collect → verify, per dimension, as a pipeline (each dimension verifies the moment its collection finishes; no global barrier):

Collection agent — objective stance. Every finding carries a source URL and a source_kind (对象自述/营销 vs 第三方独立信源 vs 混合). Anything not found goes in gaps — never filled by guessing.
Verification agent — adversarial, default-skeptical stance. Grade every claim L1–L4 and rule 坐实 / 大体可信 / 存疑 / 证伪-水分. The job is to actively hunt falsifying evidence, especially for the headline claims (the trophy stat, "#1 ranking", funding amount, user counts). bubble_summary names the biggest water in that dimension.

Grading rubric, source_kind, verdicts, and both JSON schemas → references/evidence_grading_rubric.md.

Typical dimensions (tailor to the benchmark type — person / company / product):

Subject background + headline-claim attribution (the #1 bubble target)
Corporate base — entity, founding, funding/valuation
Core product/business real metrics — user counts, revenue, rankings, awards, cross-verified against third parties
Playbook teardown — platform matrix, persona, content types, how they borrow other people's audiences, how personal IP funnels to the product
Comparison sample — a structurally-similar peer or parallel path
Sector + how this class of playbook usually wins and usually fails

Phase 3 — synthesis: due-diligence conclusion (single agent, consumes all verdicts):

Real relationship map (correcting the common misreadings from Phase 0)
Bubble-busting table — claim | evidence level | verdict | one-line basis, sorted by most-water-first
Playbook teardown — concrete, copyable actions
Attribution breakdown (the core) — what share of the success is product vs market-timing vs personal-IP-marketing vs operations? Give % ranges with reasons, and explicitly split replicable method from luck / timing / non-transferable endowment.

Phase 4 — synthesis: what this means for the commissioner (single agent; consumes Phase 3 + COMMISSIONER_CONTEXT):

Resource-mapping table — benchmark's playbook elements × the commissioner's real resources; tag each cell ✅ borrow-able / ⚠️ not-replicable (luck/timing) / 🔄 already-doing / 🚫 bubble-don't-copy, one line each
Landing points — exactly how the commissioner uses it (their to-B service / their own IP / their tooling)
Action list + open questions (what's still unconfirmed)

Attribution weighting and the four-tag mapping framework → references/attribution_and_resource_mapping.md.

Don't rebuild what already exists

This skill's edge is the adversarial bubble-busting + attribution + commissioner-mapping layers. The plumbing underneath is not novel — reuse it:

Fan-out collection / source governance — borrow the lead-agent + subagent pattern from deep-research. (What's unique here is the skeptical verification stance and the L1–L4 bubble grading, not the parallelism.)
Person-subject identity / footprint checks — invoke osint-investigate (ACH hypothesis matrix, Bellingcat-style pivots) rather than re-deriving identity attribution.
Mainland-China corporate registration / funding — invoke the qcc family of skills for 工商 data.
Social-platform playbook data — the agent-reach CLI covers B站/小红书/抖音/YouTube/X.

Read before you run

references/evidence_discipline_traps.md — the recurring traps (inferring relationships from appearances, headline-claim attribution, client-vs-asset, foundation-before-fan-out, grade-don't-binary, privacy leak) with real teardown war-stories. Read this first; it's where runs actually break.
references/evidence_grading_rubric.md — L1–L4, source_kind, verdicts, collection/verification schemas.
references/attribution_and_resource_mapping.md — attribution weighting + four-tag mapping + landing-point framework.
references/workflow_orchestration_template.md — a ready-to-fill Workflow script with the FACTS / COMMISSIONER_CONTEXT injection split already wired in.

Next Step

After the due-diligence conclusion is ready, suggest the natural follow-on (opt-in, never auto-run):

Due-diligence teardown is done.

Options:
A) Render it as a shareable PDF report — pdf-creator (Recommended if this goes to a partner/team)
B) One dimension needs deeper neutral background — deep-research on that sub-topic
C) No thanks — the markdown teardown is enough

benchmark-due-diligence

Más de este repositorio

Más de este repositorio

Benchmark Due Diligence

CRITICAL: run inline, never context: fork

The one rule that protects the commissioner: two injection channels

Phase 0 — nail the foundation by evidence, not appearance (do this BEFORE any agent)

The four-phase orchestration

Don't rebuild what already exists

Read before you run

Next Step

Benchmark Due Diligence

CRITICAL: run inline, never context: fork

The one rule that protects the commissioner: two injection channels

Phase 0 — nail the foundation by evidence, not appearance (do this BEFORE any agent)

The four-phase orchestration

Don't rebuild what already exists

Read before you run

Next Step

CRITICAL: run inline, never `context: fork`

CRITICAL: run inline, never `context: fork`