一键在 Manus 中运行任何 Skill

fact-check

Lightweight single-claim adversarial verdict — supported / refuted / inconclusive with cited evidence. Use when one factual claim needs checking mid-conversation — the host agent gathers a little evidence and runs the same adversarial quorum as deep-research, returning a verdict (not a report), using the host's own WebSearch/WebFetch + LLM with zero API-key setup.

在 Manus 中运行

概览

安装命令

npx skills add https://github.com/kouko/monkey-skills --skill fact-check

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

kouko/monkey-skills

星标4

分支0

更新时间2026年6月3日 04:55

文件资源管理器

9 个文件

SKILL.md

readonly

同仓库更多 Skills

同仓库

dbt-model-style

kouko/monkey-skills

Enforces a dbt + Redshift model **writing-style & structure** contract — CTE roles, the zero-logic `final` CTE, naming, the two-block YAML header, column comments/tags, and Redshift syntax. Use when authoring, editing, or reviewing a dbt model (`.sql`), or when asked whether one 「符合規範嗎」. Adoptable template: project-specific items are tagged `(adapt)`; model comments & frontmatter values stay in the user's working language. Do NOT use for calculation logic, business rules, metric formulas, or layer-dependency design — style & structure ONLY. dbt 撰寫風格・CTE 結構・命名・註解・排版。dbt スタイル・命名規則。

2026-06-044

dogfood-skill-testing

kouko/monkey-skills

Behavioral black-box dogfood of a skill-IN-DEVELOPMENT — a raw SKILL.md in the working tree that is NOT yet installed. Use when the user wants to gut-check how a drafted/edited skill actually behaves before trusting it: does it FIRE when it should and NOT over-fire, and does its workflow produce output that meets its own declared contract on real input. A fresh blind subagent that does NOT know the author's intent probes the triggers and the workflow and reports what breaks with reproducible transcript evidence. Tests BOTH dimensions co-equally: triggering (trigger-miss / over-trigger) AND output quality (workflow drift / gate bypass / valid-but-wrong), on working-tree files — no install, emitting a fix-actionable report. Triggers — zh-TW:「dogfood 這個 skill」「測試我的 skill 會不會觸發」; ja:「スキルをドッグフード」「発火するか試す」; en: "dogfood this skill", "behavioral / blind-test my skill", "will this skill fire before I ship". Do NOT use for: static design scoring of a SKILL.md (use dev-workflow:skill-judge — 8-dimension rubric, reads t

2026-06-034

deep-read

kouko/monkey-skills

Deeply understand ONE large document or book — build a structured understanding (sections, claims, methodology, caveats, argument-structure) of a single source, depth-on-one-source vs deep-research's breadth-across-many. Use when the user wants to thoroughly comprehend one long document, paper, or book, run inside any coding agent host using the host's own tools (zero API-key setup).

2026-06-034

daily-brief

kouko/monkey-skills

把散在 Gmail / Slack / Notion / Asana / Drive / Calendar / GitHub 的事整理成一份可信、每列可點處理的晨報,外加一張零省略行動表;每天累積、跨日延續(已結/仍在等你/新發生)。每天開工、想知道今天有什麼要處理、怕漏掉待回覆時用——before starting your workday。觸發詞:每日簡報、daily brief、morning brief、晨報、今天要做什麼、待回覆、要回的訊息、standup、今日まとめ、朝のブリーフィング。 Do NOT use for 績效回顧/自評/專案盤點(那是 performance-evidence-audit,同機制反方向時間軸);Do NOT use 來寫回官方系統、代送或自動回覆(本 skill 唯讀 + 只寫本機草稿)。

2026-06-034

cite-check

kouko/monkey-skills

Audit an existing document's cited claims — fetch each cited source and check it actually supports the claim; flag unsupported / misattributed / dead-link citations. Use when the user wants to verify that a document's citations hold up, run inside any coding agent host using the host's own LLM + web tools (zero API-key setup).

2026-06-034

init

kouko/monkey-skills

First-time setup for dbt-wiki: scaffold .dbt-wiki/ knowledge base from target/manifest.json (model / source / macro / seed / snapshot / test / exposure metadata, ref/source dependencies, schema.yml columns and tests), plus target/compiled/<project>/**/*.sql parsed via sqlglot for column-level lineage, plus dbt/models/**/*.sql raw files parsed via regex for inline SQL/jinja comments. Generates one markdown page per resource, plus index.md (grouped by tier / materialization / tag / group), lineage.md (ASCII DAG + adjacency list), log.md, SCHEMA.md, and an idempotent CLAUDE.md drop-in. Re-runnable: refreshes manifest-derived fields, archives orphans, preserves user-owned body sections. Pre-condition: dbt parse && dbt compile must be run first (init checks for target/manifest.json and target/compiled/), and sqlglot must be installed (pip install sqlglot). Triggers on "init dbt-wiki", "set up dbt-wiki", "scaffold dbt knowledge base", "seed dbt model wiki", "build dbt-wiki from manifest", "first-time dbt knowledge"

2026-06-034

来源

kouko

kouko/monkey-skills

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

name	fact-check
description	Lightweight single-claim adversarial verdict — supported / refuted / inconclusive with cited evidence. Use when one factual claim needs checking mid-conversation — the host agent gathers a little evidence and runs the same adversarial quorum as deep-research, returning a verdict (not a report), using the host's own WebSearch/WebFetch + LLM with zero API-key setup.
version	0.1.0

fact-check

A lightweight, single-claim adversarial verifier. You hand it one factual claim; it gathers a little evidence, runs the same adversarial quorum deep-research uses, and returns a verdict — supported / refuted / inconclusive with cited evidence and a confidence — not a multi-page report.

This is the point-check counterpart to deep-research. Where deep-research fans out across 3–6 angles and dozens of sources to synthesize a breadth report, fact-check spends a small budget on one claim and stops at the verdict. It reuses deep-research's verify + quorum primitives directly: schemas.py, rank.py, prompts.py, dedup.py in this skill's scripts/ are byte-identical copies of deep-research's (kept in sync by a repo-level MD5 drift check). Only factcheck.py — the Stage-C verdict mapper — is new.

Executor model — who does what

You (the agent running this skill) are the executor. You supply the LLM reasoning, the web tools, and the per-voter fan-out:

LLM reasoning — you rank search hits, extract supporting/contradicting quotes, and cast the adversarial votes yourself, emitting JSON that conforms to a bundled schema.
Web search — your host WebSearch tool.
Web fetch — your host WebFetch tool.
Fan-out — you dispatch the parallel verifier voters (see below).

The bundled scripts/*.py supply only deterministic logic — prompt text, JSON schemas, URL-dedup, ranking, quorum, the verdict mapping. They make no network calls and read no API keys. They are stdlib-only and run with plain python.

No API key is required. This skill borrows the host agent's own LLM + web tools (your existing subscription) — there is no key to set, no separate program to install, no per-call API cost. The reasoning and I/O ride on the agent you are already in.

Run all python scripts/… commands from this skill's own scripts/ directory (paths below are relative to it).

Portable fan-out convention

Stage B casts VOTES_PER_CLAIM = 3 independent verifier votes. Do these in parallel by dispatching 3 subagents, per code-toolkit:dispatching-parallel-agents: one fresh subagent per voter_idx, dispatched in a single assistant message with multiple agent calls so the harness runs them concurrently.

Describe and dispatch this work abstractly as "dispatch N subagents" — do not hard-code the Claude Code Workflow tool. Stated abstractly, the fan-out maps onto whatever concurrent-subagent primitive the host agent provides (Claude Code, Codex, Cursor, …); binding to one harness's workflow primitive would break agent-portability. The three voters are independent (same claim, disjoint voter_idx, no shared files), exactly the case the fan-out is for.

Stage A — Gather evidence

Spend a small budget confirming and disconfirming the one claim.

Run host WebSearch once or twice: one query phrased to confirm the claim, one phrased to disconfirm it (e.g. "<claim>" and "<claim> debunked OR false OR contradicted"). Keep it to 1–2 queries — this is a point-check, not a breadth sweep.
Rank/filter the raw hits into structured results. Treat the claim as a single search angle. Get the ranking prompt:
```
python scripts/prompts.py search --angle '{"label":"claim","query":"<q>"}' --question "<the claim>"
```
Reason over it and emit results conforming to:
```
python scripts/schemas.py search
```
Shape: {results: [...]}, each result carrying at least url and a relevance rank.
Dedup and cap to a small fetch budget — ≤6 sources for a point-check (well under deep-research's MAX_FETCH = 15). The dedup script normalizes URLs (strips www., trailing /, lowercases host+path) so www.X.com/a/ and x.com/a collapse to one:
```
echo '{"results": [...], "seen": {}, "fetch_slots": 6}' | python scripts/dedup.py
```
stdin {results, seen, fetch_slots} → stdout {novel, seen, slots}: novel is the deduped, budget-capped sources to fetch. High-relevance hits are never budget-dropped, so a strongly-sourced claim can exceed the slot count slightly — that is fine for a point-check.
For each novel source, fetch it with host WebFetch and extract a supporting or contradicting quote bearing on the claim. Get the extraction prompt:
```
python scripts/prompts.py fetch --source '{"url":"<url>","title":"<title>"}' --label "claim" --question "<the claim>"
```
Reason over the fetched content and emit an extraction object conforming to:
```
python scripts/schemas.py extract
```
Shape: {sourceQuality, publishDate, claims} — tag each extracted item with its sourceQuality and importance. Collect the per-source quote, URL, and quality tag into one small evidence pool for the claim.

If no evidence is found (every search empty, every fetch paywalled/dead), skip to Stage C with an empty verdict list — the mapper returns inconclusive.

Stage B — Verify (adversarial quorum)

The one claim faces VOTES_PER_CLAIM = 3 independent adversarial voters whose job is to refute it, each grounded in the Stage-A evidence. Fan out one subagent per voter_idx (0, 1, 2) per the convention above.

For each voter_idx:

Get that voter's prompt (the per-voter --voter-idx diversifies the three votes so they do not echo):

python scripts/prompts.py verify --claim '{"claim":"<the claim>","sourceUrl":"<url>","sourceQuality":"<tag>","quote":"<quote>"}' --voter-idx <i> --question "<the claim>"

The voter reasons (it may run its own WebSearch/WebFetch to hunt counter-evidence) and emits a verdict conforming to:
```
python scripts/schemas.py verdict
```
Shape: {refuted: bool, evidence, confidence, counterSource?}. A voter that fails or returns nothing is an abstention — record its vote as null, not as a non-refutation.
Collect the three votes into an array (abstentions as null).

Stage C — Verdict

Map the three votes to the 3-way taxonomy with factcheck.py:

echo '[<verdict>, <verdict-or-null>, <verdict>]' | python scripts/factcheck.py verdict

stdin: the verdicts array → stdout a JSON object {verdict, confidence, valid_count, refuted_count}:

supported — survives the quorum (rank.quorum_survives: ≥2 valid votes AND fewer than REFUTATIONS_REQUIRED = 2 of them refute it).
refuted — ≥REFUTATIONS_REQUIRED valid votes carry refuted: true.
inconclusive — anything else. This covers the three weak cases: all-abstain ([null, null, null]), fewer than 2 valid votes, and empty input ([] — no evidence was found in Stage A). The valid-count check gates first, so an all-abstain claim is never falsely "supported" on a refuted-count of 0.

confidence is the strongest confidence among the non-refuting valid votes (else low).

Return the verdict to the user as a short answer: the claim, the supported/refuted/inconclusive label, the confidence, and the cited quotes + source URLs from Stage A that back it. Do not synthesize a full report — that is deep-research's job.

Script-invocation quick reference

Stage	Command	stdin → stdout
A	`prompts.py search --angle A --question Q`	— → search prompt
A	`schemas.py search`	— → search schema
A	`dedup.py`	`{results, seen, fetch_slots}` → `{novel, seen, slots}`
A	`prompts.py fetch --source S --label L --question Q`	— → fetch prompt
A	`schemas.py extract`	— → extract schema
B	`prompts.py verify --claim C --voter-idx I --question Q`	— → verify prompt
B	`schemas.py verdict`	— → verdict schema
C	`factcheck.py verdict`	verdicts array → `{verdict, confidence, valid_count, refuted_count}`