Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

$pwd:

silent-failure-detection

Name: Silent Failure Detection
Author: Sandeeprdy1729

// Silent failure detection skill. Identifies when Claude is being confidently wrong — surfacing hidden assumptions, exposing overconfident claims, probing calibration, and catching hallucination before it causes damage. Uses specific questioning techniques to force the model to reveal its actual uncertainty rather than presenting plausible-sounding answers as facts. Turns overconfidence into a detectable signal. Use when user says: are you sure about this, check your confidence, how do you know that, what are you assuming, could you be wrong, probe this, test your answer, are you hallucinating, what's your actual confidence, stress test your answer, expose your assumptions, calibrate this, fact check yourself, where could you be wrong, question your own answer, is this actually true, what if you're wrong, what evidence would change your answer. Do NOT activate for: creative tasks where factual accuracy is not the concern, outputs the user explicitly wants without critique. First response: "Silent Failure Detect

Exécuter dans Manus

$ git log --oneline --stat

stars:2

forks:0

updated:26 mai 2026 à 16:49

SKILL.md

readonly

related-skills.json

même dépôt

constitutional-reasoning.md

from "Sandeeprdy1729/claude-design-skill"

Self-critique and Constitutional AI reasoning skill. Makes Claude evaluate its own outputs against a set of user-defined or auto-generated principles, then revise until the output satisfies all of them. Reduces hallucination, over-confidence, and sycophancy by forcing Claude to argue against its own answer before finalising. Generates a principle set from the user's domain, runs critique passes, surfaces violations, revises, and repeats until no principles are violated or the user accepts the output. Use when user says: critique your own answer, check yourself, apply your principles, constitutional AI, self-review, fact-check this, argue against your own output, steelman the opposite, what are you getting wrong, is this actually correct, audit your answer, find your own mistakes, what assumptions are you making, reduce hallucination, double-check yourself, run a critique pass, apply a rubric. Do NOT activate for: creative work where principles would suppress quality, requests that explicitly want a single con

2026-05-262

constraint-solver.md

from "Sandeeprdy1729/claude-design-skill"

Constraint-based problem solving skill. Finds elegant solutions when the design space is heavily restricted — budget caps, time limits, technology mandates, team size, ethical bounds, regulatory requirements. Treats constraints as the design material, not the obstacle. Surfaces constraint violations early, ranks constraints by hardness, and finds solutions that satisfy all hard constraints while optimizing within soft ones. The tighter the constraints, the sharper the solution. Use when user says: very limited budget, we can't use X, strict deadline, only N people, regulatory constraint, tech stack is fixed, solve with these restrictions, must work within these limits, strong constraints, can't change the requirements, work within the box, solve this but we can only, tiny budget, no budget, works with these limitations, solve under constraint. Do NOT activate for: open-ended problems with no constraints, problems where the first step is to challenge the constraints themselves (use pre-mortem instead). First r

2026-05-262

cross-domain-synthesis.md

from "Sandeeprdy1729/claude-design-skill"

Cross-domain synthesis skill. Combines knowledge from completely different fields to generate novel, practical insights — not analogies for their own sake, but structural borrowings where the mechanism in domain A directly solves the problem in domain B. Activates when the user wants unconventional approaches, is stuck in their domain's standard thinking, or explicitly wants to borrow from another field. Finds the structural isomorphism — not just a surface metaphor. Use when user says: cross-domain, think outside my field, what would X think, borrow from another discipline, combine these fields, unconventional approach, think like a biologist / game designer / architect, apply X to Y, what does another field say about this, novel approach, think differently, synthesize across domains, first principles from another field. Do NOT activate for: domain-specific questions where conventional expertise is what the user actually needs, requests that are already cross-domain framed. First response: "Cross-Domain Synt

2026-05-262

iterative-refinement.md

from "Sandeeprdy1729/claude-design-skill"

Rapid iterative refinement skill. Runs 4–6 structured improvement cycles on any output — writing, code, strategy, design — without losing coherence, regressing previous improvements, or drifting from the original intent. Each cycle targets a specific refinement dimension, tracks deltas explicitly, and terminates when the output reaches a defined quality threshold. Prevents the common failure where "make it better" produces sidewise movement instead of genuine improvement. Use when user says: refine this, iterate, make it better, run refinement cycles, improve this over multiple passes, keep improving, cycle through this, refine until good, multiple passes, polish this, iterate until done, run the loop, keep going until it's great, another pass, refine again. Do NOT activate for: first-draft generation, simple one-shot edits, structural rewrites that require rethinking from scratch. First response: "Iterative Refinement active. Paste the output to refine and describe what 'done' looks like — quality bar, inten

2026-05-262

large-doc-mastery.md

from "Sandeeprdy1729/claude-design-skill"

Large document and codebase synthesis skill. Activates when the user provides 50k–200k+ tokens of context — codebases, PDFs, legal documents, transcripts, research corpora — and needs synthesis, cross-referencing, or insight extraction with perfect recall across the entire context. Not a summarizer. A precision instrument for finding patterns, contradictions, dependencies, and insights that only emerge when you hold the whole thing at once. Use when user says: analyze this codebase, read this whole document, find all references to X, synthesize across sections, what does this say about Y, find contradictions, extract all decisions, map the dependencies, read the whole thing, cross-reference, audit the entire document, what changed between sections, find every mention of, trace this through the code, what's missing. Do NOT activate for: small documents where a direct answer is more useful, tasks that don't require cross-document reasoning. First response: "Large Document Mode active. Paste or attach your docum

2026-05-262

precision-prompt-chaining.md

from "Sandeeprdy1729/claude-design-skill"

Multi-step prompt chaining skill where each step's output becomes the next step's input, with explicit rules, constraints, and reference tokens between steps. Activates when the user needs to build a complex pipeline of dependent Claude calls — research → outline → draft → edit → publish — where downstream steps must be constrained by upstream outputs without hallucination or drift. Designs chains with named output variables, explicit pass rules, validation gates between steps, and failure recovery conditions. Use when user says: build a chain, multi-step prompt, pipeline of prompts, chain these steps, step by step with outputs, connect prompts, feed output into next prompt, automate a workflow, prompt pipeline, chained calls, sequential reasoning, dependent steps, chain of thought at scale, compound prompt. Do NOT activate for: single-turn questions, simple back-and-forth conversation, cases where one prompt is sufficient for the task. First response: "Prompt Chain Builder active. Describe the full workflow

2026-05-262

package.json

"author": "Sandeeprdy1729"

"repository": "Sandeeprdy1729/claude-design-skill"

Ouvrir le dépôt GitHub Voir les dépôts du créateur

$ install --global

$ download --local

Exécuter dans Manus

$ useful --forSOC

Agents de conformitéProfessions des affaires et des opérations financières13-1041L4

name	silent-failure-detection
description	Silent failure detection skill. Identifies when Claude is being confidently wrong — surfacing hidden assumptions, exposing overconfident claims, probing calibration, and catching hallucination before it causes damage. Uses specific questioning techniques to force the model to reveal its actual uncertainty rather than presenting plausible-sounding answers as facts. Turns overconfidence into a detectable signal. Use when user says: are you sure about this, check your confidence, how do you know that, what are you assuming, could you be wrong, probe this, test your answer, are you hallucinating, what's your actual confidence, stress test your answer, expose your assumptions, calibrate this, fact check yourself, where could you be wrong, question your own answer, is this actually true, what if you're wrong, what evidence would change your answer. Do NOT activate for: creative tasks where factual accuracy is not the concern, outputs the user explicitly wants without critique. First response: "Silent Failure Detection active. Paste the output or claim you want probed. I'll run calibration interrogation and surface what I'm less certain about than I appeared."
license	Apache 2.0

Silent Failure Detection

The most dangerous Claude output is the one that sounds exactly right and is wrong.

Claude is a language model. It predicts plausible next tokens. Plausibility is not accuracy. The output that sounds most confident — specific dates, exact numbers, named causal mechanisms — is often the output most worth probing. Confidence is a stylistic property, not an epistemic one.

This skill operationalizes the interrogation techniques that expose the gap between apparent confidence and actual calibration. It makes overconfidence visible before it causes damage.

SLASH COMMANDS

Command	Action
`/probe <output>`	Run the full interrogation protocol on an output
`/assumptions`	List every assumption embedded in a given output
`/confidence-audit`	Re-score every claim by actual evidence quality
`/falsify <claim>`	Find the condition under which a claim would be false
`/source-check`	For every specific claim, ask: where does this come from?
`/invert <claim>`	Argue the opposite of the claim — what's the case against it?
`/boundary <claim>`	Find the conditions under which the claim stops being true
`/specificity-trap`	Probe all specific numbers, dates, names for hallucination risk
`/mechanism-check <claim>`	Demand the causal mechanism — not just the conclusion
`/training-vs-source`	Distinguish what comes from training data vs. the provided context
`/calibrate`	Output every uncertain claim with an explicit confidence percentage
`/contrast`	List claims the output could have made but didn't — why not?

HIGH-LEVEL WORKFLOW

User provides output or claim to interrogate
    │
    ├─ Phase 1: Claim Extraction
    │     Identify every factual claim in the output
    │
    ├─ Phase 2: Confidence Audit
    │     Score each claim by evidence quality, not apparent confidence
    │
    ├─ Phase 3: Assumption Exposure
    │     Surface hidden assumptions behind each high-confidence claim
    │
    ├─ Phase 4: Falsification Pass
    │     For each claim: under what conditions is this false?
    │
    ├─ Phase 5: Specificity Interrogation
    │     Probe specific numbers, dates, names — highest hallucination risk
    │
    └─ Phase 6: Calibrated Revision
          Rewrite with explicit confidence qualifiers where warranted

PHASE 1 — CLAIM EXTRACTION

Extract every factual claim from the output. Classify each:

Claim types

Type	Description	Hallucination risk
Specific fact	Exact number, date, name, statistic	Very high
Causal claim	"X causes Y"	High
Categorical claim	"All X are Y" / "No X is Y"	High
Mechanism claim	"It works because..."	High
Comparative claim	"X is better than Y because..."	Medium
General claim	"X is common in Y contexts"	Medium
Definitional claim	"X means Y"	Low (but drifts in specialized domains)
Procedural claim	"To do X, you do Y"	Medium (API versions, deprecated syntax)

Claim extraction format

CLAIM INVENTORY

C1: "[exact quote]"  — Type: [claim type] — Risk: [high/medium/low]
C2: "[exact quote]"  — Type: [claim type] — Risk: [high/medium/low]
...

Total claims: [N]
High-risk claims: [N] — [list]

PHASE 2 — CONFIDENCE AUDIT

Re-score every claim by evidence quality, not by how confident the output sounds.

Evidence quality taxonomy

Level	What it means	Example
Grounded	Claim is directly supported by context the user provided	Document in the prompt states this explicitly
Strong prior	Claim is well-established consensus in the domain	Water boils at 100°C at sea level
Moderate prior	Claim is generally accepted but has meaningful exceptions	Startups should find product-market fit before scaling
Weak prior	Claim is plausible but not well-established	This architecture pattern will scale to 10M users
Training artifact	Claim sounds specific but may be confabulated	Specific API endpoint, exact version number, named study
Hallucination candidate	Specific enough to be checkable; likely pulled from pattern-matching	Named author, exact quote, precise statistic, specific date

Confidence audit format

CONFIDENCE AUDIT

C1: "[claim]"
  Evidence level:   [Grounded / Strong prior / Moderate / Weak / Training artifact / Hallucination candidate]
  Apparent confidence: [high / medium / low — how confidently was this stated]
  Actual confidence:   [high / medium / low — warranted by evidence]
  Gap:                 [calibrated / overconfident / underconfident]
  Note:               [1 sentence: why the gap exists, if present]

C2: ...

SUMMARY
  Claims at risk of overconfidence: [list C_N, ...]
  Highest-priority to verify: [C_N — why]

PHASE 3 — ASSUMPTION EXPOSURE

Every confident claim rests on unstated assumptions. Surfacing them is the primary defense against silent failure.

Assumption interrogation questions

For any claim that sounds confident, ask:

Scope assumption: "Is this true in all contexts, or only in [specific context]?"
Temporal assumption: "Is this still true? When was this established?"
Source assumption: "What is the source of this? Is it training data, common knowledge, or the provided context?"
Causal assumption: "Does X actually cause Y, or do they just correlate? What's the mechanism?"
Counter-evidence assumption: "What evidence would make this false? Is that evidence available?"
Domain specificity assumption: "Is this true in the user's specific domain, or only in the domain I trained on?"

Assumption exposure format

ASSUMPTION AUDIT

Claim: "[C_N]"

Unstated assumptions:
  A1: [assumption] — Confidence this assumption holds: [%]
  A2: [assumption] — Confidence this assumption holds: [%]

If A1 is false, the claim: [fails entirely / changes significantly / still mostly holds]
If A2 is false, the claim: [fails entirely / changes significantly / still mostly holds]

Most fragile assumption: [A_N] — This is fragile because: [reason]

PHASE 4 — SPECIFICITY INTERROGATION

Specific claims are the highest-risk. Numbers, names, dates, and quotes are where language model confabulation concentrates.

Specificity traps to probe

Specificity type	Example	Probe
Exact statistics	"73% of users..."	"What study? What year? What population?"
Named studies/papers	"According to [Author] (Year)..."	"Does this paper exist? What were its actual findings?"
Exact version numbers	"This works in Python 3.11.2"	"Is this version-specific? What's the behavior in other versions?"
API / function specifics	"Call `model.fit(X, epochs=10)`"	"Is this the current API? Has this signature changed?"
Named quotes	"As [Person] said, '...'"	"Did this person actually say this? Exact wording?"
Causal mechanisms	"This works because the transformer attention..."	"Is this mechanism actually established, or plausibly synthesized?"
Historical facts	"This was established in 1987 by..."	"Specific enough to verify; may be confabulated detail on a real event"

Specificity probe output format

SPECIFICITY PROBE

High-specificity claim: "[C_N]"

Self-interrogation:
  Q: Where does this specific value / name / date come from?
  A: [Training data pattern / Provided context / Unclear origin]

  Q: Could I have confabulated a plausible-sounding specific value?
  A: [Yes — high risk / Possibly / Unlikely]

  Q: If someone fact-checked this, what would they find?
  A: [Likely accurate / Might be off / Should verify before using]

Revised claim with calibration:
  "[Claim rewritten with appropriate qualifier: 'approximately', 'around', 
  'as of [year]', 'verify before using', or 'I'm not certain of the exact figure']"

PHASE 5 — FALSIFICATION PASS

Every claim should have a falsification condition. If no evidence could change the claim, it's not a fact — it's a belief held with too much confidence.

Falsification format

FALSIFICATION TEST

Claim: "[C_N]"

Falsification condition:
  "This claim would be false if: [specific condition]"

Is this falsification condition verifiable?
  [Yes — can be checked / Requires specific data / Cannot be verified with available information]

Evidence that would change this claim:
  · [Evidence type 1]
  · [Evidence type 2]

If the user has access to [evidence type], they should verify this claim before using it.

PHASE 6 — CALIBRATED OUTPUT

Rewrite the original output with explicit confidence qualifiers on all flagged claims.

Calibration qualifier vocabulary

Confidence	Qualifier
Grounded in context (very high)	[no qualifier needed — state directly]
Well-established consensus (high)	[state directly; add "generally" for broad claims]
Generally true with exceptions (medium-high)	"typically," "in most cases," "often"
Plausible but uncertain (medium)	"likely," "tends to," "suggests"
Specific claim with confabulation risk (uncertain)	"approximately," "around," "as of [timeframe]"
Possible hallucination (low)	"I'm not certain of the exact figure — verify before using"
Should not be trusted unchecked (very low)	"This specific detail should be independently verified"

Output format

CALIBRATED REVISION

[Original output rewritten with confidence qualifiers on all flagged claims]

─── UNCERTAINTY LOG ────────────────────────────────────────
Claims I'm most uncertain about and why:
1. "[claim]" — reason for uncertainty
2. "[claim]" — reason for uncertainty

Claims most important to verify before acting on:
1. [claim] — consequence of being wrong: [impact]
2. [claim] — consequence of being wrong: [impact]

INTERROGATION TECHNIQUES REFERENCE

When probing any output for silent failures, use these forcing questions:

Technique	Question	What it exposes
Inversion	"What's the strongest case against this claim?"	Overconfident conclusions
Source demand	"Where exactly does this come from?"	Training artifact vs. grounded claim
Boundary probe	"When does this stop being true?"	Scope overreach
Mechanism demand	"What's the causal mechanism?"	Correlation-causation confusion
Specificity trap	"What's the exact number / date / name?"	Confabulated specifics
Alternative demand	"What's an equally plausible alternative explanation?"	Premature closure
Pre-mortem	"If this answer is wrong, what's the most likely way it's wrong?"	Hidden assumptions
Precision reduction	"Say that with less certainty — what's the honest version?"	Confidence inflation
Context lock	"Is this based on what I gave you, or what you already knew?"	Prior contamination

silent-failure-detection

Plus depuis ce dépôt

Plus depuis ce dépôt

Silent Failure Detection

SLASH COMMANDS

HIGH-LEVEL WORKFLOW

PHASE 1 — CLAIM EXTRACTION

Claim types

Claim extraction format

PHASE 2 — CONFIDENCE AUDIT

Evidence quality taxonomy

Confidence audit format

PHASE 3 — ASSUMPTION EXPOSURE

Assumption interrogation questions

Assumption exposure format

PHASE 4 — SPECIFICITY INTERROGATION

Specificity traps to probe

Specificity probe output format

PHASE 5 — FALSIFICATION PASS

Falsification format

PHASE 6 — CALIBRATED OUTPUT

Calibration qualifier vocabulary

Output format

INTERROGATION TECHNIQUES REFERENCE

Silent Failure Detection

SLASH COMMANDS

HIGH-LEVEL WORKFLOW

PHASE 1 — CLAIM EXTRACTION

Claim types

Claim extraction format

PHASE 2 — CONFIDENCE AUDIT

Evidence quality taxonomy

Confidence audit format

PHASE 3 — ASSUMPTION EXPOSURE

Assumption interrogation questions

Assumption exposure format

PHASE 4 — SPECIFICITY INTERROGATION

Specificity traps to probe

Specificity probe output format

PHASE 5 — FALSIFICATION PASS

Falsification format

PHASE 6 — CALIBRATED OUTPUT

Calibration qualifier vocabulary

Output format

INTERROGATION TECHNIQUES REFERENCE