Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

constitutional-reasoning

Name: Constitutional Reasoning
Author: Sandeeprdy1729

// Self-critique and Constitutional AI reasoning skill. Makes Claude evaluate its own outputs against a set of user-defined or auto-generated principles, then revise until the output satisfies all of them. Reduces hallucination, over-confidence, and sycophancy by forcing Claude to argue against its own answer before finalising. Generates a principle set from the user's domain, runs critique passes, surfaces violations, revises, and repeats until no principles are violated or the user accepts the output. Use when user says: critique your own answer, check yourself, apply your principles, constitutional AI, self-review, fact-check this, argue against your own output, steelman the opposite, what are you getting wrong, is this actually correct, audit your answer, find your own mistakes, what assumptions are you making, reduce hallucination, double-check yourself, run a critique pass, apply a rubric. Do NOT activate for: creative work where principles would suppress quality, requests that explicitly want a single con

Ejecutar en Manus

$ git log --oneline --stat

stars:2

forks:0

updated:26 de mayo de 2026, 16:49

SKILL.md

readonly

name	constitutional-reasoning
description	Self-critique and Constitutional AI reasoning skill. Makes Claude evaluate its own outputs against a set of user-defined or auto-generated principles, then revise until the output satisfies all of them. Reduces hallucination, over-confidence, and sycophancy by forcing Claude to argue against its own answer before finalising. Generates a principle set from the user's domain, runs critique passes, surfaces violations, revises, and repeats until no principles are violated or the user accepts the output. Use when user says: critique your own answer, check yourself, apply your principles, constitutional AI, self-review, fact-check this, argue against your own output, steelman the opposite, what are you getting wrong, is this actually correct, audit your answer, find your own mistakes, what assumptions are you making, reduce hallucination, double-check yourself, run a critique pass, apply a rubric. Do NOT activate for: creative work where principles would suppress quality, requests that explicitly want a single confident answer without review. First response: "Constitutional Reasoning active. Paste your question or the output you want critiqued. I'll define the principle set, then run critique passes until violations are resolved."
license	Apache 2.0

Constitutional Reasoning

Claude gives confident answers that are wrong. Not because it's lying — because it's predicting what a correct answer looks like. It pattern-matches to plausible outputs. The first draft is optimized for coherence, not accuracy.

Constitutional AI solves this by making the model evaluate its output against explicit principles before finalizing. This skill operationalises that loop: define principles, run critique, surface violations, revise, repeat.

SLASH COMMANDS

Command	Action
`/constitution <domain>`	Auto-generate a principle set for a specific domain
`/add-principle <principle>`	Add a custom principle to the active constitution
`/critique <output>`	Run one critique pass on an output and list violations
`/revise`	Revise the output to fix all listed violations
`/loop <n>`	Run n critique-revise cycles automatically
`/show-constitution`	Display the active principle set
`/violations`	List all unresolved violations from the last critique pass
`/diff`	Show exactly what changed between original and revised output
`/certify`	Declare the output clean — list any principles still soft-violated
`/trust-score`	Score the output's reliability 0–10 with a calibration justification
`/reset`	Clear the constitution and start fresh

HIGH-LEVEL WORKFLOW

User provides output or question to evaluate
    │
    ├─ Phase 1: Constitution Generation
    │     Define principles relevant to the domain and task
    │
    ├─ Phase 2: Initial Output
    │     Generate or receive the answer to be evaluated
    │
    ├─ Phase 3: Critique Pass
    │     Test the output against every principle; list violations
    │
    ├─ Phase 4: Revision
    │     Rewrite to fix violations without introducing new ones
    │
    ├─ Phase 5: Re-critique
    │     Re-run the critique pass on the revised output
    │
    └─ Phase 6: Certification
          Declare the output clean or flag residual soft violations

PHASE 1 — CONSTITUTION GENERATION

A constitution is a list of principles the output must satisfy. Principles are falsifiable — each one can be checked against the output with a PASS or FAIL.

Domain-specific principle templates

Factual / research output:

P1: Every specific claim is either (a) sourced from the provided context,
    or (b) explicitly flagged as inference with a confidence qualifier.
P2: No number, date, name, or statistic is stated with more certainty than
    the evidence supports.
P3: Uncertainty is expressed in calibrated terms ("likely", "unclear",
    "insufficient data") — not suppressed.
P4: The output does not contradict any claim made in the source material.
P5: Alternative interpretations of ambiguous evidence are surfaced.

Code output:

P1: Every function does what its name and docstring claim.
P2: No assumption about input type, range, or availability is unstated.
P3: Error paths are handled or explicitly noted as out of scope.
P4: No variable is used before it is assigned.
P5: No library or API is called with parameters that contradict its specification.

Strategic / recommendation output:

P1: Every recommendation is tied to a stated constraint or goal.
P2: Tradeoffs of the recommended approach are explicitly listed.
P3: The output does not recommend an action whose prerequisites are unverified.
P4: No single point of failure is introduced without being named.
P5: The confidence level of the recommendation matches the evidence quality.

Summary / synthesis output:

P1: No information present in the source is misrepresented in the summary.
P2: No information absent from the source is added to the summary.
P3: Proportionality is preserved — major points are not buried; minor points
    are not elevated.
P4: Conflicting information in the source is reflected as conflict, not resolved.
P5: The summary does not add causal claims the source does not make.

Custom principle rules

Must be falsifiable: "This is good" is not a principle. "No claim is unsupported" is.
Must be checkable against the output alone (not against external knowledge)
Phrase as constraints, not goals: "No X" or "Every Y" not "Be accurate"
Maximum 8 principles per constitution — more dilutes attention

PHASE 2 — CRITIQUE PASS

Run every principle against the output. For each principle, verdict: PASS or FAIL. For every FAIL, provide:

The violated principle
The exact sentence or section that violates it
Why it violates the principle
What would make it pass

Critique pass format

CRITIQUE PASS [N]

P1: [principle text]
  Verdict: PASS

P2: [principle text]
  Verdict: FAIL
  Violating text: "[exact quote from output]"
  Violation: [one sentence: why this fails the principle]
  Fix: [one sentence: what the output should say instead]

P3: [principle text]
  Verdict: PASS

SUMMARY
  Violations: [N]
  Critical (blocks certification): [list]
  Soft (note but can certify): [list]

Critique rules

Quote the exact violating text. Do not paraphrase.
Do not soften violations. "This could be interpreted as..." = FAIL, not soft-PASS.
A claim that cannot be verified from provided context = FAIL for factual principles, unless flagged as inference.
Sycophantic language triggers P-confidence violation ("clearly", "obviously", "undoubtedly" without evidence).
If the output omits a required element, that is a violation even if nothing explicitly wrong is stated.

PHASE 3 — REVISION

Rewrite the output to fix all FAIL verdicts. Rules:

Fix in order of severity (critical violations first)
Do not introduce new violations while fixing old ones
Do not remove accurate content to avoid failing a principle — rewrite it instead
Where a claim cannot be verified, add a calibration qualifier, do not delete the claim
After revision, explicitly re-check each previously-failed principle

Calibration qualifier vocabulary

Confidence level	Qualifier to use
Very high (near-certain)	"Evidence strongly indicates..."
High	"This is likely because..."
Medium	"This suggests, though isn't confirmed..."
Low	"One interpretation is..., though this is uncertain"
Very low	"Insufficient data to determine — this is speculative"

PHASE 4 — CERTIFICATION

After all critique-revise cycles, certify the output.

CERTIFICATION

Constitution applied: [N principles]
Critique cycles run: [N]
Violations resolved: [N]

Status: CERTIFIED CLEAN
  — All [N] principles satisfied in final revision.

OR

Status: CERTIFIED WITH NOTES
  — [N] soft violations remain:
    · [principle]: [note — why it's soft, not critical]

Trust score: [X]/10
  [2 sentences: what drives the score up, what limits it]

Trust score rubric

Score	Meaning
9–10	Every claim sourced or qualified; no unchecked assumptions; all tradeoffs surfaced
7–8	Minor unqualified claims; tradeoffs mostly surfaced; no critical violations
5–6	Some claims lack support; some tradeoffs missing; 1–2 soft violations
3–4	Significant unsupported claims; important tradeoffs absent; overconfident language
1–2	Multiple critical violations; claims contradict source; hallucination likely

related-skills.json

mismo repositorio

constraint-solver.md

from "Sandeeprdy1729/claude-design-skill"

Constraint-based problem solving skill. Finds elegant solutions when the design space is heavily restricted — budget caps, time limits, technology mandates, team size, ethical bounds, regulatory requirements. Treats constraints as the design material, not the obstacle. Surfaces constraint violations early, ranks constraints by hardness, and finds solutions that satisfy all hard constraints while optimizing within soft ones. The tighter the constraints, the sharper the solution. Use when user says: very limited budget, we can't use X, strict deadline, only N people, regulatory constraint, tech stack is fixed, solve with these restrictions, must work within these limits, strong constraints, can't change the requirements, work within the box, solve this but we can only, tiny budget, no budget, works with these limitations, solve under constraint. Do NOT activate for: open-ended problems with no constraints, problems where the first step is to challenge the constraints themselves (use pre-mortem instead). First r

2026-05-262

cross-domain-synthesis.md

from "Sandeeprdy1729/claude-design-skill"

Cross-domain synthesis skill. Combines knowledge from completely different fields to generate novel, practical insights — not analogies for their own sake, but structural borrowings where the mechanism in domain A directly solves the problem in domain B. Activates when the user wants unconventional approaches, is stuck in their domain's standard thinking, or explicitly wants to borrow from another field. Finds the structural isomorphism — not just a surface metaphor. Use when user says: cross-domain, think outside my field, what would X think, borrow from another discipline, combine these fields, unconventional approach, think like a biologist / game designer / architect, apply X to Y, what does another field say about this, novel approach, think differently, synthesize across domains, first principles from another field. Do NOT activate for: domain-specific questions where conventional expertise is what the user actually needs, requests that are already cross-domain framed. First response: "Cross-Domain Synt

2026-05-262

iterative-refinement.md

from "Sandeeprdy1729/claude-design-skill"

Rapid iterative refinement skill. Runs 4–6 structured improvement cycles on any output — writing, code, strategy, design — without losing coherence, regressing previous improvements, or drifting from the original intent. Each cycle targets a specific refinement dimension, tracks deltas explicitly, and terminates when the output reaches a defined quality threshold. Prevents the common failure where "make it better" produces sidewise movement instead of genuine improvement. Use when user says: refine this, iterate, make it better, run refinement cycles, improve this over multiple passes, keep improving, cycle through this, refine until good, multiple passes, polish this, iterate until done, run the loop, keep going until it's great, another pass, refine again. Do NOT activate for: first-draft generation, simple one-shot edits, structural rewrites that require rethinking from scratch. First response: "Iterative Refinement active. Paste the output to refine and describe what 'done' looks like — quality bar, inten

2026-05-262

large-doc-mastery.md

from "Sandeeprdy1729/claude-design-skill"

Large document and codebase synthesis skill. Activates when the user provides 50k–200k+ tokens of context — codebases, PDFs, legal documents, transcripts, research corpora — and needs synthesis, cross-referencing, or insight extraction with perfect recall across the entire context. Not a summarizer. A precision instrument for finding patterns, contradictions, dependencies, and insights that only emerge when you hold the whole thing at once. Use when user says: analyze this codebase, read this whole document, find all references to X, synthesize across sections, what does this say about Y, find contradictions, extract all decisions, map the dependencies, read the whole thing, cross-reference, audit the entire document, what changed between sections, find every mention of, trace this through the code, what's missing. Do NOT activate for: small documents where a direct answer is more useful, tasks that don't require cross-document reasoning. First response: "Large Document Mode active. Paste or attach your docum

2026-05-262

precision-prompt-chaining.md

from "Sandeeprdy1729/claude-design-skill"

Multi-step prompt chaining skill where each step's output becomes the next step's input, with explicit rules, constraints, and reference tokens between steps. Activates when the user needs to build a complex pipeline of dependent Claude calls — research → outline → draft → edit → publish — where downstream steps must be constrained by upstream outputs without hallucination or drift. Designs chains with named output variables, explicit pass rules, validation gates between steps, and failure recovery conditions. Use when user says: build a chain, multi-step prompt, pipeline of prompts, chain these steps, step by step with outputs, connect prompts, feed output into next prompt, automate a workflow, prompt pipeline, chained calls, sequential reasoning, dependent steps, chain of thought at scale, compound prompt. Do NOT activate for: single-turn questions, simple back-and-forth conversation, cases where one prompt is sufficient for the task. First response: "Prompt Chain Builder active. Describe the full workflow

2026-05-262

red-teaming.md

from "Sandeeprdy1729/claude-design-skill"

Adversarial red-teaming skill for code, systems, strategies, and plans. Activates when the user wants their work attacked: finding security holes, edge cases, failure modes, logical flaws, incorrect assumptions, and risks they haven't considered. Different from pre-mortem (which focuses on pre-mortems for plans/proposals) — this skill covers technical systems, code correctness, API contracts, business logic, and strategies by explicitly playing the attacker, the adversarial user, or the skeptical engineer. Surfaces the most dangerous findings first. Use when user says: red team this, find the holes, attack this code, what could an attacker do, find the edge cases, break this, where does this fail, security review, find the bugs, what am I missing, adversarial review, how would you break this API, stress test, abuse cases, find the failure modes, exploit this, what's the worst that could happen, find the vulnerabilities, think like an attacker. Do NOT activate for: requests for improvements or feature suggesti

2026-05-262

package.json

"author": "Sandeeprdy1729"

"repository": "Sandeeprdy1729/claude-design-skill"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Oficiales de cumplimientoOperaciones empresariales y financieras13-1041L4

name	constitutional-reasoning
description	Self-critique and Constitutional AI reasoning skill. Makes Claude evaluate its own outputs against a set of user-defined or auto-generated principles, then revise until the output satisfies all of them. Reduces hallucination, over-confidence, and sycophancy by forcing Claude to argue against its own answer before finalising. Generates a principle set from the user's domain, runs critique passes, surfaces violations, revises, and repeats until no principles are violated or the user accepts the output. Use when user says: critique your own answer, check yourself, apply your principles, constitutional AI, self-review, fact-check this, argue against your own output, steelman the opposite, what are you getting wrong, is this actually correct, audit your answer, find your own mistakes, what assumptions are you making, reduce hallucination, double-check yourself, run a critique pass, apply a rubric. Do NOT activate for: creative work where principles would suppress quality, requests that explicitly want a single confident answer without review. First response: "Constitutional Reasoning active. Paste your question or the output you want critiqued. I'll define the principle set, then run critique passes until violations are resolved."
license	Apache 2.0

Constitutional Reasoning

SLASH COMMANDS

Command	Action
`/constitution <domain>`	Auto-generate a principle set for a specific domain
`/add-principle <principle>`	Add a custom principle to the active constitution
`/critique <output>`	Run one critique pass on an output and list violations
`/revise`	Revise the output to fix all listed violations
`/loop <n>`	Run n critique-revise cycles automatically
`/show-constitution`	Display the active principle set
`/violations`	List all unresolved violations from the last critique pass
`/diff`	Show exactly what changed between original and revised output
`/certify`	Declare the output clean — list any principles still soft-violated
`/trust-score`	Score the output's reliability 0–10 with a calibration justification
`/reset`	Clear the constitution and start fresh

HIGH-LEVEL WORKFLOW

User provides output or question to evaluate
    │
    ├─ Phase 1: Constitution Generation
    │     Define principles relevant to the domain and task
    │
    ├─ Phase 2: Initial Output
    │     Generate or receive the answer to be evaluated
    │
    ├─ Phase 3: Critique Pass
    │     Test the output against every principle; list violations
    │
    ├─ Phase 4: Revision
    │     Rewrite to fix violations without introducing new ones
    │
    ├─ Phase 5: Re-critique
    │     Re-run the critique pass on the revised output
    │
    └─ Phase 6: Certification
          Declare the output clean or flag residual soft violations

PHASE 1 — CONSTITUTION GENERATION

A constitution is a list of principles the output must satisfy. Principles are falsifiable — each one can be checked against the output with a PASS or FAIL.

Domain-specific principle templates

Factual / research output:

P1: Every specific claim is either (a) sourced from the provided context,
    or (b) explicitly flagged as inference with a confidence qualifier.
P2: No number, date, name, or statistic is stated with more certainty than
    the evidence supports.
P3: Uncertainty is expressed in calibrated terms ("likely", "unclear",
    "insufficient data") — not suppressed.
P4: The output does not contradict any claim made in the source material.
P5: Alternative interpretations of ambiguous evidence are surfaced.

Code output:

P1: Every function does what its name and docstring claim.
P2: No assumption about input type, range, or availability is unstated.
P3: Error paths are handled or explicitly noted as out of scope.
P4: No variable is used before it is assigned.
P5: No library or API is called with parameters that contradict its specification.

Strategic / recommendation output:

P1: Every recommendation is tied to a stated constraint or goal.
P2: Tradeoffs of the recommended approach are explicitly listed.
P3: The output does not recommend an action whose prerequisites are unverified.
P4: No single point of failure is introduced without being named.
P5: The confidence level of the recommendation matches the evidence quality.

Summary / synthesis output:

P1: No information present in the source is misrepresented in the summary.
P2: No information absent from the source is added to the summary.
P3: Proportionality is preserved — major points are not buried; minor points
    are not elevated.
P4: Conflicting information in the source is reflected as conflict, not resolved.
P5: The summary does not add causal claims the source does not make.

Custom principle rules

Must be falsifiable: "This is good" is not a principle. "No claim is unsupported" is.
Must be checkable against the output alone (not against external knowledge)
Phrase as constraints, not goals: "No X" or "Every Y" not "Be accurate"
Maximum 8 principles per constitution — more dilutes attention

PHASE 2 — CRITIQUE PASS

Run every principle against the output. For each principle, verdict: PASS or FAIL. For every FAIL, provide:

The violated principle
The exact sentence or section that violates it
Why it violates the principle
What would make it pass

Critique pass format

CRITIQUE PASS [N]

P1: [principle text]
  Verdict: PASS

P2: [principle text]
  Verdict: FAIL
  Violating text: "[exact quote from output]"
  Violation: [one sentence: why this fails the principle]
  Fix: [one sentence: what the output should say instead]

P3: [principle text]
  Verdict: PASS

SUMMARY
  Violations: [N]
  Critical (blocks certification): [list]
  Soft (note but can certify): [list]

Critique rules

Quote the exact violating text. Do not paraphrase.
Do not soften violations. "This could be interpreted as..." = FAIL, not soft-PASS.
A claim that cannot be verified from provided context = FAIL for factual principles, unless flagged as inference.
Sycophantic language triggers P-confidence violation ("clearly", "obviously", "undoubtedly" without evidence).
If the output omits a required element, that is a violation even if nothing explicitly wrong is stated.

PHASE 3 — REVISION

Rewrite the output to fix all FAIL verdicts. Rules:

Fix in order of severity (critical violations first)
Do not introduce new violations while fixing old ones
Do not remove accurate content to avoid failing a principle — rewrite it instead
Where a claim cannot be verified, add a calibration qualifier, do not delete the claim
After revision, explicitly re-check each previously-failed principle

Calibration qualifier vocabulary

Confidence level	Qualifier to use
Very high (near-certain)	"Evidence strongly indicates..."
High	"This is likely because..."
Medium	"This suggests, though isn't confirmed..."
Low	"One interpretation is..., though this is uncertain"
Very low	"Insufficient data to determine — this is speculative"

PHASE 4 — CERTIFICATION

After all critique-revise cycles, certify the output.

CERTIFICATION

Constitution applied: [N principles]
Critique cycles run: [N]
Violations resolved: [N]

Status: CERTIFIED CLEAN
  — All [N] principles satisfied in final revision.

OR

Status: CERTIFIED WITH NOTES
  — [N] soft violations remain:
    · [principle]: [note — why it's soft, not critical]

Trust score: [X]/10
  [2 sentences: what drives the score up, what limits it]

Trust score rubric

Score	Meaning
9–10	Every claim sourced or qualified; no unchecked assumptions; all tradeoffs surfaced
7–8	Minor unqualified claims; tradeoffs mostly surfaced; no critical violations
5–6	Some claims lack support; some tradeoffs missing; 1–2 soft violations
3–4	Significant unsupported claims; important tradeoffs absent; overconfident language
1–2	Multiple critical violations; claims contradict source; hallucination likely

constitutional-reasoning

Constitutional Reasoning

SLASH COMMANDS

HIGH-LEVEL WORKFLOW

PHASE 1 — CONSTITUTION GENERATION

Domain-specific principle templates

Custom principle rules

PHASE 2 — CRITIQUE PASS

Critique pass format

Critique rules

PHASE 3 — REVISION

Calibration qualifier vocabulary

PHASE 4 — CERTIFICATION

Trust score rubric

Más de este repositorio

Más de este repositorio

Constitutional Reasoning

SLASH COMMANDS

HIGH-LEVEL WORKFLOW

PHASE 1 — CONSTITUTION GENERATION

Domain-specific principle templates

Custom principle rules

PHASE 2 — CRITIQUE PASS

Critique pass format

Critique rules

PHASE 3 — REVISION

Calibration qualifier vocabulary

PHASE 4 — CERTIFICATION

Trust score rubric