Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

Comenzar

$pwd:

llm-council

Name: Llm Council
Author: DNYoussef

// Multi-model consensus using Karpathy LLM Council pattern for critical decisions

Ejecutar en Manus

$ git log --oneline --stat

stars:31

forks:6

updated:11 de enero de 2026, 17:48

SKILL.md

readonly

name	llm-council
description	Multi-model consensus using Karpathy LLM Council pattern for critical decisions
allowed-tools	Bash, Read, Write, TodoWrite

LLM Council Skill

LIBRARY-FIRST PROTOCOL (MANDATORY)

Before writing ANY code, you MUST check:

Step 1: Library Catalog

Location: .claude/library/catalog.json
If match >70%: REUSE or ADAPT

Step 2: Patterns Guide

Location: .claude/docs/inventories/LIBRARY-PATTERNS-GUIDE.md
If pattern exists: FOLLOW documented approach

Step 3: Existing Projects

Location: D:\Projects\*
If found: EXTRACT and adapt

Decision Matrix

Match	Action
Library >90%	REUSE directly
Library 70-90%	ADAPT minimally
Pattern exists	FOLLOW pattern
In project	EXTRACT
No match	BUILD (add to library after)

Purpose

Run 3-stage multi-model consensus for critical decisions where:

Single-model hallucination risk is unacceptable
Multiple perspectives improve decision quality
High-stakes choices need validation

Architecture (Karpathy Pattern)

STAGE 1: COLLECT
  +---> Claude ---> Response A
  |
Query --+---> Gemini ---> Response B
  |
  +---> Codex ----> Response C

STAGE 2: RANK
  Each model reviews others (anonymized)
  Produces rankings with rationale

STAGE 3: SYNTHESIZE
  Chairman aggregates rankings
  Produces final answer with consensus score

When to Use

Perfect For:

Architecture decisions
Technology selection
Critical bug triage
Security assessment
High-risk deployments
Contentious design choices

Don't Use When:

Simple, low-risk decisions
Time-critical responses
Single correct answer exists
Cost is a concern (3x API usage)

Usage

Basic Council

/llm-council "Should we use microservices or monolith for this system?"

With Threshold

/llm-council "Which auth approach is best?" --threshold 0.75

With Chairman Override

/llm-council "Architecture decision" --chairman gemini

Command Pattern

bash scripts/multi-model/llm-council.sh "<query>" "<threshold>" "<chairman>"

Configuration

Parameter	Default	Description
threshold	0.67	Minimum consensus score
chairman	claude	Model that synthesizes final answer
models	[claude, gemini, codex]	Participating models

Consensus Scoring

>0.80: Strong consensus - proceed with confidence
0.67-0.80: Moderate consensus - consider minority views
<0.67: Weak consensus - escalate to human review

Memory Integration

Results stored to Memory-MCP:

Key: multi-model/council/decisions/{query_id}
Tags: WHO=llm-council, WHY=consensus-decision

Output Format

{
  "query": "Original question",
  "final_answer": {
    "synthesis": "Combined answer...",
    "chairman": "claude"
  },
  "consensus_score": 0.85,
  "responses": {
    "claude": "...",
    "gemini": "...",
    "codex": "..."
  },
  "rankings": [
    {"model": "A", "rank": 1, "rationale": "..."}
  ]
}

Failure Modes

Deadlock (No Consensus)

All models disagree
Consensus < threshold
Action: Store for human review

Model Unavailable

One model times out
Action: Continue with 2 models (2/3 quorum)

Chairman Failure

Synthesis fails
Action: Fallback to highest-ranked response

Integration Examples

Architecture Decision

const decision = await runCouncil(
  "Microservices vs Monolith for our scale?",
  { threshold: 0.75 }
);

if (decision.consensus_score >= 0.75) {
  proceed(decision.final_answer);
} else {
  escalateToHuman(decision);
}

Security Assessment

const assessment = await runCouncil(
  "Is this authentication approach secure?",
  { threshold: 0.80 }
);
// Higher threshold for security decisions

Sources

related-skills.json

mismo repositorio

browser-automation.md

from "DNYoussef/context-cascade"

Complex browser automation workflow using claude-in-chrome MCP with mandatory sequential-thinking planning. Use when automating multi-step web interactions, form filling, navigation sequences, or web scraping.

2026-01-1331

e2e-test.md

from "DNYoussef/context-cascade"

End-to-end testing workflow for validating complete user journeys through web applications using claude-in-chrome MCP. Specializes in test assertions, suite organization, evidence collection, and pass/fail reporting.

2026-01-1331

visual-testing.md

from "DNYoussef/context-cascade"

Screenshot-based visual comparison and regression testing using claude-in-chrome MCP. Captures, compares, and validates UI states to detect layout shifts, visual bugs, and design regressions across viewports.

2026-01-1331

web-scraping.md

from "DNYoussef/context-cascade"

Structured data extraction from web pages using claude-in-chrome MCP with sequential-thinking planning. Focus on READ operations, data transformation, and pagination handling for multi-page extraction.

2026-01-1331

when-reviewing-pull-request-orchestrate-comprehensive-code-revie.md

from "DNYoussef/context-cascade"

Use when conducting comprehensive code review for pull requests across multiple quality dimensions. Orchestrates 12-15 specialized reviewer agents across 4 phases using star topology coordination. Covers automated checks, parallel specialized reviews (quality, security, performance, architecture, documentation), integration analysis, and final merge recommendation in a 4-hour workflow.

2026-01-1131

hook-creator.md

from "DNYoussef/context-cascade"

Create Claude Code hooks with proper schemas, RBAC integration, and performance requirements. Use when implementing PreToolUse, PostToolUse, SessionStart, or any of the 10 hook event types for automation, validation, or security enforcement.

2026-01-1131

package.json

"author": "DNYoussef"

"repository": "DNYoussef/context-cascade"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Analistas de sistemas informáticosOcupaciones informáticas y matemáticas15-1211L4

Desarrolladores de softwareL4

name	llm-council
description	Multi-model consensus using Karpathy LLM Council pattern for critical decisions
allowed-tools	Bash, Read, Write, TodoWrite

LLM Council Skill

LIBRARY-FIRST PROTOCOL (MANDATORY)

Before writing ANY code, you MUST check:

Step 1: Library Catalog

Location: .claude/library/catalog.json
If match >70%: REUSE or ADAPT

Step 2: Patterns Guide

Location: .claude/docs/inventories/LIBRARY-PATTERNS-GUIDE.md
If pattern exists: FOLLOW documented approach

Step 3: Existing Projects

Location: D:\Projects\*
If found: EXTRACT and adapt

Decision Matrix

Match	Action
Library >90%	REUSE directly
Library 70-90%	ADAPT minimally
Pattern exists	FOLLOW pattern
In project	EXTRACT
No match	BUILD (add to library after)

Purpose

Run 3-stage multi-model consensus for critical decisions where:

Single-model hallucination risk is unacceptable
Multiple perspectives improve decision quality
High-stakes choices need validation

Architecture (Karpathy Pattern)

STAGE 1: COLLECT
  +---> Claude ---> Response A
  |
Query --+---> Gemini ---> Response B
  |
  +---> Codex ----> Response C

STAGE 2: RANK
  Each model reviews others (anonymized)
  Produces rankings with rationale

STAGE 3: SYNTHESIZE
  Chairman aggregates rankings
  Produces final answer with consensus score

When to Use

Perfect For:

Architecture decisions
Technology selection
Critical bug triage
Security assessment
High-risk deployments
Contentious design choices

Don't Use When:

Simple, low-risk decisions
Time-critical responses
Single correct answer exists
Cost is a concern (3x API usage)

Usage

Basic Council

/llm-council "Should we use microservices or monolith for this system?"

With Threshold

/llm-council "Which auth approach is best?" --threshold 0.75

With Chairman Override

/llm-council "Architecture decision" --chairman gemini

Command Pattern

bash scripts/multi-model/llm-council.sh "<query>" "<threshold>" "<chairman>"

Configuration

Parameter	Default	Description
threshold	0.67	Minimum consensus score
chairman	claude	Model that synthesizes final answer
models	[claude, gemini, codex]	Participating models

Consensus Scoring

>0.80: Strong consensus - proceed with confidence
0.67-0.80: Moderate consensus - consider minority views
<0.67: Weak consensus - escalate to human review

Memory Integration

Results stored to Memory-MCP:

Key: multi-model/council/decisions/{query_id}
Tags: WHO=llm-council, WHY=consensus-decision

Output Format

{
  "query": "Original question",
  "final_answer": {
    "synthesis": "Combined answer...",
    "chairman": "claude"
  },
  "consensus_score": 0.85,
  "responses": {
    "claude": "...",
    "gemini": "...",
    "codex": "..."
  },
  "rankings": [
    {"model": "A", "rank": 1, "rationale": "..."}
  ]
}

Failure Modes

Deadlock (No Consensus)

All models disagree
Consensus < threshold
Action: Store for human review

Model Unavailable

One model times out
Action: Continue with 2 models (2/3 quorum)

Chairman Failure

Synthesis fails
Action: Fallback to highest-ranked response

Integration Examples

Architecture Decision

const decision = await runCouncil(
  "Microservices vs Monolith for our scale?",
  { threshold: 0.75 }
);

if (decision.consensus_score >= 0.75) {
  proceed(decision.final_answer);
} else {
  escalateToHuman(decision);
}

Security Assessment

const assessment = await runCouncil(
  "Is this authentication approach secure?",
  { threshold: 0.80 }
);
// Higher threshold for security decisions

llm-council

LLM Council Skill

LIBRARY-FIRST PROTOCOL (MANDATORY)

Step 1: Library Catalog

Step 2: Patterns Guide

Step 3: Existing Projects

Decision Matrix

Purpose

Architecture (Karpathy Pattern)

When to Use

Perfect For:

Don't Use When:

Usage

Basic Council

With Threshold

With Chairman Override

Command Pattern

Configuration

Consensus Scoring

Memory Integration

Output Format

Failure Modes

Deadlock (No Consensus)

Model Unavailable

Chairman Failure

Integration Examples

Architecture Decision

Security Assessment

Sources

Más de este repositorio

Más de este repositorio

LLM Council Skill

LIBRARY-FIRST PROTOCOL (MANDATORY)

Step 1: Library Catalog

Step 2: Patterns Guide

Step 3: Existing Projects

Decision Matrix

Purpose

Architecture (Karpathy Pattern)

When to Use

Perfect For:

Don't Use When:

Usage

Basic Council

With Threshold

With Chairman Override

Command Pattern

Configuration

Consensus Scoring

Memory Integration

Output Format

Failure Modes

Deadlock (No Consensus)

Model Unavailable

Chairman Failure

Integration Examples

Architecture Decision

Security Assessment

Sources