Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

karpathy-guidelines

Name: Karpathy Guidelines
Author: elihuvillaraus

// Behavioral guidelines to reduce common LLM coding mistakes, derived from Andrej Karpathy's observations on LLM pitfalls. Use when writing, reviewing, or refactoring code to avoid overcomplication, make surgical changes, surface assumptions, and define verifiable success criteria. Universal — applies to all agents and all AI tools. Triggers on "karpathy", "think before coding", "don't assume", "simplify", "surgical changes", "success criteria".

Ejecutar en Manus

$ git log --oneline --stat

stars:2

forks:0

updated:16 de abril de 2026, 14:48

SKILL.md

readonly

related-skills.json

mismo repositorio

architect.md

from "elihuvillaraus/skills"

System Architect that creates parallelizable PRDs with junior-proof technical specs. Use when planning features, designing implementations, or when the user says 'plan', 'architect', 'design', or 'PRD'. Outputs PRDs organized in Priority groups where tasks within each group can be executed in parallel by independent dev subagents (ralph). Each user story includes file ownership, technical specs, and acceptance criteria detailed enough for a Sonnet-class model to implement without clarification.

2026-05-042

orchestrator.md

from "elihuvillaraus/skills"

Runs the full feature pipeline: research → architect → implement → document → test → report. Use when the user defines a feature objective and wants it fully implemented end-to-end without supervision. Triggered by: 'build this', 'implement end-to-end', 'full pipeline', 'orchestrate', or when the user describes a vision and says 'go'.

2026-05-042

ralph.md

from "elihuvillaraus/skills"

Autonomous dev subagent that implements a single user story from a PRD. Use when you need parallel, independent implementation of tasks. Designed to run as a subagent alongside other ralph instances. Receives a specific task ID and PRD path (e.g., 'Implement US003 from docs/tasks/PRD-feature.md'). Part of the Generator→Evaluator loop: ralph generates, evaluator validates before commit. Does NOT commit or modify the PRD — those are handled by the documenter. Includes: Sprint Contract before coding, AGENTS.md context loading, Engram context search + save, pre-coding baseline test run, TDD (Red-Green-Triangulate), tsc --noEmit TypeScript check, lint step (eslint/biome), no-any/no-@ts-ignore/no-magic-string rules, security constraints (no secrets/env files, parameterized SQL, XSS protection), no-regression guard (fix tests broken by your changes, never fix pre-existing failures), a11y check for UI stories, transaction safety for multi-step DB mutations, test isolation with mocked externals, git diff ownership ver

2026-05-042

tester.md

from "elihuvillaraus/skills"

QA specialist that PROVES the app works by actually running it. Uses playwright-cli to navigate, click, fill forms, and screenshot the real running app. No escape hatches — if the app is broken, this must find it before the user does. Use after ralph implements a feature. Triggered by: 'run tests', 'write tests', 'validate', 'E2E', 'tester', 'QA'.

2026-04-212

code-reviewer.md

from "elihuvillaraus/skills"

Expert code reviewer who provides constructive, actionable feedback focused on correctness, maintainability, security, and performance — not style preferences. Reviews code like a mentor, not a gatekeeper. Every comment teaches something. Activar cuando se necesite un Code Reviewer en el equipo o pipeline.

2026-04-162

guardian-angel.md

from "elihuvillaraus/skills"

Pre-commit code validator (GGA — Guardian Angel). Validates code against project cultural norms, skill definitions, and architectural rules before it's committed. AI-powered alternative to SonarQube — uses your own LLM, no external service. Works locally and in GitHub Actions CI/CD. Triggered by: 'validate code', 'check PR', 'guardian angel', '/gga', 'pre-commit check', 'review before commit'.

2026-04-162

package.json

"author": "elihuvillaraus"

"repository": "elihuvillaraus/skills"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Desarrolladores de softwareOcupaciones informáticas y matemáticas15-1252L4

name	karpathy-guidelines
description	Behavioral guidelines to reduce common LLM coding mistakes, derived from Andrej Karpathy's observations on LLM pitfalls. Use when writing, reviewing, or refactoring code to avoid overcomplication, make surgical changes, surface assumptions, and define verifiable success criteria. Universal — applies to all agents and all AI tools. Triggers on "karpathy", "think before coding", "don't assume", "simplify", "surgical changes", "success criteria".
license	MIT
source	https://github.com/forrestchang/andrej-karpathy-skills

Karpathy Guidelines

Behavioral guidelines to reduce common LLM coding mistakes, derived from Andrej Karpathy's observations on LLM coding pitfalls.

"The models make wrong assumptions on your behalf and just run along with them without checking. They don't manage their confusion, don't seek clarifications, don't surface inconsistencies, don't present tradeoffs, don't push back when they should."

"LLMs are exceptionally good at looping until they meet specific goals... Don't tell it what to do, give it success criteria and watch it go."

Tradeoff: These guidelines bias toward caution over speed. For trivial tasks (obvious one-liners, typo fixes), use judgment. For non-trivial work — anything that could cause regressions or require rework — apply these fully.

1. Think Before Coding

Don't assume. Don't hide confusion. Surface tradeoffs.

Before implementing:

State your assumptions explicitly. If uncertain, ask.
If multiple valid interpretations exist, present them — don't pick silently.
If a simpler approach exists, say so. Push back when warranted.
If something is unclear, stop. Name what's confusing. Ask.

Banned behaviors:

Picking a silent interpretation of an ambiguous request and running with it
Hiding confusion behind "I'll figure it out as I go"
Implementing before surfacing a tradeoff that would change the decision

2. Simplicity First

Minimum code that solves the problem. Nothing speculative.

No features beyond what was asked.
No abstractions for single-use code.
No "flexibility" or "configurability" that wasn't requested.
No error handling for impossible scenarios.
If you write 200 lines and it could be 50, rewrite it.

The test: Would a senior engineer say this is overcomplicated? If yes, simplify.

Banned patterns:

Generic base classes for a problem with 1 concrete case
Config objects with 15 fields when 3 are ever used
"For future flexibility" arguments that add complexity now
Wrapper functions that do nothing but call another function

3. Surgical Changes

Touch only what you must. Clean up only your own mess.

When editing existing code:

Don't "improve" adjacent code, comments, or formatting.
Don't refactor things that aren't broken.
Match existing style, even if you'd do it differently.
If you notice unrelated dead code, mention it — don't delete it.

When your changes create orphans:

Remove imports/variables/functions that YOUR changes made unused.
Don't remove pre-existing dead code unless explicitly asked.

The test: Every changed line should trace directly to the user's request. If you can't explain why a line changed, undo it.

4. Goal-Driven Execution

Define success criteria. Loop until verified.

Transform imperative tasks into verifiable goals:

Instead of...	Transform to...
"Add validation"	"Write tests for invalid inputs, then make them pass"
"Fix the bug"	"Write a test that reproduces it, then make it pass"
"Refactor X"	"Ensure tests pass before and after"
"Improve performance"	"Benchmark current time, implement change, verify X% improvement"

For multi-step tasks, state a brief plan with explicit verification:

1. [Step] → verify: [check]
2. [Step] → verify: [check]
3. [Step] → verify: [check]

Strong success criteria let the agent loop independently. Weak criteria ("make it work") require constant clarification and usually produce wrong results.

How to Know It's Working

These guidelines are working when you see:

Fewer unnecessary changes in diffs — Only requested changes appear
Fewer rewrites due to overcomplication — Code is simple the first time
Clarifying questions come before implementation — Not after mistakes
Clean, minimal PRs — No drive-by refactoring, no "improvements"

AGENTS.md / CLAUDE.md Snippet

Add this to any project's AGENTS.md, CLAUDE.md, or .github/copilot-instructions.md to make all AI tools follow these rules on that project:

## Coding Behavior Guidelines (Karpathy)

1. **Think Before Coding** — State assumptions explicitly. If multiple interpretations exist, ask. Push back when a simpler approach exists.
2. **Simplicity First** — Minimum code that solves the problem. No speculative features, no single-use abstractions.
3. **Surgical Changes** — Touch only what you must. Match existing style. Mention unrelated dead code, don't delete it.
4. **Goal-Driven Execution** — Transform tasks into verifiable goals. Loop until success criteria are met.

Pipeline Integration

These rules are already embedded in key pipeline agents — this skill makes them explicit and invocable on demand:

Agent	Where Karpathy rules appear
`ralph`	Sprint Contract (Think Before Coding), TDD (Goal-Driven), output enforcement
`guardian-angel`	Simplicity First check, Surgical Changes diff review
`spec-writer`	Think Before Coding — defines what exists before ralph codes
`code-reviewer`	All 4 principles in the review checklist

Use /karpathy-guidelines when:

Onboarding a new project (add AGENTS.md snippet)
A PR has unexpected drive-by changes
A implementation came back overengineered
You want to re-center an agent that's going in circles

Copilot CLI Operations

Signal

Invoked as a reminder/overlay: KARPATHY_DONE: [principle applied]
No blocking signal — this is a behavioral overlay, not a pipeline step

karpathy-guidelines

Más de este repositorio

Más de este repositorio

Karpathy Guidelines

1. Think Before Coding

2. Simplicity First

3. Surgical Changes

4. Goal-Driven Execution

How to Know It's Working

AGENTS.md / CLAUDE.md Snippet

Pipeline Integration

Copilot CLI Operations

Signal

Karpathy Guidelines

1. Think Before Coding

2. Simplicity First

3. Surgical Changes

4. Goal-Driven Execution

How to Know It's Working

AGENTS.md / CLAUDE.md Snippet

Pipeline Integration

Copilot CLI Operations

Signal