Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

prompt-engineering

Estrellas1

Forks0

Actualizado1 de junio de 2026, 04:10

Use when developing, aligning, calibrating, or improving ANY prompt, read-trigger description, agent instruction, or output style. Two modes — (1) alignment: fit a prompt to example input→output pairs and test generalization; (2) self-improvement: turn user corrections into a growing test suite and iterate via isolated subagents. Trigger on "align this prompt", "the prompt gave wrong output", "improve/calibrate this prompt", "make this prompt self-improving", "tune the trigger description", or running a prompt improvement cycle. NOT for general copywriting or prose editing — use the writing skill for that.

Instalación

Instalar con Codex o Claude Copia este prompt, pégalo en Codex, Claude u otro asistente, y deja que revise la página de la skill y la instale por ti.

Ejecutar en Manus

Fuente

mshuffett

mshuffett/dotfiles

Abrir repositorio de GitHub Ver repositorios del creador

Descarga

Ejecutar en Manus

Ocupaciones relacionadasSOC

Basado en la clasificación ocupacional SOC

Desarrolladores de softwareOcupaciones informáticas y matemáticas·SOC 15-1252

Explorador de archivos

5 archivos

SKILL.md

readonly

name

prompt-engineering

description

Prompt Engineering

Two complementary workflows for making prompts reliable. Pick the mode that fits the situation:

Mode	Use when	Reference
Alignment	Building or tuning a prompt from scratch; you have (or can write) example X→Y pairs and want it to fit them and generalize to unseen cases	references/alignment.md
Self-improvement	A deployed prompt produces occasionally-wrong output; you want each correction to become a permanent test case	references/self-improvement.md

They compose: use alignment for initial development, then self-improvement for ongoing calibration after deployment.

Shared principles

These hold in both modes — read the why, not just the rule:

Test steering power in isolation. Run the prompt where the tester cannot see the expected answer (the orchestrator/subagent pattern in self-improvement). If the agent can see the right answer, it produces it regardless of whether the prompt actually steers there — so isolation is what makes the test meaningful.
Smallest effective change. Propose the minimal edit that fixes the widest set of failures, then re-test for regressions before accepting. Good improvements often make the prompt shorter, not longer.
Approve before writing. Present the diff + rationale + test results; don't write prompt files until explicitly approved.

Live testing (Claude CLI)

time claude -p --print --output-format text \
  --system-prompt "$(cat .claude/debug/sample-prompt.md)" \
  "ping"

Codex alternative, the fit-to-generalize rubric, and read-trigger-description patterns: see references/alignment.md. Full session structure, subagent execution, and the improvement-log template: see references/protocol.md.

eval-triage (productivity plugin) — an automated LLM-as-judge implementation of the self-improvement loop for Todoist classification.
mistake-tracking — for tracking Claude's own operational mistakes; this skill tracks prompt output quality.

Más de este repositorio

mismo repositorio

coach

mshuffett/dotfiles

Michael's operating + emotional coach. Operational mode — daily startup/shutdown, weekly review, pomodoro, inbox capture, daily notes (auto-loads in ~/ws/notes). Emotional/decision mode (Joe Hudson style) — use on "coach me"/"joe coach", stuck/looping/overthinking, harsh self- or other-judgment, a binary either/or decision that won't resolve, or fear, shame, loneliness, anxiety, burnout, or grief, when Michael wants to be met in a feeling rather than handed advice. Not for clinical crises (refer out).

2026-06-261

todoist

mshuffett/dotfiles

Use when creating or processing Todoist tasks, triaging inbox items, doing daily task review, calibrating Todoist triage behavior, or turning corrections into reusable preferences. Routes to operations (CLI actions) vs calibrated triage (policy, context recovery, preference memory, evals). Trigger this whenever the user asks what to do with Todoist items, wants better task triage, or is refining how Todoist decisions should work.

2026-06-231

deep-research-fanout

mshuffett/dotfiles

Run real Deep Research across ChatGPT, Claude, and Gemini in parallel via the user's own logged-in browser (Chrome extension, zero API cost), save each original report to Notion, then synthesize. Use whenever the user wants a "deep dive", "deep research", a thorough multi-source investigation, or to research a topic across the models and compare what each finds. Drives the paid subscription products, NOT the API. NOT for single-fact lookups or ordinary web search — use web-search for those.

2026-06-221

harness-engineering

mshuffett/dotfiles

Use when setting up, auditing, or improving AI agent infrastructure in a repo — AGENTS.md/CLAUDE.md files, linters, architectural constraints, feedback loops, context tiering, agent specialization, or entropy management. Also triggers on "harness engineering", "agent-friendly repo", "make my repo work well with coding agents", "set up my repo for agents", or "why is my agent struggling".

2026-06-221

adaptive-triage

mshuffett/dotfiles

Interactive Todoist triage with preference learning. Use when the user says "triage", "process my inbox", "clean up tasks", "triage my todoist", "file these captures", or mentions inbox zero. Also use when the user has a batch of raw items (voice notes, links, ideas) that need classifying and routing to Todoist projects or Obsidian. Runs an interactive confirm/correct loop that learns your routing preferences over time.

2026-06-161

session-save

mshuffett/dotfiles

Use when the user asks to save session context, identify the current session or thread, create a resumable handoff, or prepare a Todoist/note summary that must include the working directory and session id. Works across Codex and Claude Code by detecting runtime-specific session identifiers and normalizing them into one summary.

2026-06-161

name

prompt-engineering

description

Prompt Engineering

Two complementary workflows for making prompts reliable. Pick the mode that fits the situation:

Mode	Use when	Reference
Alignment	Building or tuning a prompt from scratch; you have (or can write) example X→Y pairs and want it to fit them and generalize to unseen cases	references/alignment.md
Self-improvement	A deployed prompt produces occasionally-wrong output; you want each correction to become a permanent test case	references/self-improvement.md

They compose: use alignment for initial development, then self-improvement for ongoing calibration after deployment.

Shared principles

These hold in both modes — read the why, not just the rule:

Test steering power in isolation. Run the prompt where the tester cannot see the expected answer (the orchestrator/subagent pattern in self-improvement). If the agent can see the right answer, it produces it regardless of whether the prompt actually steers there — so isolation is what makes the test meaningful.
Smallest effective change. Propose the minimal edit that fixes the widest set of failures, then re-test for regressions before accepting. Good improvements often make the prompt shorter, not longer.
Approve before writing. Present the diff + rationale + test results; don't write prompt files until explicitly approved.

Live testing (Claude CLI)

time claude -p --print --output-format text \
  --system-prompt "$(cat .claude/debug/sample-prompt.md)" \
  "ping"

eval-triage (productivity plugin) — an automated LLM-as-judge implementation of the self-improvement loop for Todoist classification.
mistake-tracking — for tracking Claude's own operational mistakes; this skill tracks prompt output quality.

prompt-engineering

Prompt Engineering

Shared principles

Live testing (Claude CLI)

Related

Prompt Engineering

Shared principles

Live testing (Claude CLI)

Related