con un clic
ai-agent-design
Design autonomous AI agents that reason, plan, and execute tasks
Instalar con Codex o Claude Copia este prompt, pégalo en Codex, Claude u otro asistente, y deja que revise la página de la skill y la instale por ti.
Menú
Design autonomous AI agents that reason, plan, and execute tasks
Instalar con Codex o Claude Copia este prompt, pégalo en Codex, Claude u otro asistente, y deja que revise la página de la skill y la instale por ti.
Basado en la clasificación ocupacional SOC
Create and maintain ASCII visual dashboards for project tracking with parallel lane progress bars
Store and manage voice samples for TTS cloning — portable, version-controlled audio references
Clear documentation through visual excellence
AI music generation via Replicate — 5 models for background tracks, lyrics, and sound design
Practitioner methodology for longitudinal case study research, evidence-based documentation, and publication-ready academic writing in AI-assisted development.
First impressions matter. Set projects up for success.
| name | ai-agent-design |
| description | Design autonomous AI agents that reason, plan, and execute tasks |
| tier | standard |
| applyTo | **/*agent*,**/*ai-agent*,**/*orchestrat*,**/*multi-agent* |
Patterns for designing AI agents—autonomous systems that use LLMs to reason, plan, and execute multi-step tasks.
| Aspect | Chatbot | Workflow | Agent |
|---|---|---|---|
| Autonomy | Low | None | High |
| Planning | None | Predefined | Dynamic |
| Tool Use | Limited | Fixed | Flexible |
| Memory | Session | None | Persistent |
| Error Recovery | Retry | Fail | Reason & adapt |
1. Thought: Reason about the task
2. Action: Choose and execute a tool
3. Observation: Process tool output
4. Repeat until complete
Example:
Thought: Need Seattle weather to answer umbrella question
Action: weather_api(location="Seattle")
Observation: {"temp": 52, "condition": "rain", "precipitation": 80%}
Thought: Raining with 80% precipitation. Recommend umbrella.
For complex multi-step tasks:
Use when order matters and partial failures need recovery.
Self-improvement through reflection:
Central coordinator delegates to specialists:
Supervisor
/ | \
Research Writer Reviewer
Nested supervisors for complex organizations:
Top Supervisor
/ \
Research Lead Writing Lead
/ \ / \
Web Paper Draft Edit
Multiple agents argue to reduce hallucination:
Agent A (Pro) <--argue--> Agent B (Con)
\ | /
Judge
{
"name": "search_database",
"description": "Search products. Use for availability/pricing queries.",
"parameters": {
"query": { "type": "string", "description": "Search terms" },
"max_results": { "type": "integer", "default": 10 }
}
}
Principles:
| Tools | Strategy |
|---|---|
| < 10 | Direct selection |
| 10-50 | Categorize first |
| 50+ | Embed and retrieve |
Working Memory → Current context (in prompt)
Short-Term Memory → Session state (key-value)
Long-Term Memory → Facts, history (vector DB + graph)
| Type | Storage | Use Case |
|---|---|---|
| Episodic | Vector DB | Past conversations |
| Semantic | Graph DB | Facts, relationships |
| Procedural | Code/prompts | How to do tasks |
| Working | Prompt | Current task |
def detect_loop(history, window=5, threshold=0.8):
recent = history[-window:]
previous = history[-window*2:-window]
return similarity(recent, previous) > threshold
Recovery: reflection prompt, force tool change, replan, escalate.
Require approval for high-risk actions:
Log: LLM calls, tool calls, state transitions, errors, recovery attempts.
| Strategy | Implementation |
|---|---|
| Token budgets | Max tokens per task |
| Step limits | Max N actions |
| Tiered models | GPT-4 plan, 3.5 execute |
| Caching | Cache tool/LLM results |
| Early termination | Stop when good enough |
| Framework | Best For |
|---|---|
| LangChain | Rapid prototyping |
| LangGraph | Complex multi-agent |
| AutoGen | Research, code gen |
| CrewAI | Business workflows |
| Semantic Kernel | Microsoft stack |
✅ Good: Open-ended research, multi-step workflows, tool orchestration ❌ Poor: Simple Q&A (use RAG), deterministic flows (use code), no human oversight