一键导入
ai-agent-design
Design autonomous AI agents that reason, plan, and execute tasks
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
菜单
Design autonomous AI agents that reason, plan, and execute tasks
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
基于 SOC 职业分类
| name | ai-agent-design |
| description | Design autonomous AI agents that reason, plan, and execute tasks |
| tier | standard |
| applyTo | **/*agent*,**/*ai-agent*,**/*orchestrat*,**/*multi-agent* |
Patterns for designing AI agents—autonomous systems that use LLMs to reason, plan, and execute multi-step tasks.
| Aspect | Chatbot | Workflow | Agent |
|---|---|---|---|
| Autonomy | Low | None | High |
| Planning | None | Predefined | Dynamic |
| Tool Use | Limited | Fixed | Flexible |
| Memory | Session | None | Persistent |
| Error Recovery | Retry | Fail | Reason & adapt |
1. Thought: Reason about the task
2. Action: Choose and execute a tool
3. Observation: Process tool output
4. Repeat until complete
Example:
Thought: Need Seattle weather to answer umbrella question
Action: weather_api(location="Seattle")
Observation: {"temp": 52, "condition": "rain", "precipitation": 80%}
Thought: Raining with 80% precipitation. Recommend umbrella.
For complex multi-step tasks:
Use when order matters and partial failures need recovery.
Self-improvement through reflection:
Central coordinator delegates to specialists:
Supervisor
/ | \
Research Writer Reviewer
Nested supervisors for complex organizations:
Top Supervisor
/ \
Research Lead Writing Lead
/ \ / \
Web Paper Draft Edit
Multiple agents argue to reduce hallucination:
Agent A (Pro) <--argue--> Agent B (Con)
\ | /
Judge
{
"name": "search_database",
"description": "Search products. Use for availability/pricing queries.",
"parameters": {
"query": { "type": "string", "description": "Search terms" },
"max_results": { "type": "integer", "default": 10 }
}
}
Principles:
| Tools | Strategy |
|---|---|
| < 10 | Direct selection |
| 10-50 | Categorize first |
| 50+ | Embed and retrieve |
Working Memory → Current context (in prompt)
Short-Term Memory → Session state (key-value)
Long-Term Memory → Facts, history (vector DB + graph)
| Type | Storage | Use Case |
|---|---|---|
| Episodic | Vector DB | Past conversations |
| Semantic | Graph DB | Facts, relationships |
| Procedural | Code/prompts | How to do tasks |
| Working | Prompt | Current task |
def detect_loop(history, window=5, threshold=0.8):
recent = history[-window:]
previous = history[-window*2:-window]
return similarity(recent, previous) > threshold
Recovery: reflection prompt, force tool change, replan, escalate.
Require approval for high-risk actions:
Log: LLM calls, tool calls, state transitions, errors, recovery attempts.
| Strategy | Implementation |
|---|---|
| Token budgets | Max tokens per task |
| Step limits | Max N actions |
| Tiered models | GPT-4 plan, 3.5 execute |
| Caching | Cache tool/LLM results |
| Early termination | Stop when good enough |
| Framework | Best For |
|---|---|
| LangChain | Rapid prototyping |
| LangGraph | Complex multi-agent |
| AutoGen | Research, code gen |
| CrewAI | Business workflows |
| Semantic Kernel | Microsoft stack |
✅ Good: Open-ended research, multi-step workflows, tool orchestration ❌ Poor: Simple Q&A (use RAG), deterministic flows (use code), no human oversight
Create and maintain ASCII visual dashboards for project tracking with parallel lane progress bars
Store and manage voice samples for TTS cloning — portable, version-controlled audio references
Clear documentation through visual excellence
AI music generation via Replicate — 5 models for background tracks, lyrics, and sound design
Practitioner methodology for longitudinal case study research, evidence-based documentation, and publication-ready academic writing in AI-assisted development.
First impressions matter. Set projects up for success.