一键在 Manus 中运行任何 Skill

model-routing

Guide for configuring and debugging the model routing layer in Superagent. TRIGGER when: adding a new model provider, configuring fallback chains, tuning cost/latency routing strategy, debugging "model not found" errors, or when the user asks "how does model routing work", "模型路由怎么配置", "add a new LLM provider". DO NOT TRIGGER when: writing agent YAML (use agent-yaml-authoring), implementing tool logic, or working on the UI.

在 Manus 中运行

概览

安装命令

npx skills add https://github.com/Colin4k1024/superagent-base --skill model-routing

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

Colin4k1024/superagent-base

星标3

分支0

更新时间2026年5月20日 03:13

SKILL.md

readonly

name	model-routing
description	Guide for configuring and debugging the model routing layer in Superagent. TRIGGER when: adding a new model provider, configuring fallback chains, tuning cost/latency routing strategy, debugging "model not found" errors, or when the user asks "how does model routing work", "模型路由怎么配置", "add a new LLM provider". DO NOT TRIGGER when: writing agent YAML (use agent-yaml-authoring), implementing tool logic, or working on the UI.
origin	learned
tags	["model","routing","llm","provider","fallback","cost","latency"]

Model Routing

Source: backend/pkg/modelrouter/ and configs/models/routing-rules.yaml.

Routing Strategies

Strategy	When to use
`capability`	Route by required feature (vision, function-calling, long-context)
`cost`	Always pick cheapest model that meets the task
`latency`	Always pick fastest model (streaming first-token)
`fallback`	Try primary; on error/timeout move to next in chain
`round_robin`	Distribute load evenly across providers

routing-rules.yaml Structure

strategies:
  default: fallback

providers:
  openai:
    base_url: https://api.openai.com/v1
    api_key: ${OPENAI_API_KEY}
    models:
      - id: gpt-4o
        capabilities: [vision, function_calling]
        cost_per_1k_tokens: 0.005
        avg_latency_ms: 800

  anthropic:
    base_url: https://api.anthropic.com
    api_key: ${ANTHROPIC_API_KEY}
    models:
      - id: claude-sonnet-4-6
        capabilities: [function_calling, long_context]
        cost_per_1k_tokens: 0.003

fallback_chains:
  default:
    - gpt-4o
    - claude-sonnet-4-6
    - gpt-4o-mini          # cheap fallback

Agent-Level Override

spec:
  model: gpt-4o            # explicit model
  # or
  model_strategy: cost     # let router pick cheapest capable model
  model_capabilities:
    - vision
    - function_calling

Fallback Behavior

Primary model called; if error or timeout → next in chain.
All models in chain exhausted → returns last error to caller.
Timeout per-model configured at provider level (timeout_ms).

Adding a New Provider

Add provider block to routing-rules.yaml with base_url, api_key, model list.
Ensure the model ID matches what the provider API expects.
Restart or hot-reload (router watches the config file).
Test: curl -X POST /api/v1/chat -d '{"model":"new-model","messages":[...]}'

Debugging

# Check which model was selected for a request (log level DEBUG)
APP_LOG_LEVEL=debug make dev-server

# Grep for routing decisions
grep "model_router" logs/app.log

同仓库更多 Skills

同仓库

a2ui-streaming

Colin4k1024/superagent-base

Reference for the A2UI SSE streaming protocol used by Superagent agents to push typed events to the frontend. TRIGGER when: implementing a new event type, debugging streaming output, writing frontend SSE consumers, or when the user asks "how does streaming work", "what events does the agent emit", "A2UI 协议". DO NOT TRIGGER when: working on HTTP REST endpoints unrelated to streaming.

2026-05-203

agent-yaml-authoring

Colin4k1024/superagent-base

Author and validate Superagent declarative YAML agent definitions (apiVersion: superagent/v1, kind: Agent). TRIGGER when: creating a new agent, editing configs/agents/*.yaml, reviewing agent spec fields, or when the user asks "how do I define an agent", "write me an agent yaml", "agent 配置怎么写". DO NOT TRIGGER when: working on workflow DAG nodes (use workflow-dag skill), pure Go backend logic, or infrastructure config.

2026-05-203

code-fix

Colin4k1024/superagent-base

Analyzes code errors, generates fix solutions, evaluates and applies fixes. Use when fixing test failures, compilation errors, runtime exceptions, "编码修复", "code fix", "fix this error", or when the user asks to debug and fix code issues.

2026-05-203

interrupt-resume

Colin4k1024/superagent-base

Implement and debug agent interrupt/resume (human-in-the-loop checkpointing) in Superagent. TRIGGER when: enabling interrupt on an agent, implementing resume API, debugging "checkpoint not found" errors, adding human approval steps, or when the user asks "interrupt/resume 怎么用", "how does checkpoint work", "pause agent for approval". DO NOT TRIGGER when: building non-interactive batch agents or simple one-shot queries.

2026-05-203

mcp-integration

Colin4k1024/superagent-base

Patterns for integrating MCP (Model Context Protocol) servers and tools into Superagent agents. TRIGGER when: wiring an MCP server into an agent YAML, implementing a new MCP client/server, debugging "mcp tool not found" errors, or when the user asks "how do I add an MCP tool", "MCP 怎么接入", "connect filesystem MCP". DO NOT TRIGGER when: writing pure builtin tools (use builtin/<name> directly) or REST API integrations unrelated to MCP.

2026-05-203

multi-agent-orchestration

Colin4k1024/superagent-base

Design multi-agent systems using Superagent's supervisor, sequential, and parallel agent types. TRIGGER when: building orchestration agents, routing tasks between sub-agents, designing fan-out/fan-in pipelines, or when the user asks "multi-agent 怎么设计", "supervisor agent", "parallel agents", "agent orchestration". DO NOT TRIGGER when: working on single-model agents (use agent-yaml-authoring) or workflow DAG nodes (use workflow-dag skill).

2026-05-203

来源

Colin4k1024

Colin4k1024/superagent-base

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

适用职业SOC

软件开发工程师计算机与数学类职业15-1252L4

name	model-routing
description	Guide for configuring and debugging the model routing layer in Superagent. TRIGGER when: adding a new model provider, configuring fallback chains, tuning cost/latency routing strategy, debugging "model not found" errors, or when the user asks "how does model routing work", "模型路由怎么配置", "add a new LLM provider". DO NOT TRIGGER when: writing agent YAML (use agent-yaml-authoring), implementing tool logic, or working on the UI.
origin	learned
tags	["model","routing","llm","provider","fallback","cost","latency"]

Model Routing

Source: backend/pkg/modelrouter/ and configs/models/routing-rules.yaml.

Routing Strategies

Strategy	When to use
`capability`	Route by required feature (vision, function-calling, long-context)
`cost`	Always pick cheapest model that meets the task
`latency`	Always pick fastest model (streaming first-token)
`fallback`	Try primary; on error/timeout move to next in chain
`round_robin`	Distribute load evenly across providers

routing-rules.yaml Structure

strategies:
  default: fallback

providers:
  openai:
    base_url: https://api.openai.com/v1
    api_key: ${OPENAI_API_KEY}
    models:
      - id: gpt-4o
        capabilities: [vision, function_calling]
        cost_per_1k_tokens: 0.005
        avg_latency_ms: 800

  anthropic:
    base_url: https://api.anthropic.com
    api_key: ${ANTHROPIC_API_KEY}
    models:
      - id: claude-sonnet-4-6
        capabilities: [function_calling, long_context]
        cost_per_1k_tokens: 0.003

fallback_chains:
  default:
    - gpt-4o
    - claude-sonnet-4-6
    - gpt-4o-mini          # cheap fallback

Agent-Level Override

spec:
  model: gpt-4o            # explicit model
  # or
  model_strategy: cost     # let router pick cheapest capable model
  model_capabilities:
    - vision
    - function_calling

Fallback Behavior

Primary model called; if error or timeout → next in chain.
All models in chain exhausted → returns last error to caller.
Timeout per-model configured at provider level (timeout_ms).

Adding a New Provider

Add provider block to routing-rules.yaml with base_url, api_key, model list.
Ensure the model ID matches what the provider API expects.
Restart or hot-reload (router watches the config file).
Test: curl -X POST /api/v1/chat -d '{"model":"new-model","messages":[...]}'

Debugging

# Check which model was selected for a request (log level DEBUG)
APP_LOG_LEVEL=debug make dev-server

# Grep for routing decisions
grep "model_router" logs/app.log