一键导入
utility
Scores agent actions by expected gain, cost, uncertainty, and redundancy. Use when deciding whether to dispatch an agent or invoke a tool.
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
菜单
Scores agent actions by expected gain, cost, uncertainty, and redundancy. Use when deciding whether to dispatch an agent or invoke a tool.
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
基于 SOC 职业分类
Detects AI-generated writing patterns in prose. Use when reviewing docs for slop, vague language, or identity leaks before publishing.
Audits Rust code for unsafe blocks, ownership issues, and Cargo dependency risks. Use when reviewing Rust code or before merging Rust changes.
Recommends context compression strategies for bloated or quota-heavy sessions. Use when context feels sluggish or quota burns faster than expected.
Guide minimal code via a decision ladder with full safety, edge, and negative-case coverage. Use when adding code, choosing a dependency, or auditing a diff.
Optimizes context window via MECW principles and memory tiering. Use when context exceeds 30% or before long multi-step tasks.
Generates or remediates documentation with human-quality writing. Use when creating new docs, rewriting AI-generated content, or applying style profiles.
| name | utility |
| description | Scores agent actions by expected gain, cost, uncertainty, and redundancy. Use when deciding whether to dispatch an agent or invoke a tool. |
| alwaysApply | false |
| category | infrastructure |
| tags | ["orchestration","cost-control","decision-making","agent-dispatch"] |
| dependencies | [] |
| provides | {"infrastructure":["utility-scoring","action-selection","termination-control"],"patterns":["gain-estimation","cost-computation","redundancy-detection"]} |
| usage_patterns | ["agent-dispatch-gating","tool-call-decisions","continuation-decisions","model-tier-selection"] |
| complexity | intermediate |
| model_hint | standard |
| estimated_tokens | 600 |
| progressive_loading | true |
| modules | ["modules/state-builder.md","modules/gain.md","modules/step-cost.md","modules/uncertainty.md","modules/redundancy.md","modules/action-selector.md","modules/integration.md"] |
A decision framework for agent orchestration based on Liu et al., "Utility-Guided Agent Orchestration for Efficient LLM Tool Use" (arXiv:2603.19896). Each candidate action is scored by subtracting weighted costs from expected gain, producing a single utility value that guides action selection. The framework prevents over-calling tools and premature stopping by making both errors costly. Utility range is [-2.3, 1.0].
A = {respond, retrieve, tool_call, verify, delegate, stop}
| Action | Description |
|---|---|
| respond | Emit a final answer from current context |
| retrieve | Fetch additional information (search, read, lookup) |
| tool_call | Execute a tool (code runner, API, file write) |
| verify | Check a prior result for correctness or completeness |
| delegate | Spawn a sub-agent or hand off to a specialist |
| stop | Terminate the loop and return current state |
U(a | s_t) = Gain(a | s_t)
- λ₁ · StepCost(a | s_t)
- λ₂ · Uncertainty(a | s_t)
- λ₃ · Redundancy(a | s_t)
| Parameter | Default | Rationale |
|---|---|---|
| λ₁ | 1.0 | Cost baseline; all other weights relative to this |
| λ₂ | 0.5 | Weak empirical correlation with outcome (r=0.0131) |
| λ₃ | 0.8 | Redundancy pruning yields ~10% token savings |
Utility range: [-2.3, 1.0]. Positive values indicate the action is worth taking. Values below the floor (-0.5 default) indicate the action should be skipped.
Stop the loop when any of the following is true:
stopstop actions score below the floor (default: -0.5)High-gain override: If Gain >= 0.7 for any action, condition
(c) may be overridden.
Document the override and the gain value in your reasoning trace.
Minimal 4-step advisory pattern:
modules/state-builder.mdA per
modules/action-selector.mdU(a | s_t), subject to termination conditionsmodules/state-builder.md, how to
populate s_t from task contextmodules/gain.md, estimating expected information
or progress gainmodules/step-cost.md, token, latency, and
monetary cost tablesmodules/uncertainty.md, confidence
estimation and calibrationmodules/redundancy.md, detecting duplicate
or low-delta actionsmodules/action-selector.md, scoring
loop and tie-breaking rulesmodules/integration.md, wiring utility
scoring into existing orchestration loops