Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

cost-optimization

Review and optimize LLM API costs in the orchestrator. Analyze model routing, budget tracking, prompt caching, and retry logic for cost efficiency.

In Manus ausführen

Überblick

Review and optimize LLM API costs in the orchestrator. Analyze model routing, budget tracking, prompt caching, and retry logic for cost efficiency.

Installationsbefehl

npx skills add https://github.com/pjcau/claude-kit --skill cost-optimization

Kopieren Sie diesen Befehl und fügen Sie ihn in Claude Code ein, um den Skill zu installieren

Quelle

pjcau/claude-kit

Sterne0

Forks0

Aktualisiert24. März 2026 um 10:41

SKILL.md

readonly

name	cost-optimization
description	Review and optimize LLM API costs in the orchestrator. Analyze model routing, budget tracking, prompt caching, and retry logic for cost efficiency.
allowed-tools	Read, Grep, Glob, Bash

Cost Optimization — LLM Pipeline Efficiency

Patterns for controlling LLM API costs. Adapted from flatrick/everything-claude-code cost-aware-llm-pipeline.

When to Use

Reviewing or optimizing provider cost configuration
Adding new models and need to set pricing correctly
Investigating unexpected cost spikes
Planning batch processing jobs

Review Checklist

1. Model Pricing Accuracy

# Check all model pricing in provider catalogs
grep -n "input_cost\|output_cost" src/agent_orchestrator/providers/openrouter.py

Verify prices match current OpenRouter/provider rates.

2. Router Configuration

# Check complexity classifier thresholds
grep -n "THRESHOLD\|_KEYWORDS\|_PATTERNS" src/agent_orchestrator/core/router.py

Ensure:

Low-complexity tasks route to cheap/free models
Only high-complexity tasks use expensive models
Medium tasks use mid-tier pricing

3. Budget Enforcement

# Check budget settings
grep -rn "budget\|cost_budget" src/agent_orchestrator/core/

Verify:

Per-task budget limits are set
Session budget limits are set
Budget exceeded triggers graceful stop (not crash)

4. Retry Cost Control

# Check retry configuration
grep -rn "max_retries\|retry\|fallback" src/agent_orchestrator/

Ensure:

Retries only on transient errors (429, 500, connection)
No retries on 400, 401, 403 (wastes budget on permanent failures)
Fallback chain prefers cheaper models

5. Token Usage

# Check max_tokens defaults
grep -rn "max_tokens" src/agent_orchestrator/

Ensure max_tokens is not unnecessarily high for simple tasks.

Cost Tiers Reference

Tier	Use Case	Target Cost
Free	Simple lookups, formatting	$0/M tokens
Low	Summaries, translations	< $1/M output
Mid	Feature work, debugging	$1-5/M output
High	Architecture, deep analysis	$5+/M output

Anti-Patterns

Using expensive models for all requests regardless of complexity
Retrying on all errors (wastes budget on permanent failures)
No budget limits on batch operations
Hardcoded model names scattered through code (use catalog)
Ignoring prompt caching for repetitive system prompts

Mehr aus diesem Repository

gleiches Repository

epic

pjcau/claude-kit

Epic feature development — break a large feature into phased stories, then execute each phase via /feature. Use when building multi-phase features, large initiatives, or features that span multiple components/layers.

2026-03-270

code-review

pjcau/claude-kit

Review code changes for quality, security, and correctness. Use this before merging PRs, after significant refactors, or when reviewing agent output.

2026-03-240

deploy

pjcau/claude-kit

Deploy services using Docker/OrbStack. Use this to build and deploy containers, run health checks, and manage deployments.

2026-03-240

doc

pjcau/claude-kit

Review production code and update all documentation under docs/ to match the current codebase state.

2026-03-240

docker-build

pjcau/claude-kit

Build and manage Docker containers via OrbStack. Use this to build images, run containers, check service health, and manage docker-compose services.

2026-03-240

feature

pjcau/claude-kit

End-to-end feature development — implement, iterate with user feedback, test, SOLID review, document, commit & push. Use when building a new feature or making a significant code change that needs quality assurance.

2026-03-240

Quelle

pjcau

pjcau/claude-kit

GitHub-Repository öffnen Creator-Repositorys ansehen

Installationsbefehl

Download

In Manus ausführen

Nützlich fürSOC

SoftwareentwicklerInformatik- und Mathematikberufe15-1252L4