Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

ai-tools-platform-stinger

Sterne66

Forks24

Aktualisiert23. Juni 2026 um 16:32

The vibe coder's AI toolbox — AI gateways (Portkey, OpenRouter), cloud providers (Bedrock, Vertex AI), frontier model selection (Claude, GPT, Gemini), cheap-fallback routes (Haiku, Mini, Flash), local LLMs (Ollama, LM Studio), GPU cloud (Runpod, Modal, Together, Fireworks), and must-have MCPs and IDE plugins. Use when the user says "which AI provider should I use", "set up Portkey", "Ollama for local dev", "Runpod vs Modal", "which MCP servers do I need", or asks to optimize AI spend. Do NOT use for cognitive-layer architecture (mind-worker-bee), API key security (security-worker-bee), or PRD authorship (library-worker-bee).

Installation

Mit Codex oder Claude installieren Kopieren Sie diesen Prompt, fügen Sie ihn in Codex, Claude oder einen anderen Assistant ein und lassen Sie die Skill-Seite prüfen und installieren.

In Manus ausführen

Quelle

legioncodeinc

legioncodeinc/that-git-life

GitHub-Repository öffnen Creator-Repositorys ansehen

Download

In Manus ausführen

Datei-Explorer

25 Dateien

SKILL.md

readonly

name

ai-tools-platform-stinger

description

ai-tools-platform Stinger

You are the playbook for ai-tools-platform-worker-bee. Every invocation produces one concrete artifact: a recommendation, a comparison matrix, a configuration snippet, or a setup guide. Every claim is backed by the research in research/.

Invocation modes (routing table)

Read the user's request and match to one mode. Most requests match one primary mode with one supporting mode.

Mode	Trigger phrases	Primary guide
`gateway-setup`	"set up Portkey", "configure OpenRouter", "AI gateway", "virtual keys", "budget cap on LLM spend"	`guides/01-ai-gateways.md`
`provider-selection`	"Bedrock vs Vertex", "which cloud AI provider", "Azure OpenAI", "enterprise AI", "private VPC AI"	`guides/02-cloud-providers.md`
`model-selection`	"which model should I use", "Claude vs GPT vs Gemini", "best model for code", "context window comparison"	`guides/03-model-selection.md`
`cost-optimization`	"LLM spend too high", "prompt caching", "batch API", "cheap model fallback", "token cost"	`guides/04-cost-optimization.md`
`local-llm-workflow`	"Ollama", "LM Studio", "local LLM", "offline dev", "privacy-first AI", "llama.cpp"	`guides/05-local-llms.md`
`gpu-cloud-selection`	"Runpod", "Modal", "Together AI", "Fireworks", "Groq", "GPU inference", "serverless GPU"	`guides/06-gpu-cloud.md`
`mcp-plugin-setup`	"MCP server", "which MCPs", "IDE plugin", "Cursor plugin", "tool use setup", "agent toolbox"	`guides/07-mcp-and-ide-plugins.md`

First action on every invocation

Read guides/00-principles.md — the non-negotiables that govern every output.
Match the request to the routing table above.
Open the relevant guide(s) before producing any output.

Folder layout

ai-tools-platform-stinger/
├── SKILL.md                         (this file — master index)
├── guides/
│   ├── 00-principles.md             (non-negotiables: pricing, privacy, fallback discipline)
│   ├── 01-ai-gateways.md            (Portkey vs OpenRouter vs LiteLLM; virtual keys; fallback chains)
│   ├── 02-cloud-providers.md        (Bedrock vs Vertex AI vs Azure OpenAI vs direct; when to use each)
│   ├── 03-model-selection.md        (2026 frontier landscape; capability tiers; cheap-fallback table)
│   ├── 04-cost-optimization.md      (prompt caching; batch API; tiering strategy; spend telemetry)
│   ├── 05-local-llms.md             (Ollama; LM Studio; llama.cpp; model selection; OpenAI-compat wiring)
│   ├── 06-gpu-cloud.md              (Runpod vs Modal vs Together vs Fireworks vs Groq; price table)
│   └── 07-mcp-and-ide-plugins.md    (must-have MCPs; Cursor plugin setup; IDE extension picks)
├── examples/
│   ├── gateway-setup-portkey.md     (Portkey virtual keys + fallback + budget cap end-to-end)
│   ├── model-selection-matrix.md    (filled-in comparison for a SaaS product)
│   └── local-llm-vibe-coding-workflow.md  (Ollama + Cursor offline workflow)
├── templates/
│   ├── provider-comparison.md       (canonical comparison table skeleton)
│   └── cost-estimate.md             (monthly cost estimate sheet)
├── reports/
│   └── README.md                    (describes how past recommendation reports accumulate)
└── research/
    ├── research-plan.md
    ├── research-summary.md
    ├── index.md
    ├── internal/
    │   └── command-brief-notes.md
    └── external/
        ├── portkey-openrouter-gateways.md
        ├── aws-bedrock-vertex-azure-comparison.md
        ├── frontier-model-landscape-2026.md
        ├── gpu-cloud-inference-vendors.md
        ├── ollama-local-llm-workflows.md
        └── mcp-servers-ide-plugins-2026.md

Canonical stack defaults

These are the recommended defaults. Deviating requires explicit rationale.

Decision	Recommended default	Rationale
AI gateway	Portkey	Unified virtual keys, budget caps, fallback routing, observability; OpenRouter preferred when pure model routing with no ops overhead needed
Primary frontier model (capability)	Claude 3.7 Sonnet / Opus or GPT-4.1	Top-tier reasoning, long context; choose by use case (see `guides/03-model-selection.md`)
Cheap fallback (cloud)	Claude Haiku 3.5 or Gemini 2.0 Flash	Sub-cent per 1K tokens; fast; adequate for classification, summarization, simple generation
Local LLM runtime	Ollama	Easiest setup; OpenAI-compatible REST; cross-platform; large model library
Local model (8B class)	Llama 3.1 8B / 3.2 3B or Gemma 3 9B	Best quality-per-GB in the 4-bit quantized range
GPU cloud (serverless)	Modal	Best developer experience; container caching; Python-native; pay-per-second
GPU cloud (persistent)	Runpod	Lowest price-per-GPU-hour; good for always-on inference
Fast inference (Llama)	Groq	Sub-100ms latency for Llama 3.1 70B; free tier available
MCP toolbox	See `guides/07-mcp-and-ide-plugins.md`	Context-dependent; filesystem + Supabase + GitHub are near-universal

Severity rubric

Used to classify findings when auditing an existing AI tooling stack.

Must-fix: No fallback model configured (single point of failure); API keys committed to code; no spend cap on gateway; PII sent to a provider without a DPA.
Should-refactor: Using a frontier model for tasks a cheap model handles adequately; no prompt caching on repeated system prompts; local-capable workloads running on expensive cloud inference.
Style / nice-to-have: Observability dashboard not configured; no cost attribution per feature; MCP server count excessive for the project size.

Cross-Bee handoffs

Surface these explicitly rather than attempting them inline:

security-worker-bee — for API key vault strategy, PII audit in prompts, DPA compliance verification, model provider's data-retention policies.
mind-worker-bee — for cognitive-layer architecture: RAG pipeline design, prompt cascade, three-tier memory, evaluation, coach routing. This Bee picks the providers; mind-worker-bee decides how to use them architecturally.
devops-worker-bee — for Docker container setup for GPU cloud deploys, CI/CD wiring for model inference services, secret injection from environment.
library-worker-bee — for PRD authorship when a new AI tooling decision needs to be documented as a feature requirement.

Mehr aus diesem Repository

gleiches Repository

adr-writing-stinger

legioncodeinc/that-git-life

Architecture Decision Records specialist covering Nygard format (Context / Decision / Consequences), MADR extended template, Y-statement framing, supersession and deprecation lifecycle, Log4brains and adr-tools CLI integration, and the "decisions, not docs" philosophy. Use when authoring a new ADR, superseding an existing decision, auditing the ADR log, setting up Log4brains, or onboarding a team to ADR practice. Do NOT use for general knowledge-base authoring (library-worker-bee), code entity extraction (wiki-worker-bee), or security review of the decisions themselves (security-worker-bee).

2026-06-2366

affiliate-referral-program-stinger

legioncodeinc/that-git-life

Affiliate and referral program specialist for SaaS products -- platform selection (Rewardful, FirstPromoter, Tolt, PartnerStack, Impact, Refersion), the affiliate-vs-referral distinction, cookie-based and server-side attribution (post-ITP, post-3PC era), payout automation, fraud detection (self-referral, cookie stuffing, velocity fraud), and EPC/LTV program economics. Invoke when the user says "set up an affiliate program", "which affiliate platform should I use", "Rewardful vs FirstPromoter", "my attribution is broken in Safari", "referral program fraud", "EPC or LTV for our program", "20% recurring commission", "postback tracking setup", or "PartnerStack vs FirstPromoter". Do NOT invoke for Stripe subscription billing mechanics (payments-worker-bee), API key secret management (security-worker-bee), custom attribution DB schema (db-worker-bee), or outbound partner recruitment campaigns (cold-outreach-worker-bee).

2026-06-2366

agile-scrum-stinger

legioncodeinc/that-git-life

Scrum methodology specialist — sprints, ceremonies (Sprint Planning, Daily Scrum, Sprint Review, Retrospective, Backlog Refinement), roles (Scrum Master, PO, Developers), estimation (Fibonacci, Planning Poker,

2026-06-2366

ai-coding-tools-stinger

legioncodeinc/that-git-life

The vibe-coder's AI coding tool advisor — Cursor, Claude Code, Aider, Cline, Windsurf/Cascade, Continue.dev, Replit Agent, Devin, and Bolt. Covers tool-tier classification, selection rubric (autonomy, context, budget, language), SWE-bench benchmark data, model-routing patterns (including Aider's 3-5x cost-reducing architect/editor split), per-tool prompt and context discipline, and a documented footgun catalog. Use when the user says "which AI coding tool should I use", "Cursor vs Claude Code vs Aider", "set up Aider", "Cline keeps breaking", "is Devin worth it", "which tool for autonomous tasks", "how do I reduce AI coding costs", "prompt discipline for Aider/Claude Code/Cline", or any question comparing or configuring AI-assisted development tools. Cross-links to cursor-ide-worker-bee for deep Cursor IDE config and ai-tools-platform-worker-bee for LLM provider/gateway decisions. Do NOT use for Cursor IDE configuration (cursor-ide-worker-bee owns that), CI/CD pipelines that run agents (devops-worker-bee), or

2026-06-2366

alt-ads-platforms-stinger

legioncodeinc/that-git-life

Paid acquisition specialist for alternative ad platforms beyond Meta and Google Search — LinkedIn Ads (B2B), TikTok Ads + CAPI, Reddit Ads, Microsoft/Bing Ads with LinkedIn Profile Targeting, Pinterest Ads, Quora Ads, YouTube standalone video, Spotify Ad Studio + podcast advertising, and the channel-fit-by-ICP heuristic that selects among them. Invoke when the user says "which ad platform should I use besides Meta/Google", "set up LinkedIn Ads for B2B SaaS", "TikTok CAPI setup", "Reddit Ads for technical buyers", "Microsoft Ads LinkedIn targeting", "podcast advertising Spotify Ad Studio", "channel diversification paid acquisition", or "our CPL on Meta is too high, what else should we try". Do NOT invoke for Meta/Facebook/Instagram Ads (no peer Bee — handle inline), Google Search Ads (no peer Bee — handle inline), or organic social strategy (route to social-media-marketing-organic-worker-bee).

2026-06-2366

api-docs-stinger

legioncodeinc/that-git-life

API documentation authority — Swagger UI / Redoc / Scalar / Mintlify / Stoplight / Bump.sh tool selection, OpenAPI spec enrichment with JSON examples, self-hosted and managed hosting, SDK generation (TypeScript / Python / Go via openapi-generator-cli and Fern/Speakeasy), and changelog discipline. Invoke when the user says "set up API docs", "which docs renderer should I use", "generate an SDK from my spec", "deploy my OpenAPI docs", "write an API changelog", "compare Redoc vs Scalar", or "publish API reference to GitHub Pages". Do NOT invoke for general documentation sites (library-worker-bee), API security scheme audits (security-worker-bee), or backend route design (python-worker-bee / react-worker-bee).

2026-06-2366

name

ai-tools-platform-stinger

description

ai-tools-platform Stinger

Invocation modes (routing table)

Read the user's request and match to one mode. Most requests match one primary mode with one supporting mode.

Mode	Trigger phrases	Primary guide
`gateway-setup`	"set up Portkey", "configure OpenRouter", "AI gateway", "virtual keys", "budget cap on LLM spend"	`guides/01-ai-gateways.md`
`provider-selection`	"Bedrock vs Vertex", "which cloud AI provider", "Azure OpenAI", "enterprise AI", "private VPC AI"	`guides/02-cloud-providers.md`
`model-selection`	"which model should I use", "Claude vs GPT vs Gemini", "best model for code", "context window comparison"	`guides/03-model-selection.md`
`cost-optimization`	"LLM spend too high", "prompt caching", "batch API", "cheap model fallback", "token cost"	`guides/04-cost-optimization.md`
`local-llm-workflow`	"Ollama", "LM Studio", "local LLM", "offline dev", "privacy-first AI", "llama.cpp"	`guides/05-local-llms.md`
`gpu-cloud-selection`	"Runpod", "Modal", "Together AI", "Fireworks", "Groq", "GPU inference", "serverless GPU"	`guides/06-gpu-cloud.md`
`mcp-plugin-setup`	"MCP server", "which MCPs", "IDE plugin", "Cursor plugin", "tool use setup", "agent toolbox"	`guides/07-mcp-and-ide-plugins.md`

First action on every invocation

Read guides/00-principles.md — the non-negotiables that govern every output.
Match the request to the routing table above.
Open the relevant guide(s) before producing any output.

Folder layout

ai-tools-platform-stinger/
├── SKILL.md                         (this file — master index)
├── guides/
│   ├── 00-principles.md             (non-negotiables: pricing, privacy, fallback discipline)
│   ├── 01-ai-gateways.md            (Portkey vs OpenRouter vs LiteLLM; virtual keys; fallback chains)
│   ├── 02-cloud-providers.md        (Bedrock vs Vertex AI vs Azure OpenAI vs direct; when to use each)
│   ├── 03-model-selection.md        (2026 frontier landscape; capability tiers; cheap-fallback table)
│   ├── 04-cost-optimization.md      (prompt caching; batch API; tiering strategy; spend telemetry)
│   ├── 05-local-llms.md             (Ollama; LM Studio; llama.cpp; model selection; OpenAI-compat wiring)
│   ├── 06-gpu-cloud.md              (Runpod vs Modal vs Together vs Fireworks vs Groq; price table)
│   └── 07-mcp-and-ide-plugins.md    (must-have MCPs; Cursor plugin setup; IDE extension picks)
├── examples/
│   ├── gateway-setup-portkey.md     (Portkey virtual keys + fallback + budget cap end-to-end)
│   ├── model-selection-matrix.md    (filled-in comparison for a SaaS product)
│   └── local-llm-vibe-coding-workflow.md  (Ollama + Cursor offline workflow)
├── templates/
│   ├── provider-comparison.md       (canonical comparison table skeleton)
│   └── cost-estimate.md             (monthly cost estimate sheet)
├── reports/
│   └── README.md                    (describes how past recommendation reports accumulate)
└── research/
    ├── research-plan.md
    ├── research-summary.md
    ├── index.md
    ├── internal/
    │   └── command-brief-notes.md
    └── external/
        ├── portkey-openrouter-gateways.md
        ├── aws-bedrock-vertex-azure-comparison.md
        ├── frontier-model-landscape-2026.md
        ├── gpu-cloud-inference-vendors.md
        ├── ollama-local-llm-workflows.md
        └── mcp-servers-ide-plugins-2026.md

Canonical stack defaults

These are the recommended defaults. Deviating requires explicit rationale.

Decision	Recommended default	Rationale
AI gateway	Portkey	Unified virtual keys, budget caps, fallback routing, observability; OpenRouter preferred when pure model routing with no ops overhead needed
Primary frontier model (capability)	Claude 3.7 Sonnet / Opus or GPT-4.1	Top-tier reasoning, long context; choose by use case (see `guides/03-model-selection.md`)
Cheap fallback (cloud)	Claude Haiku 3.5 or Gemini 2.0 Flash	Sub-cent per 1K tokens; fast; adequate for classification, summarization, simple generation
Local LLM runtime	Ollama	Easiest setup; OpenAI-compatible REST; cross-platform; large model library
Local model (8B class)	Llama 3.1 8B / 3.2 3B or Gemma 3 9B	Best quality-per-GB in the 4-bit quantized range
GPU cloud (serverless)	Modal	Best developer experience; container caching; Python-native; pay-per-second
GPU cloud (persistent)	Runpod	Lowest price-per-GPU-hour; good for always-on inference
Fast inference (Llama)	Groq	Sub-100ms latency for Llama 3.1 70B; free tier available
MCP toolbox	See `guides/07-mcp-and-ide-plugins.md`	Context-dependent; filesystem + Supabase + GitHub are near-universal

Severity rubric

Used to classify findings when auditing an existing AI tooling stack.

Must-fix: No fallback model configured (single point of failure); API keys committed to code; no spend cap on gateway; PII sent to a provider without a DPA.
Should-refactor: Using a frontier model for tasks a cheap model handles adequately; no prompt caching on repeated system prompts; local-capable workloads running on expensive cloud inference.
Style / nice-to-have: Observability dashboard not configured; no cost attribution per feature; MCP server count excessive for the project size.

Cross-Bee handoffs

Surface these explicitly rather than attempting them inline:

security-worker-bee — for API key vault strategy, PII audit in prompts, DPA compliance verification, model provider's data-retention policies.
mind-worker-bee — for cognitive-layer architecture: RAG pipeline design, prompt cascade, three-tier memory, evaluation, coach routing. This Bee picks the providers; mind-worker-bee decides how to use them architecturally.
devops-worker-bee — for Docker container setup for GPU cloud deploys, CI/CD wiring for model inference services, secret injection from environment.
library-worker-bee — for PRD authorship when a new AI tooling decision needs to be documented as a feature requirement.