Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

ai-architecture-spec

Generate the AI Architecture Specification: RAG vs fine-tune vs agent decisions, model gateway, vector store, eval harness, observability, security boundaries, and the SaaS-specific multi-tenant AI plane that the generic HLD does not capture.

Exécuter dans Manus

Aperçu

Commande d'installation

npx skills add https://github.com/peterbamuhigire/srs-skills --skill ai-architecture-spec

Copiez et collez cette commande dans Claude Code pour installer le skill

Source

peterbamuhigire/srs-skills

Étoiles3

Forks1

Mis à jour11 mai 2026 à 15:29

Explorateur de fichiers

6 fichiers

SKILL.md

readonly

name	ai-architecture-spec
description	Generate the AI Architecture Specification: RAG vs fine-tune vs agent decisions, model gateway, vector store, eval harness, observability, security boundaries, and the SaaS-specific multi-tenant AI plane that the generic HLD does not capture.
metadata	{"use_when":"Use when one or more AI features ship in a SaaS product. Required alongside or after the generic HLD.","do_not_use_when":"Do not use for projects with no AI features.","required_inputs":"HLD.md, Multi_Tenancy_Architecture_Spec.md, AI_Feature_PRD_Spec.md, AI_Data_And_Knowledge_Base_Spec.md, tech_stack.md.","workflow":"Read inputs, declare the AI plane decomposition, map each feature to a pattern (RAG / agent / fine-tune / direct call), spec the model gateway, vector store, eval harness, observability, security boundaries, emit ADR seeds, write the AI_Architecture_Spec.md.","quality_standards":"Every AI feature shall map to a pattern with explicit drivers. The model gateway shall be the sole egress for model calls. Every cross-tenant boundary shall name its enforcement mechanism.","anti_patterns":"Do not let individual services call model providers directly. Do not store conversation logs in the same store as customer documents without isolation. Do not omit the eval harness from the architecture.","outputs":"AI_Architecture_Spec.md plus ADR seeds in adr-seeds/.","references":"Use references/ai-architecture-spec-template.md and references/ai-architecture-patterns.md."}

AI Architecture Spec Skill

Overview

The AI-distinctive architecture artefact. Sits alongside the multi-tenancy spec and the generic HLD. Captures the model gateway, vector store, eval harness, prompt registry, observability bus, and the multi-tenant AI security boundaries.

Core Instructions

Step 1: Read context

Read HLD, multi-tenancy spec, AI feature PRD spec, AI data spec. Identify in-scope AI features, models, patterns, and the tenant boundaries.

Step 2: Declare the AI plane

The AI plane is a sub-set of the application plane plus a small set of dedicated control-plane services:

Model Gateway (control plane) — single egress for model-provider calls; carries auth, tenant-id propagation, per-tenant rate limit, cost meter, request/response log, content-filter, fallback routing.
Prompt Registry (control plane) — versioned prompts, change-control, regression-test attachment.
Vector Store (application plane, per-tenant or namespaced) — embedding-backed retrieval.
Eval Harness Runner (control plane) — runs eval suites against new prompt/model versions in CI.
Observability Bus — token use, latency, fallback rate, abstention, citation rate, judge-LLM score, cost per tenant.

Diagram with Mermaid; place every AI service.

Step 3: Map each AI feature to a pattern

For each AI feature select the pattern:

Pattern	When	Components
Direct LLM call	input is self-contained, no external data	gateway + prompt + model
RAG	grounding in customer data	gateway + retrieval + reranker + prompt + model + citation post-processor
Agent	multi-step, tool-using, planned	gateway + planner + tool catalogue + executor + audit log + per-step approval UI
Fine-tune	repetitive narrow task, cost reduction	training pipeline + model artefact + eval suite + rollback artefact
Classical ML	structured prediction	feature store + model artefact + monitoring

State the verdict per feature with rejected alternatives.

Step 4: Specify the Model Gateway

The gateway is the sole egress to model providers. Capture:

Supported providers and models (primary + fallback per feature).
Authentication and credential rotation.
Tenant-id propagation as a guarded claim.
Per-tenant and per-feature rate limit and cost ceiling.
Request/response log retention.
Content-filter chain (input and output).
Fallback routing rule (model-down, cost-overrun, latency-overrun, content-filter-trip).
Idempotency keys for retries.

Step 5: Specify the Vector Store

For each retrieval index: store technology, partitioning model (per-tenant index / namespace / metadata-filter), embedding model + version, dimensions, ANN parameters, freshness, encryption posture, key management.

Step 6: Specify the Prompt Registry

Versioned, tagged, changes proposed via PR with regression eval attached. State the registry source-of-truth, deploy pipeline, rollback procedure.

Step 7: Specify the Eval Harness in architecture terms

The eval harness is a first-class production system, not a notebook. State: dataset store, judge-LLM, CI gate hook, scheduled regression, alerting on score drop.

Step 8: Specify observability

AI-specific signals: tokens in/out per request, model latency per provider, fallback rate, abstention rate, citation rate, judge-LLM score, cost per tenant per feature, content-filter trips, red-team alerts.

Step 9: Specify security boundaries

Prompt injection surface (untrusted text in retrieved docs, in user input, in tool outputs).
Sandboxing of tool execution.
Egress allow-list at the gateway.
Secrets handling: never in prompts; tool-side fetch via tenant-id claim.
Cross-tenant retrieval prohibition enforced at the gateway.

Step 10: Emit ADR seeds

ADR seeds: model choice per feature, RAG-vs-fine-tune, vector store choice, eval threshold, abstain policy, content filter, fallback policy.

Step 11: Write the spec

AI_Architecture_Spec.md sections: 1) AI Plane Diagram, 2) Feature-to-Pattern Map, 3) Model Gateway, 4) Vector Store, 5) Prompt Registry, 6) Eval Harness, 7) Observability, 8) Security Boundaries, 9) ADR Seed Index, 10) Traceability.

Standards

AWS Well-Architected ML/AI Lens
OWASP LLM Top 10
NIST AI RMF MAP / MEASURE
ISO/IEC 42001

Plus depuis ce dépôt

même dépôt

05-ux-specification

peterbamuhigire/srs-skills

Generate a comprehensive UX specification document covering information architecture, wireframing standards, design system documentation, usability testing protocols, and design handoff specs per ISO 9241-210 and ISO 25010.

2026-05-273

ai-agent-strategy-doc

peterbamuhigire/srs-skills

Generate the AI Agent Strategy Doc: when to use an agent vs a workflow or a single LLM call, agent capability ladder by pricing tier, autonomy-level taxonomy (suggest / approve-each / approve-batch / autonomous), proprietary action catalogue and tool-telemetry moat, and the agent-feature sequencing roadmap.

2026-05-113

ai-feature-prd-spec

peterbamuhigire/srs-skills

Generate the AI-Feature PRD Spec: IEEE 830-form requirements for every AI-powered feature, with hallucination tolerance, latency budget, $/call ceiling, abstain criteria, citation policy, consent and training-data exclusion clauses, and acceptance tests anchored to the eval harness.

2026-05-113

ai-agent-feature-prd-spec

peterbamuhigire/srs-skills

Generate the AI Agent Feature PRD Spec: IEEE 830-form requirements for every agentic feature, with task scope, autonomy level, action-catalogue summary, intervention triggers, success metrics, max-step / max-cost / wallclock budgets, abstain criteria, and irreversible-action gates anchored to the agent eval and red-team registries.

2026-05-113

ai-agent-action-catalogue-spec

peterbamuhigire/srs-skills

Generate the Action Catalogue Spec: the enumerated, schema-bound set of tools an agent may call. Every tool declares input/output schema, side-effect class, reversibility class, per-tier availability, audit fields, rate-limit class, and kill-switch behaviour. This is the contract between the planner, the dispatcher, and the operator.

2026-05-113

embedded-accounting-engine-srs

peterbamuhigire/srs-skills

Generate the SRS subsection for any system that handles money, inventory value, payroll, tax, grants, fees, payments, receivables, payables, fixed assets, or financial reporting. Specifies embedded accounting engine requirements: chart of accounts, mapping layer, LedgerPostingService, append-only journals, subledgers, accounting periods, reversals, reports, audit trail, IFRS/IFRS for SMEs/local tax context, and integrity invariants.

2026-05-113

Source

peterbamuhigire

peterbamuhigire/srs-skills

Ouvrir le dépôt GitHub Voir les dépôts du créateur

Commande d'installation

Téléchargement

Exécuter dans Manus

Utile pourSOC

Analystes des systèmes informatiquesProfessions informatiques et mathématiques15-1211L4

name	ai-architecture-spec
description	Generate the AI Architecture Specification: RAG vs fine-tune vs agent decisions, model gateway, vector store, eval harness, observability, security boundaries, and the SaaS-specific multi-tenant AI plane that the generic HLD does not capture.
metadata	{"use_when":"Use when one or more AI features ship in a SaaS product. Required alongside or after the generic HLD.","do_not_use_when":"Do not use for projects with no AI features.","required_inputs":"HLD.md, Multi_Tenancy_Architecture_Spec.md, AI_Feature_PRD_Spec.md, AI_Data_And_Knowledge_Base_Spec.md, tech_stack.md.","workflow":"Read inputs, declare the AI plane decomposition, map each feature to a pattern (RAG / agent / fine-tune / direct call), spec the model gateway, vector store, eval harness, observability, security boundaries, emit ADR seeds, write the AI_Architecture_Spec.md.","quality_standards":"Every AI feature shall map to a pattern with explicit drivers. The model gateway shall be the sole egress for model calls. Every cross-tenant boundary shall name its enforcement mechanism.","anti_patterns":"Do not let individual services call model providers directly. Do not store conversation logs in the same store as customer documents without isolation. Do not omit the eval harness from the architecture.","outputs":"AI_Architecture_Spec.md plus ADR seeds in adr-seeds/.","references":"Use references/ai-architecture-spec-template.md and references/ai-architecture-patterns.md."}

AI Architecture Spec Skill

Overview

Core Instructions

Step 1: Read context

Read HLD, multi-tenancy spec, AI feature PRD spec, AI data spec. Identify in-scope AI features, models, patterns, and the tenant boundaries.

Step 2: Declare the AI plane

The AI plane is a sub-set of the application plane plus a small set of dedicated control-plane services:

Model Gateway (control plane) — single egress for model-provider calls; carries auth, tenant-id propagation, per-tenant rate limit, cost meter, request/response log, content-filter, fallback routing.
Prompt Registry (control plane) — versioned prompts, change-control, regression-test attachment.
Vector Store (application plane, per-tenant or namespaced) — embedding-backed retrieval.
Eval Harness Runner (control plane) — runs eval suites against new prompt/model versions in CI.
Observability Bus — token use, latency, fallback rate, abstention, citation rate, judge-LLM score, cost per tenant.

Diagram with Mermaid; place every AI service.

Step 3: Map each AI feature to a pattern

For each AI feature select the pattern:

Pattern	When	Components
Direct LLM call	input is self-contained, no external data	gateway + prompt + model
RAG	grounding in customer data	gateway + retrieval + reranker + prompt + model + citation post-processor
Agent	multi-step, tool-using, planned	gateway + planner + tool catalogue + executor + audit log + per-step approval UI
Fine-tune	repetitive narrow task, cost reduction	training pipeline + model artefact + eval suite + rollback artefact
Classical ML	structured prediction	feature store + model artefact + monitoring

State the verdict per feature with rejected alternatives.

Step 4: Specify the Model Gateway

The gateway is the sole egress to model providers. Capture:

Supported providers and models (primary + fallback per feature).
Authentication and credential rotation.
Tenant-id propagation as a guarded claim.
Per-tenant and per-feature rate limit and cost ceiling.
Request/response log retention.
Content-filter chain (input and output).
Fallback routing rule (model-down, cost-overrun, latency-overrun, content-filter-trip).
Idempotency keys for retries.

Step 5: Specify the Vector Store

Step 6: Specify the Prompt Registry

Versioned, tagged, changes proposed via PR with regression eval attached. State the registry source-of-truth, deploy pipeline, rollback procedure.

Step 7: Specify the Eval Harness in architecture terms

The eval harness is a first-class production system, not a notebook. State: dataset store, judge-LLM, CI gate hook, scheduled regression, alerting on score drop.

Step 8: Specify observability

Step 9: Specify security boundaries

Prompt injection surface (untrusted text in retrieved docs, in user input, in tool outputs).
Sandboxing of tool execution.
Egress allow-list at the gateway.
Secrets handling: never in prompts; tool-side fetch via tenant-id claim.
Cross-tenant retrieval prohibition enforced at the gateway.

Step 10: Emit ADR seeds

ADR seeds: model choice per feature, RAG-vs-fine-tune, vector store choice, eval threshold, abstain policy, content filter, fallback policy.

Step 11: Write the spec

Standards

AWS Well-Architected ML/AI Lens
OWASP LLM Top 10
NIST AI RMF MAP / MEASURE
ISO/IEC 42001