com um clique
aiq-deploy
Use when asked to install, deploy, run, validate, troubleshoot, or stop NVIDIA AI-Q Blueprint infrastructure.
Menu
Use when asked to install, deploy, run, validate, troubleshoot, or stop NVIDIA AI-Q Blueprint infrastructure.
Comprehensive chezmoi dotfiles management expertise including templates, cross-platform configuration, file naming conventions, and troubleshooting. Covers source directory management, reproducible environment setup, and chezmoi templating with Go templates. Use when user mentions chezmoi, dotfiles, cross-platform config, chezmoi apply, chezmoi diff, .chezmoidata, or managing configuration files across machines.
Spin up a working session by surfacing the project's open threads. Pulls the taskwarrior queue for the inferred project, the unchecked Todos from the most recent FVH daily note when work is FVH-scoped, and recent git state (current branch, uncommitted changes, unpushed commits, open PRs). Read-only — does not write to taskwarrior or the vault. Use when the user says "spin up", "session start", "pick up where I left off", "what was I doing", or invokes /session-spinup. May also be triggered by a SessionStart hook nudge — when an FVH-scoped session opens with pending threads, the hook injects a one-time reminder to offer this skill.
Use when asked to run deep research or AI-Q research through a reachable NVIDIA AI-Q Blueprint backend.
Modern Neovim configuration expertise including Lua scripting, plugin management with lazy.nvim, LSP setup with Mason, AI integration with CodeCompanion, and workflow optimization. Covers keymaps, autocommands, and treesitter configuration. Use when user mentions Neovim, nvim, lazy.nvim, Mason, init.lua, Lua config, nvim plugins, or Neovim customization.
Obsidian Bases database feature for YAML-based interactive note views. Use when creating .base files, writing filter queries, building formulas, configuring table/card views, or working with Obsidian properties and frontmatter databases.
Wrap up a working session by capturing loose threads. Updates taskwarrior tasks for the current project (done / annotate / add) and appends genuine follow-ups to the FVH Obsidian daily note when the work is FVH-scoped. Filters aggressively for signal — only logs items that won't resurface on their own. Use when the user says "wrap up", "session wrap", "wrap the session", "I'm done for now", or invokes /session-wrap. May also be triggered by a Stop-hook nudge — when the user signals wind-down on an FVH-scoped session, the hook injects a one-time reminder to offer this skill.
| name | aiq-deploy |
| description | Use when asked to install, deploy, run, validate, troubleshoot, or stop NVIDIA AI-Q Blueprint infrastructure. |
| license | Apache-2.0 |
| compatibility | Designed for Claude Code, OpenCode, Codex, and Agent Skills-compatible tools. Requires Git, network access to GitHub, and one selected runtime path: Docker Compose v2 for the default local deployment, Python 3.11+ and uv for local process or CLI mode, Node.js 20+ and npm for local web UI mode, or kubectl 1.28+ and Helm 3.12+ for Kubernetes and Helm mode. |
| metadata | {"version":"2.1.0","author":"NVIDIA AI-Q Blueprint Team <aiq-blueprint@nvidia.com>","github-url":"https://github.com/NVIDIA-AI-Blueprints/aiq","tags":["nvidia","aiq","blueprint","deploy","operations","agent-skills"]} |
| allowed-tools | Read Bash |
Use this skill to get a local or self-hosted NVIDIA AI-Q Blueprint server running and verified for use by
aiq-research.
This skill owns setup, deployment, operational checks, troubleshooting, and shutdown. It does not run deep
research itself. After deployment is healthy, hand off the verified server URL to aiq-research.
The workflow stays explicit so deployment validation and handoff are repeatable across supported agent clients.
Users need:
https://github.com/NVIDIA-AI-Blueprints/aiq.uv for local process or CLI mode.npm for local browser UI development mode.kubectl 1.28+, Helm 3.12+, and access to a Kubernetes cluster for Helm mode.NVIDIA_API_KEY; web research requires at least
one supported search provider key such as TAVILY_API_KEY, SERPER_API_KEY, or EXA_API_KEY.3000. Self-hosted model or RAG deployments may require GPU resources.Before writing secrets, verify deploy/.env is ignored:
git check-ignore deploy/.env
Expected output: deploy/.env or a matching ignore rule. If it is not ignored, stop and fix the ignore rule before
placing credentials in the file.
deploy/.env without overwriting user secrets.AIQ_SERVER_URL for aiq-research.If no AI-Q checkout exists, read references/locate-or-clone.md before cloning. In an existing checkout, confirm the
required files:
pwd
test -f pyproject.toml
test -f deploy/.env.example
test -d configs
Expected output: pwd prints the AI-Q repository path; the test commands exit with status 0 and no output.
If the user asks to install, deploy, set up, or run AI-Q without naming a mode, ask:
How do you want to run AI-Q?
1. Skill backend - backend-only service for aiq-research w/o browser UI.
2. CLI - interactive terminal AI-Q.
3. UI - browser AI-Q app with backend and frontend.
4. Custom - choose an existing AI-Q config or review advanced customization docs before deployment.
Wait for the user's answer before starting services.
Do not ask this question when the user already specified a mode, such as Docker Compose, Helm, UI, CLI, or Agent Skill
backend. Do not ask the full mode question when aiq-research routed here because a deep research request needs a
backend. In that case, prefer Agent Skill backend and ask only for permission to start it if needed.
Read references/env-and-secrets.md before changing deploy/.env.
if [ ! -f deploy/.env ]; then
cp deploy/.env.example deploy/.env
echo "created deploy/.env from deploy/.env.example"
fi
Expected output when the file is missing: created deploy/.env from deploy/.env.example. Expected output when the file
already exists: no output, and the existing file is preserved.
Never print secret values. If credentials are missing, ask the user to update deploy/.env; do not ask them to paste
secret values into chat.
Match the user request, then read the referenced file before acting:
| User Intent | Reference |
|---|---|
| No AI-Q checkout exists, install AIQ, clone AIQ, locate repo | references/locate-or-clone.md |
Configure environment, check API keys, inspect .env | references/env-and-secrets.md |
Choose an AI-Q workflow config, understand config files, set BACKEND_CONFIG or CONFIG_FILE | references/configs.md |
Backend-only local server for aiq-research, AIQ as an Agent Skill | references/skill-backend.md |
| Terminal assistant, CLI-only run, no web UI | references/terminal-cli.md |
| Quick local development run, start UI/backend without containers | references/local-web.md |
| Default durable local deployment, Docker Compose, containers, PostgreSQL | references/docker-compose.md |
| Kubernetes, Helm, cluster deployment | references/kubernetes-helm.md |
| Foundational RAG / FRAG integration | references/frag.md |
Basic health checks, shallow smoke checks, handoff to aiq-research | references/validation.md |
| Optional deep research completion validation | references/end-to-end-validation.md |
| Logs, unhealthy services, port conflicts, config failures | references/troubleshooting.md |
| Stop services, restart, rebuild, safe cleanup | references/shutdown.md |
After startup, read references/validation.md and run the appropriate checks for the selected mode. For the default
local backend, verify health:
curl -sf http://localhost:8000/health
Expected output: a successful JSON health response or an empty successful response depending on the server build. If the
command fails, read references/troubleshooting.md and diagnose before claiming the backend is ready.
aiq-research needs a reachable AI-Q server URL. If the backend is on the default port, no extra configuration is
needed:
AIQ_SERVER_URL=http://localhost:8000
If the backend runs elsewhere, tell the user to set:
export AIQ_SERVER_URL="http://localhost:<PORT>"
Do not continue into deep research or deep research completion validation unless the user asks for it or confirms the post-deploy validation prompt. This skill's success criterion is a deployed and basically validated server, not report generation quality.
IMPORTANT: This skill is designed for NVIDIA AI-Q Blueprint version 2.1.0.
Semantic Versioning Compatibility Rules:
Skill version: X.Y.Z
Blueprint version: A.B.C
Compatible IF:
1. A == X (Major versions MUST match)
2. B >= Y (Minor version must be equal or greater)
3. C can be anything (Patch version does not affect compatibility)
Examples:
If your Blueprint version is not compatible:
deploy/.env or environment variables, not in chat transcripts, shell history, committed files,
or example commands.deploy/.env when it already exists.down -v.RAG_SERVER_URL and RAG_INGEST_URL are configured and reachable.test -f deploy/.env || cp deploy/.env.example deploy/.env
git check-ignore deploy/.env
cd deploy/compose
BUILD_TARGET=release docker compose --env-file ../.env -f docker-compose.yaml config --quiet
BUILD_TARGET=release docker compose --env-file ../.env -f docker-compose.yaml up -d --build aiq-agent
curl -sf http://localhost:8000/health
Expected output:
deploy/.env
<docker compose starts aiq-agent and dependencies>
<health endpoint returns a successful response>
If Docker, ports, credentials, or health checks fail, read references/troubleshooting.md before retrying.
export AIQ_SERVER_URL="http://localhost:8100"
curl -sf "$AIQ_SERVER_URL/health"
Expected output: a successful health response. Then tell the user to keep AIQ_SERVER_URL set before invoking
aiq-research.
| Topic | Documentation |
|---|---|
| Locate or clone AI-Q | references/locate-or-clone.md |
| Environment and secrets | references/env-and-secrets.md |
| Workflow configs | references/configs.md |
| Agent Skill backend | references/skill-backend.md |
| CLI deployment | references/terminal-cli.md |
| Local web deployment | references/local-web.md |
| Docker Compose deployment | references/docker-compose.md |
| Kubernetes and Helm deployment | references/kubernetes-helm.md |
| FRAG integration | references/frag.md |
| Basic validation | references/validation.md |
| End-to-end validation | references/end-to-end-validation.md |
| Troubleshooting | references/troubleshooting.md |
| Shutdown and cleanup | references/shutdown.md |
Symptoms:
8000.curl -sf http://localhost:8000/health reaches an unexpected service or fails.Causes:
PORT in deploy/.env conflicts with an existing process.Solutions:
lsof -nP -iTCP:8000 -sTCP:LISTEN
deploy/.env, such as
PORT=8100.curl -sf http://localhost:8100/health
Symptoms:
Causes:
NVIDIA_API_KEY is missing or empty.Solutions:
references/env-and-secrets.md.deploy/.env; do not ask them to paste secrets into chat.references/validation.md after the user updates credentials.Symptoms:
/health succeeds, but /chat or /v1/jobs/async/agents fails.aiq-research reports that async agents are unavailable.Causes:
BACKEND_CONFIG or CONFIG_FILE points at the wrong AI-Q config.Solutions:
references/configs.md and confirm the selected config is API-enabled.configs/config_web_default_llamaindex.yml.references/validation.md.Symptoms:
docker compose down -v.Causes:
down -v removes Docker volumes.Solutions:
references/shutdown.md.