Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

deploy

Sterne0

Forks0

Aktualisiert11. Februar 2026 um 00:09

Deploy agent-services to the infra VM. Use when shipping new code, restarting services, or rolling back a broken deploy. Covers pre-deploy snapshot, pull/build/restart, smoke tests, and rollback.

Installation

Mit Codex oder Claude installieren Kopieren Sie diesen Prompt, fügen Sie ihn in Codex, Claude oder einen anderen Assistant ein und lassen Sie die Skill-Seite prüfen und installieren.

In Manus ausführen

Quelle

hdresearch

hdresearch/vers-agent-services

GitHub-Repository öffnen Creator-Repositorys ansehen

Download

In Manus ausführen

Verwandte BerufeSOC

Basierend auf der SOC-Berufsklassifikation

Netzwerk- und ComputersystemadministratorenInformatik- und Mathematikberufe·SOC 15-1244

SKILL.md

readonly

name	deploy
description	Deploy agent-services to the infra VM. Use when shipping new code, restarting services, or rolling back a broken deploy. Covers pre-deploy snapshot, pull/build/restart, smoke tests, and rollback.

Deploy Runbook

Step-by-step procedure for deploying agent-services to the infra VM. Follow this exactly — skipping steps has caused real incidents (unauthenticated servers, unrecoverable state).

When to Use

Shipping new features — after a PR merges to main
Hotfixing a broken service — pull the fix, rebuild, restart
Rolling back — restore from pre-deploy snapshot
Restarting after a crash — same process minus the git pull

Prerequisites

You need:

The infra VM ID (check the registry or ask the orchestrator)
The VERS_AUTH_TOKEN value (check with the orchestrator — if lost, all agents need reconfiguration)
SSH access to the VM (root@{VM_ID}.vm.vers.sh)

Step 1: Snapshot the Infra VM

Do this FIRST. Every time. No exceptions.

Before touching anything, snapshot the current state so you can roll back:

# From the orchestrator or any agent with vers tools
vers_vm_commit --vmId {INFRA_VM_ID}
# → returns a commit ID, e.g., "commit-abc123"

Record the commit in the ledger (if the service is still running):

curl -X POST "http://{INFRA_VM_ID}.vm.vers.sh:3000/commits" \
  -H "Authorization: Bearer $VERS_AUTH_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "commitId": "commit-abc123",
    "vmId": "{INFRA_VM_ID}",
    "label": "pre-deploy-backup",
    "agent": "your-agent-name",
    "tags": ["backup", "infra", "pre-deploy"]
  }'

Save the commit ID. You will need it if the deploy goes wrong.

Step 2: SSH into the Infra VM

ssh -o StrictHostKeyChecking=no root@{INFRA_VM_ID}.vm.vers.sh

Step 3: Pull Latest Code

cd /root/workspace/vers-agent-services
git fetch origin
git checkout main
git reset --hard origin/main

Using reset --hard ensures a clean state — no merge conflicts, no stale local changes.

Step 4: Install and Build

npm install
npm run build

Watch for errors. If npm run build (TypeScript compilation) fails, do not proceed — the deploy will serve stale code from the old dist/.

Step 5: Restart the Server

Kill the old process and start a new one with the required env vars:

# Stop the running server
pkill -f 'node dist/server.js' || true
sleep 1

# Start with required env vars
export VERS_AUTH_TOKEN=<token>
nohup env VERS_AUTH_TOKEN=$VERS_AUTH_TOKEN node dist/server.js > /tmp/agent-services.log 2>&1 &

# Verify it started
sleep 2
cat /tmp/agent-services.log

You should see: vers-agent-services running on :3000

Step 6: Smoke Test

Health check (no auth required)

curl -s http://localhost:3000/health

Expected: {"status":"ok","uptime":...}

Spot-check endpoints (auth required)

# Board
curl -s -H "Authorization: Bearer $VERS_AUTH_TOKEN" http://localhost:3000/board/tasks | head -c 200

# Feed
curl -s -H "Authorization: Bearer $VERS_AUTH_TOKEN" http://localhost:3000/feed/events | head -c 200

# Registry
curl -s -H "Authorization: Bearer $VERS_AUTH_TOKEN" http://localhost:3000/registry/vms | head -c 200

# Commits
curl -s -H "Authorization: Bearer $VERS_AUTH_TOKEN" http://localhost:3000/commits | head -c 200

Each should return JSON with the expected structure. If any returns 401, the auth token is correct (middleware is working). If any returns nothing or errors, check the logs:

tail -50 /tmp/agent-services.log

Verify from outside the VM

From the orchestrator or another agent, test the external URL:

curl -s -H "Authorization: Bearer $VERS_AUTH_TOKEN" \
  http://{INFRA_VM_ID}.vm.vers.sh:3000/health

Step 7: Post-Deploy

Record the successful deploy in the feed
Snapshot the new working state (another vers_vm_commit) and record it with a post-deploy tag
Verify agents can still reach the board, feed, and registry

Rollback Procedure

If the deploy is broken:

Option A: Restore from Snapshot (clean rollback)

# From the orchestrator (not from the broken VM)
vers_vm_restore --commitId {PRE_DEPLOY_COMMIT_ID}
# → creates a new VM with the old state
# Update DNS/references to point to the new VM ID

This gives you a brand new VM with the exact pre-deploy state. The old broken VM can be deleted.

Option B: Git Revert (if the VM is still accessible)

ssh -o StrictHostKeyChecking=no root@{INFRA_VM_ID}.vm.vers.sh
cd /root/workspace/vers-agent-services
git log --oneline -5  # find the last known good commit
git reset --hard {GOOD_COMMIT_SHA}
npm install && npm run build
pkill -f 'node dist/server.js' || true
sleep 1
nohup env VERS_AUTH_TOKEN=$VERS_AUTH_TOKEN node dist/server.js > /tmp/agent-services.log 2>&1 &

Common Pitfalls

❌ Forgetting VERS_AUTH_TOKEN

What happens: The server starts but with no auth — all endpoints are open to anyone. You'll see this warning in the logs:

⚠️  VERS_AUTH_TOKEN is not set — all endpoints are unauthenticated.

Fix: Kill the server, set the env var, restart.

❌ Forgetting to Snapshot First

What happens: If the deploy breaks, you have no rollback point. You're stuck debugging a broken VM under pressure.

Fix: Always snapshot before deploying. Make it muscle memory. The commit ledger exists precisely for this.

❌ Stale dist/ Files

What happens: npm run build fails but you restart anyway. The server runs old compiled code from dist/, which may not match the new source. Subtle bugs ensue.

Fix: Never restart after a failed build. Fix the build error first, or roll back.

❌ Port Already in Use

What happens: pkill didn't fully kill the old process. The new server fails to bind to port 3000.

Fix:

# Find what's using port 3000
lsof -i :3000
# Force kill
kill -9 $(lsof -t -i :3000)
sleep 1
# Restart

❌ Deploying Without Testing

What happens: You push untested code and deploy it. The build passes but runtime errors break endpoints.

Fix: Always run npm test locally (or on a branch VM) before deploying. The infra VM is shared — breaking it affects all agents.

❌ Data File Permissions / Missing Directories

What happens: The server crashes on first write because data/ doesn't exist or has wrong permissions.

Fix: The stores auto-create directories, but if you see file permission errors:

mkdir -p /root/workspace/vers-agent-services/data
chmod 755 /root/workspace/vers-agent-services/data

Mehr aus diesem Repository

gleiches Repository

commit-discipline

hdresearch/vers-agent-services

Always register VM snapshots in the commit ledger. Use when committing/snapshotting VMs, before destroying VMs, during deploys, or when creating golden images. Ensures no snapshot goes untracked.

2026-02-130

magic-link

hdresearch/vers-agent-services

Generate a magic login link for the agent-services dashboard UI. Use when the user wants to access the dashboard, needs a login link, says "magic link," or asks for UI access.

2026-02-130

board

hdresearch/vers-agent-services

Shared task board for coordinating work across agents. Use when creating, assigning, tracking, or querying tasks in a swarm.

2026-02-120

preview-deploy

hdresearch/vers-agent-services

Deploy a branch to a preview VM for zero-risk change review. Snapshot infra, clone it, deploy the branch on the clone, share the link. Production untouched. Use for ALL UI changes, feature demos, and PR reviews.

2026-02-120

commits

hdresearch/vers-agent-services

Commit ledger for tracking VM snapshots. Use when recording, querying, or managing Vers VM commit history — golden images, infra snapshots, rollback points.

2026-02-110

log

hdresearch/vers-agent-services

Append-only work log for agent sessions. Use when recording what happened, what was decided, and what's next — the log is for the next session's orchestrator.

2026-02-110

name	deploy
description	Deploy agent-services to the infra VM. Use when shipping new code, restarting services, or rolling back a broken deploy. Covers pre-deploy snapshot, pull/build/restart, smoke tests, and rollback.

Deploy Runbook

Step-by-step procedure for deploying agent-services to the infra VM. Follow this exactly — skipping steps has caused real incidents (unauthenticated servers, unrecoverable state).

When to Use

Shipping new features — after a PR merges to main
Hotfixing a broken service — pull the fix, rebuild, restart
Rolling back — restore from pre-deploy snapshot
Restarting after a crash — same process minus the git pull

Prerequisites

You need:

The infra VM ID (check the registry or ask the orchestrator)
The VERS_AUTH_TOKEN value (check with the orchestrator — if lost, all agents need reconfiguration)
SSH access to the VM (root@{VM_ID}.vm.vers.sh)

Step 1: Snapshot the Infra VM

Do this FIRST. Every time. No exceptions.

Before touching anything, snapshot the current state so you can roll back:

# From the orchestrator or any agent with vers tools
vers_vm_commit --vmId {INFRA_VM_ID}
# → returns a commit ID, e.g., "commit-abc123"

Record the commit in the ledger (if the service is still running):

curl -X POST "http://{INFRA_VM_ID}.vm.vers.sh:3000/commits" \
  -H "Authorization: Bearer $VERS_AUTH_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "commitId": "commit-abc123",
    "vmId": "{INFRA_VM_ID}",
    "label": "pre-deploy-backup",
    "agent": "your-agent-name",
    "tags": ["backup", "infra", "pre-deploy"]
  }'

Save the commit ID. You will need it if the deploy goes wrong.

Step 2: SSH into the Infra VM

ssh -o StrictHostKeyChecking=no root@{INFRA_VM_ID}.vm.vers.sh

Step 3: Pull Latest Code

cd /root/workspace/vers-agent-services
git fetch origin
git checkout main
git reset --hard origin/main

Using reset --hard ensures a clean state — no merge conflicts, no stale local changes.

Step 4: Install and Build

npm install
npm run build

Watch for errors. If npm run build (TypeScript compilation) fails, do not proceed — the deploy will serve stale code from the old dist/.

Step 5: Restart the Server

Kill the old process and start a new one with the required env vars:

# Stop the running server
pkill -f 'node dist/server.js' || true
sleep 1

# Start with required env vars
export VERS_AUTH_TOKEN=<token>
nohup env VERS_AUTH_TOKEN=$VERS_AUTH_TOKEN node dist/server.js > /tmp/agent-services.log 2>&1 &

# Verify it started
sleep 2
cat /tmp/agent-services.log

You should see: vers-agent-services running on :3000

Step 6: Smoke Test

Health check (no auth required)

curl -s http://localhost:3000/health

Expected: {"status":"ok","uptime":...}

Spot-check endpoints (auth required)

# Board
curl -s -H "Authorization: Bearer $VERS_AUTH_TOKEN" http://localhost:3000/board/tasks | head -c 200

# Feed
curl -s -H "Authorization: Bearer $VERS_AUTH_TOKEN" http://localhost:3000/feed/events | head -c 200

# Registry
curl -s -H "Authorization: Bearer $VERS_AUTH_TOKEN" http://localhost:3000/registry/vms | head -c 200

# Commits
curl -s -H "Authorization: Bearer $VERS_AUTH_TOKEN" http://localhost:3000/commits | head -c 200

Each should return JSON with the expected structure. If any returns 401, the auth token is correct (middleware is working). If any returns nothing or errors, check the logs:

tail -50 /tmp/agent-services.log

Verify from outside the VM

From the orchestrator or another agent, test the external URL:

curl -s -H "Authorization: Bearer $VERS_AUTH_TOKEN" \
  http://{INFRA_VM_ID}.vm.vers.sh:3000/health

Step 7: Post-Deploy

Record the successful deploy in the feed
Snapshot the new working state (another vers_vm_commit) and record it with a post-deploy tag
Verify agents can still reach the board, feed, and registry

Rollback Procedure

If the deploy is broken:

Option A: Restore from Snapshot (clean rollback)

# From the orchestrator (not from the broken VM)
vers_vm_restore --commitId {PRE_DEPLOY_COMMIT_ID}
# → creates a new VM with the old state
# Update DNS/references to point to the new VM ID

This gives you a brand new VM with the exact pre-deploy state. The old broken VM can be deleted.

Option B: Git Revert (if the VM is still accessible)

ssh -o StrictHostKeyChecking=no root@{INFRA_VM_ID}.vm.vers.sh
cd /root/workspace/vers-agent-services
git log --oneline -5  # find the last known good commit
git reset --hard {GOOD_COMMIT_SHA}
npm install && npm run build
pkill -f 'node dist/server.js' || true
sleep 1
nohup env VERS_AUTH_TOKEN=$VERS_AUTH_TOKEN node dist/server.js > /tmp/agent-services.log 2>&1 &

Common Pitfalls

❌ Forgetting VERS_AUTH_TOKEN

What happens: The server starts but with no auth — all endpoints are open to anyone. You'll see this warning in the logs:

⚠️  VERS_AUTH_TOKEN is not set — all endpoints are unauthenticated.

Fix: Kill the server, set the env var, restart.

❌ Forgetting to Snapshot First

What happens: If the deploy breaks, you have no rollback point. You're stuck debugging a broken VM under pressure.

Fix: Always snapshot before deploying. Make it muscle memory. The commit ledger exists precisely for this.

❌ Stale dist/ Files

What happens: npm run build fails but you restart anyway. The server runs old compiled code from dist/, which may not match the new source. Subtle bugs ensue.

Fix: Never restart after a failed build. Fix the build error first, or roll back.

❌ Port Already in Use

What happens: pkill didn't fully kill the old process. The new server fails to bind to port 3000.

Fix:

# Find what's using port 3000
lsof -i :3000
# Force kill
kill -9 $(lsof -t -i :3000)
sleep 1
# Restart

❌ Deploying Without Testing

What happens: You push untested code and deploy it. The build passes but runtime errors break endpoints.

Fix: Always run npm test locally (or on a branch VM) before deploying. The infra VM is shared — breaking it affects all agents.

❌ Data File Permissions / Missing Directories

What happens: The server crashes on first write because data/ doesn't exist or has wrong permissions.

Fix: The stores auto-create directories, but if you see file permission errors:

mkdir -p /root/workspace/vers-agent-services/data
chmod 755 /root/workspace/vers-agent-services/data