一键在 Manus 中运行任何 Skill

qa-cycle-kc

星标0

分支0

更新时间2026年4月30日 23:42

Run an autonomous QA cycle against the Keycloak dev stack — dispatches QA, Product, Dev, and Infra subagents in a loop until the lifecycle scenario passes end-to-end. Uses dev ports (3000/8080/8443/8180) with real Keycloak authentication. Usage - /qa-cycle-kc <scenario-file> [gap-report] [--resume]

安装

用 Codex 或 Claude 帮你安装复制这段 Prompt，粘贴到 Codex、Claude 或其他助手里，让它检查 Skill 页面并帮你完成安装。

在 Manus 中运行

来源

rakheen-dama

rakheen-dama/b2b-strawman

打开 GitHub 仓库查看创作者相关仓库

下载

在 Manus 中运行

QA Cycle (Keycloak Dev Stack) — In-Session Orchestration

Run all QA cycle agent turns directly in this session against the Keycloak dev stack (not the E2E mock-auth stack). Each agent role (QA, Product, Dev, Infra) is dispatched as a subagent via the Agent tool. You (the orchestrator) inspect results between turns and adapt when things go wrong.

Environment — Keycloak Dev Stack (Local Services)

Services run locally in the background (not Docker). Only infrastructure runs via Docker Compose. Use compose/scripts/svc.sh to manage services:

bash compose/scripts/svc.sh start all              # Start backend, gateway, frontend, portal
bash compose/scripts/svc.sh restart backend         # Restart after Java changes
bash compose/scripts/svc.sh stop frontend portal    # Stop specific services
bash compose/scripts/svc.sh status                  # Health check all services
bash compose/scripts/svc.sh logs backend            # Last 50 lines of log

Service	How to Start	URL
Infra (Postgres, LocalStack, Mailpit, Keycloak)	`bash compose/scripts/dev-up.sh`	various
Backend	`svc.sh start backend` (or `SPRING_PROFILES_ACTIVE=local,keycloak ./mvnw spring-boot:run`)	http://localhost:8080
Frontend	`svc.sh start frontend` (or `NEXT_PUBLIC_AUTH_MODE=keycloak pnpm dev`)	http://localhost:3000
Gateway	`svc.sh start gateway` (or `./mvnw spring-boot:run` in gateway/)	http://localhost:8443
Portal	`svc.sh start portal` (or `pnpm dev` in portal/)	http://localhost:3002
Keycloak Bootstrap	`bash compose/scripts/keycloak-bootstrap.sh` (run once after first start)	—

Stop infra: bash compose/scripts/dev-down.sh Stop services: bash compose/scripts/svc.sh stop all Restart after code changes: bash compose/scripts/svc.sh restart backend (Java changes need restart; frontend/portal use HMR)

Key URLs

Service	URL
Frontend	http://localhost:3000
Backend	http://localhost:8080
Gateway (BFF)	http://localhost:8443
Keycloak Admin	http://localhost:8180 (admin/admin)
Mailpit UI	http://localhost:8025
Mailpit API	http://localhost:8025/api/v1/

Platform Admin (pre-created by keycloak-bootstrap.sh)

User	Email	Password	Role
Platform Admin	padmin@docteams.local	password	platform-admin

Other users (org owners, members) are created through the product's onboarding flow — not pre-seeded.

Keycloak Login Flow (Playwright)

Authentication uses a multi-redirect OIDC flow. The keycloak-auth.ts fixture at frontend/e2e/fixtures/keycloak-auth.ts provides helper functions:

loginAs(page, email, password) — navigates to /dashboard, follows redirect to Keycloak login, fills form, waits for redirect back
loginAsPlatformAdmin(page) — shortcut for padmin
registerFromInvite(page, inviteLink, firstName, lastName, password) — follows KC invite link, fills registration form

Keycloak form selectors are centralized in frontend/e2e/fixtures/keycloak-selectors.ts (based on Keycloakify theme). If the theme changes, update that one file.

Onboarding Flow (How Orgs Are Created)

New user clicks "Get Started" → /request-access
Fills form: email, name, org name, country, industry → submits
Receives OTP via email (Mailpit) → enters OTP → request goes to PENDING
Platform admin logs in → /platform-admin/access-requests → approves
Approval triggers: Keycloak org creation → tenant schema provisioning → invite email
User clicks invite link from Mailpit → Keycloak registration page → sets password
User logs in → backend JIT syncs member → user becomes org owner
Owner invites members via Teams page (needs plan upgrade for >2 members)

Email Integration (Mailpit API)

The mailpit.ts helper at frontend/e2e/helpers/mailpit.ts provides:

clearMailbox() — delete all emails (call before test runs)
waitForEmail(recipient, { subject?, timeout? }) — polls until email arrives
extractOtp(email) — extracts 6-digit code from email body
extractInviteLink(email) — extracts Keycloak invite/registration URL

Running E2E Tests

# Run Keycloak onboarding + member invite tests
cd frontend && E2E_AUTH_MODE=keycloak npx playwright test keycloak/ --config e2e/playwright.config.ts --reporter=list

# Debug with headed browser
cd frontend && E2E_AUTH_MODE=keycloak npx playwright test keycloak/onboarding --config e2e/playwright.config.ts --headed

The E2E_AUTH_MODE=keycloak env var:

Sets Playwright to 60s timeout, 1 worker (serial)
Allows port 3000 navigation in the PreToolUse hook

Why In-Session (not bash script)

The bash script approach is terminal on failure. In-session orchestration gives you:

Error recovery: Inspect failures, adjust specs, retry
Adaptive flow: Skip non-blocking items, reorder priorities
Context preservation: You see all agent outputs and can carry lessons forward
No nesting issues: Agent tool works cleanly, no claude -p inside Claude

Arguments

<scenario-file> — path to the lifecycle script (e.g., tasks/phase47-lifecycle-script.md)
[gap-report] — optional path to pre-existing gap report
[--resume] — resume an existing cycle (skip branch/dir creation)

State Files

All cycle state lives in qa_cycle/ on the parent branch:

File	Purpose
`qa_cycle/status.md`	Shared tracker — all agents read/write this
`qa_cycle/fix-specs/{GAP_ID}.md`	Product writes, Dev reads
`qa_cycle/checkpoint-results/day-{NN}.md`	QA writes test results
`qa_cycle/error-log.md`	Docker log errors (manual check)

Orchestrator Rules

Stay lean: Do NOT read the scenario file, ARCHITECTURE.md, or CLAUDE.md subdirectory files. Subagents do that.
Read status.md between every turn: This is your decision input.
One agent at a time: Each agent turn is a blocking subagent call. No parallel agent turns within the same cycle.
Max 3 retries per fix: If a Dev fix fails 3 times, mark as STUCK in status.md and move on.
Max 20 cycles: If not ALL_DAYS_COMPLETE after 20 cycles, stop and summarize.
Commit between turns: Each agent should commit and push its changes before returning.

Step 0 — Setup (First Run Only, skip if --resume)

# Verify branch
BRANCH="bugfix_cycle_$(date +%Y-%m-%d)"
git checkout "$BRANCH" 2>/dev/null || git checkout -b "$BRANCH"

# Create directories
mkdir -p qa_cycle/fix-specs qa_cycle/checkpoint-results

# Verify status.md exists (must be pre-seeded or created by user)
test -f qa_cycle/status.md || echo "ERROR: qa_cycle/status.md not found"

If gap-report argument was provided, initialize status.md from it (extract gaps into tracker table). If status.md already exists, skip.

Step 1 — Decide Next Action

Read qa_cycle/status.md and determine the next action:

IF Dev Stack = "Not running" AND OPEN blockers tagged "Infra":
  → Infra Agent (seed fix + start stack)

ELIF NEEDS_REBUILD flag set:
  → Infra Agent (rebuild)

ELIF any SPEC_READY items exist:
  → Dev Agent (fix first SPEC_READY item)

ELIF any OPEN/REOPENED items exist AND QA is blocked:
  → Product Agent (triage OPEN items into SPEC_READY)

ELSE:
  → QA Agent (execute next day/checkpoint)

After each agent returns, go back to Step 1 (read status.md again, decide next action).

Step 2 — Agent Dispatches

Infra Agent (Seed Fix / Rebuild)

Launch a blocking general-purpose subagent:

You are the **Infra Agent** for the QA cycle on branch `{BRANCH}`.

## Context
{IF seed fix: Read the infra-seed prompt at scripts/qa-cycle/prompts/infra-seed.md (if it exists)}
{IF rebuild: Rebuild the dev stack after Dev fixes have been merged.}

## Your Job
{IF first start: Start the Keycloak dev stack infra and verify local services are running.}
{IF rebuild: Restart specific local services after Dev fixes.}

## Service Management
Use `compose/scripts/svc.sh` to manage local services (background, PID-tracked, with health waits):

```bash
bash compose/scripts/svc.sh status              # Check health of all services
bash compose/scripts/svc.sh start all            # Start backend, gateway, frontend, portal
bash compose/scripts/svc.sh restart backend      # Restart after Java changes
bash compose/scripts/svc.sh stop frontend portal # Stop specific services
bash compose/scripts/svc.sh logs backend         # Last 50 lines of log

Service	Port	Health Check
Backend	8080	/actuator/health
Gateway	8443	/actuator/health
Frontend	3000	/
Portal	3002	/

Docker infra (Postgres, Keycloak, Mailpit, LocalStack) is managed separately via bash compose/scripts/dev-up.sh.

Prerequisites Check

Verify Docker infra is running: bash compose/scripts/dev-up.sh
Check service health: bash compose/scripts/svc.sh status
Start any services that are down: bash compose/scripts/svc.sh start all

Starting the Stack (first time)

Start Docker infra: bash compose/scripts/dev-up.sh
Wait for Keycloak to be ready: curl -sf http://localhost:8180/realms/docteams
Run Keycloak bootstrap (creates platform admin): bash compose/scripts/keycloak-bootstrap.sh
Start local services: bash compose/scripts/svc.sh start all
If any service fails to start, check logs: bash compose/scripts/svc.sh logs {service}

NOTE: Org/user data is NOT pre-seeded. The QA lifecycle script's Day 0 exercises the real onboarding flow (access request → admin approval → Keycloak registration).

Rebuilding (after Dev fixes)

Restart the affected service: bash compose/scripts/svc.sh restart backend (or gateway/frontend/portal)
svc.sh will stop, restart, and wait for health check automatically.
If Docker infra changed: bash compose/scripts/dev-rebuild.sh {service}
Clear NEEDS_REBUILD from status.md.
When to restart: Backend/Gateway need restart after Java source changes. Frontend/Portal use HMR (auto-reload).

State File

Read and update: qa_cycle/status.md

Guard Rails

Commit directly to {BRANCH} (infra changes, not feature PRs)
Run backend tests if you change seeder code
Read backend/CLAUDE.md before making backend changes
If rebuild fails after 2 attempts, report the error and exit


### QA Agent

Launch a **blocking** `general-purpose` subagent:

You are the QA Agent for the QA cycle on branch {BRANCH}.

Your Job

Execute the lifecycle script via Playwright MCP against the Keycloak dev stack (http://localhost:3000). Record pass/fail for each checkpoint. Stop when you hit a blocker.

Before You Start

Read qa_cycle/status.md — check "QA Position" for where to resume.
Read the scenario file: {SCENARIO_FILE}
Skip to the day/checkpoint in QA Position.
Check which gaps are FIXED — verify those first.

Keycloak Authentication via Playwright

To log in as a user (e.g., Alice):

Navigate to http://localhost:3000/dashboard (or any protected route)
You will be redirected through the gateway to the Keycloak login page
Wait for the Keycloak login form to appear (look for #username or input[name="username"])
Fill in username: alice@example.com
Fill in password: password
Click the login button (#kc-login or input[type="submit"])
Wait for redirect back to the frontend — you should land on /org/acme-corp/dashboard or similar
Verify you see the authenticated UI (sidebar, user avatar, etc.)

To switch users:

Log out first (if the app has a sign-out button, click it; otherwise clear cookies)
Follow the login steps above with the new user's credentials

Available users:

User	Email	Password	Role
Alice	alice@example.com	password	owner
Bob	bob@example.com	password	admin
Carol	carol@example.com	password	member

Organization slug: acme-corp

Execution Rules

One day at a time. Complete all checkpoints before moving to next day.
Record every checkpoint: ID, Result (PASS/FAIL/PARTIAL), Evidence
On blocker: Stop. Log it. Exit. Do NOT skip ahead.
On non-cascading bug: Log it and continue.
Check console errors after each page navigation.
Take screenshots on failures for evidence.

Verifying Fixes

When resuming after Dev fixes:

Re-run the blocked checkpoint.
PASS → mark gap VERIFIED in status.md.
FAIL → mark gap REOPENED with new evidence.
Continue forward.

Writing Results

Write to qa_cycle/checkpoint-results/day-{NN}.md with checkpoint ID, result, evidence, gap ID.

Updating Status

Update "QA Position" to next unexecuted checkpoint.
New blockers → add row to Tracker (OPEN, severity, owner).
Verified fixes → FIXED → VERIFIED.
Reopened fixes → FIXED → REOPENED.
If all days complete → add ALL_DAYS_COMPLETE.
Add log entries.

Commit

Commit checkpoint results + status.md to {BRANCH} and push. Message: qa: Day {N} checkpoint results (cycle {CYCLE})

Do NOT fix issues yourself. Test and document only.

CRITICAL: No SQL Shortcuts

Do NOT use direct SQL queries (INSERT, UPDATE, DELETE) to create or modify data. ALL operations must go through the REST API or browser UI:

Customer creation: POST /api/customers
Lifecycle transitions: POST /api/customers/{id}/transition
Checklist completion: POST /api/checklists/{id}/items/{itemId}/complete
Document uploads: POST /api/projects/{id}/documents/upload-init → PUT to S3 → POST /api/documents/{id}/confirm
Time entries: POST /api/time-entries
Invoices: POST /api/invoices
Member management: POST /internal/members/sync If an API step fails, log it as a gap — do NOT work around it with SQL.


### Product Agent

Launch a **blocking** `general-purpose` subagent:

You are the Product Agent for the QA cycle on branch {BRANCH}.

Your Job

Triage all OPEN/REOPENED items in qa_cycle/status.md. Write fix specifications that Dev agents can implement. Determine if bugs are cascading (escalate to blocker).

Before You Start

Read qa_cycle/status.md — focus on OPEN and REOPENED items.
Read qa_cycle/error-log.md for backend errors.
Read latest checkpoint results in qa_cycle/checkpoint-results/.
Read {GAP_REPORT} for background context (if provided).

Triage Rules

Blocker: QA cannot proceed. Next checkpoint depends on this.
Bug: Wrong but QA can work around it.
Cascading bug → blocker: Bug causes 2+ downstream failures. Escalate.
WONT_FIX: Requires new infra or days of work. Out of scope for this cycle.
Only SPEC_READY items fixable in < 2 hours of dev work.

Prioritize by QA Position

Fix blockers at the CURRENT QA day first. Don't spec Day 90 fixes when QA is stuck on Day 0.

Fix Spec Format

Write one file per item to qa_cycle/fix-specs/{GAP_ID}.md:

# Fix Spec: {GAP_ID} — {Summary}
## Problem
{2-3 sentences with evidence from QA checkpoint results}
## Root Cause (hypothesis)
{File paths, class names, method names — use grep to confirm}
## Fix
{Step-by-step: "Add X to Y", "Change Z from A to B". Include file paths.}
## Scope
Backend / Frontend / Both / Seed / Docker
Files to modify: {list}
Files to create: {list}
Migration needed: yes/no
## Verification
{Which checkpoint to re-run}
## Estimated Effort
S (< 30 min) / M (30 min - 2 hr) / L (> 2 hr)

Updating Status

Change triaged items: OPEN → SPEC_READY.
Escalate cascading bugs to blocker.
Add log entries.
Commit and push to {BRANCH}.

Key: Search the codebase before writing specs

Use grep/glob to confirm root cause hypotheses. Include actual file paths and line numbers.


### Dev Agent

Launch a **blocking** `general-purpose` subagent with `isolation: "worktree"`:

You are the Dev Agent for the QA cycle on branch {BRANCH}.

Your Fix

Read the fix spec at: qa_cycle/fix-specs/{GAP_ID}.md

Before You Start

Read the fix spec — it has problem, root cause, fix steps, file paths.
Read relevant CLAUDE.md (backend/CLAUDE.md and/or frontend/CLAUDE.md).
Check qa_cycle/status.md for context.

Workflow

1. Create Fix Branch

git checkout {BRANCH} git pull origin {BRANCH} git checkout -b fix/{GAP_ID}

2. Implement

Follow the fix spec steps exactly. Read files before editing. Keep changes minimal.

3. Reproduce-before-fix (CLAUDE.md §4 — mandatory)

Before writing any fix, you must reproduce the bug locally. Run the failing scenario / open the failing page / hit the failing endpoint, observe the actual broken behaviour, save evidence (screenshot, log line, payload). Diagnostic-by-spec ("the spec says line 88, change it") is forbidden — bugs have shipped from the wrong subtree more than once. If you can't reproduce, the spec is wrong; report up, don't fix-and-pray.

4. Build & Verify (CLAUDE.md §1 — full verify is mandatory)

Targeted tests are for inner-loop iteration. The merge bar is a clean full verify. Don't ship without it.

Backend (if in scope):

cd backend
./mvnw spotless:apply 2>&1 | tail -3
./mvnw compile test-compile -q > /tmp/mvn-compile.log 2>&1     # quick gate
./mvnw test -Dtest='<your-targeted-class>' > /tmp/mvn-targeted.log 2>&1  # iterate
# THEN before PR:
./mvnw verify > /tmp/mvn-verify.log 2>&1                        # MANDATORY before PR
# If verify is green:
cat > .claude/markers/verify-backend.json <<EOF
{"commit":"$(git rev-parse --short HEAD)","command":"./mvnw verify","exit":0,"ts":"$(date -u +%Y-%m-%dT%H:%M:%SZ)","summary":"<test count from log>"}
EOF

Frontend (if in scope):

cd frontend
NODE_OPTIONS="" /opt/homebrew/bin/pnpm install > /dev/null 2>&1
NODE_OPTIONS="" /opt/homebrew/bin/pnpm run lint > /tmp/lint-fix.log 2>&1   # full lint
NODE_OPTIONS="" /opt/homebrew/bin/pnpm run build > /tmp/build-fix.log 2>&1 # full build
NODE_OPTIONS="" /opt/homebrew/bin/pnpm test > /tmp/test-fix.log 2>&1       # full vitest, NOT narrowed
# If green:
cat > .claude/markers/verify-frontend.json <<EOF
{"commit":"$(git rev-parse --short HEAD)","command":"pnpm run lint && pnpm run build && pnpm test","exit":0,"ts":"$(date -u +%Y-%m-%dT%H:%M:%SZ)","summary":"<test count>"}
EOF

Portal (if in scope): same pattern as Frontend, write verify-portal.json.

If any step fails: fix it, re-run from the top. Max 3 attempts before marking STUCK and exiting. Do NOT write a marker for a failing run.

5. Commit & Push

git add <specific files>                # ONLY the files for this fix — no scope creep
git commit -m "fix({GAP_ID}): {short description}"
git push -u origin fix/{GAP_ID}

6. Create PR

gh pr create --base {BRANCH} --title "Fix {GAP_ID}: {summary}" --body "..."

PR body MUST include:

Summary of the bug (with reproduction evidence: file:line, screenshot, log).
Root cause (verified, not hypothesized).
Files changed and why each one.
Verification results (mvnw verify test counts, lint/build/test outcomes).
Out-of-scope items (anything you noticed but did not fix).

7. Review (CLAUDE.md §2 — mandatory for agent-authored PRs)

Self-review is not enough. Either:

(a) Wait for CodeRabbit (if configured on the repo), or
(b) Dispatch a superpowers:code-reviewer subagent on the PR with framing "find the slop", or
(c) Stop and ask the user to review before merge.

Do NOT merge an agent-authored PR without an independent review pass.

8. Merge (gated by `.claude/hooks/pre-pr-merge-gate.sh`)

The merge-gate hook will block gh pr merge if:

The verify marker for any touched area (backend / frontend / portal) is missing or stale (>24h) or exit != 0.
The PR is not documentation-only.

If the hook blocks you, that means a marker is missing — fix the verify, write the marker, retry. Do NOT bypass with --admin or by editing the hook.

gh pr merge {PR_NUMBER} --squash --delete-branch
git checkout {BRANCH} && git pull origin {BRANCH}

9. Update Status (post-merge)

Set gap status to FIXED in qa_cycle/status.md (NOT VERIFIED — that comes after QA re-runs the scenario). If backend/gateway changed: run bash compose/scripts/svc.sh restart backend (or gateway). If frontend/portal changed: HMR picks up changes automatically (no restart needed). Add log entry. Commit and push to {BRANCH}.

Use MERGED-AWAITING-VERIFY if behaviour was not end-to-end verified post-merge. Don't claim VERIFIED without observing the fix work in browser/Mailpit/DB.

Guard Rails (CLAUDE.md §1–§10)

These are NOT advice. Loopholes are forbidden. If a rule blocks you, raise it; don't bypass.

One fix per PR. Same-bug-class clusters (e.g. 3 dialogs with identical defect) only with explicit authorization.
Reproduce before fix. No diagnostic-by-spec. If you can't reproduce, the spec is wrong; report up.
Full verify is mandatory before PR, NOT targeted tests. The .claude/markers/verify-*.json files must exist and be current. The pre-merge hook will block merge without them.
Don't touch code outside the spec's scope. Scope expansion = halt and re-spec, not "while I was here."
Max 3 build attempts. Report failure (specific error) and exit STUCK if still broken. Don't band-aid.
If spec is wrong, exit STUCK with notes. Don't silently change scope or invent a different fix.
PASS means observed end-to-end (browser/log/Mailpit/DB). Inferred PASS is forbidden. Use DEFERRED or MERGED-AWAITING-VERIFY when behaviour is unverified.
Status reports are drafts. Write what you actually did. "Stream timed out, here's what got done" is correct. Inflated PASS claims are dishonest.
Pride and quality. Slow correct fix > fast broken fix. This is not a race.

Environment

Postgres host: b2mash.local:5432
LocalStack host: b2mash.local:4566
pnpm: /opt/homebrew/bin/pnpm
NODE_OPTIONS="" needed before pnpm commands
SHELL=/bin/bash prefix for docker build


**IMPORTANT**: If the Dev agent is dispatched with `isolation: "worktree"`, it already has an isolated copy. Adjust the branch/merge commands accordingly — the agent creates the fix branch from the worktree's HEAD, and the PR targets `{BRANCH}`.

If NOT using worktree isolation (e.g., for seed/infra fixes that commit directly to the parent branch), omit the `isolation` parameter.

## Step 3 — Error Recovery

After each agent returns, inspect the result:

| Situation | Action |
|-----------|--------|
| Agent succeeded | Read status.md, go to Step 1 |
| Dev build failed 3x | Mark gap as STUCK in status.md, move to next SPEC_READY item |
| QA found new blocker | Product Agent will triage it next cycle |
| Fix spec was wrong | Re-dispatch Product Agent to rewrite the spec |
| Infra rebuild failed | Check Docker logs manually, fix, retry once |
| Agent ran out of context | Resume with fresh subagent, pass status.md state as context |
| REOPENED after Dev fix | Increment retry counter; if 3rd reopen, mark STUCK |
| Keycloak login failed | Check Keycloak health, check `/etc/hosts`, check gateway session store |

### Retry Tracking

Keep a mental counter (or note in status.md log) of retries per gap:


## Step 4 — Cycle Summary

After each full cycle (QA → Product → Dev → optional Infra), log a summary:

Cycle {N} complete:

QA position: Day {X}, Checkpoint {Y}
Items fixed this cycle: {list}
Items stuck: {list}
Items remaining: {count}
Next action: {what Step 1 will dispatch}


## Step 5 — Completion

When `ALL_DAYS_COMPLETE` appears in status.md OR max cycles reached:

1. Read final status.md
2. Count: VERIFIED, FIXED, OPEN, STUCK, WONT_FIX
3. Report summary to user
4. If all days complete: suggest merging the bugfix branch to main
5. If max cycles: list remaining blockers and recommend next steps

## Differences from /qa-cycle (E2E Mock-Auth)

| Aspect | /qa-cycle (E2E) | /qa-cycle-kc (Keycloak) |
|--------|-----------------|-------------------------|
| Frontend | http://localhost:3001 (Docker) | http://localhost:3000 (local pnpm dev) |
| Backend | http://localhost:8081 (Docker) | http://localhost:8080 (local mvnw) |
| Auth | Mock IDP (port 8090) | Keycloak (port 8180) via Gateway BFF (8443) |
| Login flow | Navigate to `/mock-login`, click Sign In | OIDC redirect → Keycloak login form → fill email/password |
| Services | All in Docker | Infra in Docker, services run locally in terminals |
| Start | `e2e-up.sh` | `dev-up.sh` + start services manually |
| Seed data | Docker seed container (automatic) | `keycloak-bootstrap.sh` (platform admin only). Orgs created through UI. |
| Postgres | localhost:5433, db: app | localhost:5432, db: docteams |
| Prerequisite | None | None (gateway runs locally, uses localhost:8180) |
| E2E fixtures | `e2e/fixtures/auth.ts` (mock) | `e2e/fixtures/keycloak-auth.ts` |
| Email helper | None | `e2e/helpers/mailpit.ts` (OTP + invite links) |

## Guardrails

- **Orchestrator stays lean**: Never read the scenario file, ARCHITECTURE.md, or CLAUDE.md subdirectories
- **State is in status.md**: All decisions derive from reading this file
- **Sequential agent turns**: One agent at a time, inspect result, then decide next
- **Dev uses worktree isolation**: Prevents polluting the parent branch with broken code
- **Infra commits directly**: Seed/rebuild changes go straight to the parent branch
- **No blind retries**: If something fails, diagnose WHY before retrying
- **Commit after every turn**: Each agent commits its state changes before returning
- **Keycloak session awareness**: If QA agent reports auth errors, check gateway health and Keycloak status before retrying
- **NEVER use direct SQL to bypass steps**: QA agents must use REST APIs or browser UI for ALL operations — customer creation, lifecycle transitions, checklist completion, document uploads, time entries, invoices, member management. If an API step fails, log it as a gap. Do NOT work around it with SQL INSERT/UPDATE. Document uploads use the presigned-URL flow: `POST /api/projects/{id}/documents/upload-init` → `PUT` to S3 presigned URL → `POST /api/documents/{id}/confirm`. SQL shortcuts mask real bugs and defeat the purpose of QA.

name	qa-cycle-kc
description	Run an autonomous QA cycle against the Keycloak dev stack — dispatches QA, Product, Dev, and Infra subagents in a loop until the lifecycle scenario passes end-to-end. Uses dev ports (3000/8080/8443/8180) with real Keycloak authentication. Usage - /qa-cycle-kc <scenario-file> [gap-report] [--resume]

QA Cycle (Keycloak Dev Stack) — In-Session Orchestration

Environment — Keycloak Dev Stack (Local Services)

Services run locally in the background (not Docker). Only infrastructure runs via Docker Compose. Use compose/scripts/svc.sh to manage services:

bash compose/scripts/svc.sh start all              # Start backend, gateway, frontend, portal
bash compose/scripts/svc.sh restart backend         # Restart after Java changes
bash compose/scripts/svc.sh stop frontend portal    # Stop specific services
bash compose/scripts/svc.sh status                  # Health check all services
bash compose/scripts/svc.sh logs backend            # Last 50 lines of log

Service	How to Start	URL
Infra (Postgres, LocalStack, Mailpit, Keycloak)	`bash compose/scripts/dev-up.sh`	various
Backend	`svc.sh start backend` (or `SPRING_PROFILES_ACTIVE=local,keycloak ./mvnw spring-boot:run`)	http://localhost:8080
Frontend	`svc.sh start frontend` (or `NEXT_PUBLIC_AUTH_MODE=keycloak pnpm dev`)	http://localhost:3000
Gateway	`svc.sh start gateway` (or `./mvnw spring-boot:run` in gateway/)	http://localhost:8443
Portal	`svc.sh start portal` (or `pnpm dev` in portal/)	http://localhost:3002
Keycloak Bootstrap	`bash compose/scripts/keycloak-bootstrap.sh` (run once after first start)	—

Key URLs

Service	URL
Frontend	http://localhost:3000
Backend	http://localhost:8080
Gateway (BFF)	http://localhost:8443
Keycloak Admin	http://localhost:8180 (admin/admin)
Mailpit UI	http://localhost:8025
Mailpit API	http://localhost:8025/api/v1/

Platform Admin (pre-created by keycloak-bootstrap.sh)

User	Email	Password	Role
Platform Admin	padmin@docteams.local	password	platform-admin

Other users (org owners, members) are created through the product's onboarding flow — not pre-seeded.

Keycloak Login Flow (Playwright)

Authentication uses a multi-redirect OIDC flow. The keycloak-auth.ts fixture at frontend/e2e/fixtures/keycloak-auth.ts provides helper functions:

loginAs(page, email, password) — navigates to /dashboard, follows redirect to Keycloak login, fills form, waits for redirect back
loginAsPlatformAdmin(page) — shortcut for padmin
registerFromInvite(page, inviteLink, firstName, lastName, password) — follows KC invite link, fills registration form

Keycloak form selectors are centralized in frontend/e2e/fixtures/keycloak-selectors.ts (based on Keycloakify theme). If the theme changes, update that one file.

Onboarding Flow (How Orgs Are Created)

New user clicks "Get Started" → /request-access
Fills form: email, name, org name, country, industry → submits
Receives OTP via email (Mailpit) → enters OTP → request goes to PENDING
Platform admin logs in → /platform-admin/access-requests → approves
Approval triggers: Keycloak org creation → tenant schema provisioning → invite email
User clicks invite link from Mailpit → Keycloak registration page → sets password
User logs in → backend JIT syncs member → user becomes org owner
Owner invites members via Teams page (needs plan upgrade for >2 members)

Email Integration (Mailpit API)

The mailpit.ts helper at frontend/e2e/helpers/mailpit.ts provides:

clearMailbox() — delete all emails (call before test runs)
waitForEmail(recipient, { subject?, timeout? }) — polls until email arrives
extractOtp(email) — extracts 6-digit code from email body
extractInviteLink(email) — extracts Keycloak invite/registration URL

Running E2E Tests

# Run Keycloak onboarding + member invite tests
cd frontend && E2E_AUTH_MODE=keycloak npx playwright test keycloak/ --config e2e/playwright.config.ts --reporter=list

# Debug with headed browser
cd frontend && E2E_AUTH_MODE=keycloak npx playwright test keycloak/onboarding --config e2e/playwright.config.ts --headed

The E2E_AUTH_MODE=keycloak env var:

Sets Playwright to 60s timeout, 1 worker (serial)
Allows port 3000 navigation in the PreToolUse hook

Why In-Session (not bash script)

The bash script approach is terminal on failure. In-session orchestration gives you:

Error recovery: Inspect failures, adjust specs, retry
Adaptive flow: Skip non-blocking items, reorder priorities
Context preservation: You see all agent outputs and can carry lessons forward
No nesting issues: Agent tool works cleanly, no claude -p inside Claude

Arguments

<scenario-file> — path to the lifecycle script (e.g., tasks/phase47-lifecycle-script.md)
[gap-report] — optional path to pre-existing gap report
[--resume] — resume an existing cycle (skip branch/dir creation)

State Files

All cycle state lives in qa_cycle/ on the parent branch:

File	Purpose
`qa_cycle/status.md`	Shared tracker — all agents read/write this
`qa_cycle/fix-specs/{GAP_ID}.md`	Product writes, Dev reads
`qa_cycle/checkpoint-results/day-{NN}.md`	QA writes test results
`qa_cycle/error-log.md`	Docker log errors (manual check)

Orchestrator Rules

Stay lean: Do NOT read the scenario file, ARCHITECTURE.md, or CLAUDE.md subdirectory files. Subagents do that.
Read status.md between every turn: This is your decision input.
One agent at a time: Each agent turn is a blocking subagent call. No parallel agent turns within the same cycle.
Max 3 retries per fix: If a Dev fix fails 3 times, mark as STUCK in status.md and move on.
Max 20 cycles: If not ALL_DAYS_COMPLETE after 20 cycles, stop and summarize.
Commit between turns: Each agent should commit and push its changes before returning.

Step 0 — Setup (First Run Only, skip if --resume)

# Verify branch
BRANCH="bugfix_cycle_$(date +%Y-%m-%d)"
git checkout "$BRANCH" 2>/dev/null || git checkout -b "$BRANCH"

# Create directories
mkdir -p qa_cycle/fix-specs qa_cycle/checkpoint-results

# Verify status.md exists (must be pre-seeded or created by user)
test -f qa_cycle/status.md || echo "ERROR: qa_cycle/status.md not found"

If gap-report argument was provided, initialize status.md from it (extract gaps into tracker table). If status.md already exists, skip.

Step 1 — Decide Next Action

Read qa_cycle/status.md and determine the next action:

IF Dev Stack = "Not running" AND OPEN blockers tagged "Infra":
  → Infra Agent (seed fix + start stack)

ELIF NEEDS_REBUILD flag set:
  → Infra Agent (rebuild)

ELIF any SPEC_READY items exist:
  → Dev Agent (fix first SPEC_READY item)

ELIF any OPEN/REOPENED items exist AND QA is blocked:
  → Product Agent (triage OPEN items into SPEC_READY)

ELSE:
  → QA Agent (execute next day/checkpoint)

After each agent returns, go back to Step 1 (read status.md again, decide next action).

Step 2 — Agent Dispatches

Infra Agent (Seed Fix / Rebuild)

Launch a blocking general-purpose subagent:

You are the **Infra Agent** for the QA cycle on branch `{BRANCH}`.

## Context
{IF seed fix: Read the infra-seed prompt at scripts/qa-cycle/prompts/infra-seed.md (if it exists)}
{IF rebuild: Rebuild the dev stack after Dev fixes have been merged.}

## Your Job
{IF first start: Start the Keycloak dev stack infra and verify local services are running.}
{IF rebuild: Restart specific local services after Dev fixes.}

## Service Management
Use `compose/scripts/svc.sh` to manage local services (background, PID-tracked, with health waits):

```bash
bash compose/scripts/svc.sh status              # Check health of all services
bash compose/scripts/svc.sh start all            # Start backend, gateway, frontend, portal
bash compose/scripts/svc.sh restart backend      # Restart after Java changes
bash compose/scripts/svc.sh stop frontend portal # Stop specific services
bash compose/scripts/svc.sh logs backend         # Last 50 lines of log

Service	Port	Health Check
Backend	8080	/actuator/health
Gateway	8443	/actuator/health
Frontend	3000	/
Portal	3002	/

Docker infra (Postgres, Keycloak, Mailpit, LocalStack) is managed separately via bash compose/scripts/dev-up.sh.

Prerequisites Check

Verify Docker infra is running: bash compose/scripts/dev-up.sh
Check service health: bash compose/scripts/svc.sh status
Start any services that are down: bash compose/scripts/svc.sh start all

Starting the Stack (first time)

Start Docker infra: bash compose/scripts/dev-up.sh
Wait for Keycloak to be ready: curl -sf http://localhost:8180/realms/docteams
Run Keycloak bootstrap (creates platform admin): bash compose/scripts/keycloak-bootstrap.sh
Start local services: bash compose/scripts/svc.sh start all
If any service fails to start, check logs: bash compose/scripts/svc.sh logs {service}

NOTE: Org/user data is NOT pre-seeded. The QA lifecycle script's Day 0 exercises the real onboarding flow (access request → admin approval → Keycloak registration).

Rebuilding (after Dev fixes)

Restart the affected service: bash compose/scripts/svc.sh restart backend (or gateway/frontend/portal)
svc.sh will stop, restart, and wait for health check automatically.
If Docker infra changed: bash compose/scripts/dev-rebuild.sh {service}
Clear NEEDS_REBUILD from status.md.
When to restart: Backend/Gateway need restart after Java source changes. Frontend/Portal use HMR (auto-reload).

State File

Read and update: qa_cycle/status.md

Guard Rails

Commit directly to {BRANCH} (infra changes, not feature PRs)
Run backend tests if you change seeder code
Read backend/CLAUDE.md before making backend changes
If rebuild fails after 2 attempts, report the error and exit


### QA Agent

Launch a **blocking** `general-purpose` subagent:

You are the QA Agent for the QA cycle on branch {BRANCH}.

Your Job

Execute the lifecycle script via Playwright MCP against the Keycloak dev stack (http://localhost:3000). Record pass/fail for each checkpoint. Stop when you hit a blocker.

Before You Start

Read qa_cycle/status.md — check "QA Position" for where to resume.
Read the scenario file: {SCENARIO_FILE}
Skip to the day/checkpoint in QA Position.
Check which gaps are FIXED — verify those first.

Keycloak Authentication via Playwright

To log in as a user (e.g., Alice):

Navigate to http://localhost:3000/dashboard (or any protected route)
You will be redirected through the gateway to the Keycloak login page
Wait for the Keycloak login form to appear (look for #username or input[name="username"])
Fill in username: alice@example.com
Fill in password: password
Click the login button (#kc-login or input[type="submit"])
Wait for redirect back to the frontend — you should land on /org/acme-corp/dashboard or similar
Verify you see the authenticated UI (sidebar, user avatar, etc.)

To switch users:

Log out first (if the app has a sign-out button, click it; otherwise clear cookies)
Follow the login steps above with the new user's credentials

Available users:

User	Email	Password	Role
Alice	alice@example.com	password	owner
Bob	bob@example.com	password	admin
Carol	carol@example.com	password	member

Organization slug: acme-corp

Execution Rules

One day at a time. Complete all checkpoints before moving to next day.
Record every checkpoint: ID, Result (PASS/FAIL/PARTIAL), Evidence
On blocker: Stop. Log it. Exit. Do NOT skip ahead.
On non-cascading bug: Log it and continue.
Check console errors after each page navigation.
Take screenshots on failures for evidence.

Verifying Fixes

When resuming after Dev fixes:

Re-run the blocked checkpoint.
PASS → mark gap VERIFIED in status.md.
FAIL → mark gap REOPENED with new evidence.
Continue forward.

Writing Results

Write to qa_cycle/checkpoint-results/day-{NN}.md with checkpoint ID, result, evidence, gap ID.

Updating Status

Update "QA Position" to next unexecuted checkpoint.
New blockers → add row to Tracker (OPEN, severity, owner).
Verified fixes → FIXED → VERIFIED.
Reopened fixes → FIXED → REOPENED.
If all days complete → add ALL_DAYS_COMPLETE.
Add log entries.

Commit

Commit checkpoint results + status.md to {BRANCH} and push. Message: qa: Day {N} checkpoint results (cycle {CYCLE})

Do NOT fix issues yourself. Test and document only.

CRITICAL: No SQL Shortcuts

Do NOT use direct SQL queries (INSERT, UPDATE, DELETE) to create or modify data. ALL operations must go through the REST API or browser UI:

Customer creation: POST /api/customers
Lifecycle transitions: POST /api/customers/{id}/transition
Checklist completion: POST /api/checklists/{id}/items/{itemId}/complete
Document uploads: POST /api/projects/{id}/documents/upload-init → PUT to S3 → POST /api/documents/{id}/confirm
Time entries: POST /api/time-entries
Invoices: POST /api/invoices
Member management: POST /internal/members/sync If an API step fails, log it as a gap — do NOT work around it with SQL.


### Product Agent

Launch a **blocking** `general-purpose` subagent:

You are the Product Agent for the QA cycle on branch {BRANCH}.

Your Job

Triage all OPEN/REOPENED items in qa_cycle/status.md. Write fix specifications that Dev agents can implement. Determine if bugs are cascading (escalate to blocker).

Before You Start

Read qa_cycle/status.md — focus on OPEN and REOPENED items.
Read qa_cycle/error-log.md for backend errors.
Read latest checkpoint results in qa_cycle/checkpoint-results/.
Read {GAP_REPORT} for background context (if provided).

Triage Rules

Blocker: QA cannot proceed. Next checkpoint depends on this.
Bug: Wrong but QA can work around it.
Cascading bug → blocker: Bug causes 2+ downstream failures. Escalate.
WONT_FIX: Requires new infra or days of work. Out of scope for this cycle.
Only SPEC_READY items fixable in < 2 hours of dev work.

Prioritize by QA Position

Fix blockers at the CURRENT QA day first. Don't spec Day 90 fixes when QA is stuck on Day 0.

Fix Spec Format

Write one file per item to qa_cycle/fix-specs/{GAP_ID}.md:

# Fix Spec: {GAP_ID} — {Summary}
## Problem
{2-3 sentences with evidence from QA checkpoint results}
## Root Cause (hypothesis)
{File paths, class names, method names — use grep to confirm}
## Fix
{Step-by-step: "Add X to Y", "Change Z from A to B". Include file paths.}
## Scope
Backend / Frontend / Both / Seed / Docker
Files to modify: {list}
Files to create: {list}
Migration needed: yes/no
## Verification
{Which checkpoint to re-run}
## Estimated Effort
S (< 30 min) / M (30 min - 2 hr) / L (> 2 hr)

Updating Status

Change triaged items: OPEN → SPEC_READY.
Escalate cascading bugs to blocker.
Add log entries.
Commit and push to {BRANCH}.

Key: Search the codebase before writing specs

Use grep/glob to confirm root cause hypotheses. Include actual file paths and line numbers.


### Dev Agent

Launch a **blocking** `general-purpose` subagent with `isolation: "worktree"`:

You are the Dev Agent for the QA cycle on branch {BRANCH}.

Your Fix

Read the fix spec at: qa_cycle/fix-specs/{GAP_ID}.md

Before You Start

Read the fix spec — it has problem, root cause, fix steps, file paths.
Read relevant CLAUDE.md (backend/CLAUDE.md and/or frontend/CLAUDE.md).
Check qa_cycle/status.md for context.

Workflow

1. Create Fix Branch

git checkout {BRANCH} git pull origin {BRANCH} git checkout -b fix/{GAP_ID}

2. Implement

Follow the fix spec steps exactly. Read files before editing. Keep changes minimal.

3. Reproduce-before-fix (CLAUDE.md §4 — mandatory)

4. Build & Verify (CLAUDE.md §1 — full verify is mandatory)

Targeted tests are for inner-loop iteration. The merge bar is a clean full verify. Don't ship without it.

Backend (if in scope):

cd backend
./mvnw spotless:apply 2>&1 | tail -3
./mvnw compile test-compile -q > /tmp/mvn-compile.log 2>&1     # quick gate
./mvnw test -Dtest='<your-targeted-class>' > /tmp/mvn-targeted.log 2>&1  # iterate
# THEN before PR:
./mvnw verify > /tmp/mvn-verify.log 2>&1                        # MANDATORY before PR
# If verify is green:
cat > .claude/markers/verify-backend.json <<EOF
{"commit":"$(git rev-parse --short HEAD)","command":"./mvnw verify","exit":0,"ts":"$(date -u +%Y-%m-%dT%H:%M:%SZ)","summary":"<test count from log>"}
EOF

Frontend (if in scope):

cd frontend
NODE_OPTIONS="" /opt/homebrew/bin/pnpm install > /dev/null 2>&1
NODE_OPTIONS="" /opt/homebrew/bin/pnpm run lint > /tmp/lint-fix.log 2>&1   # full lint
NODE_OPTIONS="" /opt/homebrew/bin/pnpm run build > /tmp/build-fix.log 2>&1 # full build
NODE_OPTIONS="" /opt/homebrew/bin/pnpm test > /tmp/test-fix.log 2>&1       # full vitest, NOT narrowed
# If green:
cat > .claude/markers/verify-frontend.json <<EOF
{"commit":"$(git rev-parse --short HEAD)","command":"pnpm run lint && pnpm run build && pnpm test","exit":0,"ts":"$(date -u +%Y-%m-%dT%H:%M:%SZ)","summary":"<test count>"}
EOF

Portal (if in scope): same pattern as Frontend, write verify-portal.json.

If any step fails: fix it, re-run from the top. Max 3 attempts before marking STUCK and exiting. Do NOT write a marker for a failing run.

5. Commit & Push

git add <specific files>                # ONLY the files for this fix — no scope creep
git commit -m "fix({GAP_ID}): {short description}"
git push -u origin fix/{GAP_ID}

6. Create PR

gh pr create --base {BRANCH} --title "Fix {GAP_ID}: {summary}" --body "..."

PR body MUST include:

Summary of the bug (with reproduction evidence: file:line, screenshot, log).
Root cause (verified, not hypothesized).
Files changed and why each one.
Verification results (mvnw verify test counts, lint/build/test outcomes).
Out-of-scope items (anything you noticed but did not fix).

7. Review (CLAUDE.md §2 — mandatory for agent-authored PRs)

Self-review is not enough. Either:

(a) Wait for CodeRabbit (if configured on the repo), or
(b) Dispatch a superpowers:code-reviewer subagent on the PR with framing "find the slop", or
(c) Stop and ask the user to review before merge.

Do NOT merge an agent-authored PR without an independent review pass.

8. Merge (gated by `.claude/hooks/pre-pr-merge-gate.sh`)

The merge-gate hook will block gh pr merge if:

The verify marker for any touched area (backend / frontend / portal) is missing or stale (>24h) or exit != 0.
The PR is not documentation-only.

If the hook blocks you, that means a marker is missing — fix the verify, write the marker, retry. Do NOT bypass with --admin or by editing the hook.

gh pr merge {PR_NUMBER} --squash --delete-branch
git checkout {BRANCH} && git pull origin {BRANCH}

9. Update Status (post-merge)

Use MERGED-AWAITING-VERIFY if behaviour was not end-to-end verified post-merge. Don't claim VERIFIED without observing the fix work in browser/Mailpit/DB.

Guard Rails (CLAUDE.md §1–§10)

These are NOT advice. Loopholes are forbidden. If a rule blocks you, raise it; don't bypass.

One fix per PR. Same-bug-class clusters (e.g. 3 dialogs with identical defect) only with explicit authorization.
Reproduce before fix. No diagnostic-by-spec. If you can't reproduce, the spec is wrong; report up.
Full verify is mandatory before PR, NOT targeted tests. The .claude/markers/verify-*.json files must exist and be current. The pre-merge hook will block merge without them.
Don't touch code outside the spec's scope. Scope expansion = halt and re-spec, not "while I was here."
Max 3 build attempts. Report failure (specific error) and exit STUCK if still broken. Don't band-aid.
If spec is wrong, exit STUCK with notes. Don't silently change scope or invent a different fix.
PASS means observed end-to-end (browser/log/Mailpit/DB). Inferred PASS is forbidden. Use DEFERRED or MERGED-AWAITING-VERIFY when behaviour is unverified.
Status reports are drafts. Write what you actually did. "Stream timed out, here's what got done" is correct. Inflated PASS claims are dishonest.
Pride and quality. Slow correct fix > fast broken fix. This is not a race.

Environment

Postgres host: b2mash.local:5432
LocalStack host: b2mash.local:4566
pnpm: /opt/homebrew/bin/pnpm
NODE_OPTIONS="" needed before pnpm commands
SHELL=/bin/bash prefix for docker build


**IMPORTANT**: If the Dev agent is dispatched with `isolation: "worktree"`, it already has an isolated copy. Adjust the branch/merge commands accordingly — the agent creates the fix branch from the worktree's HEAD, and the PR targets `{BRANCH}`.

If NOT using worktree isolation (e.g., for seed/infra fixes that commit directly to the parent branch), omit the `isolation` parameter.

## Step 3 — Error Recovery

After each agent returns, inspect the result:

| Situation | Action |
|-----------|--------|
| Agent succeeded | Read status.md, go to Step 1 |
| Dev build failed 3x | Mark gap as STUCK in status.md, move to next SPEC_READY item |
| QA found new blocker | Product Agent will triage it next cycle |
| Fix spec was wrong | Re-dispatch Product Agent to rewrite the spec |
| Infra rebuild failed | Check Docker logs manually, fix, retry once |
| Agent ran out of context | Resume with fresh subagent, pass status.md state as context |
| REOPENED after Dev fix | Increment retry counter; if 3rd reopen, mark STUCK |
| Keycloak login failed | Check Keycloak health, check `/etc/hosts`, check gateway session store |

### Retry Tracking

Keep a mental counter (or note in status.md log) of retries per gap:


## Step 4 — Cycle Summary

After each full cycle (QA → Product → Dev → optional Infra), log a summary:

Cycle {N} complete:

QA position: Day {X}, Checkpoint {Y}
Items fixed this cycle: {list}
Items stuck: {list}
Items remaining: {count}
Next action: {what Step 1 will dispatch}


## Step 5 — Completion

When `ALL_DAYS_COMPLETE` appears in status.md OR max cycles reached:

1. Read final status.md
2. Count: VERIFIED, FIXED, OPEN, STUCK, WONT_FIX
3. Report summary to user
4. If all days complete: suggest merging the bugfix branch to main
5. If max cycles: list remaining blockers and recommend next steps

## Differences from /qa-cycle (E2E Mock-Auth)

| Aspect | /qa-cycle (E2E) | /qa-cycle-kc (Keycloak) |
|--------|-----------------|-------------------------|
| Frontend | http://localhost:3001 (Docker) | http://localhost:3000 (local pnpm dev) |
| Backend | http://localhost:8081 (Docker) | http://localhost:8080 (local mvnw) |
| Auth | Mock IDP (port 8090) | Keycloak (port 8180) via Gateway BFF (8443) |
| Login flow | Navigate to `/mock-login`, click Sign In | OIDC redirect → Keycloak login form → fill email/password |
| Services | All in Docker | Infra in Docker, services run locally in terminals |
| Start | `e2e-up.sh` | `dev-up.sh` + start services manually |
| Seed data | Docker seed container (automatic) | `keycloak-bootstrap.sh` (platform admin only). Orgs created through UI. |
| Postgres | localhost:5433, db: app | localhost:5432, db: docteams |
| Prerequisite | None | None (gateway runs locally, uses localhost:8180) |
| E2E fixtures | `e2e/fixtures/auth.ts` (mock) | `e2e/fixtures/keycloak-auth.ts` |
| Email helper | None | `e2e/helpers/mailpit.ts` (OTP + invite links) |

## Guardrails

- **Orchestrator stays lean**: Never read the scenario file, ARCHITECTURE.md, or CLAUDE.md subdirectories
- **State is in status.md**: All decisions derive from reading this file
- **Sequential agent turns**: One agent at a time, inspect result, then decide next
- **Dev uses worktree isolation**: Prevents polluting the parent branch with broken code
- **Infra commits directly**: Seed/rebuild changes go straight to the parent branch
- **No blind retries**: If something fails, diagnose WHY before retrying
- **Commit after every turn**: Each agent commits its state changes before returning
- **Keycloak session awareness**: If QA agent reports auth errors, check gateway health and Keycloak status before retrying
- **NEVER use direct SQL to bypass steps**: QA agents must use REST APIs or browser UI for ALL operations — customer creation, lifecycle transitions, checklist completion, document uploads, time entries, invoices, member management. If an API step fails, log it as a gap. Do NOT work around it with SQL INSERT/UPDATE. Document uploads use the presigned-URL flow: `POST /api/projects/{id}/documents/upload-init` → `PUT` to S3 presigned URL → `POST /api/documents/{id}/confirm`. SQL shortcuts mask real bugs and defeat the purpose of QA.

qa-cycle-kc

同仓库更多 Skills

同仓库更多 Skills

QA Cycle (Keycloak Dev Stack) — In-Session Orchestration

Environment — Keycloak Dev Stack (Local Services)

Key URLs

Platform Admin (pre-created by keycloak-bootstrap.sh)

Keycloak Login Flow (Playwright)

Onboarding Flow (How Orgs Are Created)

Email Integration (Mailpit API)

Running E2E Tests

Why In-Session (not bash script)

Arguments

State Files

Orchestrator Rules

Step 0 — Setup (First Run Only, skip if --resume)

Step 1 — Decide Next Action

Step 2 — Agent Dispatches

Infra Agent (Seed Fix / Rebuild)

Prerequisites Check

Starting the Stack (first time)

Rebuilding (after Dev fixes)

State File

Guard Rails

Your Job

Before You Start

Keycloak Authentication via Playwright

Execution Rules

Verifying Fixes

Writing Results

Updating Status

Commit

CRITICAL: No SQL Shortcuts

Your Job

Before You Start

Triage Rules

Prioritize by QA Position

Fix Spec Format

Updating Status

Key: Search the codebase before writing specs

Your Fix

Before You Start

Workflow

1. Create Fix Branch

2. Implement

3. Reproduce-before-fix (CLAUDE.md §4 — mandatory)

4. Build & Verify (CLAUDE.md §1 — full verify is mandatory)

5. Commit & Push

6. Create PR

7. Review (CLAUDE.md §2 — mandatory for agent-authored PRs)

8. Merge (gated by .claude/hooks/pre-pr-merge-gate.sh)

9. Update Status (post-merge)

Guard Rails (CLAUDE.md §1–§10)

Environment

QA Cycle (Keycloak Dev Stack) — In-Session Orchestration

Environment — Keycloak Dev Stack (Local Services)

Key URLs

Platform Admin (pre-created by keycloak-bootstrap.sh)

Keycloak Login Flow (Playwright)

Onboarding Flow (How Orgs Are Created)

Email Integration (Mailpit API)

Running E2E Tests

Why In-Session (not bash script)

Arguments

State Files

Orchestrator Rules

Step 0 — Setup (First Run Only, skip if --resume)

Step 1 — Decide Next Action

Step 2 — Agent Dispatches

Infra Agent (Seed Fix / Rebuild)

Prerequisites Check

Starting the Stack (first time)

Rebuilding (after Dev fixes)

State File

Guard Rails

Your Job

Before You Start

Keycloak Authentication via Playwright

Execution Rules

Verifying Fixes

8. Merge (gated by `.claude/hooks/pre-pr-merge-gate.sh`)

8. Merge (gated by `.claude/hooks/pre-pr-merge-gate.sh`)