تشغيل أي مهارة في Manus بنقرة واحدة

agent-verify-workflows

النجوم٣

التفرعات١

آخر تحديث١٦ أبريل ٢٠٢٦ في ٢٢:٢٢

Explicit protocol for a management agent to verify web workflows after a subagent completes a task, without writing Playwright or browser automation. Trigger only when the user explicitly asks to verify workflows, run a named flow, or mentions agent-verify / verify-workflows / page-objects-for-agents. Do NOT trigger on general web app work, testing discussions, or code review.

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

robert-chiniquy

robert-chiniquy/dotfiles

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

محللو ضمان جودة البرمجيات والمختبرونمهن الحاسوب والرياضيات·SOC 15-1253

مستكشف الملفات

7 ملفات

SKILL.md

readonly

المزيد من هذا المستودع

نفس المستودع

authorization-model-review

robert-chiniquy/dotfiles

Reviewer persona for authorization models — RBAC, ABAC, ReBAC, and hybrids. Catches the bugs that ship after auth is correct but authz is wrong: missing tenant scoping, IDOR via predictable IDs, role escalation through unchecked write paths, permission caching staleness, transitive-trust loopholes, RBAC/ReBAC drift between policy doc and code. Use when reviewing endpoints that gate access by user/role/relationship, when adding a new role/permission/scope, when changing tenant isolation, or when designing a permission system from scratch. Triggers: RBAC, ABAC, ReBAC, IDOR, tenant isolation, multi-tenant, permission check, role, scope, principal, Zanzibar, OpenFGA, casbin, authz, can_, has_permission, isAuthorized.

2026-06-233

c1-dev-stack-in-squire

robert-chiniquy/dotfiles

Stand up a full c1 dev stack inside a Squire env — process-compose, postgres, envoy, pub-api, pub-auth, be-* services — wired so an external client can drive c1's gRPC surface end to end with TLS + OAuth2 client_credentials. Use when testing a Latchkey or other c1 client against a real (not stubbed) c1 backend, or when reproducing c1 server-side behavior locally. Triggers on: c1 dev env, squire c1 stack, pc/up, dev-util mint-test-client, test against c1, c1 OAuth client_credentials, run c1 integration tests in squire, repro buildkite integration test, TEST_LOCAL_EXEC, api_no_uplift.

2026-06-233

c1-squire-dispatch

robert-chiniquy/dotfiles

c1-specific values for the general squire dispatch protocols defined in squire-env-management. Provides the c1 gate bundle's contents, the task-family table for c1 work, the c1 always-actives, and the list of c1 skills that should NOT be spent on a squire env. Use when about to spawn a squire env to execute c1 work, when writing a brief for a remote c1 agent, or when filing a c1 bead intended for squire dispatch. Triggers: c1 squire dispatch, c1 squire brief, c1 remote work, c1 ephemeral env, c1 fire-and-forget.

2026-06-233

custom-crypto-detection

robert-chiniquy/dotfiles

Reviewer persona for detecting hand-rolled cryptography. Distinct from `sharp-edges` (which catches footgun APIs) and `key-lifecycle-review` (which covers lifecycle hygiene): this skill catches the class where someone wrote their own MAC, KDF, AEAD, signature scheme, secret-comparison routine, RNG, or password hash. Almost all custom crypto is broken. Use when reviewing any code that does math on bytes, manipulates buffers in a 'crypto-shaped' way, or implements something whose docs reference a named primitive (HMAC, AES-GCM, Argon2, X25519). Triggers: hand-rolled crypto, custom MAC, custom hash, custom KDF, byte XOR, constant-time compare, derived key, password hashing, HKDF, encrypt_then_mac, mac_then_encrypt, AE, AEAD.

2026-06-233

key-lifecycle-review

robert-chiniquy/dotfiles

Reviewer persona for the full lifecycle of cryptographic keys and high-value secrets: generation, storage, distribution, rotation, revocation, and destruction. Trail of Bits' `zeroize-audit` covers the destruction half; this skill covers the other four phases plus closes the loop with destruction. Use when reviewing key management code, secret stores, KMS integrations, rotation logic, key derivation, RNG usage, or any system that issues, holds, or revokes long-lived credentials. Triggers: key generation, key rotation, KMS, HSM, secret store, vault, key derivation, KDF, master key, DEK, KEK, rotation, revocation, RNG, entropy, random, secrets management.

2026-06-233

oauth-oidc-review

robert-chiniquy/dotfiles

Reviewer persona for OAuth 2.0 / 2.1 and OpenID Connect flow implementations. Catches the well-documented attack classes that still ship: missing PKCE, wildcard redirect URIs, mishandled refresh tokens, scope creep, mixed flows on a single endpoint, leaking tokens through referrer or logs, JWT signature bypass. Use when reviewing any code that issues, accepts, validates, exchanges, refreshes, revokes, or stores tokens; when designing a new auth integration; when a PR touches /authorize, /token, /userinfo, /jwks, /introspect, /revoke, OIDC discovery, or a third-party identity provider client. Triggers: OAuth, OIDC, JWT, PKCE, redirect_uri, scope, refresh token, access token, id_token, client_credentials, authorization code, implicit, device code, token exchange, identity provider, IdP, SSO.

2026-06-233

name	agent-verify-workflows
description	Explicit protocol for a management agent to verify web workflows after a subagent completes a task, without writing Playwright or browser automation. Trigger only when the user explicitly asks to verify workflows, run a named flow, or mentions agent-verify / verify-workflows / page-objects-for-agents. Do NOT trigger on general web app work, testing discussions, or code review.

Agent Verify Workflows

A contract for a management agent to verify web workflows owned by an app, without writing browser automation. The app ships a manifest of named flows and a single runner command. The manager discovers, invokes, and interprets.

When to use

Invoke explicitly when:

A subagent (Squire, Haiku, etc.) reports a task complete and the tracked issue lists flows to verify.
The user asks to verify a specific named flow.
Confirming a deployment still passes its declared flows.

Do not invoke:

For general "does the code work" questions.
As a substitute for unit or integration tests.
When no manifest exists in the target repo.

Pattern shape

The app being verified owns three things:

.verify/workflows.yaml — a manifest listing named flows.
A runner command template that takes a flow name as input.
Whatever machinery actually executes flows (Playwright, HTTP, curl).

The manager only ever reads (1), executes (2), and parses (3)'s stdout JSON.

Manifest schema

.verify/workflows.yaml:

version: 1

# Command template the manager invokes. {flow} is replaced with the flow name.
# Must print a single JSON object to stdout and exit 0 on pass / non-zero on fail.
runner_command: "npm run verify -- {flow}"

# Optional default base URL. Runner may also read VERIFY_BASE_URL from env.
base_url: "http://localhost:3000"

flows:
  - name: checkout-happy-path
    description: Anonymous user adds an item to cart and reaches checkout.
    timeout_s: 60
    tags: [critical, checkout]

  - name: auth-login
    description: Valid credentials route to the dashboard.
    timeout_s: 30
    tags: [critical, auth]

Discovery via `--list`

The runner is the authoritative source for available flows. The manager substitutes --list in place of {flow} in runner_command and invokes:

npm run verify -- --list

Stdout:

{
  "flows": [
    {
      "name": "checkout-happy-path",
      "description": "Anonymous user adds an item to cart and reaches checkout.",
      "timeout_s": 60,
      "tags": ["critical", "checkout"],
      "implemented": true
    }
  ]
}

implemented: false means the flow is declared in the manifest but has no backing code. Surface this as a setup error — do not attempt to run.

Prefer --list over parsing the manifest directly. The manifest YAML is a declaration; the runner is the interface.

Runner output contract

The command specified by runner_command MUST print one JSON object to stdout. Logs, progress, and browser chatter go to stderr.

{
  "flow": "checkout-happy-path",
  "status": "pass",
  "duration_ms": 4320,
  "steps": [
    { "name": "navigate-home",       "status": "pass", "duration_ms": 820 },
    { "name": "add-to-cart",         "status": "pass", "duration_ms": 1200 },
    { "name": "assert-cart-count-1", "status": "fail", "duration_ms": 50,
      "error": "expected 1, got 0" }
  ],
  "artifacts": {
    "screenshot": "/tmp/verify/checkout-happy-path.png",
    "url_at_failure": "http://localhost:3000/cart"
  }
}

Exit 0 = pass, non-zero = fail.
Exit code and status must agree. If they disagree, treat as failure.
If stdout is not parseable as JSON, treat as failure and report raw stderr.

Manager procedure

Given a target repo directory $DIR and a flow name $FLOW:

Read $DIR/.verify/workflows.yaml to obtain runner_command. If the manifest is missing, abort — this skill does not apply to the target.
Discover flows: substitute --list for {flow} in runner_command, execute with cwd=$DIR, parse stdout JSON.
Confirm $FLOW exists in the listed flows AND has implemented: true.
Substitute {flow} in runner_command with $FLOW.
Execute with cwd=$DIR. Capture stdout, stderr, and exit code. Enforce the flow's timeout_s.
Parse stdout as JSON.
Report pass/fail plus the first failing step's error to the user or the upstream issue.

Do not layer logic on top of the runner. If the runner's output is malformed, surface that as the failure — do not attempt to infer success from partial output.

Issue-tracker integration

An issue's description may declare which flows to verify after the issue is resolved. Recommended convention, readable by any tracker:

...task description...

Verify:
- checkout-happy-path
- auth-login

When the subagent reports task complete:

Read the issue (bd show <id>, or equivalent).
Parse the Verify: section for flow names.
For each flow, run the manager procedure above.
If any flow fails, post the failure JSON as a comment and leave the issue open. If all pass, close the issue.

Page Objects (reference implementation)

Adopters may implement the runner in any language. The reference in reference/ uses TypeScript + Playwright with the classic Page Object pattern to keep flows readable and selectors stable.

.verify/
├── workflows.yaml
├── package.json
└── src/
    ├── runner.ts          # dispatch by flow name, emit JSON
    ├── pages/             # one class per page/screen; selectors live here
    │   └── CheckoutPage.ts
    └── flows/             # one module per flow; drives page objects
        └── checkout-happy-path.ts

The page object pattern matters because the manager agent never sees selectors or DOM. It only sees named steps. Selectors live in page objects, intent lives in flows, and the manager sees results.

Common Mistakes

Writing Playwright in the manager's context. The manager never spawns a browser. It invokes the runner and parses JSON. If you catch yourself driving a DOM from the manager, stop — the runner is the seam.
Auto-triggering this skill. The description is narrow by design. If the user did not explicitly ask for workflow verification, do not invoke.
Inferring success from logs. The runner's exit code and JSON status field are the only sources of truth. Treat everything else as noise.
Adding assertions in the manager. Assertions belong inside flows, in the runner. The manager's only judgment is pass/fail + which step failed.
Synthesizing flows on the fly. Flows are additions to the app repo, not manager-time inventions. If the user wants to verify something not in the manifest, ask them to add a flow — do not improvise one.
Mixing stdout and stderr. JSON to stdout, everything human to stderr. Mixing breaks the parse.
Trusting status without checking exit code. A runner that crashes before printing a result can leave stdout blank with a non-zero exit. Check both; either signals failure.

agent-verify-workflows

المزيد من هذا المستودع

المزيد من هذا المستودع

Agent Verify Workflows

When to use

Pattern shape

Manifest schema

Discovery via --list

Runner output contract

Manager procedure

Issue-tracker integration

Page Objects (reference implementation)

Common Mistakes

Agent Verify Workflows

When to use

Pattern shape

Manifest schema

Discovery via --list

Runner output contract

Manager procedure

Issue-tracker integration

Page Objects (reference implementation)

Common Mistakes

Discovery via `--list`

Discovery via `--list`