Run any Skill in Manus with one click

agent-work-auditor

Stars54

Forks3

UpdatedMay 28, 2026 at 20:01

Unified auditing skill for AI agent workflows. Provides change-type-aware, artifact-adaptive auditing with self-fix capabilities. Works standalone or with spec-driven extensions. Three-layer architecture: core (always active), modules (per-type), extensions (auto-detected).

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

Run Skill in Manus

Source

lindoelio

lindoelio/spec-driven-steroids

View GitHub Repository View Creator Repositories

Download

Run Skill in Manus

Related occupationsSOC

Based on SOC occupation classification

Software Quality Assurance Analysts and TestersComputer and Mathematical Occupations·SOC 15-1253

File Explorer

35 files

SKILL.md

readonly

More from this repository

same repository

spec-driven-shared-protocol

lindoelio/spec-driven-steroids

Shared protocols and templates for Spec-Driven phases. Auto-loaded by phase skills via references.

2026-06-0654

spec-driven-task-decomposer

lindoelio/spec-driven-steroids

Use this skill when approved requirements and design need to be decomposed into tasks.md for Phase 3 of a Spec-Driven change. It creates atomic, traceable implementation and testing tasks, validates the plan, and should not be used to design architecture or write implementation code.

2026-06-0654

spec-driven-task-implementer

lindoelio/spec-driven-steroids

Use this skill when an approved Spec-Driven change has reached Phase 4 and the user wants a task, phase, or full feature implemented from .specs/changes/<slug>. It executes tasks in order, updates task status immediately, verifies each task before completion, and should not be used before implementation approval.

2026-06-0654

spec-driven-requirements-writer

lindoelio/spec-driven-steroids

Use this skill when the user wants to start Phase 1 of a Spec-Driven change, define behavior, or turn a feature idea into a requirements.md file. It writes EARS-format requirements, validates them, and should not be used for design, task breakdown, or implementation.

2026-05-2854

spec-driven-technical-designer

lindoelio/spec-driven-steroids

Use this skill when approved requirements need to be translated into a design.md file for Phase 2 of a Spec-Driven change. It creates a traceable technical design with validator-safe Mermaid diagrams and should not be used for task breakdown or implementation.

2026-05-2854

contextual-stewardship

lindoelio/spec-driven-steroids

Use this skill when the user makes a technical decision, establishes a new pattern, defines business rules, or explicitly asks to remember or save a guideline. Also use this skill when you are about to implement a feature, write code, plan an architecture, or make a technical decision - you MUST retrieve contextual memory first to follow established patterns. Acts as a Staff Engineer to extract, curate, and persist architectural decisions, business rules, and workflows into long-term memory using a JSON graph store.

2026-05-2854

name	agent-work-auditor
description	Unified auditing skill for AI agent workflows. Provides change-type-aware, artifact-adaptive auditing with self-fix capabilities. Works standalone or with spec-driven extensions. Three-layer architecture: core (always active), modules (per-type), extensions (auto-detected).

agent-work-auditor

A unified, modular auditing framework for AI agent workflows. Provides adversarial audits designed to find gaps and block premature approval. Every audit must switch the agent from author mode to critic mode and force specific, named weaknesses to be surfaced.

Activation

Load this skill when:

User asks to audit code, a PR, or a change set
Spec-driven Code Review requires auditing
User mentions "are you sure?" after producing work
User wants verification of completeness or correctness

Three-Layer Architecture

Layer 1: Core (Always Active)

Universal dimensions evaluated for every audit:

Dimension	Question	Auto-Fix
Completeness	Is everything present?	Yes
Correctness	Is it right?	Yes
Consistency	Does it fit patterns?	Yes
Traceability	Can we trace it?	Partial
Safety	Could it cause harm?	No
Maintainability	Will future devs thank us?	Partial
Rigorous Against Prompt/Spec	Does it match the original intent?	Partial

Layer 2: Change-Type Modules (Per-Invocation)

Loaded based on detected change type:

Type	Posture	Focus
feat	Thorough	Design, scalability
fix	Focused	Bug repro, verification
hotfix	Compressed	Scope min, correctness
refactor	Rigorous	Behavioral parity
migrate	Cautious	Backward compat, schema
docs	Lightweight	Accuracy

Layer 3: Context Extensions (Auto-Detected)

Activated when relevant context is detected:

spec-driven — When .specs/ or spec artifacts found
(Extensible for future context types)

Invocation

Input Parameters

Parameter	Required	Default	Description
`artifact`	Yes	—	Path(s) to audit
`changeType`	No	auto-detect	feat, fix, hotfix, refactor, migrate, docs, general
`mode`	No	standard	quick, standard, thorough
`extensions`	No	auto	Array of extension names

Invocation Patterns

# Explicit type
"Audit this PR using agent-work-auditor, type:feat"

# Quick check
"Quick audit on these changes"

# Spec-driven (auto-loads spec-driven extension)
"Validate the spec implementation"

# Combined types
"Audit this refactor+fix"

Audit Workflow

1. DETECT change type
   └─ Explicit tag → Branch scan → Commit scan → Heuristic → Fallback

2. DETECT artifact type
   └─ code | specification | design | tasks | mixed

3. GATHER context
   └─ Style guide, conventions, project docs

4. ACTIVATE layers
   └─ Core (always) → Type module → Extensions (if detected)

5. EVALUATE dimensions
   └─ Universal (7) → Type-specific → Extension-specific

6. CLASSIFY findings
   └─ Severity (blocking, warning, info)
   └─ Fixability (direct-fix, author-required, informational)

7. SELF-FIX LOOP
   └─ Pass 1: Apply direct-fix → Re-review
   └─ Pass 2: Apply direct-fix → Re-review
   └─ Escalate remaining to author-required

8. OUTPUT
   └─ Markdown report + JSON (machine consumption)

Finding Classification

Severity Axis

Level	Label	Blocks Approval?
Blocking	Must Fix	Yes
Warning	Should Fix	No
Info	FYI	No

Fixability Axis

Level	Label	Action
direct-fix	Auto-Fix	Agent applies immediately
author-required	Human Decision	Author must resolve
informational	Mentoring	FYI only

Prefixes

No prefix for blocking findings
Nit: for non-blocking polish suggestions
Mentoring: for educational comments

Self-Fix Loop

For each direct-fix finding:
- Apply the fix autonomously
- Re-review to verify no new issues
- If new blocking issues: rollback, mark author-required
Max 3 passes to prevent infinite loops
After max passes, escalate remaining direct-fix to author-required

Output Format

Markdown Report

# Audit Report — {change-type}

**Artifact:** {path}
**Timestamp:** {ISO-8601}
**Verdict:** Approve | Request Changes | Approval with Notes

## Summary
{2-3 sentence assessment}

## Direct Fixes Applied
| Finding | Fix Applied | Status |
|---------|-------------|--------|
| ... | ... | fixed |

## Blocking Findings
| Finding | Decision Needed | Status |
|---------|---------------|--------|
| ... | ... | pending |

## Nit Findings
| Finding | Suggestion | Status |
|---------|------------|--------|
| ... | ... | fixed |

## Traceability Matrix (spec-driven only)
| REQ | DES | TASK | Coverage |
|-----|-----|------|----------|

JSON Output

{
  "artifact": "{path}",
  "changeType": "{type}",
  "timestamp": "{ISO-8601}",
  "verdict": "Approve | Request Changes | Approval with Notes",
  "dimensions": {
    "{dimension}": {
      "score": 1-5,
      "findings": []
    }
  },
  "findings": [
    {
      "severity": "blocking | warning | info",
      "fixability": "direct-fix | author-required | informational",
      "title": "...",
      "description": "...",
      "fix": "...",
      "status": "fixed | pending | escalated"
    }
  ],
  "summary": "..."
}

Verdict Rules

Approve — No blocking findings remain.
Approval with Notes — No blocking findings, non-blocking concerns documented.
Request Changes — Blocking findings remain.

Context Detection

Detection happens in priority order:

Explicit config (auditor.json) — apply configured defaults
Directory scan (.specs/) — auto-load spec-driven extension
File scan (requirements.md, design.md, tasks.md) — auto-load spec-driven
Package scan (package.json, Cargo.toml) — domain inference
Fallback — general-purpose audit

Portability

Zero external dependencies
Works standalone in any project
Optional auditor.json for configuration

Reference Loading

Load reference files on demand:

File	When
`dimensions/*.md`	During dimension evaluation
`modules/{type}.md`	After type detection
`extensions/spec-driven.md`	When `.specs/` detected
`references/migration/*`	When migrate type detected
`references/finding-severity.md`	During finding classification
`references/output-format.md`	During output generation

Migration Auditing

When migrate type is detected:

Phase 0 (Gating): Codebase Inventory must complete before requirements
Eight Dimensions: DIM-1 through DIM-8 with 1-5 scoring
Prove-It Challenges: Agent must demonstrate comprehension
Approval Threshold: ALL 8 dimensions must score 5+ (any <5 blocks)

See references/migration/ for detailed migration auditing.