// "Comprehensive truth scoring, code quality verification, and automatic rollback system with 0.95 accuracy threshold for ensuring high-quality agent outputs and codebase reliability."
| name | Verification & Quality Assurance |
| description | Comprehensive truth scoring, code quality verification, and automatic rollback system with 0.95 accuracy threshold for ensuring high-quality agent outputs and codebase reliability. |
| version | 2.0.0 |
| category | quality-assurance |
| tags | ["verification","truth-scoring","quality","rollback","metrics","ci-cd"] |
This skill provides a comprehensive verification and quality assurance system that ensures code quality and correctness through:
claude-flow)# View current truth scores
claude-flow truth
# Run verification check
claude-flow verify check
# Verify specific file with custom threshold
claude-flow verify check --file src/app.js --threshold 0.98
# Rollback last failed verification
claude-flow verify rollback --last-good
Display comprehensive quality and reliability metrics for your codebase and agent tasks.
Basic Usage:
# View current truth scores (default: table format)
claude-flow truth
# View scores for specific time period
claude-flow truth --period 7d
# View scores for specific agent
claude-flow truth --agent coder --period 24h
# Find files/tasks below threshold
claude-flow truth --threshold 0.8
Output Formats:
# Table format (default)
claude-flow truth --format table
# JSON for programmatic access
claude-flow truth --format json
# CSV for spreadsheet analysis
claude-flow truth --format csv
# HTML report with visualizations
claude-flow truth --format html --export report.html
Real-time Monitoring:
# Watch mode with live updates
claude-flow truth --watch
# Export metrics automatically
claude-flow truth --export .claude-flow/metrics/truth-$(date +%Y%m%d).json
Example dashboard output:
š Truth Metrics Dashboard
āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā
Overall Truth Score: 0.947 ā
Trend: āļø +2.3% (7d)
Top Performers:
verification-agent 0.982 ā
code-analyzer 0.971 ā
test-generator 0.958 ā
Needs Attention:
refactor-agent 0.821 ā ļø
docs-generator 0.794 ā ļø
Recent Tasks:
task-456 0.991 ā
"Implement auth"
task-455 0.967 ā
"Add tests"
task-454 0.743 ā "Refactor API"
Truth Scores (0.0-1.0):
1.0-0.95: Excellent ā (production-ready)0.94-0.85: Good ā
(acceptable quality)0.84-0.75: Warning ā ļø (needs attention)<0.75: Critical ā (requires immediate action)Trend Indicators:
Statistics:
Execute comprehensive verification checks on code, tasks, or agent outputs.
File Verification:
# Verify single file
claude-flow verify check --file src/app.js
# Verify directory recursively
claude-flow verify check --directory src/
# Verify with auto-fix enabled
claude-flow verify check --file src/utils.js --auto-fix
# Verify current working directory
claude-flow verify check
Task Verification:
# Verify specific task output
claude-flow verify check --task task-123
# Verify with custom threshold
claude-flow verify check --task task-456 --threshold 0.99
# Verbose output for debugging
claude-flow verify check --task task-789 --verbose
Batch Verification:
# Verify multiple files in parallel
claude-flow verify batch --files "*.js" --parallel
# Verify with pattern matching
claude-flow verify batch --pattern "src/**/*.ts"
# Integration test suite
claude-flow verify integration --test-suite full
The verification system evaluates:
Code Correctness
Best Practices
Security
Performance
Documentation
# Get structured JSON output
claude-flow verify check --json > verification.json
# Example JSON structure:
{
"overallScore": 0.947,
"passed": true,
"threshold": 0.95,
"checks": [
{
"name": "code-correctness",
"score": 0.98,
"passed": true
},
{
"name": "security",
"score": 0.91,
"passed": false,
"issues": [...]
}
]
}
Automatically revert changes that fail verification checks.
Basic Rollback:
# Rollback to last known good state
claude-flow verify rollback --last-good
# Rollback to specific commit
claude-flow verify rollback --to-commit abc123
# Interactive rollback with preview
claude-flow verify rollback --interactive
Smart Rollback:
# Rollback only failed files (preserve good changes)
claude-flow verify rollback --selective
# Rollback with automatic backup
claude-flow verify rollback --backup-first
# Dry-run mode (preview without executing)
claude-flow verify rollback --dry-run
Rollback Performance:
Create detailed verification reports with metrics and visualizations.
Report Formats:
# JSON report
claude-flow verify report --format json
# HTML report with charts
claude-flow verify report --export metrics.html --format html
# CSV for data analysis
claude-flow verify report --format csv --export metrics.csv
# Markdown summary
claude-flow verify report --format markdown
Time-based Reports:
# Last 24 hours
claude-flow verify report --period 24h
# Last 7 days
claude-flow verify report --period 7d
# Last 30 days with trends
claude-flow verify report --period 30d --include-trends
# Custom date range
claude-flow verify report --from 2025-01-01 --to 2025-01-31
Report Content:
Run interactive web-based verification dashboard with real-time updates.
# Launch dashboard on default port (3000)
claude-flow verify dashboard
# Custom port
claude-flow verify dashboard --port 8080
# Export dashboard data
claude-flow verify dashboard --export
# Dashboard with auto-refresh
claude-flow verify dashboard --refresh 5s
Dashboard Features:
Set verification preferences in .claude-flow/config.json:
{
"verification": {
"threshold": 0.95,
"autoRollback": true,
"gitIntegration": true,
"hooks": {
"preCommit": true,
"preTask": true,
"postEdit": true
},
"checks": {
"codeCorrectness": true,
"security": true,
"performance": true,
"documentation": true,
"bestPractices": true
}
},
"truth": {
"defaultFormat": "table",
"defaultPeriod": "24h",
"warningThreshold": 0.85,
"criticalThreshold": 0.75,
"autoExport": {
"enabled": true,
"path": ".claude-flow/metrics/truth-daily.json"
}
}
}
Adjust verification strictness:
# Strict mode (99% accuracy required)
claude-flow verify check --threshold 0.99
# Lenient mode (90% acceptable)
claude-flow verify check --threshold 0.90
# Set default threshold
claude-flow config set verification.threshold 0.98
Per-environment thresholds:
{
"verification": {
"thresholds": {
"production": 0.99,
"staging": 0.95,
"development": 0.90
}
}
}
GitHub Actions:
name: Quality Verification
on: [push, pull_request]
jobs:
verify:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
- name: Install Dependencies
run: npm install
- name: Run Verification
run: |
claude-flow verify check --json > verification.json
- name: Check Truth Score
run: |
score=$(jq '.overallScore' verification.json)
if (( $(echo "$score < 0.95" | bc -l) )); then
echo "Truth score too low: $score"
exit 1
fi
- name: Upload Report
uses: actions/upload-artifact@v3
with:
name: verification-report
path: verification.json
GitLab CI:
verify:
stage: test
script:
- claude-flow verify check --threshold 0.95 --json > verification.json
- |
score=$(jq '.overallScore' verification.json)
if [ $(echo "$score < 0.95" | bc) -eq 1 ]; then
echo "Verification failed with score: $score"
exit 1
fi
artifacts:
paths:
- verification.json
reports:
junit: verification.json
Run verification automatically during swarm operations:
# Swarm with verification enabled
claude-flow swarm --verify --threshold 0.98
# Hive Mind with auto-rollback
claude-flow hive-mind --verify --rollback-on-fail
# Training pipeline with verification
claude-flow train --verify --threshold 0.99
Enable real-time verification during collaborative development:
# Pair with verification
claude-flow pair --verify --real-time
# Pair with custom threshold
claude-flow pair --verify --threshold 0.97 --auto-fix
Monitor codebase continuously during development:
# Watch directory for changes
claude-flow verify watch --directory src/
# Watch with auto-fix
claude-flow verify watch --directory src/ --auto-fix
# Watch with notifications
claude-flow verify watch --notify --threshold 0.95
Send metrics to external monitoring systems:
# Export to Prometheus
claude-flow truth --format json | \
curl -X POST https://pushgateway.example.com/metrics/job/claude-flow \
-d @-
# Send to DataDog
claude-flow verify report --format json | \
curl -X POST "https://api.datadoghq.com/api/v1/series?api_key=${DD_API_KEY}" \
-H "Content-Type: application/json" \
-d @-
# Custom webhook
claude-flow truth --format json | \
curl -X POST https://metrics.example.com/api/truth \
-H "Content-Type: application/json" \
-d @-
Automatically verify before commits:
# Install pre-commit hook
claude-flow verify install-hook --pre-commit
# .git/hooks/pre-commit example:
#!/bin/bash
claude-flow verify check --threshold 0.95 --json > /tmp/verify.json
score=$(jq '.overallScore' /tmp/verify.json)
if (( $(echo "$score < 0.95" | bc -l) )); then
echo "ā Verification failed with score: $score"
echo "Run 'claude-flow verify check --verbose' for details"
exit 1
fi
echo "ā
Verification passed with score: $score"
Verification Speed:
Rollback Speed:
Dashboard Performance:
Low Truth Scores:
# Get detailed breakdown
claude-flow truth --verbose --threshold 0.0
# Check specific criteria
claude-flow verify check --verbose
# View agent-specific issues
claude-flow truth --agent <agent-name> --format json
Rollback Failures:
# Check git status
git status
# View rollback history
claude-flow verify rollback --history
# Manual rollback
git reset --hard HEAD~1
Verification Timeouts:
# Increase timeout
claude-flow verify check --timeout 60s
# Verify in batches
claude-flow verify batch --batch-size 10
Verification commands return standard exit codes:
0: Verification passed (score ā„ threshold)1: Verification failed (score < threshold)2: Error during verification (invalid input, system error)claude-flow pair - Collaborative development with verificationclaude-flow train - Training with verification feedbackclaude-flow swarm - Multi-agent coordination with quality checksclaude-flow report - Generate comprehensive project reports/docs/truth-scoring.md/docs/verification-criteria.md/examples/verification//docs/api/verification.md