Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

ultrareview

Sterne2

Forks2

Aktualisiert16. April 2026 um 16:57

Deep validation protocol that examines preceding context for errors, unvalidated assumptions, alignment issues, gaps, and improvement opportunities. Produces a machine-parseable summary. Use when validating plans, code changes, configurations, or any work product before proceeding.

Installation

Mit Codex oder Claude installieren Kopieren Sie diesen Prompt, fügen Sie ihn in Codex, Claude oder einen anderen Assistant ein und lassen Sie die Skill-Seite prüfen und installieren.

In Manus ausführen

Quelle

AeyeOps

AeyeOps/aeo-skill-marketplace

GitHub-Repository öffnen Creator-Repositorys ansehen

Download

In Manus ausführen

Verwandte BerufeSOC

Basierend auf der SOC-Berufsklassifikation

Softwarequalitätssicherungsanalysten und -testerInformatik- und Mathematikberufe·SOC 15-1253

SKILL.md

readonly

name	ultrareview
description	Deep validation protocol that examines preceding context for errors, unvalidated assumptions, alignment issues, gaps, and improvement opportunities. Produces a machine-parseable summary. Use when validating plans, code changes, configurations, or any work product before proceeding.
allowed-tools	Read, Glob, Grep, Bash(git status ), Bash(git diff ), Bash(find *)

Ultra-Validation Protocol

Evaluate each validation dimension systematically. Question every assumption. Cross-reference against actual codebase artifacts.

Focus Area

$ARGUMENTS

If no focus specified, validate the entire preceding context (plan, code changes, discussion, or proposal).

Validation Steps

Step 0: Deliverable Existence Check

<deliverable_check> Based on the focus area, identify:

What concrete deliverable was requested? (code, config, documentation, etc.)
Does this deliverable exist?

If the requested deliverable does not exist, this is automatically a critical finding. Do not proceed to validate planning artifacts as a substitute. Status is NEEDS_ACTION until the deliverable exists. </deliverable_check>

<context_detection> Identify what you're validating:

Plan/Proposal: Architecture design, implementation approach, technical spec
Code Changes: Diff, new files, refactored code, PR
Discussion: Requirements gathering, debugging session, design conversation
Configuration: Environment setup, infrastructure, deployment config

Adapt your validation approach accordingly. </context_detection>

Step 0.5: Source Reading

<source_reading> Before evaluating, re-read every primary source file in scope from disk — even files that appear in the conversation context. Files may have been edited since they were last read, and stale context produces false findings. Trust what the file contains now, not what an earlier Read result showed.

When the focus area references a command template that calls a script, that script is a primary source — read it, because the template's description of what the script does may be incomplete or outdated. The same applies to configs, schemas, and any file referenced by another file you've read.

Follow reference chains, because a script that calls another script makes both relevant to your findings: if file A calls file B which reads file C, all three are in scope.

Build a file inventory as you go. Any finding that references a file you haven't read belongs in NEEDS_VALIDATION, not ERRORS — because you're reasoning from description rather than source. </source_reading>

Step 1: Assumption Inventory

List every assumption in the preceding context. For each:

VALIDATED: Confirmed by examining actual code/config/docs (cite file:line)
UNVALIDATED: Not yet verified against codebase
CONTRADICTED: Evidence suggests assumption is wrong

Step 2: Error & Risk Scan

Examine for issues appropriate to the context type:

For Code:

Logic errors, null handling, type mismatches
Missing error handlers, unhandled promises
Race conditions, async timing issues
Security vulnerabilities, exposed secrets
Performance issues, N+1 queries, memory leaks

For Plans/Proposals:

Unstated dependencies or prerequisites
Scope gaps or undefined edge cases
Integration risks with existing systems

For Configuration:

Missing environment variables
Security misconfigurations
Incompatible version constraints

Step 3: Omission Detection

Identify what's missing:

Incomplete implementations or undefined behaviors
Missing error handling for edge cases
Undocumented assumptions

Testing: Only flag missing e2e tests that run the real system with real data. Never flag absent unit tests, mocks, fakes, or synthetic-data tests.

Step 4: Codebase Alignment

Compare against existing patterns:

Does approach match existing code structure and conventions?
Are we violating established patterns?
Will changes integrate cleanly?
Are we introducing inconsistencies?

Step 5: Enhancement Opportunities

Can we reduce complexity?
Are there safer, faster, or cleaner approaches?
Can we consolidate duplicate logic?

Output Format

CRITICAL (Resolve before proceeding)

Location: [file:line, section, or concept]
Evidence: [direct observation — what you read in the source that confirms this]
Risk: [why critical]
Action: [specific next step]

If your evidence is an inference about behavior in a file you haven't read, this belongs in NEEDS_VALIDATION until you read that file.

ERRORS FOUND (Severity: HIGH/MEDIUM/LOW)

Location: [file:line, section, or concept]
Evidence: [direct observation — what you read in the source that confirms this]
Impact: [what breaks or fails]
Fix: [concrete solution]

If your evidence is an inference about behavior in a file you haven't read, this belongs in NEEDS_VALIDATION until you read that file.

ALIGNMENT ISSUES (Conflicts with codebase or conventions)

Current: [what exists]
Proposed: [what conflicts]
Resolution: [how to align]

MISSING (Gaps needing attention)

IMPROVEMENTS (Better alternatives with expected benefit)

VALIDATED (Confirmed with citations)

NEEDS VALIDATION (Default category for unverified concerns)

Use this for any concern where:

You identified a potential issue but haven't read the implementation files to confirm
The evidence comes from documentation/comments rather than source code
You're reasoning about behavior across components without verifying the integration point

Promote to ERRORS only after reading the relevant source and confirming the problem exists.

Artifact Inventory

List every file you read during this review, because this allows the reader to verify your coverage and identify files you may have missed.

path/to/file.py — relevant to: [what aspect of the review it informed]

Scorecard

After all findings sections, output this human-readable scorecard table:

## Scorecard

| Category           | Count | Action needed? |
|--------------------|-------|----------------|
| Critical           |     X | YES            |
| Errors             |     X | YES            |
| Alignment issues   |     X | YES            |
| Missing            |     X | YES            |
| Needs validation   |     X | YES            |
| Improvements       |     X | no             |
| Validated          |     X | no             |
| **Status**         |       | **PASS / NEEDS_ACTION** |

Rules for Action needed column: Critical, Errors, Alignment, Missing, Needs validation = YES when count > 0. Improvements and Validated are always "no".

Machine-Parseable Summary

Immediately after the scorecard, output this exact summary block (parsed by automation hooks):

<ultrareview_summary>
status: [PASS|NEEDS_ACTION]
critical: [count]
errors: [count]
alignment: [count]
missing: [count]
improvements: [count]
needs_validation: [count]
validated: [count]
</ultrareview_summary>

Rules:

Requirements stated in the focus area are required, not optional
status: PASS only if critical=0 AND errors=0 AND alignment=0 AND missing=0 AND needs_validation=0
status: NEEDS_ACTION if any actionable findings exist
Count each distinct finding, not each bullet point
The scorecard and summary block appear at the very end of your response, in that order

Mehr aus diesem Repository

gleiches Repository

cowork-migrate

AeyeOps/aeo-skill-marketplace

Migrate a Claude Cowork session from one Windows machine to another with full history, working file links, and no truncated-transcript rendering bug. Use this whenever the user mentions moving, importing, copying, or migrating a Cowork session/conversation/project between machines, or troubleshoots symptoms of a broken import like "session shows blank", "only the latest messages show", "scratchpad files don't open", "can't scroll past the last compaction", or "Loaded N messages (truncated via tail/compaction)" in the Cowork log. Covers orphan sessions on Windows to Windows under the same Cowork account. Handles the undocumented two-layer compact_boundary truncation filter in app.asar that silently clips imported transcripts. Does not handle Cowork Spaces/Projects, Linux/macOS, or cross-account migration.

2026-06-242

skill-creator

AeyeOps/aeo-skill-marketplace

Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, update or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.

2026-06-242

tailscale-macos-headscale

AeyeOps/aeo-skill-marketplace

Onboard a macOS host (Tahoe / macOS 26 and later) as a Tailscale client of a self-hosted headscale control plane. Covers Tailscale.app installation via Homebrew Cask, the NetworkExtension permission grants required for the daemon to start, the conflict that arises if the brew formula `tailscale` is also installed alongside the cask, how to use `tailscale up --login-server` with a headscale preauth key, the deep-link fallback flow when the CLI cannot reach the daemon, the headscale-specific gotcha that `headscale preauthkeys create --user <N>` expects a numeric user ID rather than a username on recent builds, and bidirectional reach verification once joined. Use when adding a macOS host to a headscale-controlled mesh, troubleshooting symptoms like "failed to connect to local tailscale service", Tailscale.app stuck on "Starting...", `tailscale up` hanging on "joining <coordinator>", a blank menu-bar icon after a fresh install, deciding between the Homebrew cask and formula distributions, or recovering from a st

2026-05-242

glinet-slate7

AeyeOps/aeo-skill-marketplace

Comprehensive reference for the GL-iNet Slate 7 travel router (model GL-BE3600, Wi-Fi 7). Covers hardware specs, 2.5G ports, touchscreen interface, full admin panel menu structure, VPN client setup (WireGuard/OpenVPN; NordVPN, Mullvad, Surfshark, and 30+ providers), WireGuard/OpenVPN server setup, AdGuard Home, Tor, Tailscale, DDNS, network modes (Router/AP/Extender/WDS/Drop-in Gateway), SSH/CLI access with command reference, factory reset, firmware update, and U-Boot bricked-device recovery. Also covers the JSON-RPC admin API at /rpc (challenge/response auth, module/method discovery, reusable bash helper), programmatic WireGuard server provisioning via the wg-server module (add_peer, generate_peer, settings, leak verification, local-only Endpoint pattern), and Linux client-side WG with overlay-VPN stacking — including the two leak modes that appear when running Tailscale on top of a full-tunnel WG client (fwmark 0x80000 bypass and wg-quick catch-all shadowing the tailnet routes) plus the wg-quick PostUp/PreD

2026-05-242

mlx-serving

AeyeOps/aeo-skill-marketplace

This skill should be used when the user asks about "MLX serving", "mlx_lm.server", "oMLX", "Apple Silicon LLM serving", or "local LLM on Mac" — and when troubleshooting symptoms like model fails to load, OOM during load or inference, server hangs or crashes at batch>1, tool calls returning as plaintext content, throughput regression, or choosing between mlx-lm and oMLX. Also applies to oMLX feature-flag tuning ("turboquant_kv", "dflash", "MTP", "specprefill", "thinking_budget", "max-concurrent-requests", "force_sampling"), OptiQ proxy for models exceeding RAM, Llama-4 ChunkedKVCache batch handling, Llama-3 tool-call JSON format ("name"/"parameters"), and bench-driven validation of serving configs. For Apple Silicon (M-series) only — not for cloud LLM hosting (Bedrock, OpenAI API, Anthropic API), not for non-MLX backends (llama.cpp, Ollama, vLLM), not for model training.

2026-05-092

lima-vm-operations

AeyeOps/aeo-skill-marketplace

This skill should be used when the user asks about "Lima", "limactl", "lima.yaml", "lima start", "lima shell", "creating a Linux VM on Mac", "running Linux on Apple Silicon", "macOS Linux VM", "Apple Silicon VM", or wants to "install Lima", "configure a Lima VM", "edit lima config", "spin up an Ubuntu VM on my Mac", or "use Lima to run Docker on macOS". Also applies for "lima vmType vz", "lima vz vs qemu", "host.lima.internal", "socket_vmnet", "lima networking", "lima shared network", "lima bridged network", "virtiofs mount", "9p mount", "lima port forward", "lima mount writable", "limactl edit", "limactl validate", "limactl template", "lima Rosetta", "running x86 in lima", "lima debug startup", or any task involving spinning up, configuring, troubleshooting, or shelling into a Lima VM on an Apple Silicon Mac. Use this skill whenever Lima is mentioned even if the user doesn't explicitly ask for "help" — the right configuration choices (vz vs qemu, mount type, network mode) are non-obvious and easy to get wron

2026-05-092

name	ultrareview
description	Deep validation protocol that examines preceding context for errors, unvalidated assumptions, alignment issues, gaps, and improvement opportunities. Produces a machine-parseable summary. Use when validating plans, code changes, configurations, or any work product before proceeding.
allowed-tools	Read, Glob, Grep, Bash(git status ), Bash(git diff ), Bash(find *)

Ultra-Validation Protocol

Evaluate each validation dimension systematically. Question every assumption. Cross-reference against actual codebase artifacts.

Focus Area

$ARGUMENTS

If no focus specified, validate the entire preceding context (plan, code changes, discussion, or proposal).

Validation Steps

Step 0: Deliverable Existence Check

<deliverable_check> Based on the focus area, identify:

What concrete deliverable was requested? (code, config, documentation, etc.)
Does this deliverable exist?

<context_detection> Identify what you're validating:

Plan/Proposal: Architecture design, implementation approach, technical spec
Code Changes: Diff, new files, refactored code, PR
Discussion: Requirements gathering, debugging session, design conversation
Configuration: Environment setup, infrastructure, deployment config

Adapt your validation approach accordingly. </context_detection>

Step 0.5: Source Reading

Follow reference chains, because a script that calls another script makes both relevant to your findings: if file A calls file B which reads file C, all three are in scope.

Step 1: Assumption Inventory

List every assumption in the preceding context. For each:

VALIDATED: Confirmed by examining actual code/config/docs (cite file:line)
UNVALIDATED: Not yet verified against codebase
CONTRADICTED: Evidence suggests assumption is wrong

Step 2: Error & Risk Scan

Examine for issues appropriate to the context type:

For Code:

Logic errors, null handling, type mismatches
Missing error handlers, unhandled promises
Race conditions, async timing issues
Security vulnerabilities, exposed secrets
Performance issues, N+1 queries, memory leaks

For Plans/Proposals:

Unstated dependencies or prerequisites
Scope gaps or undefined edge cases
Integration risks with existing systems

For Configuration:

Missing environment variables
Security misconfigurations
Incompatible version constraints

Step 3: Omission Detection

Identify what's missing:

Incomplete implementations or undefined behaviors
Missing error handling for edge cases
Undocumented assumptions

Testing: Only flag missing e2e tests that run the real system with real data. Never flag absent unit tests, mocks, fakes, or synthetic-data tests.

Step 4: Codebase Alignment

Compare against existing patterns:

Does approach match existing code structure and conventions?
Are we violating established patterns?
Will changes integrate cleanly?
Are we introducing inconsistencies?

Step 5: Enhancement Opportunities

Can we reduce complexity?
Are there safer, faster, or cleaner approaches?
Can we consolidate duplicate logic?

Output Format

CRITICAL (Resolve before proceeding)

Location: [file:line, section, or concept]
Evidence: [direct observation — what you read in the source that confirms this]
Risk: [why critical]
Action: [specific next step]

If your evidence is an inference about behavior in a file you haven't read, this belongs in NEEDS_VALIDATION until you read that file.

ERRORS FOUND (Severity: HIGH/MEDIUM/LOW)

Location: [file:line, section, or concept]
Evidence: [direct observation — what you read in the source that confirms this]
Impact: [what breaks or fails]
Fix: [concrete solution]

If your evidence is an inference about behavior in a file you haven't read, this belongs in NEEDS_VALIDATION until you read that file.

ALIGNMENT ISSUES (Conflicts with codebase or conventions)

Current: [what exists]
Proposed: [what conflicts]
Resolution: [how to align]

MISSING (Gaps needing attention)

IMPROVEMENTS (Better alternatives with expected benefit)

VALIDATED (Confirmed with citations)

NEEDS VALIDATION (Default category for unverified concerns)

Use this for any concern where:

You identified a potential issue but haven't read the implementation files to confirm
The evidence comes from documentation/comments rather than source code
You're reasoning about behavior across components without verifying the integration point

Promote to ERRORS only after reading the relevant source and confirming the problem exists.

Artifact Inventory

List every file you read during this review, because this allows the reader to verify your coverage and identify files you may have missed.

path/to/file.py — relevant to: [what aspect of the review it informed]

Scorecard

After all findings sections, output this human-readable scorecard table:

## Scorecard

| Category           | Count | Action needed? |
|--------------------|-------|----------------|
| Critical           |     X | YES            |
| Errors             |     X | YES            |
| Alignment issues   |     X | YES            |
| Missing            |     X | YES            |
| Needs validation   |     X | YES            |
| Improvements       |     X | no             |
| Validated          |     X | no             |
| **Status**         |       | **PASS / NEEDS_ACTION** |

Rules for Action needed column: Critical, Errors, Alignment, Missing, Needs validation = YES when count > 0. Improvements and Validated are always "no".

Machine-Parseable Summary

Immediately after the scorecard, output this exact summary block (parsed by automation hooks):

<ultrareview_summary>
status: [PASS|NEEDS_ACTION]
critical: [count]
errors: [count]
alignment: [count]
missing: [count]
improvements: [count]
needs_validation: [count]
validated: [count]
</ultrareview_summary>

Rules:

Requirements stated in the focus area are required, not optional
status: PASS only if critical=0 AND errors=0 AND alignment=0 AND missing=0 AND needs_validation=0
status: NEEDS_ACTION if any actionable findings exist
Count each distinct finding, not each bullet point
The scorecard and summary block appear at the very end of your response, in that order