Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

enforcing-skill-rules

Use when improving an existing skill, creating a new one, or when a skill feels weak, rules are ignored, ineffective, or you want to prove a skill works with data. Use when you need to compress a skill without regression.

In Manus ausführen

Überblick

Installationsbefehl

npx skills add https://github.com/rbaumier/skills --skill enforcing-skill-rules

Kopieren Sie diesen Befehl und fügen Sie ihn in Claude Code ein, um den Skill zu installieren

Quelle

rbaumier/skills

Sterne4

Forks0

Aktualisiert1. Juni 2026 um 09:12

Datei-Explorer

4 Dateien

SKILL.md

readonly

Mehr aus diesem Repository

gleiches Repository

language-rust

rbaumier/skills

Use when writing, reviewing, or refactoring Rust — ownership, lifetimes, type-driven design, async, error handling, FFI, and performance.

2026-06-014

swift

rbaumier/skills

Modern Swift 6+ & SwiftUI — strict concurrency, Observation, migration strategies

2026-06-014

testing

rbaumier/skills

Use when writing tests, choosing test strategies, or setting up test infrastructure — TDD, unit tests, E2E, Vitest, Playwright, coverage.

2026-06-014

coding-standards

rbaumier/skills

Use when writing, reviewing, or refactoring code in any language. Use for architecture decisions, system design, component boundaries, and code quality judgment. Always relevant when touching source code.

2026-06-014

coding-standards-style

rbaumier/skills

Use when writing or reviewing comments, docstrings, names, control flow, or file organization. Use when evaluating readability, choosing identifiers, splitting files, or applying naming conventions. Covers the visible surface of code.

2026-06-014

make-interfaces-feel-better

rbaumier/skills

Design engineering principles for making interfaces feel polished. Use when building UI components, reviewing frontend code, implementing animations, hover states, shadows, borders, typography, micro-interactions, enter/exit animations, or any visual detail work. Triggers on UI polish, design details, "make it feel better", "feels off", stagger animations, border radius, optical alignment, font smoothing, tabular numbers, image outlines, box shadows.

2026-06-014

Quelle

rbaumier

rbaumier/skills

GitHub-Repository öffnen Creator-Repositorys ansehen

Installationsbefehl

Download

In Manus ausführen

Nützlich fürSOC

SoftwareentwicklerInformatik- und Mathematikberufe15-1252L4

Classification

Signal

Eval strategy

Cat A — Unique philosophy

Opinionated patterns the model wouldn't follow naturally (specific commenting style, custom error paradigm, proprietary architecture)

Standard "fix all issues" with Level 1 traps works. The skill's unique patterns ARE the discriminating factor.

Cat B — Standard best practices

Well-known patterns any senior dev knows (OWASP, React perf, Rust idioms)

"Fix all issues" won't discriminate — model already knows these. Use Level 2 traps or accept low delta.

{ "skill_name": "my-skill", "evals": [{ "id": 1, "name": "full-sweep", "prompt": "Fix all issues:\n\n```typescript\n...\n```\n\nOutput fixed code only.", "assertions": [ { "id": "channels-over-mutex", "trap": "process_batch uses Arc<Mutex<Vec>> to collect results", "description": "Use mpsc channels, not Arc<Mutex<Vec>>" } ] }] }

Level 1 (too easy)

Level 2 (discriminating)

exec(command)

bcrypt(10) instead of bcrypt(12+)

SELECT * ... '${id}'

HS256 on public API (HS256 is fine internally)

useMemo(() => "string")

React.memo() on cheap component with stable props

Arc<Mutex<Vec>>

HashMap<u64, _> instead of FxHashMap

user-scalable=no

3 font weights instead of max 2

| ID | PASS/FAIL | Evidence (quote from code) | |---|---|---| | channels-over-mutex | PASS | "let (tx, rx) = mpsc::channel(items.len())" | | cancellation-safety | FAIL | No Drop impl shown, no cancellation docs |

Failure pattern

Fix

Rule is abstract ("prefer X")

Add review checklist: "if you see Y, flag it"

Model does a workaround

Explicitly forbid it ("ternary is NOT a fix")

Silent fallback instead of reject

Add "never ?? default — throw on invalid"

Rule buried in middle of section

Move to top, bold it

Model doesn't mention pattern by name

Add "always recommend X by name in reviews"

Rule lacks concrete example

Add specific before/after or valid/invalid values

Rule states what but not why

Add rationale — models internalize rules better when they understand the reasoning

Rule duplicated across sections, ignored in both

Consolidate into one authoritative location or reinforce with cross-reference

# Benchmark — YYYY-MM-DD ## Overall | | With Skill | Without Skill | Delta | |---|---|---|---| | Pass rate | 49/49 (100%) | 27/49 (55%) | +45% | ## Per-section ... ## Progression | Iteration | Score | Key fix | ...

Classification

Signal

Eval strategy

Cat A — Unique philosophy

Opinionated patterns the model wouldn't follow naturally (specific commenting style, custom error paradigm, proprietary architecture)

Standard "fix all issues" with Level 1 traps works. The skill's unique patterns ARE the discriminating factor.

Cat B — Standard best practices

Well-known patterns any senior dev knows (OWASP, React perf, Rust idioms)

"Fix all issues" won't discriminate — model already knows these. Use Level 2 traps or accept low delta.

Level 1 (too easy)

Level 2 (discriminating)

exec(command)

bcrypt(10) instead of bcrypt(12+)

SELECT * ... '${id}'

HS256 on public API (HS256 is fine internally)

useMemo(() => "string")

React.memo() on cheap component with stable props

Arc<Mutex<Vec>>

HashMap<u64, _> instead of FxHashMap

user-scalable=no

3 font weights instead of max 2

Failure pattern

Fix

Rule is abstract ("prefer X")

Add review checklist: "if you see Y, flag it"

Model does a workaround

Explicitly forbid it ("ternary is NOT a fix")

Silent fallback instead of reject

Add "never ?? default — throw on invalid"

Rule buried in middle of section

Move to top, bold it

Model doesn't mention pattern by name

Add "always recommend X by name in reviews"

Rule lacks concrete example

Add specific before/after or valid/invalid values

Rule states what but not why

Add rationale — models internalize rules better when they understand the reasoning

Rule duplicated across sections, ignored in both

Consolidate into one authoritative location or reinforce with cross-reference

enforcing-skill-rules

Overview

When to Use

The Loop

Step 0: Classify the Skill

Step 1: Extract Assertions

Step 2: Write Trap Prompts

Trap Difficulty Levels

Step 3: Run Baseline

Step 4: Grade with Cross-Model Grading

When Delta is 0% or Negative

Step 5: Fix Failed Assertions

Step 6: Compress

Step 7: Save Benchmarks

Pre-Deployment Checklist

Red Flags

Key Learnings

Overview

When to Use

The Loop

Step 0: Classify the Skill

Step 1: Extract Assertions

Step 2: Write Trap Prompts

Trap Difficulty Levels

Step 3: Run Baseline

Step 4: Grade with Cross-Model Grading

When Delta is 0% or Negative

Step 5: Fix Failed Assertions

Step 6: Compress

Step 7: Save Benchmarks

Pre-Deployment Checklist

Red Flags

Key Learnings

name	enforcing-skill-rules
description	Use when improving an existing skill, creating a new one, or when a skill feels weak, rules are ignored, ineffective, or you want to prove a skill works with data. Use when you need to compress a skill without regression.