تشغيل أي مهارة في Manus بنقرة واحدة

$pwd:

skill-creator

Name: Skill Creator
Author: damionrashford

// Use when building a skill, creating a SKILL.md, packaging a workflow, making a slash command, or asked "how do I make a skill". Scaffolds the folder, generates SKILL.md from a template, validates against spec. Produces a complete ready-to-deploy skill folder: scripts, references, assets. Also use to review or improve an existing skill.

تشغيل في Manus

$ git log --oneline --stat

stars:٢

forks:٠

updated:٩ أبريل ٢٠٢٦ في ٠١:٢١

مستكشف الملفات

15 ملفات

SKILL.md

readonly

related-skills.json

نفس المستودع

claude-code.md

from "damionrashford/mlx"

Comprehensive Claude Code knowledge base — plugins, hooks, skills, agents, MCP, channels, headless mode, permissions, settings, and all extensibility features. Use when building, configuring, debugging, or extending Claude Code.

2026-04-092

analyze.md

from "damionrashford/mlx"

Statistical analysis, hypothesis testing, A/B testing, cohort analysis, segmentation, trend detection, business metrics, pre-delivery validation, and data visualization. Use when the user asks to "analyze this data", "run a statistical test", "compare groups", "find trends", "do A/B test analysis", "segment customers", "calculate KPIs", "validate this analysis", "check my work", "sanity check", "review my numbers", "make a chart", "create a dashboard", "plot the data", "visualize results", or mentions hypothesis testing, cohort analysis, business analytics, data validation, bar charts, line charts, heatmaps, scatter plots, or data storytelling.

2026-04-092

autoexperiment.md

from "damionrashford/mlx"

Autonomous time-budget experiment loop. Modify a training script, train for a fixed wall-clock budget, evaluate, record, repeat. Inspired by karpathy/autoresearch. Use for overnight architecture search, systematic hyperparameter sweeps, or any iterative model improvement workflow.

2026-04-092

context-engineering.md

from "damionrashford/mlx"

Context engineering for building production LLM applications: context window management, degradation patterns, optimization strategies, memory system selection, multi-agent architecture, filesystem context patterns, and tool design principles. Use when building LLM apps, RAG pipelines, AI agents, multi-agent systems, or when designing memory, tool APIs, or context strategies for any language model application.

2026-04-092

data-prep.md

from "damionrashford/mlx"

Explore, clean, and engineer datasets end-to-end: statistical profiling, distribution checks, missing value analysis, duplicate detection, outlier removal, type fixing, encoding, create features, encode categories, transform columns, add rolling windows, build interaction terms, and feature engineering. Supports pandas, polars, and PySpark. Use when the user wants to explore data, profile columns, understand a dataset, clean data, handle missing values, remove duplicates, fix data types, preprocess a dataset before modeling, create features, encode categories, transform columns, add rolling windows, build interaction terms, or do feature engineering.

2026-04-092

drift-detect.md

from "damionrashford/mlx"

Detect data drift, concept drift, and model performance degradation in production. Uses PSI, KS-test, and chi-squared for statistical drift, plus evidently and nannyml for automated reports. Use when monitoring a deployed model or comparing training vs production data distributions.

2026-04-092

package.json

"author": "damionrashford"

"repository": "damionrashford/mlx"

فتح مستودع GitHub عرض مستودعات المنشئ

$ install --global

$ download --local

تشغيل في Manus

$ useful --forSOC

مطوّرو البرمجياتمهن الحاسوب والرياضيات15-1252L4

name	skill-creator
description	Use when building a skill, creating a SKILL.md, packaging a workflow, making a slash command, or asked "how do I make a skill". Scaffolds the folder, generates SKILL.md from a template, validates against spec. Produces a complete ready-to-deploy skill folder: scripts, references, assets. Also use to review or improve an existing skill.
argument-hint	[what the skill should do — describe the workflow or task]

Skill Creator

What you're building: $ARGUMENTS

Available scripts

scripts/scaffold.py — Creates skill folder + SKILL.md from template. Run this first.
scripts/validate.py — Validates a skill against spec requirements. Run after writing.
scripts/init-evals.py — Scaffolds evals/ with evals.json + trigger-queries.json templates.
scripts/test-triggers.py — Runs trigger queries through claude CLI, reports trigger rates.
scripts/run-eval.py — Runs a single eval case (with-skill or without) via claude CLI.
scripts/grade.py — Grades assertions against outputs, writes grading.json + benchmark.json.

Phase 1 — Gather what you need

Ask the user these questions up front (all at once):

What task or workflow should the skill handle? What does the user type that should trigger it?
Who invokes it — you, Claude, or both?
- User-only (deploy, commit, send slack) → --disable-model-invocation
- Claude-auto only → will set user-invocable: false in body
- Both → default, no flag
Personal or project-scoped?
- Personal (all your projects): ~/.claude/skills
- Project-only: .claude/skills
Does it need scripts? Reusable logic, validation, commands too complex to get right by language alone → --with-scripts
Does it need reference docs? API guides, schemas, style guides, gotchas too long for SKILL.md → --with-references
Does it depend on an MCP server or external tool?

If $ARGUMENTS already covers these, confirm and proceed.

Phase 2 — Scaffold the folder

Run scaffold.py to create the structure. Build the command from answers in Phase 1:

uv run ${CLAUDE_SKILL_DIR}/scripts/scaffold.py \
  --name <kebab-case-name> \
  --output <~/.claude/skills or .claude/skills> \
  [--description "What it does. Use when the user asks to X, Y, or Z."] \
  [--disable-model-invocation] \
  [--with-scripts] \
  [--with-references] \
  [--with-assets] \
  [--argument-hint "[arg]"] \
  [--context fork] \
  [--agent Explore]

The scaffold creates:

<name>/SKILL.md — template with frontmatter pre-filled
<name>/scripts/process.py — placeholder with --help, argparse, PEP 723 deps (if --with-scripts)
<name>/references/guide.md — placeholder reference doc (if --with-references)
<name>/assets/ — empty dir (if --with-assets)

For the description flag: structure it as "What it does. Use when the user asks to X, Y, or Z." — max 1024 chars, no XML brackets. Read references/description-optimization.md for optimization guidance.

Phase 3 — Write the SKILL.md body

Open the scaffolded SKILL.md. The template at assets/skill-template.md shows the full canonical structure.

Write the body following this structure:

# Skill Name

**Context:** $ARGUMENTS ← keep if the skill takes arguments

## Quick start

3 bullet points → "Most common task: → Step N"

## When to use

Bullet list of scenarios this covers.

## Step 1 — [Action]

Specific, actionable instructions. Not "validate properly" — write exactly what to do.
Include exact commands, field names, expected outputs.

## Gotchas ← HIGHEST VALUE SECTION — never skip this

- [Non-obvious fact that the agent would get wrong without it]
- Add every correction you've had to make manually

## Examples

Input → Steps → Result

## Troubleshooting

Error message → Cause → Solution

Key rules — read references/best-practices.md for full guidance:

Under 500 lines. Move detailed docs to references/ and link with "Read references/X.md when [condition]"
Add only what the agent wouldn't know on its own — skip general knowledge
Prescriptive when operations are fragile; give freedom when approaches are flexible
Defaults not menus: "Use pdfplumber. For scanned PDFs, use pdf2image instead."

If scripts/ was created:

The placeholder scripts/process.py already has --help, argparse, PEP 723 inline deps, and proper stdout/stderr separation. Replace the TODO section with real logic. Never add interactive prompts (input()) — agents can't respond.

If references/ was created:

Fill references/guide.md with content too large for SKILL.md. Link it from the body: Read references/guide.md if [condition] — not a generic "see references/".

Phase 4 — Validate

Run validate.py against the scaffolded skill:

uv run ${CLAUDE_SKILL_DIR}/scripts/validate.py <path-to-skill>

Expected output: all checks PASS, zero errors.

Common issues and fixes:

name-matches-folder warning → either rename the folder or fix the name field
description-has-trigger warning → add "Use when the user asks to..." to the description
body-size warning → move reference material to references/
script-no-interactive warning → replace input() with argparse flags

Run until exit code 0 (all pass) or 2 (warnings only — acceptable). Exit code 1 (errors) means spec violations that will prevent the skill from loading correctly.

Phase 5 — Test triggering

Quick check: Ask Claude: "When would you use the <skill-name> skill?" — Claude quotes the description back. Adjust if the answer doesn't match user intent.

Structured test with script:

# 1. Scaffold the evals directory
uv run ${CLAUDE_SKILL_DIR}/scripts/init-evals.py --skill <path-to-skill>

# 2. Fill in evals/trigger-queries.json — replace REPLACE placeholders with:
#    - 5-10 "should trigger" queries (vary phrasing, formality, explicitness)
#    - 5-10 "should NOT trigger" near-misses (share keywords, need something different)

# 3. Run trigger tests (3 runs per query for nondeterminism)
uv run ${CLAUDE_SKILL_DIR}/scripts/test-triggers.py --skill <path-to-skill> --runs 3 --save

Reads results: trigger rate per query. Target: should-trigger avg ≥80%, should-not-trigger avg ≤20%.

If failing: revise description in SKILL.md and re-run. Read references/description-optimization.md for the full optimization loop. Use --train-only to reserve 40% of queries for final validation.

Phase 6 — Install and smoke test

The skill is already in place from Phase 2 (--output ~/.claude/skills or --output .claude/skills).

/skill-name          ← direct invocation (if user-invocable)

Or say something that matches the description to test auto-triggering.

Phase 7 — Run evals (measure output quality)

Use this when you need to measure whether the skill actually improves output quality, not just triggering.

# 1. Fill in evals/evals.json — add realistic prompts + assertions per eval case
#    Put input files in evals/files/

# 2. Run each eval WITH skill (iteration 1)
uv run ${CLAUDE_SKILL_DIR}/scripts/run-eval.py --skill <path> --all --iteration 1

# 3. Run same evals WITHOUT skill (baseline)
uv run ${CLAUDE_SKILL_DIR}/scripts/run-eval.py --skill <path> --all --iteration 1 --no-skill

# 4. Grade assertions — uses claude CLI to grade each claim against actual output
uv run ${CLAUDE_SKILL_DIR}/scripts/grade.py --skill <path> --iteration 1

# Or grade manually (human-in-the-loop):
uv run ${CLAUDE_SKILL_DIR}/scripts/grade.py --skill <path> --iteration 1 --human

Reads benchmark.json: pass rates with-skill vs without-skill, delta, token/time cost.

Phase 8 — Iterate

After grading, three signal sources tell you what to fix:

Failed assertions → specific gaps in instructions, missing steps, unhandled cases
Human feedback → broader quality issues the assertions didn't catch
Execution transcripts → WHY things went wrong (agent tried multiple approaches = instructions too vague)

Fix SKILL.md, then re-run in a new iteration:

# Increment iteration to keep results separate
uv run ${CLAUDE_SKILL_DIR}/scripts/run-eval.py --skill <path> --all --iteration 2
uv run ${CLAUDE_SKILL_DIR}/scripts/run-eval.py --skill <path> --all --iteration 2 --no-skill
uv run ${CLAUDE_SKILL_DIR}/scripts/grade.py --skill <path> --iteration 2

Stop when: pass rate plateaus, human feedback is consistently empty, or delta is satisfactory.

Signal	Cause	Fix
Skill never triggers	Description too narrow	Add trigger phrases, re-run test-triggers.py
Triggers too often	Description too broad	Add negative scope or `disable-model-invocation: true`
Instructions not followed	Critical rules buried	Move to top, use CRITICAL headers, add to Gotchas
Inconsistent results across runs	Instructions ambiguous	Add deterministic validation script
Pass rate low vs baseline	Skill adding noise not signal	Trim instructions to what agent actually lacks
Skill slow / large context	SKILL.md too big	Move content to `references/`

skill-creator

المزيد من هذا المستودع

المزيد من هذا المستودع

Skill Creator

Available scripts

Phase 1 — Gather what you need

Phase 2 — Scaffold the folder

Phase 3 — Write the SKILL.md body

Phase 4 — Validate

Phase 5 — Test triggering

Phase 6 — Install and smoke test

Phase 7 — Run evals (measure output quality)

Phase 8 — Iterate

Skill Creator

Available scripts

Phase 1 — Gather what you need

Phase 2 — Scaffold the folder

Phase 3 — Write the SKILL.md body

Phase 4 — Validate

Phase 5 — Test triggering

Phase 6 — Install and smoke test

Phase 7 — Run evals (measure output quality)

Phase 8 — Iterate