Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

skill-quality-scorer

Name: Skill Quality Scorer
Author: aristoteleo

// Evaluate the quality of a Codex skill from multiple dimensions and produce a structured score, verdict, and revision guidance. Use when reviewing a `SKILL.md`, scoring a skill folder, comparing two skills, performing cross-review on a newly created skill, or checking whether a skill is triggerable, executable, concise, and maintainable without loading unnecessary context.

Ejecutar en Manus

$ git log --oneline --stat

stars:9

forks:0

updated:19 de marzo de 2026, 03:45

Explorador de archivos

6 archivos

SKILL.md

readonly

name	skill-quality-scorer
description	Evaluate the quality of a Codex skill from multiple dimensions and produce a structured score, verdict, and revision guidance. Use when reviewing a `SKILL.md`, scoring a skill folder, comparing two skills, performing cross-review on a newly created skill, or checking whether a skill is triggerable, executable, concise, and maintainable without loading unnecessary context.

Skill Quality Scorer

Goal

Score a skill on quality, not domain sophistication. Review whether another agent could trigger and use the skill reliably with minimal ambiguity and minimal context waste.

Quick Workflow

Read the target SKILL.md.
If the skill documents a real callable or CLI, inspect the live interface before trusting notebook-derived claims.
For data workflows, try to follow the skill on representative data before final scoring.
Read only the referenced files needed to judge execution quality, resource partitioning, validation quality, source grounding, and empirical executability.
Score each required dimension using references/rubric.md.
Apply hard gates before computing the final verdict.
Return a compact scorecard plus the highest-signal revision guidance.
For every dimension score, include an explicit rationale directly under that score in the report.
Write a human-readable Markdown score report to the current working directory instead of hiding it inside the generated skill folder.

Scope

Use this skill to review:

a skill folder
a standalone SKILL.md
a proposed skill diff
two competing skill designs
a notebook-derived skill after conversion

Do not score:

the scientific correctness of the underlying domain unless the skill itself makes unsupported domain claims
code quality outside the skill's own bundled scripts and instructions
repository-wide documentation unrelated to the skill

Review Rules

Default to reading SKILL.md first.
Read references/ only when the main skill explicitly depends on them or when resource partitioning is being scored.
Read scripts/ only when the skill relies on them for core execution or validation.
When the skill claims concrete function or CLI behavior, verify that behavior against source, help(...), or -h/--help before treating the skill as complete.
For notebook-derived data workflows, do not stop at interface inspection if representative data can be constructed or loaded reasonably.
Treat missing branch coverage for method, backend, mode, provider, or similar parameters as a real usability risk, not a minor omission.
Do not reward verbosity.
Do not reward domain difficulty.
Do not assume missing validation or compatibility notes are present elsewhere.
Penalize trigger ambiguity, execution ambiguity, and hidden dependencies aggressively.

Hard Gates

Apply these gates before issuing a passing verdict:

Trigger Precision must be at least 3/5.
Execution Clarity must be at least 3/5.
Validation Strength must be at least 3/5.
For data workflows, Empirical Executability must be at least 3/5.

If any hard-gate dimension is below 3, the skill cannot receive a pass verdict even if the weighted score is otherwise high.

Output Format

Return results in this order:

Overall verdict: pass, revise, or fail
Weighted score out of 100
Dimension-by-dimension scores, with a short reason under each score
Top issues blocking a higher score
Targeted revision actions

Also write a Markdown report file in the current working directory with a strict name like:

<skill-name>-score-report-YYYY-MM-DD.md

Do not place this report inside the generated skill folder unless the user explicitly asks for that. The filename pattern replaces older loose names such as score-report.md.

The report should include concrete evidence, not only the final score:

files reviewed
commands actually run
test and validation outputs
interface inspection evidence
reviewer-run empirical execution evidence on real or synthetic data when applicable
dimension-by-dimension scoring rationale, written directly under each dimension score instead of only in a separate summary section
residual risks

Keep the review concise. Prefer high-signal findings over a long narrative.

Comparison Mode

When comparing two skills:

score both using the same rubric
keep the same standard across both reviews
explain which one is more triggerable, more executable, or more maintainable
identify whether one is shorter but underspecified or richer but too heavy

Cross-Review Guidance

When reviewing a skill that was just created in the same repository:

treat the scoring as independent evaluation
do not import the creator's unstated intentions
score only what is discoverable from the skill and its referenced resources
if a needed behavior exists only in the source notebook or in the author's head, score that as missing
if a notebook-derived skill documents only one observed method path but the source has more branches, score the missing branches as an execution and compatibility gap

Resource Map

Read references/rubric.md for scoring dimensions, weights, and verdict rules.
Read references/output-template.md when you need a consistent review format.
Read references/source-grounding-audit.md when judging whether a skill is grounded in real function signatures, docstrings, or branch behavior.
Run scripts/inspect_python_interface.py when you need a quick source-grounded view of a Python callable before scoring completeness.

related-skills.json

mismo repositorio

skill-authoring.md

from "aristoteleo/awesome-skill-generate"

Create or update a Codex skill that packages reusable workflows, references, scripts, and assets for repeated tasks. Use when turning notebooks, tutorials, analyses, or domain procedures into a triggerable local skill for other agents, or when deciding whether a notebook subset or branch should update an existing skill instead of creating a duplicate one.

2026-03-209

dynamo-preprocess.md

from "aristoteleo/awesome-skill-generate"

Run or adapt dynamo preprocessing with `dynamo.preprocessing.Preprocessor`, including the `recipe` branches `monocle`, `seurat`, `sctransform`, `pearson_residuals`, and `monocle_pearson_residuals`. Use when converting or reproducing `docs/tutorials/notebooks/100_tutorial_preprocess.ipynb`, preprocessing an `AnnData` object for downstream dynamo analysis, customizing preprocessing kwargs, or translating notebook-level preprocessing into a reusable agent workflow.

2026-03-199

package.json

"author": "aristoteleo"

"repository": "aristoteleo/awesome-skill-generate"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Desarrolladores de softwareOcupaciones informáticas y matemáticas15-1252L4

name	skill-quality-scorer
description	Evaluate the quality of a Codex skill from multiple dimensions and produce a structured score, verdict, and revision guidance. Use when reviewing a `SKILL.md`, scoring a skill folder, comparing two skills, performing cross-review on a newly created skill, or checking whether a skill is triggerable, executable, concise, and maintainable without loading unnecessary context.

Skill Quality Scorer

Goal

Score a skill on quality, not domain sophistication. Review whether another agent could trigger and use the skill reliably with minimal ambiguity and minimal context waste.

Quick Workflow

Read the target SKILL.md.
If the skill documents a real callable or CLI, inspect the live interface before trusting notebook-derived claims.
For data workflows, try to follow the skill on representative data before final scoring.
Read only the referenced files needed to judge execution quality, resource partitioning, validation quality, source grounding, and empirical executability.
Score each required dimension using references/rubric.md.
Apply hard gates before computing the final verdict.
Return a compact scorecard plus the highest-signal revision guidance.
For every dimension score, include an explicit rationale directly under that score in the report.
Write a human-readable Markdown score report to the current working directory instead of hiding it inside the generated skill folder.

Scope

Use this skill to review:

a skill folder
a standalone SKILL.md
a proposed skill diff
two competing skill designs
a notebook-derived skill after conversion

Do not score:

the scientific correctness of the underlying domain unless the skill itself makes unsupported domain claims
code quality outside the skill's own bundled scripts and instructions
repository-wide documentation unrelated to the skill

Review Rules

Default to reading SKILL.md first.
Read references/ only when the main skill explicitly depends on them or when resource partitioning is being scored.
Read scripts/ only when the skill relies on them for core execution or validation.
When the skill claims concrete function or CLI behavior, verify that behavior against source, help(...), or -h/--help before treating the skill as complete.
For notebook-derived data workflows, do not stop at interface inspection if representative data can be constructed or loaded reasonably.
Treat missing branch coverage for method, backend, mode, provider, or similar parameters as a real usability risk, not a minor omission.
Do not reward verbosity.
Do not reward domain difficulty.
Do not assume missing validation or compatibility notes are present elsewhere.
Penalize trigger ambiguity, execution ambiguity, and hidden dependencies aggressively.

Hard Gates

Apply these gates before issuing a passing verdict:

Trigger Precision must be at least 3/5.
Execution Clarity must be at least 3/5.
Validation Strength must be at least 3/5.
For data workflows, Empirical Executability must be at least 3/5.

If any hard-gate dimension is below 3, the skill cannot receive a pass verdict even if the weighted score is otherwise high.

Output Format

Return results in this order:

Overall verdict: pass, revise, or fail
Weighted score out of 100
Dimension-by-dimension scores, with a short reason under each score
Top issues blocking a higher score
Targeted revision actions

Also write a Markdown report file in the current working directory with a strict name like:

<skill-name>-score-report-YYYY-MM-DD.md

Do not place this report inside the generated skill folder unless the user explicitly asks for that. The filename pattern replaces older loose names such as score-report.md.

The report should include concrete evidence, not only the final score:

files reviewed
commands actually run
test and validation outputs
interface inspection evidence
reviewer-run empirical execution evidence on real or synthetic data when applicable
dimension-by-dimension scoring rationale, written directly under each dimension score instead of only in a separate summary section
residual risks

Keep the review concise. Prefer high-signal findings over a long narrative.

Comparison Mode

When comparing two skills:

score both using the same rubric
keep the same standard across both reviews
explain which one is more triggerable, more executable, or more maintainable
identify whether one is shorter but underspecified or richer but too heavy

Cross-Review Guidance

When reviewing a skill that was just created in the same repository:

treat the scoring as independent evaluation
do not import the creator's unstated intentions
score only what is discoverable from the skill and its referenced resources
if a needed behavior exists only in the source notebook or in the author's head, score that as missing
if a notebook-derived skill documents only one observed method path but the source has more branches, score the missing branches as an execution and compatibility gap

Resource Map

Read references/rubric.md for scoring dimensions, weights, and verdict rules.
Read references/output-template.md when you need a consistent review format.
Read references/source-grounding-audit.md when judging whether a skill is grounded in real function signatures, docstrings, or branch behavior.
Run scripts/inspect_python_interface.py when you need a quick source-grounded view of a Python callable before scoring completeness.

skill-quality-scorer

Skill Quality Scorer

Goal

Quick Workflow

Scope

Review Rules

Hard Gates

Output Format

Comparison Mode

Cross-Review Guidance

Resource Map

Más de este repositorio

Más de este repositorio

Skill Quality Scorer

Goal

Quick Workflow

Scope

Review Rules

Hard Gates

Output Format

Comparison Mode

Cross-Review Guidance

Resource Map