Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

idea-evaluator

Name: Idea Evaluator
Author: HKUSTDial

// Evaluates a preliminary research idea against a five-dimension framework (Higher, Faster, Stronger, Cheaper, Broader) plus idea-lifecycle and student-capability matching, paradigm-shift probing, and a fatal-flaws audit. Returns a reviewer-style verdict. Use when the user has a draft research idea and asks whether it is worth pursuing, asks to 'evaluate this idea', 'score this idea', 'assess feasibility', 'novelty check', 'is this a good research direction', or before committing to a paper scope.

Ejecutar en Manus

$ git log --oneline --stat

stars:1162

forks:81

updated:22 de abril de 2026, 12:33

Explorador de archivos

11 archivos

SKILL.md

readonly

related-skills.json

mismo repositorio

figure-designer.md

from "HKUSTDial/Supervisor-Skills"

Advises on the design of the three core figures in a technical paper: the Motivated Example (Figure 1), the Solution Overview (Methodology), and the Experimental Results figures. Recommends the right design paradigm, layout, labelling, and tool for each figure type, then runs a quality-control audit. Use when the user asks to 'design a figure', 'draw Figure 1', 'plot experiment results', 'choose the right chart type', 'which figure tool to use', or 'figure looks unprofessional'.

2026-04-221.2k

intro-drafter.md

from "HKUSTDial/Supervisor-Skills"

Drafts a 6-paragraph Introduction outline for a technical paper from a structured Flowchart: background and running example, existing limitations, problem essence and goal, key challenges, solution overview, contributions. Positions the paper as Technique or New Problem/Setting and aligns contributions with challenges. Use when the user asks to 'draft the Introduction', 'outline the Introduction', 'intro logic needs clarifying', 'help structure the paper story', or before writing any Introduction prose.

2026-04-221.2k

pre-submission-reviewer.md

from "HKUSTDial/Supervisor-Skills"

Runs a pre-submission review of a technical paper across five dimensions: macro logic, writing details, English grammar, LaTeX formatting, and figure quality. Uses a reviewer-style severity taxonomy (CRITICAL / MAJOR / MINOR) and flags banned AI-tone vocabulary and em-dash misuse. Use when the user asks to 'review this paper', 'audit before submission', 'check the draft', 'find issues', 'proofread', or within one week of a submission deadline.

2026-04-221.2k

tech-paper-template.md

from "HKUSTDial/Supervisor-Skills"

Structures a technical paper's full logical skeleton using a thinking-template table (research background, limitations, key idea or goal, challenges, methodology modules, contributions), positions the paper as Technique or New Problem/Setting, and runs a four-point self-consistency check. Use when the user is brainstorming a paper, discussing progress with an advisor, or planning the paper before drafting. Also use for 'paper skeleton', 'paper logic chain', 'thinking template', 'paper-structure planning'.

2026-04-221.2k

vibe-research-workflow.md

from "HKUSTDial/Supervisor-Skills"

Guides AI-assisted research across three sub-flows, Vibe Coding, Vibe Figure, and Vibe Writing, with behavioural rules that keep the user in charge of academic judgment while delegating mechanical work to AI. Recommends the right tool (Cursor, Claude Code, Codex, Figma, Gemini) for the current stage. Use when the user asks 'how to use AI for research', 'Vibe Coding tips', 'AI-assisted writing workflow', 'which AI tool for this', or starts an AI-assisted work session.

2026-04-221.2k

benchmark-paper-template.md

from "HKUSTDial/Supervisor-Skills"

Structures Benchmark and Evaluation papers using the five-pillar framework (Research Gap, Construction Pipeline, Evaluation Framework, Empirical Findings, optional Companion Method). Returns a completeness audit, a six-part Introduction logic chain, a Section 2-7 skeleton, and a pre-submission checklist. Use when writing a benchmark paper, structuring a benchmark paper, checking whether a benchmark idea is substantive, drafting a benchmark Introduction, or planning the data-construction pipeline or experiments.

2026-04-191.2k

package.json

"author": "HKUSTDial"

"repository": "HKUSTDial/Supervisor-Skills"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Bioquímicos y biofísicosCiencias de la vida, físicas y sociales19-1021L4

name	idea-evaluator
description	Evaluates a preliminary research idea against a five-dimension framework (Higher, Faster, Stronger, Cheaper, Broader) plus idea-lifecycle and student-capability matching, paradigm-shift probing, and a fatal-flaws audit. Returns a reviewer-style verdict. Use when the user has a draft research idea and asks whether it is worth pursuing, asks to 'evaluate this idea', 'score this idea', 'assess feasibility', 'novelty check', 'is this a good research direction', or before committing to a paper scope.
license	CC-BY-4.0

Idea Evaluator

Overview

This skill evaluates a preliminary research idea from the combined perspective of a top-venue reviewer and an experienced advisor. It scores the idea against five improvement dimensions from the idea-generation guide (Higher, Faster, Stronger, Cheaper, Broader), matches the idea's lifecycle against the user's actual capability and available hours per week, probes whether the idea has paradigm-shift potential, flags fatal flaws, and returns one of three verdicts: Strong Accept, Accept with Revisions, or Reject and Pivot.

The goal is to kill weak ideas before the student invests months, and to shape promising-but-underdeveloped ideas into stronger forms before writing begins.

When to use this skill

The user has a draft idea and asks whether it is worth pursuing.
The user asks for novelty check, feasibility assessment, or scoring.
Before the user commits to a paper scope or starts implementation.
The user is comparing two or three candidate ideas and needs a structured trade-off.
The user suspects scope creep and wants an external check.
The user mentions 'evaluate this idea', 'score this idea', 'assess feasibility', or 'is this a good research direction'.

When NOT to use this skill

The user has already implemented the idea and is writing the paper. Use intro-drafter, tech-paper-template, or benchmark-paper-template (separate plugin) instead.
The user explicitly wants brainstorming of new ideas from scratch. Use plain conversation; see handbook 2.3 for a disruptive-innovation playbook.
The user asks for review of an existing manuscript. Use pre-submission-reviewer.
The user asks to evaluate a benchmark contribution specifically. Use benchmark-paper-template (separate plugin) in targeted mode.

Core procedure

Step 1: First impression and paper-type positioning

Read the user's idea description. In one paragraph, state whether the idea reads as Novel Problem, Novel Method, or New Setting. Is the story compelling in one sentence? If you cannot write that sentence, the idea itself is probably not yet clear enough for evaluation; ask the user to restate.

Step 2: Fatal-flaws audit (early gate)

See: references/fatal-flaws.md for the ten canonical fatal flaws, each with a detection rule and a defense strategy.

Run the fatal-flaws audit before the scoring steps rather than after them. Identify at most two fatal flaws. For each, state the flaw, cite the detection rule, and recommend a concrete defense.

Short-circuit rule. If any fatal flaw is tagged CRITICAL in the severity taxonomy (single-handedly causes rejection, unfixable within the lifecycle), stop here and emit the verdict directly:

Verdict: Reject and Pivot.
Output sections 1 (First impression), 2 (Fatal flaws with the CRITICAL flaw), and 8 (Verdict with the flaw-driven rationale) only.
Do not run the five-dimension scoring, paradigm-shift probe, feasibility check, or integrity gate. Those would be decoration on a rejection.

If no CRITICAL flaw is found, continue to Step 3.

Step 3: Lifecycle and capability matching

See: references/lifecycle-capability-matching.md for the six-category lifecycle matrix, capability self-assessment rubric, and mismatch recovery strategies.

Map the idea onto one of six categories (Application, Foundational Theory, Cross-Disciplinary, Frontier Exploration, Data-Intensive, Innovative Technique). Match against the user's declared capability (effective hours per week, skill depth, theoretical versus applied strength). Output a mismatch flag if lifecycle is shorter than the user's realistic execution window.

Step 4: Five-dimension scoring

See: references/five-dimensions.md for each dimension's entry strategies, scoring rubric, and worked examples.

Score the idea on each of:

Higher: effectiveness and accuracy gains.
Faster: efficiency and cost reduction.
Stronger: robustness, noise tolerance, generalisation.
Cheaper: data, annotation, or solution cost reduction.
Broader: cross-domain transplantation or unification.

Score each 1-10 with explicit evidence from the user's stated contribution. Identify the two or three dimensions where the idea has the highest ceiling and recommend emphasising those in the paper.

Step 5: Paradigm-shift probe

See: references/paradigm-shift-probe.md for the four probing principles (First Principles, Elephant in the Room, Technology Cycle, Hamming's Rule) and the cross-reference to handbook section 2.3 when deeper disruptive-innovation exploration is needed.

Test the idea against four questions:

Does it challenge a hidden assumption the field takes for granted?
Does it address an elephant-in-the-room problem everyone sees but nobody wants to touch?
Does it ride a technology-cycle shift (for example, LLMs making a previously impractical approach now feasible)?
If this problem solved itself, would the field change meaningfully? (Hamming's Rule)

Two or more yes answers means the idea has disruptive potential. Note that, and recommend reading handbook 2.3 to deepen the thinking on disruptive-innovation dimensions.

Step 6: Feasibility check

Against the user's stated resources (hardware, data access, team size, engineering skills, timeline), assess:

Compute risk: does the experiment fit on stated hardware?
Data risk: is the required data accessible, or does it need expensive annotation or private sources?
Engineering risk: does the implementation match the user's skill stack?
Timeline risk: does the estimated end-to-end duration (coding, experiments, writing, revision) fit within the idea's lifecycle?

If any risk is high, flag it explicitly with a suggested mitigation.

Step 7: Integrity gate

Before emitting the verdict, run the checks in the Integrity gate section below.

Step 8: Final verdict

Issue one of three verdicts:

Strong Accept: execute now. Two or more dimensions at 8+, no fatal flaws, capability match green, lifecycle fit.
Accept with Revisions: pivot the scope per recommendations before starting. Some dimensions weak, fixable flaws, or lifecycle mismatch that can be shortened.
Reject and Pivot: do not pursue this version. Dominated by a prior benchmark or method, unfixable capability mismatch, or more than one fatal flaw.

Emit the evaluation in the Output format below.

Integrity gate

Each bullet is tagged with an enforceability class. [inspection] means the LLM can verify the bullet from the produced output alone. [attestation] means the LLM states it has done the check, but the user remains responsible for verification. [user-attest] means the bullet is a user-side rule the skill cannot confirm.

Before returning the verdict:

[inspection] Every dimension score cites specific evidence from the user's stated contribution; no score is "gut feeling".
[inspection] Feasibility claims reference the user's stated resources, not generic assumptions.
[inspection] Novelty claims either cite specific prior work or are labelled "unverified; literature check required".
[inspection] Fatal flaws are specific and actionable; "this might not work" is not a flaw statement.
[inspection] Verdict is consistent with scoring: Strong Accept requires at least two dimensions at 8+ and zero CRITICAL flaws.
[inspection] Paradigm-shift claim cites which probing question was answered positively.
[attestation] Lifecycle prediction is reasoned from the field's recent pace; the user should sanity-check against their own knowledge of the subfield before acting on it.

If any [inspection] check fails, downgrade the verdict and mark the corresponding output section as "needs user attention". For [attestation] bullets, the skill states the check was run and the user confirms the result.

Output format

1. First impression

Paper type:
One-sentence story: <...>

2. Fatal-flaws audit (early gate)

#	Flaw	Severity	Defense
1	...	CRITICAL or MAJOR	...

If any CRITICAL flaw is present, skip sections 3-7 and go to section 8 with verdict Reject and Pivot.

3. Lifecycle and capability match

Aspect	User's input	Assessment
Idea category	...	...
Lifecycle	... months	...
Weekly effective hours	...	...
Fit	...	Green or Yellow or Red

4. Five-dimension radar

Dimension	Score 1-10	Evidence	Lift suggestion
Higher	...	...	...
Faster	...	...	...
Stronger	...	...	...
Cheaper	...	...	...
Broader	...	...	...

5. Paradigm-shift probe

Probe	Yes or No	Rationale
First Principles	...	...
Elephant in the Room	...	...
Technology Cycle	...	...
Hamming's Rule	...	...

Disruptive potential: <none, possible, strong>.

6. Feasibility

Risk	Level	Mitigation
Compute	...	...
Data	...	...
Engineering	...	...
Timeline	...	...

7. Integrity gate result

Gate 1 through 7:

8. Verdict

Top three actions to take first:

...

...

...

idea-evaluator

Más de este repositorio

Más de este repositorio

Idea Evaluator

Overview

When to use this skill

When NOT to use this skill

Core procedure

Step 1: First impression and paper-type positioning

Step 2: Fatal-flaws audit (early gate)

Step 3: Lifecycle and capability matching

Step 4: Five-dimension scoring

Step 5: Paradigm-shift probe

Step 6: Feasibility check

Step 7: Integrity gate

Step 8: Final verdict

Integrity gate

Output format

1. First impression

2. Fatal-flaws audit (early gate)

3. Lifecycle and capability match

4. Five-dimension radar

5. Paradigm-shift probe

6. Feasibility

7. Integrity gate result

8. Verdict

Idea Evaluator

Overview

When to use this skill

When NOT to use this skill

Core procedure

Step 1: First impression and paper-type positioning

Step 2: Fatal-flaws audit (early gate)

Step 3: Lifecycle and capability matching

Step 4: Five-dimension scoring

Step 5: Paradigm-shift probe

Step 6: Feasibility check

Step 7: Integrity gate

Step 8: Final verdict

Integrity gate

Output format

1. First impression

2. Fatal-flaws audit (early gate)

3. Lifecycle and capability match

4. Five-dimension radar

5. Paradigm-shift probe

6. Feasibility

7. Integrity gate result

8. Verdict