Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

researcher-research

Estrellas0

Forks0

Actualizado10 de abril de 2026, 15:57

Research public datasets, benchmarks, documentation, and source material for official skill eval cases. Use this skill whenever a prompt or skill needs grounded public examples, authoritative dataset references, or a primary-source brief before synthesis or optimization.

Instalación

Instalar con Codex o Claude Copia este prompt, pégalo en Codex, Claude u otro asistente, y deja que revise la página de la skill y la instale por ti.

Ejecutar en Manus

Fuente

Tyler-R-Kendrick

Tyler-R-Kendrick/copilot-auto-training

Abrir repositorio de GitHub Ver repositorios del creador

Descarga

Ejecutar en Manus

Ocupaciones relacionadasSOC

Basado en la clasificación ocupacional SOC

Científicos sociales y trabajadores relacionados, todos los demásCiencias de la vida, físicas y sociales·SOC 19-3099

Explorador de archivos

10 archivos

SKILL.md

readonly

name	researcher-research
description	Research public datasets, benchmarks, documentation, and source material for official skill eval cases. Use this skill whenever a prompt or skill needs grounded public examples, authoritative dataset references, or a primary-source brief before synthesis or optimization.
license	MIT
compatibility	Requires Python 3.11+. Produces standalone research briefs, ranked source shortlists, and eval-authoring notes for this repository.
metadata	{"author":"Tyler Kendrick","version":"0.1.0"}

Research

Use this researcher-owned skill to research source material and produce a standalone research dossier before generating official skill eval cases.

Work primary-source-first. Resolve the task boundary and missing constraints before searching. End with approved sources and mapping notes, not guessed eval rows.

When to use this skill

The optimizer or evaluation workflow needs grounded eval cases, but no suitable local source material exists yet.
The user wants grounded public examples instead of purely simulated rows.
The agent needs a ranked shortlist of public datasets, benchmarks, or documentation sources that match a prompt task.
The user needs explicit judgment about source quality, data reliability, annotation quality, licensing, provenance, or leakage risk before authoring eval data.
The workflow needs to know whether no acceptable public source exists, so synthesis should stop instead of guessing.

If the source material is already known and the job is to convert it into eval rows, use a synthesis workflow instead.

Inputs

prompt_file: target markdown prompt
task_description: short description of the real task the prompt should solve
scoring_rule: expected answer format or evaluation rule
Optional constraints such as domain, language, geography or jurisdiction, recency, licensing, privacy, label taxonomy, or excluded source types

Resolve Before Searching

Resolve these inputs before recommending sources:

prompt interface and placeholders
real task boundary and evaluation target
expected answer format or scoring rule
domain, language, and jurisdiction constraints
licensing or privacy limits
recency and version expectations

If any of these materially affect source selection and are missing, ask first. Do not guess them from context.

Output

Return a standalone research brief that includes:

Target layout: the derived evals/evals.json path and any evals/files/ assets implied by the prompt
Query plan: a primary-source-first search plan tied to the task and scoring rule
Approved sources: a ranked shortlist with authority, provenance, licensing, fit, and risk notes
Rejected candidates: weak or incompatible sources and why they were rejected
Mapping notes: how approved sources can become prompt rows, expected outputs, optional files, and objective assertions
Unresolved gaps: anything still blocking safe synthesis, including a recommendation to stop if no source clears the approval bar

If the inputs are already complete, say the plan is satisfied and proceed.

Research Plan

Before searching, build a short plan with these sections:

Target layout: derived eval paths, optional files directory, and any workspace directory implied by the prompt file
Observed interface: prompt placeholders and visible fields that source material must support
Research questions: what needs to be learned to ground eval authoring for this task
Approval bar: the evidence each approved source must provide
Missing inputs: any remaining blockers that need to be elicited

Use the plan to constrain the search. Do not collect sources first and rationalize them later.

Source approval bar

Approve a source only if it clears the relevant checks for this task:

accountable maintainer, publisher, or standards body
traceable data origin, schema, and label definitions
evaluation rules, annotation guide, or benchmark protocol from the owner when available
explicit license or reuse terms
stable version, date, or release identifier
acceptable contamination, leakage, privacy, and bias risk for authored eval use

If a candidate fails the bar, keep it only as a rejected lead, not an approved recommendation.

Process

Inspect the prompt placeholders and derive the official evals/evals.json target path plus any evals/files/ assets.
Build the research plan before searching. Identify the task boundary, required evidence, and any missing constraints.
If key constraints are missing, ask for them before continuing.
Build research queries that match the task, scoring rule, and prompt-visible fields, starting with official maintainers, benchmark owners, dataset cards, annotation guides, standards bodies, and original papers.
Apply academic-style source triage: prefer primary sources first, use credible secondary sources only to discover or verify primary material, and reject derivative mirrors, listicles, or unverifiable blog summaries.
Judge each candidate source for authority, provenance, annotation quality, task fit, recency, version stability, licensing, contamination risk, and reuse constraints.
Rank the approved sources, record rejection reasons for weak candidates, and summarize the evidence behind each ranking.
Map approved source fields into realistic prompt rows, expected outputs, optional input files, and objective assertions, noting constraints or unresolved gaps.
If no candidate clears the approval bar, say so explicitly and explain what evidence is missing instead of forcing a recommendation.
Deliver the completed research brief as a self-contained artifact that another workflow can consume for eval authoring, or use directly.

Source hierarchy

Prefer sources in this order:

Official primary sources: benchmark owner sites, maintainer repositories, dataset cards, annotation guidelines, standards bodies, and original papers.
High-credibility secondary sources: trusted documentation mirrors, library docs maintained by the source owner, or peer-reviewed comparative surveys that cite the primary source.
Tertiary summaries: blog posts, tutorials, scraped mirrors, SEO roundups, or anonymous aggregators. Treat these as discovery hints only and do not rely on them when a primary source is available.

Research standards

Prefer official primary sources even when secondary summaries are easier to read.
Verify that labels, schemas, and evaluation rules come from the source owner whenever possible.
Record version, publication date, and licensing details before recommending a source.
Note sampling bias, benchmark contamination risk, train-test leakage risk, and label ambiguity when they could affect eval quality.
Reject sources that cannot be traced to an accountable maintainer or publication.
Separate research from synthesis: stop at mapping notes unless the user explicitly asks to author eval rows.

Elicit If Missing

Ask for missing details that change which public sources are acceptable, such as:

target domain terminology or user population
language or locale coverage
licensing or commercial-use requirements
privacy or data-handling restrictions
label taxonomy, class balance needs, or edge-case priorities
acceptable publication date range or version floor

If none are missing, say so explicitly and continue.

References and helper

Read references/dataset-research.md when you need a compact triage checklist.
Use scripts/run_research.py to derive eval targets, placeholders, and a research-brief scaffold when deterministic setup will save time.

Naming rationale

researcher-research keeps ownership explicit while preserving a narrow scope that separates source discovery from later synthesis and conversion.

Más de este repositorio

mismo repositorio

trainer-optimize

Tyler-R-Kendrick/copilot-auto-training

Improve a markdown prompt file using Agent Lightning APO (Automatic Prompt Optimization). Use when the user asks to optimize or improve a markdown prompt, or starts a message with /trainer-optimize.

2026-04-130

trainer-train-agent

Tyler-R-Kendrick/copilot-auto-training

Own the end-to-end trainer loop for agent contract targets (*.agent.md files, custom agent definitions, and agent instruction documents). Use this whenever the caller needs to research, synthesize datasets, optimize, validate, and write back a trained candidate for an agent-type target. Prefer this specialized loop whenever the selected target defines tool routing, MCP skill configuration, agent personas, or handoff behavior rather than raw prompts, code, or skill definitions.

2026-04-120

trainer-train-code

Tyler-R-Kendrick/copilot-auto-training

Own the end-to-end trainer loop for Python code targets optimized with Microsoft Trace (nodes, bundles, models, and trainable agent components). Use this whenever the caller needs to research, synthesize test-based datasets, optimize, validate, and write back a trained candidate for a code-type target. Prefer this specialized loop for any Python file or callable that benefits from deterministic, test-based or benchmark-based feedback rather than open-ended language instruction quality.

2026-04-120

trainer-train-code

Tyler-R-Kendrick/copilot-auto-training

2026-04-120

trainer-train-prompt

Tyler-R-Kendrick/copilot-auto-training

Own the end-to-end trainer loop for prompt-like files (*.prompt.md, *.prompty, *.instructions.md, system prompts, and other natural-language instruction artifacts). Use this whenever the caller needs to research, synthesize datasets, optimize, validate, and write back a trained candidate for a prompt-type target. Prefer this specialized loop for any file whose primary content is natural-language instructions rather than code, skill configuration, or agent contracts.

2026-04-120

trainer-train-prompt

Tyler-R-Kendrick/copilot-auto-training

2026-04-120

name	researcher-research
description	Research public datasets, benchmarks, documentation, and source material for official skill eval cases. Use this skill whenever a prompt or skill needs grounded public examples, authoritative dataset references, or a primary-source brief before synthesis or optimization.
license	MIT
compatibility	Requires Python 3.11+. Produces standalone research briefs, ranked source shortlists, and eval-authoring notes for this repository.
metadata	{"author":"Tyler Kendrick","version":"0.1.0"}

Research

Use this researcher-owned skill to research source material and produce a standalone research dossier before generating official skill eval cases.

Work primary-source-first. Resolve the task boundary and missing constraints before searching. End with approved sources and mapping notes, not guessed eval rows.

When to use this skill

The optimizer or evaluation workflow needs grounded eval cases, but no suitable local source material exists yet.
The user wants grounded public examples instead of purely simulated rows.
The agent needs a ranked shortlist of public datasets, benchmarks, or documentation sources that match a prompt task.
The user needs explicit judgment about source quality, data reliability, annotation quality, licensing, provenance, or leakage risk before authoring eval data.
The workflow needs to know whether no acceptable public source exists, so synthesis should stop instead of guessing.

If the source material is already known and the job is to convert it into eval rows, use a synthesis workflow instead.

Inputs

prompt_file: target markdown prompt
task_description: short description of the real task the prompt should solve
scoring_rule: expected answer format or evaluation rule
Optional constraints such as domain, language, geography or jurisdiction, recency, licensing, privacy, label taxonomy, or excluded source types

Resolve Before Searching

Resolve these inputs before recommending sources:

prompt interface and placeholders
real task boundary and evaluation target
expected answer format or scoring rule
domain, language, and jurisdiction constraints
licensing or privacy limits
recency and version expectations

If any of these materially affect source selection and are missing, ask first. Do not guess them from context.

Output

Return a standalone research brief that includes:

Target layout: the derived evals/evals.json path and any evals/files/ assets implied by the prompt
Query plan: a primary-source-first search plan tied to the task and scoring rule
Approved sources: a ranked shortlist with authority, provenance, licensing, fit, and risk notes
Rejected candidates: weak or incompatible sources and why they were rejected
Mapping notes: how approved sources can become prompt rows, expected outputs, optional files, and objective assertions
Unresolved gaps: anything still blocking safe synthesis, including a recommendation to stop if no source clears the approval bar

If the inputs are already complete, say the plan is satisfied and proceed.

Research Plan

Before searching, build a short plan with these sections:

Target layout: derived eval paths, optional files directory, and any workspace directory implied by the prompt file
Observed interface: prompt placeholders and visible fields that source material must support
Research questions: what needs to be learned to ground eval authoring for this task
Approval bar: the evidence each approved source must provide
Missing inputs: any remaining blockers that need to be elicited

Use the plan to constrain the search. Do not collect sources first and rationalize them later.

Source approval bar

Approve a source only if it clears the relevant checks for this task:

accountable maintainer, publisher, or standards body
traceable data origin, schema, and label definitions
evaluation rules, annotation guide, or benchmark protocol from the owner when available
explicit license or reuse terms
stable version, date, or release identifier
acceptable contamination, leakage, privacy, and bias risk for authored eval use

If a candidate fails the bar, keep it only as a rejected lead, not an approved recommendation.

Process

Inspect the prompt placeholders and derive the official evals/evals.json target path plus any evals/files/ assets.
Build the research plan before searching. Identify the task boundary, required evidence, and any missing constraints.
If key constraints are missing, ask for them before continuing.
Build research queries that match the task, scoring rule, and prompt-visible fields, starting with official maintainers, benchmark owners, dataset cards, annotation guides, standards bodies, and original papers.
Apply academic-style source triage: prefer primary sources first, use credible secondary sources only to discover or verify primary material, and reject derivative mirrors, listicles, or unverifiable blog summaries.
Judge each candidate source for authority, provenance, annotation quality, task fit, recency, version stability, licensing, contamination risk, and reuse constraints.
Rank the approved sources, record rejection reasons for weak candidates, and summarize the evidence behind each ranking.
Map approved source fields into realistic prompt rows, expected outputs, optional input files, and objective assertions, noting constraints or unresolved gaps.
If no candidate clears the approval bar, say so explicitly and explain what evidence is missing instead of forcing a recommendation.
Deliver the completed research brief as a self-contained artifact that another workflow can consume for eval authoring, or use directly.

Source hierarchy

Prefer sources in this order:

Official primary sources: benchmark owner sites, maintainer repositories, dataset cards, annotation guidelines, standards bodies, and original papers.
High-credibility secondary sources: trusted documentation mirrors, library docs maintained by the source owner, or peer-reviewed comparative surveys that cite the primary source.
Tertiary summaries: blog posts, tutorials, scraped mirrors, SEO roundups, or anonymous aggregators. Treat these as discovery hints only and do not rely on them when a primary source is available.

Research standards

Prefer official primary sources even when secondary summaries are easier to read.
Verify that labels, schemas, and evaluation rules come from the source owner whenever possible.
Record version, publication date, and licensing details before recommending a source.
Note sampling bias, benchmark contamination risk, train-test leakage risk, and label ambiguity when they could affect eval quality.
Reject sources that cannot be traced to an accountable maintainer or publication.
Separate research from synthesis: stop at mapping notes unless the user explicitly asks to author eval rows.

Elicit If Missing

Ask for missing details that change which public sources are acceptable, such as:

target domain terminology or user population
language or locale coverage
licensing or commercial-use requirements
privacy or data-handling restrictions
label taxonomy, class balance needs, or edge-case priorities
acceptable publication date range or version floor

If none are missing, say so explicitly and continue.

References and helper

Read references/dataset-research.md when you need a compact triage checklist.
Use scripts/run_research.py to derive eval targets, placeholders, and a research-brief scaffold when deterministic setup will save time.

Naming rationale

researcher-research keeps ownership explicit while preserving a narrow scope that separates source discovery from later synthesis and conversion.