Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

trainer-train-prompt

Étoiles0

Forks0

Mis à jour12 avril 2026 à 20:19

Own the end-to-end trainer loop for prompt-like files (*.prompt.md, *.prompty, *.instructions.md, system prompts, and other natural-language instruction artifacts). Use this whenever the caller needs to research, synthesize datasets, optimize, validate, and write back a trained candidate for a prompt-type target. Prefer this specialized loop for any file whose primary content is natural-language instructions rather than code, skill configuration, or agent contracts.

Installation

Installer avec Codex ou Claude Copiez ce prompt, collez-le dans Codex, Claude ou un autre assistant, puis laissez-le vérifier la page du skill et l'installer pour vous.

Exécuter dans Manus

Source

Tyler-R-Kendrick

Tyler-R-Kendrick/copilot-auto-training

Ouvrir le dépôt GitHub Voir les dépôts du créateur

Téléchargement

Exécuter dans Manus

Métiers associésSOC

Basé sur la classification professionnelle SOC

Développeurs de logicielsProfessions informatiques et mathématiques·SOC 15-1252

Explorateur de fichiers

24 fichiers

SKILL.md

readonly

Plus depuis ce dépôt

même dépôt

trainer-optimize

Tyler-R-Kendrick/copilot-auto-training

Improve a markdown prompt file using Agent Lightning APO (Automatic Prompt Optimization). Use when the user asks to optimize or improve a markdown prompt, or starts a message with /trainer-optimize.

2026-04-130

trainer-train-agent

Tyler-R-Kendrick/copilot-auto-training

Own the end-to-end trainer loop for agent contract targets (*.agent.md files, custom agent definitions, and agent instruction documents). Use this whenever the caller needs to research, synthesize datasets, optimize, validate, and write back a trained candidate for an agent-type target. Prefer this specialized loop whenever the selected target defines tool routing, MCP skill configuration, agent personas, or handoff behavior rather than raw prompts, code, or skill definitions.

2026-04-120

trainer-train-code

Tyler-R-Kendrick/copilot-auto-training

Own the end-to-end trainer loop for Python code targets optimized with Microsoft Trace (nodes, bundles, models, and trainable agent components). Use this whenever the caller needs to research, synthesize test-based datasets, optimize, validate, and write back a trained candidate for a code-type target. Prefer this specialized loop for any Python file or callable that benefits from deterministic, test-based or benchmark-based feedback rather than open-ended language instruction quality.

2026-04-120

trainer-train-code

Tyler-R-Kendrick/copilot-auto-training

2026-04-120

trainer-train-prompt

Tyler-R-Kendrick/copilot-auto-training

2026-04-120

trainer-train

Tyler-R-Kendrick/copilot-auto-training

Own the end-to-end trainer loop contract for a prompt-like file, skill contract, or agent contract after the caller has already chosen the concrete stage capabilities. Use this whenever the current agent must set up the local trainer workspace, coordinate stage sequencing, maintain workflow state, manage steering and candidates, recover from manual follow-up mode, and decide whether a trained candidate is safe to write back.

2026-04-120

name	trainer-train-prompt
description	Own the end-to-end trainer loop for prompt-like files (.prompt.md, .prompty, *.instructions.md, system prompts, and other natural-language instruction artifacts). Use this whenever the caller needs to research, synthesize datasets, optimize, validate, and write back a trained candidate for a prompt-type target. Prefer this specialized loop for any file whose primary content is natural-language instructions rather than code, skill configuration, or agent contracts.
argument-hint	Describe the target prompt file, the repository root, the validation command, the available stage capabilities (researcher, synthesizer, optimizer, elector), and any existing dataset or workspace artifacts.
license	MIT
compatibility	Python 3.11+. Works in any repository that keeps trainer artifacts in `.trainer-workspace/` next to the selected target.
metadata	{"author":"Tyler Kendrick","version":"0.1.0"}

Trainer Train - Prompt

Use this skill as the orchestration contract for one trainer run against a prompt-like target: any *.prompt.md, *.prompty, *.instructions.md, system prompt, or natural-language instruction file.

Read references/prompt-loop-contract.md for the full routing table, judge-mode rules, and prompt-specific validation constraints before any stage execution.

When to use this skill

The selected target is a prompt file, instruction file, or prompty artifact.
The caller needs to initialize or resume a trainer workspace for a prompt target.
Missing datasets or eval manifests need to be synthesized before optimization.
The optimization stage returns a manual follow-up payload and the loop must continue.
A winning candidate needs to be validated and written back to the source prompt file.

Do not use this skill for code files, skill files, or agent contract files. Read the parent trainer skill's references/target-routing.md to identify the appropriate specialist for those target types.

Required inputs

One selected prompt-like target file.
Repository root or enough path context to derive the local trainer workspace.
The validation command for the repository (e.g., python -m pytest -q).
The concrete stage capability map: researcher, synthesizer, optimizer, elector.
The currently available specialist-agent roster.
Any existing workspace artifacts to reuse.

Prompt-specific loop rules

Judge mode

Use the following routing table (sourced verbatim from references/prompt-loop-contract.md) to infer scoring mode from dataset row shape:

Row shape	Inferred mode
Explicit `scoring: exact_match`	`deterministic`
Explicit `scoring: llm_judge`	`llm_judge`
Explicit `scoring` (any other value)	Use that value as authoritative
`reference` + `criteria` fields, no explicit `scoring`	`llm_judge`
`expected` field only, task has one correct answer	Consider `deterministic`; default to `llm_judge` if ambiguous
No scoring fields	Default to `llm_judge` for prompt targets

If the caller explicitly supplies a scoring_mode override at invocation time, treat that value as authoritative and skip per-row mode inference. Even with a caller-supplied override, still validate that the train and validation splits are internally consistent (i.e., do not imply conflicting modes) before proceeding; report a blocker if they are inconsistent.

Placeholder preservation

Never remove, rename, or reorder template placeholders (e.g., {{variable}}, {input}, <PLACEHOLDER>) during optimization or write-back. Confirm placeholder set is unchanged before any candidate write-back. Any rename is a failure — for example, changing {{user_query}} to {{query}} in a candidate is not an acceptable optimization and must be rejected.

Evaluator field isolation

Keep expected, reference, criteria, and scoring fields out of the prompt-visible render path. These are evaluator-only fields and must not appear in the optimized prompt text. See the write-back gate checklist in Step 11 for the corresponding pre-commit confirmation requirement.

Few-shot and chain-of-thought patterns

When the dataset rows expose example pairs or step-by-step reasoning traces, preserve those structural patterns in the optimized candidate. Do not flatten multi-turn or chain-of-thought structures into a single instruction block.

Core workflow

Follow this order. Consult references/prompt-loop-contract.md when artifact paths, scoring mode, or stage boundaries are uncertain.

Resolve target and workspace. Derive <prompt-name> using the canonical rule: strip .prompty entirely; for .md files strip only .md (e.g., summarize.prompt.md → summarize.prompt); otherwise use Path.stem. Use <target-dir>/.trainer-workspace/<prompt-name>/ as workspace root. If state indicates a resumed run, audit tracked artifact pointers and skip only stages that already produced valid outputs. The review checkpoint (Step 2) is exempt from this skip rule and must be re-confirmed on every run regardless of workflow state or tracked artifact pointers.
Require the workspace review checkpoint. Confirm the engineering review artifact exists before optimization starts. Report a blocker if it is absent.
Initialize or refresh workspace. Create or update workflow-status.json with an initial workflow_state value of pending_engineer_prompt. Create the review artifacts subdirectory, inputs/source/, and iterations/ directories. The exact review path is defined in references/prompt-loop-contract.md. Copy the target file as the source snapshot to inputs/source/<basename> (e.g., inputs/source/my-prompt.prompt.md).
Inspect existing datasets and evals. Prefer reuse when train, validation, and authored eval assets already fit the prompt target and scoring shape. Keep authored evals, train data, and validation data as separate artifacts in separate files.
Run missing-data path if needed. If any required dataset or eval is absent or the validation split is not a genuine holdout, pause optimization and gather them via the caller-supplied researcher and synthesizer before continuing.
Infer judge mode. Inspect representative dataset rows and apply the routing table in the Judge mode section above. Default to llm_judge for prompt targets. Treat an explicit row-level scoring declaration as authoritative. Stop and report inconsistency if train and validation splits imply different modes (see Blocker-first rule for resolution path).
Run at least one optimization pass. Pass the inferred judge mode and the prompt-specific constraints (placeholder preservation, evaluator field isolation) to the optimizer.
Handle manual follow-up if returned. Save the optimizer report as manual-followup-report.json, answer the model-facing prompt, persist the revised candidate as optimized-prompt.md, and continue the loop. After persisting optimized-prompt.md, confirm placeholder preservation (full set unchanged, no renames) before proceeding to election or write-back.
Run election if multiple candidates exist. Use the caller-supplied elector when optimization produces more than one defensible candidate. A candidate is defensible when it: (a) passes the repository validation command with exit code 0, (b) achieves a judge score strictly above the current baseline score, and (c) falls within the elector's acceptable margin relative to the top-ranked candidate. If election produces no defensible candidate, report a blocker — set workflow_state: pending_iteration_review, explain why no candidate passed the defensibility gate, and recommend a new optimization pass or caller override before continuing.
Publish iteration artifacts. Stage steering, candidate bundles, validation logs, and a decision summary under the active iteration directory.
Write back only when all gate conditions are satisfied. Confirm each of the following before writing the winning candidate back to the source prompt file:
1. Validation passes — the repository validation command (e.g., python -m pytest -q) exits with code 0.
2. Placeholder preservation confirmed — the full placeholder set is identical between the original and the candidate; no placeholder was added, removed, renamed, or reordered.
3. Evaluator fields absent — expected, reference, criteria, and scoring fields do not appear in the candidate prompt text.
4. Decision summary written — <workspace-root>/decision.md exists and records the winning candidate, scores, and justification.
5. Baseline score gate — if a prior baseline score exists in the workspace, the candidate's judge score must be at or above that baseline. Record the candidate score in decision.md whether or not a prior baseline score exists.

Blocker-first rule

Stop and report a clear blocker before any optimization or rewrite when:

The workspace review artifact is absent.
Required datasets or authored evals are missing.
Tracked artifact pointers from a resumed run are missing or inconsistent.
Train and validation splits imply different judge modes — set workflow_state: pending_dataset_repair in workflow-status.json and choose one of two resolution paths: (1) repair or resplit the dataset so both splits share a consistent scoring shape, or (2) obtain an explicit caller-supplied scoring_mode override that supersedes row-shape inference.

A blocker report must name the missing artifact, explain why the loop cannot advance, and leave workflow-status.json in a resumable checkpoint state.

Output contract

Return:

Workspace status and any active blockers.
Current-turn decisions: reuse choice, judge mode, selected branch, blockers.
Optimization or manual follow-up status with artifact paths — manual-followup-report.json and optimized-prompt.md for manual follow-up branches, or optimize-report.json for direct optimization branches.
Placeholder preservation confirmation.
Validation status.
Write-back decision and justification.
Next required action to resume or continue the loop.