一键在 Manus 中运行任何 Skill

$pwd:

research-reporting

Name: Research Reporting
Author: ProfSynapse

// Create structured research notes from experiment runs and analysis artifacts. Use when creating a note at run launch, updating it as training/evaluation/loss stages finish, summarizing a finished run, comparing experiment outcomes, extracting hypotheses from eval/loss artifacts, or proposing next-run actions grounded in `.tracking/experiments/<id>/analysis/` outputs. This skill is about turning repo-native experiment evidence into stable, machine-readable markdown.

在 Manus 中运行

$ git log --oneline --stat

stars:23

forks:3

updated:2026年4月2日 23:33

文件资源管理器

3 个文件

SKILL.md

readonly

name	research-reporting
description	Create structured research notes from experiment runs and analysis artifacts. Use when creating a note at run launch, updating it as training/evaluation/loss stages finish, summarizing a finished run, comparing experiment outcomes, extracting hypotheses from eval/loss artifacts, or proposing next-run actions grounded in `.tracking/experiments/<id>/analysis/` outputs. This skill is about turning repo-native experiment evidence into stable, machine-readable markdown.
allowed-tools	Read, Bash, Write, Grep, Glob

Research Reporting

Generate compact research notes that are easy to read and easy to parse later.

Use This Skill When

The user wants a research note, experiment summary, post-run analysis, or structured markdown output.
The source of truth is an experiment bundle under .tracking/experiments/<id>/.
The output should include stable frontmatter and explicit evidence for claims.
The note should be created early and updated through the lifecycle of one experiment.

Default Workflow

Resolve the experiment id and open .tracking/experiments/<id>/experiment.json.
If spec_path is present, read the experiment spec so the note captures actual config numbers instead of only outcome artifacts.
Read primary analysis artifacts in this order:
- analysis/experiment_summary.json
- analysis/next_run_candidates.json
- analysis/hypothesis_context.json
- analysis/run_matrix.csv
Read failure slices only if you need representative examples:
- analysis/failure_slices/eval_failures.jsonl
- analysis/failure_slices/high_loss_examples.jsonl
Read stage lineage files when you need provenance, timing, commit, hardware, or cost details.
Write the note from assets/research_note_template.md.

Load reference/artifact-map.md when you need to know which artifact supports which section.

Lifecycle Modes

Use the same note template for all three modes:

Launch note:
- Create the note as soon as the experiment is launched or selected.
- Fill identity, config, and known runtime fields.
- Leave future metrics and recommendation fields empty.
Stage update:
- Re-open the same note after training, evaluation, loss, or analysis completes.
- Update only the fields now supported by artifacts.
- Preserve prior fields unless newer canonical artifacts supersede them.
Final note:
- After analysis/recommendation, ensure the note contains the final status, observed outcomes, hypotheses, and next-run recommendation.

Default policy: one evolving note per experiment, not one note per stage.

Reporting Rules

Keep frontmatter keys stable across notes. Prefer null, [], or omitted sections over ad hoc placeholder prose.
Keep the schema general: use grouped maps for metrics and config instead of adding one top-level key per experiment-specific number.
Support partial completion. A note does not need all stages populated to be valid.
Separate three things clearly:
- observed: directly supported by artifacts
- inferred: reasoned from observed evidence
- proposed: next-run actions or hypotheses
Prefer exact values from JSON for frontmatter. Round only in prose if readability improves.
Preserve config numbers exactly as found in the spec or lineage artifacts. Do not normalize 1.0e-4 into prose-only text and do not drop unset knobs that matter to interpretation.
Do not cite experiment_summary.md as the primary evidence source when the JSON exists.
Do not invent comparisons, baselines, or causes. If a baseline run is missing, state that it is missing.
When the analysis bundle includes a ranked recommendation, carry over:
- selected_candidate_rank
- selected_candidate_confidence
- recommended_next_action
If loss artifacts are absent or failed, keep loss fields null and note the missing stage in the body.
If a run has custom metrics, place them under metrics.summary or metrics.groups.<stage> rather than forcing them into a fixed eval/loss schema.
If a run has stage-specific knobs, place them under config_snapshot.<stage> rather than flattening them into root keys.
When updating an existing note, overwrite fields only when the new source is more canonical or more complete than the prior one.
For in-flight runs, prefer explicit stage_statuses and partial sections over vague prose like "still running."

Note Shape

Use the template exactly once per note and keep these sections in this order:

Summary
Run Context
Observed Results
Failure Analysis
Hypotheses
Next Run
Sources

The frontmatter is for machine-readable indexing. The body is for human judgment and downstream review.

Interpretation Heuristics

Treat experiment_summary.json as the canonical top-level snapshot.
Treat experiment.json plus the referenced experiment spec as the canonical source for config intent.
Treat next_run_candidates.json as the canonical source for ranked recommendations and high-loss snapshot summaries.
Use hypothesis_context.json when you need richer tag-level evidence or supporting context behind a recommendation.
Use run_matrix.csv to confirm stage status rather than inferring completion from one artifact alone.
If schema pass rate is materially higher than behavior pass rate, call out behavior reliability as the likely bottleneck instead of just "tool calling."

Output Discipline

Use short paragraphs and flat bullets.
Name concrete failure families or tags when the artifacts support them.
Include exact artifact paths in sources.
If the user asks for a comparison note, keep the same template and populate comparison_experiment_ids.

Bundled Resources

Template: assets/research_note_template.md
Artifact guide: reference/artifact-map.md

related-skills.json

同仓库

fine-tuning.md

from "ProfSynapse/Synaptic-Tuner"

Complete reference for the fine-tuning pipeline (SFT, KTO, GRPO), cloud HF Jobs workflows, autonomous experiment search, checkpoint evaluation, and LoRA surgery. Covers training CLI flags, YAML configuration, model presets, dataset requirements, LoRA settings, training monitoring, hyperparameter search, and post-training optimization. Use when training models, configuring training runs, choosing hyperparameters, running cloud experiments, inspecting HF jobs, or troubleshooting training issues. This skill is about USING the training system via CLI and YAML — never modifying source code.

2026-05-2923

synthetic-data-generation.md

from "ProfSynapse/Synaptic-Tuner"

Complete reference for the SynthChat synthetic dataset generation system. Covers CLI commands (generate, improve, validate), scenario YAML authoring, rubric YAML authoring, settings configuration, evaluation, and full workflow. Use when generating datasets, writing rubrics/scenarios, configuring models/workers, improving dataset quality, or running evaluations. This skill is about USING the system via CLI and YAML — never modifying source code.

2026-05-2923

case-studies.md

from "ProfSynapse/Synaptic-Tuner"

End-to-end case studies showing how to implement the full training pipeline for different skill types. Covers three complete worked examples — tool-calling training, essay-style training, and agentic search (RAG agent) training — demonstrating dataset design, synthetic generation, validation, fine-tuning, evaluation, and iteration. Use when onboarding to the project, understanding how all components fit together, explaining the pipeline to others, or planning a new training capability. This skill is about UNDERSTANDING the system holistically — reference the other skills for specific CLI commands.

2026-05-2923

upload-deployment.md

from "ProfSynapse/Synaptic-Tuner"

Complete reference for model upload and deployment. Covers HuggingFace upload, save strategies (LoRA, merged 16-bit, merged 4-bit), GGUF conversion, model merging, model cards, and the full upload workflow. Use when uploading models, creating GGUF files, merging LoRA adapters, or deploying to HuggingFace. This skill is about USING the upload/deployment tools via CLI — never modifying source code.

2026-05-0123

evaluation.md

from "ProfSynapse/Synaptic-Tuner"

Complete reference for the config-first model evaluation system. Covers the Evaluator CLI, assertion-driven YAML scenarios, response views, backend configuration, presets, scoring, LLM-as-judge, model comparison, and HuggingFace integration. Use when evaluating models, writing test prompts, comparing training runs, or interpreting eval results. This skill is about USING the evaluation system via CLI and YAML.

2026-04-2423

dataset-publishing.md

from "ProfSynapse/Synaptic-Tuner"

Publish local dataset artifacts to a Hugging Face dataset repo. Use when uploading a JSONL dataset, pushing a filtered dataset variant, syncing a matching .metadata.json sidecar, or renaming a dataset file in the target repo. This skill is about USING the checked-in dataset publish script via CLI — never ad hoc Python.

2026-03-2223

package.json

"author": "ProfSynapse"

"repository": "ProfSynapse/Synaptic-Tuner"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

数据科学家计算机与数学类职业15-2051L4

name	research-reporting
description	Create structured research notes from experiment runs and analysis artifacts. Use when creating a note at run launch, updating it as training/evaluation/loss stages finish, summarizing a finished run, comparing experiment outcomes, extracting hypotheses from eval/loss artifacts, or proposing next-run actions grounded in `.tracking/experiments/<id>/analysis/` outputs. This skill is about turning repo-native experiment evidence into stable, machine-readable markdown.
allowed-tools	Read, Bash, Write, Grep, Glob

Research Reporting

Generate compact research notes that are easy to read and easy to parse later.

Use This Skill When

The user wants a research note, experiment summary, post-run analysis, or structured markdown output.
The source of truth is an experiment bundle under .tracking/experiments/<id>/.
The output should include stable frontmatter and explicit evidence for claims.
The note should be created early and updated through the lifecycle of one experiment.

Default Workflow

Resolve the experiment id and open .tracking/experiments/<id>/experiment.json.
If spec_path is present, read the experiment spec so the note captures actual config numbers instead of only outcome artifacts.
Read primary analysis artifacts in this order:
- analysis/experiment_summary.json
- analysis/next_run_candidates.json
- analysis/hypothesis_context.json
- analysis/run_matrix.csv
Read failure slices only if you need representative examples:
- analysis/failure_slices/eval_failures.jsonl
- analysis/failure_slices/high_loss_examples.jsonl
Read stage lineage files when you need provenance, timing, commit, hardware, or cost details.
Write the note from assets/research_note_template.md.

Load reference/artifact-map.md when you need to know which artifact supports which section.

Lifecycle Modes

Use the same note template for all three modes:

Launch note:
- Create the note as soon as the experiment is launched or selected.
- Fill identity, config, and known runtime fields.
- Leave future metrics and recommendation fields empty.
Stage update:
- Re-open the same note after training, evaluation, loss, or analysis completes.
- Update only the fields now supported by artifacts.
- Preserve prior fields unless newer canonical artifacts supersede them.
Final note:
- After analysis/recommendation, ensure the note contains the final status, observed outcomes, hypotheses, and next-run recommendation.

Default policy: one evolving note per experiment, not one note per stage.

Reporting Rules

Keep frontmatter keys stable across notes. Prefer null, [], or omitted sections over ad hoc placeholder prose.
Keep the schema general: use grouped maps for metrics and config instead of adding one top-level key per experiment-specific number.
Support partial completion. A note does not need all stages populated to be valid.
Separate three things clearly:
- observed: directly supported by artifacts
- inferred: reasoned from observed evidence
- proposed: next-run actions or hypotheses
Prefer exact values from JSON for frontmatter. Round only in prose if readability improves.
Preserve config numbers exactly as found in the spec or lineage artifacts. Do not normalize 1.0e-4 into prose-only text and do not drop unset knobs that matter to interpretation.
Do not cite experiment_summary.md as the primary evidence source when the JSON exists.
Do not invent comparisons, baselines, or causes. If a baseline run is missing, state that it is missing.
When the analysis bundle includes a ranked recommendation, carry over:
- selected_candidate_rank
- selected_candidate_confidence
- recommended_next_action
If loss artifacts are absent or failed, keep loss fields null and note the missing stage in the body.
If a run has custom metrics, place them under metrics.summary or metrics.groups.<stage> rather than forcing them into a fixed eval/loss schema.
If a run has stage-specific knobs, place them under config_snapshot.<stage> rather than flattening them into root keys.
When updating an existing note, overwrite fields only when the new source is more canonical or more complete than the prior one.
For in-flight runs, prefer explicit stage_statuses and partial sections over vague prose like "still running."

Note Shape

Use the template exactly once per note and keep these sections in this order:

Summary
Run Context
Observed Results
Failure Analysis
Hypotheses
Next Run
Sources

The frontmatter is for machine-readable indexing. The body is for human judgment and downstream review.

Interpretation Heuristics

Treat experiment_summary.json as the canonical top-level snapshot.
Treat experiment.json plus the referenced experiment spec as the canonical source for config intent.
Treat next_run_candidates.json as the canonical source for ranked recommendations and high-loss snapshot summaries.
Use hypothesis_context.json when you need richer tag-level evidence or supporting context behind a recommendation.
Use run_matrix.csv to confirm stage status rather than inferring completion from one artifact alone.
If schema pass rate is materially higher than behavior pass rate, call out behavior reliability as the likely bottleneck instead of just "tool calling."

Output Discipline

Use short paragraphs and flat bullets.
Name concrete failure families or tags when the artifacts support them.
Include exact artifact paths in sources.
If the user asks for a comparison note, keep the same template and populate comparison_experiment_ids.

Bundled Resources

Template: assets/research_note_template.md
Artifact guide: reference/artifact-map.md

research-reporting

Research Reporting

Use This Skill When

Default Workflow

Lifecycle Modes

Reporting Rules

Note Shape

Interpretation Heuristics

Output Discipline

Bundled Resources

同仓库更多 Skills

同仓库更多 Skills

Research Reporting

Use This Skill When

Default Workflow

Lifecycle Modes

Reporting Rules

Note Shape

Interpretation Heuristics

Output Discipline

Bundled Resources