Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

$pwd:

parameter-golf-submission

Name: Parameter Golf Submission
Author: mkurman

// Prepare and validate Parameter Golf record folders: self-contained train_gpt.py, README.md, submission.json, FineWeb SP1024 BPB accounting, artifact-size logging, run logs, and PR-ready folder hygiene.

In Manus ausführen

$ git log --oneline --stat

stars:308

forks:24

updated:26. Mai 2026 um 06:30

SKILL.md

readonly

name	parameter-golf-submission
description	Prepare and validate Parameter Golf record folders: self-contained train_gpt.py, README.md, submission.json, FineWeb SP1024 BPB accounting, artifact-size logging, run logs, and PR-ready folder hygiene.
tags	["parameter-golf","competition","fineweb","bpb","model-craft","submission"]

Parameter Golf Submission

Use this skill when creating or reviewing a Parameter Golf submission folder, independent of the cloud provider used for the run.

Record Folder Contract

A submission folder must contain:

records/<track>/<submission-name>/
  README.md
  submission.json
  train_gpt.py
  train.log          # after a real run

train_gpt.py must compile and run from inside this folder in a clean Parameter Golf checkout.

Competition Constraints To Preserve

Artifact cap: 16,000,000 decimal bytes.
Training cap for leaderboard records: 10 minutes on 8xH100 SXM-class hardware.
Evaluation metric: FineWeb validation bits per byte (val_bpb).
No validation-set leakage. Test-time training may only use validation tokens already scored, if implemented.
No hidden downloads/network calls during evaluation.
No local repository imports unless included and counted in the record folder.
If tokenizer changes, prove BPB accounting carefully; stock SP1024 is safest for first participation.

Self-Contained Script Checklist

Before running:

python -m py_compile train_gpt.py passes.
Imports are standard/allowed environment packages only (torch, numpy, sentencepiece, etc.).
DATA_PATH and TOKENIZER_PATH are env-configurable.
Script loads fineweb_train_*.bin and fineweb_val_*.bin with the Parameter Golf binary header format.
Script computes validation BPB from SentencePiece byte accounting, not just token loss.
Script logs parameter count and artifact-size estimate.
Script writes a compressed artifact, usually final_model.int8.ptz.
Script reloads/dequantizes the compressed artifact and evaluates the round-trip model.
Final log includes final_int8_zlib_roundtrip_exact.

README Contents

The README must include:

short architecture summary
dataset/tokenizer used
exact command
run hardware and time budget
final metrics after run
artifact-size line after run
caveats if the run is smoke/non-record/pending verification

submission.json Contents

Use actual values after the run, not placeholders:

{
  "run_name": "...",
  "author": "...",
  "github_id": "...",
  "track": "track_10min_16mb or track_non_record_16mb",
  "val_bpb": 1.2345,
  "val_loss": 2.1234,
  "artifact_size_bytes": 12345678,
  "command": "...",
  "status": "completed"
}

Add architecture fields as useful, but avoid claiming record eligibility unless the log proves it.

Post-Run Extraction

After a run, extract these lines:

grep -E "final_int8_zlib_roundtrip_exact|Total submission size int8\+zlib|stopping_early|train_time|model_params" train.log

Update:

submission.json.val_bpb
submission.json.val_loss
submission.json.artifact_size_bytes
README metrics section

Status Labels

Use precise status:

prepared_pending_run: folder created, no real run yet
smoke_passed: short/non-final run passed
completed_non_record: full run but not leaderboard-valid or not SOTA
completed_record_candidate: 8xH100 10-minute compliant run with full log and artifact under cap
failed: include failure reason and last good checkpoint/log line

Common Failure Modes

Accidentally importing local model code (from src...) not present in record folder.
Forgetting to copy train.log from logs/<RUN_ID>.txt.
Reporting pre-quant BPB instead of int8 round-trip BPB.
Exceeding 16MB after counting code + compressed artifact.
Running on 1 GPU and calling it leaderboard-valid.
Using a custom tokenizer without exact byte accounting proof.

related-skills.json

gleiches Repository

runpod-parameter-golf.md

from "mkurman/zorai"

Run Parameter Golf competition submissions on RunPod GPU Pods. Covers required operator inputs, RunPod pod specs, FineWeb SP1024 data caching, record-folder hygiene, torchrun launch commands, monitoring, artifact-size checks, and result collection.

2026-05-26308

triton-kernel-programming.md

from "mkurman/zorai"

Hands-on implementation template and API reference for writing, tuning, debugging, and benchmarking Triton GPU kernels. Covers the full triton.language API surface, autotuning patterns, profiling workflows, and production integration.

2026-05-17308

triton-kernel-programming.md

Tencent AngelSlim — accessible, comprehensive, and efficient toolkit for large model compression. Quantization (FP8/INT4/NVFP4/1.25-bit), pruning, speculative decoding (Eagle3), and diffusion model compression.

2026-05-12308

distilqwen.md

from "mkurman/zorai"

DistilQwen2.5 — Alibaba's industrial practices for training distilled open lightweight language models. Knowledge distillation from Qwen2.5 72B into smaller 0.5B-7B models.

2026-05-12308

intel-neural-compressor.md

from "mkurman/zorai"

Intel Neural Compressor — SOTA low-bit LLM quantization (INT8/FP8/INT4/NVFP4), sparsity, pruning, and distillation for PyTorch, TensorFlow, and ONNX Runtime.

2026-05-12308

package.json

"author": "mkurman"

"repository": "mkurman/zorai"

GitHub-Repository öffnen Creator-Repositorys ansehen

$ install --global

$ download --local

In Manus ausführen

$ useful --forSOC

SoftwareentwicklerInformatik- und Mathematikberufe15-1252L4

name	parameter-golf-submission
description	Prepare and validate Parameter Golf record folders: self-contained train_gpt.py, README.md, submission.json, FineWeb SP1024 BPB accounting, artifact-size logging, run logs, and PR-ready folder hygiene.
tags	["parameter-golf","competition","fineweb","bpb","model-craft","submission"]

Parameter Golf Submission

Use this skill when creating or reviewing a Parameter Golf submission folder, independent of the cloud provider used for the run.

Record Folder Contract

A submission folder must contain:

records/<track>/<submission-name>/
  README.md
  submission.json
  train_gpt.py
  train.log          # after a real run

train_gpt.py must compile and run from inside this folder in a clean Parameter Golf checkout.

Competition Constraints To Preserve

Artifact cap: 16,000,000 decimal bytes.
Training cap for leaderboard records: 10 minutes on 8xH100 SXM-class hardware.
Evaluation metric: FineWeb validation bits per byte (val_bpb).
No validation-set leakage. Test-time training may only use validation tokens already scored, if implemented.
No hidden downloads/network calls during evaluation.
No local repository imports unless included and counted in the record folder.
If tokenizer changes, prove BPB accounting carefully; stock SP1024 is safest for first participation.

Self-Contained Script Checklist

Before running:

python -m py_compile train_gpt.py passes.
Imports are standard/allowed environment packages only (torch, numpy, sentencepiece, etc.).
DATA_PATH and TOKENIZER_PATH are env-configurable.
Script loads fineweb_train_*.bin and fineweb_val_*.bin with the Parameter Golf binary header format.
Script computes validation BPB from SentencePiece byte accounting, not just token loss.
Script logs parameter count and artifact-size estimate.
Script writes a compressed artifact, usually final_model.int8.ptz.
Script reloads/dequantizes the compressed artifact and evaluates the round-trip model.
Final log includes final_int8_zlib_roundtrip_exact.

README Contents

The README must include:

short architecture summary
dataset/tokenizer used
exact command
run hardware and time budget
final metrics after run
artifact-size line after run
caveats if the run is smoke/non-record/pending verification

submission.json Contents

Use actual values after the run, not placeholders:

{
  "run_name": "...",
  "author": "...",
  "github_id": "...",
  "track": "track_10min_16mb or track_non_record_16mb",
  "val_bpb": 1.2345,
  "val_loss": 2.1234,
  "artifact_size_bytes": 12345678,
  "command": "...",
  "status": "completed"
}

Add architecture fields as useful, but avoid claiming record eligibility unless the log proves it.

Post-Run Extraction

After a run, extract these lines:

grep -E "final_int8_zlib_roundtrip_exact|Total submission size int8\+zlib|stopping_early|train_time|model_params" train.log

Update:

submission.json.val_bpb
submission.json.val_loss
submission.json.artifact_size_bytes
README metrics section

Status Labels

Use precise status:

prepared_pending_run: folder created, no real run yet
smoke_passed: short/non-final run passed
completed_non_record: full run but not leaderboard-valid or not SOTA
completed_record_candidate: 8xH100 10-minute compliant run with full log and artifact under cap
failed: include failure reason and last good checkpoint/log line

Common Failure Modes

Accidentally importing local model code (from src...) not present in record folder.
Forgetting to copy train.log from logs/<RUN_ID>.txt.
Reporting pre-quant BPB instead of int8 round-trip BPB.
Exceeding 16MB after counting code + compressed artifact.
Running on 1 GPU and calling it leaderboard-valid.
Using a custom tokenizer without exact byte accounting proof.

parameter-golf-submission

Parameter Golf Submission

Record Folder Contract

Competition Constraints To Preserve

Self-Contained Script Checklist

README Contents

submission.json Contents

Post-Run Extraction

Status Labels

Common Failure Modes

Mehr aus diesem Repository

Mehr aus diesem Repository

Parameter Golf Submission

Record Folder Contract

Competition Constraints To Preserve

Self-Contained Script Checklist

README Contents

submission.json Contents

Post-Run Extraction

Status Labels

Common Failure Modes