mit einem Klick
p2v-phase-1-script
// Generate and validate video_script.jsonl from a specific paper URL or pasted paper content. Use this when running phase 1 of the paper-to-video pipeline.
// Generate and validate video_script.jsonl from a specific paper URL or pasted paper content. Use this when running phase 1 of the paper-to-video pipeline.
Generate and validate video_script.jsonl from a code repository (GitHub URL or local path). Use this when running phase 1 of the repo-to-video pipeline.
Reference for manimlib animation primitives, transform families, and timing patterns used with self.play().
| name | p2v-phase-1-script |
| description | Generate and validate video_script.jsonl from a specific paper URL or pasted paper content. Use this when running phase 1 of the paper-to-video pipeline. |
| metadata | {"short-description":"P2V phase 1 script generation"} |
Use this skill when the user wants phase 1 of paper-to-video: paper input to validated video_script.jsonl.
outputs/<video_id>-<timestamp>)docs/educational-video-pedagogy-framework.mddocs/00-system-contract.mdvideo_script.jsonl.video_script.jsonl in the run folder.uv run python -c "from pathlib import Path; from paper2video.contracts.io import validate_artifact; validate_artifact(Path('<video_script.jsonl>'), artifact_type='video_script'); print('video_script contract ok')"
<run_dir>/video_script.jsonlBefore writing the first record, the agent must do this internally:
Do not ask the user for these artifacts. Build them internally, then emit only video_script.jsonl.
Before drafting, assign a complexity tier using paper content:
tier_1 (simple conceptual paper): one main claim, light empirical evidencetier_2 (moderate): multiple claims, some formal or empirical detailtier_3 (dense empirical/mechanistic): many experiments/ablations and non-trivial mechanismtier_4 (very dense): tier_3 plus multiple interacting mechanisms or heavy formal loadUse this mapping for script depth:
tier_1: 700-1100 words (~5-8 min)tier_2: 1100-1700 words (~8-13 min)tier_3: 1700-2600 words (~13-20 min)tier_4: 2400-3600 words (~18-28 min)For ML empirical papers with broad ablations and mechanism discussion (like grokking-style papers), default to tier_3 unless there is strong evidence for tier_4.
If draft word count is below tier minimum, expand with:
The script must reflect expert-level understanding:
If the current draft feels generic, refine before finalizing.
narration_text must sound like an educational video, not a lecture outline:
Chapter 1, Chapter 2, Section, Lecture, In this chapterrecord_type=chapter, chapter_id) but keep spoken text natural.Now let’s test this on..., Next we inspect..., Here’s the key result...Keep the script teachable for video viewers (not only technically correct):
segment should deliver one primary teaching point plus at most one supporting point.about, roughly, on the order of in speech.