بنقرة واحدة
voice-video-producer
Turn the approved manga visual package into speech, subtitles, and a final motion-comic video.
التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.
القائمة
Turn the approved manga visual package into speech, subtitles, and a final motion-comic video.
التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.
Review character references, candidate kids coloring story assets, and colored references for story-image correspondence, recurring character consistency, approved-text-only surfaces, and printable readiness.
Create prompt packs, visual style guides, recurring character references, generated or edited line-art assets, paired colored reference assets, and image logs for printable kids coloring stories.
Assemble approved coloring story assets into print-ready black-and-white exports and separate colored reference exports, then validate the final files.
Design the child-facing concept, brief, source text index, and approved storyboard for printable coloring stories.
Review investigation-led article packages for understanding sufficiency, evidence support, thesis strength, structure, style fit, fidelity, platform fit, and publication readiness.
Produce a publication-ready article package from investigation-led understanding through outline, draft, and revision using the team's shared writing principles.
استنادا إلى تصنيف SOC المهني
| name | voice-video-producer |
| description | Turn the approved manga visual package into speech, subtitles, and a final motion-comic video. |
Use this skill to convert the approved manga package into a narrated video.
series-bible.mdcharacter-bible.mdcharacter-registry.mdseries-state.mdchapter-registry.mdchapters/<chapter-id>/chapter-plan.mdchapters/<chapter-id>/storyboard.mdchapters/<chapter-id>/continuity-ledger.mdchapters/<chapter-id>/visual-style-guide.mdchapters/<chapter-id>/prompt-pack.mdchapters/<chapter-id>/image-generation-log.mdchapters/<chapter-id>/visual-production-package.mdstoryboard.md is the operative source of truth for clip structure and on-screen story beats. chapter-plan.md is only supporting scope context once the storyboard exists.
chapters/<chapter-id>/voice-package.mdchapters/<chapter-id>/audio-generation-log.mdchapters/<chapter-id>/audio/clip01.*, chapters/<chapter-id>/audio/clip02.*, ...chapters/<chapter-id>/subtitles.srtchapters/<chapter-id>/video-package.mdchapters/<chapter-id>/chapter-carry-forward.mdchapters/<chapter-id>/final/manga-video.mp4Use:
chapters/<chapter-id>/voice-package.mdDecide whether the final piece is:
Before locking the chapter voice map, inspect the live supported models and voices when the runtime exposes them.
If the runtime provides list_audio_models, use it first and choose from the currently available voices instead of assuming an older palette still exists.
Start from the current voice mappings recorded in character-registry.md for recurring characters.
Use the stable voice persona or delivery-style anchors from character-bible.md to keep casting and performance behavior coherent even if the runtime voice id changes.
If a preferred recurring-character voice is unavailable in the current runtime, choose the nearest supported replacement and record that change explicitly for carry-forward sync.
For this team's generate_speech workflow, multi-speaker means at most two mapped speakers per call.
For manga-style motion comics, the working default is one visible beat, one speaker, and one short clip.
Then map the storyboard into audio clips. Use the storyboard audio beats as the primary segmentation source instead of inventing clip boundaries ad hoc at this stage.
For each clip, record:
Do not invent story action that the viewer cannot see or infer from the canon package.
Do not build the voice package from chapter-plan.md if the storyboard already says something more precise.
Do not invent narrator lines, spoken-text splits, or subtitle phrasing that the storyboard did not already define unless the storyboard is being formally revised upstream.
chapters/<chapter-id>/audio-generation-log.mdFor every material speech-generation call, record:
sleep 60 was completed before the next speech-tool callGenerate one speech clip per audio beat with the approved speech tool for the run, such as generate_speech or speak.
Speech generation must be serial-only. Treat the clip list as a queue, not a batch:
generate_speech or selected speak requestaudio-generation-log.mdsleep 60 before making any further speech-tool callFor each clip:
voice-package.mdSpeech prompt rules:
[pause], [laughs softly], [under breath], or [more controlled now] are valid when the speech tool supports expressive prompting.[camera pushes in].Examples:
Single-speaker spoken prompt:
[calm, dangerous] 我最不需要的是只会点头的人。 [short pause] 你继续待在原来的位置。 [lower voice] 我要你看清楚,谁被允许进入同一个房间。
Matching subtitle text:
我最不需要的是只会点头的人。你继续待在原来的位置。我要你看清楚,谁被允许进入同一个房间。
Multi-speaker spoken prompt only when one visible beat truly needs it:
Kurapika: [quiet, controlled] 下一位。
Borksen: [after a short pause] ……我是来参加讲习的。
Matching subtitle text:
下一位。……我是来参加讲习的。
Preferred split for most manga motion-comic beats:
- Clip A
Kurapika: [quiet, controlled] 下一位。
- Clip B
Borksen: [after a short pause] ……我是来参加讲习的。
If three voices would otherwise collide in one beat:
- split narrator into its own clip, or
- split the exchange into sequential two-speaker / one-speaker clips
instead of sending one three-speaker mapping request.
subtitles.srt from the final clip order and actual durations.visual-production-package.md before final subtitle burn and respect its declared asset lettering state and subtitle overlay guidance.video-package.md.Use ffmpeg for:
Default output should feel like a clean motion comic, not fake full animation.
When sync precision matters, prefer deterministic per-render-unit assembly over a raw slideshow concat path.
Normalize still images before concatenation so mixed asset dimensions do not silently distort timing or ordering.
If the visual package declares video-frame, confirm the assets are already full-bleed single-frame images at the locked series aspect ratio before assembly.
Do not hide white paper margins, page gutters, collage layouts, or source art that mismatches the locked series aspect ratio with casual padding. Route those assets back to manga_illustrator unless the user explicitly approved a separate export variant with a full crop/regeneration plan.
Do not stop at a successful export command.
After the final MP4 is written:
video-frame chapters with 30 render units or fewer, sample at least one midpoint timestamp per render unitpanel-first chapters with 30 render units or fewer, sample at least one midpoint timestamp per render unitIf the sampled frames do not match the intended assets, or the subtitle progression is visibly out of sync with the frame/panel progression, or the subtitle block is unreasonably large or too high for the frame, the export is invalid and must be rebuilt.
chapters/<chapter-id>/video-package.mdRecord:
video-frameffmpeg commands usedchapters/<chapter-id>/chapter-carry-forward.mdCapture the durable handoff for the next chapter:
Then route the full package back to manga_showrunner for series-level canon sync.
storyboard_director.manga_illustrator.video-frame and any approved asset looks like a manga page, panel grid, white paper sheet, collage, reference sheet, or wrong-aspect frame without an explicit crop plan, route it back to manga_illustrator.manga_illustrator before final burn decisions.generate_speech multi-speaker request with more than two mapped speakers. Split it first.