mit einem Klick
seedance-audio
// This skill should be used when the user asks for Seedance 2.0 audio, dialogue, lip-sync, music, sound effects, ambience, beat-sync, audio-reference mapping, desync troubleshooting, or sound-driven visual timing.
// This skill should be used when the user asks for Seedance 2.0 audio, dialogue, lip-sync, music, sound effects, ambience, beat-sync, audio-reference mapping, desync troubleshooting, or sound-driven visual timing.
This skill should be used when directing Seedance 2.0 T2V, I2V, V2V, R2V, audio, safety, or API work.
This skill should be used when a Seedance 2.0 prompt contains generic AI filler, hollow superlatives, vague cinematic language, bloated adjectives, weak verbs, or needs sharper production-specific wording.
This skill should be used when the user asks for camera movement, shot scale, lens feel, framing, one-take direction, dolly, pan, tilt, push-in, handheld, aerial, macro, or camera-transfer guidance for Seedance 2.0.
This skill should be used when the user asks for character consistency, character tags, identity lock, multi-character blocking, wardrobe continuity, hand safety, expression control, or likeness-sensitive character guidance.
This skill should be used when a Seedance 2.0 prompt mentions named characters, franchises, studios, celebrities, public figures, private people, brand logos, copyrighted scenes, songs, voices, or real-person likeness workflows and needs an IP-safe rewrite.
This skill should be used when the user asks for Chinese Seedance 2.0 examples, Chinese prompt patterns, example rewrites, or safe versions of working Chinese video-generation prompts.
| name | seedance-audio |
| description | This skill should be used when the user asks for Seedance 2.0 audio, dialogue, lip-sync, music, sound effects, ambience, beat-sync, audio-reference mapping, desync troubleshooting, or sound-driven visual timing. |
| license | MIT |
| user-invocable | true |
| tags | ["audio","lip-sync","dialogue","seedance-20"] |
| metadata | {"version":"5.4.5","updated":"2026-05-30","parent":"seedance-20","author":"Iamemily2050 (@iamemily2050)","repository":"https://github.com/Emily2040/seedance-2.0","openclaw":{"emoji":"🎬","homepage":"https://github.com/Emily2040/seedance-2.0"}} |
Use this for dialogue, lip-sync, sound layers, music, ambience, beat-sync, audio-reference mapping, desync troubleshooting, or sound-driven visual timing. Audio should support the visible beat instead of becoming a second competing prompt.
Load [ref:audio-guide] for detailed constraints, beat-sync, desync repair, audio-reference conflicts, and multi-character workarounds. Load [ref:audio-post-delivery] when the user needs stems, M&E, dubbing, loudness, sync, mix, or delivery guidance.
Keep dialogue short, quote spoken lines, and assign every line to a named speaker. Prefer locked or stable framing for lip-sync. Remove head-turning, large face motion, extreme camera moves, or busy hand gestures while mouth accuracy matters. Treat [Audio1] as a rhythm, pacing, mood, voice-tone, or ambience reference unless the active platform documents exact playback behavior.
Use compact layers: Dialogue: ... Sound: ... SFX: ... Music: ... Silence: .... Include only the layers that matter. Silence is valid when it sharpens drama or avoids confusing lip-sync.
| Need | Stable audio direction |
|---|---|
| Lip-sync | Character A, locked medium close-up, says "I found it." Clear dry dialogue, no head turn. |
| Product ad | Sound: low room tone. SFX: magnetic click on lid open, soft glass chime at final frame. |
| Beat sync | [Audio1] provides tempo only; light pulses and foot taps match the downbeat. |
| Drama | Distant rain and refrigerator hum; no music during the line. |
| Action | Breathing grows louder, shoe squeak at landing, metal door buzzer at endpoint. |
Use one speaker per short clip when reliability matters. If two characters must speak, separate turns and keep the camera stable: Character A says... pause. Character B answers.... For complex exchanges, recommend generating controlled single-speaker clips and compositing in post.
If dialogue desyncs, shorten the line, lock the camera, remove head turns, clean the audio role, and reduce competing SFX. If the wrong speaker talks, assign tags and split lines by speaker. If audio is ignored, remove extra music/SFX instructions and make the reference role explicit.
If audio and video references fight each other, mute the reference video before upload when possible, or make the priority explicit: [Video1] controls camera only; [Audio1] controls tempo and energy.
Return speaker map, quoted dialogue, sound layers, audio reference role, lip-sync constraints, post/delivery notes if needed, and a compact prompt-ready audio block.