| name | audio-storytelling |
| description | Audio storytelling skill used by the podcast scriptwriter and show note editor. Provides narrative structure, pacing, and sound design methodologies that maximize listener immersion in an audio-only medium. Use this skill's knowledge for 'audio narrative,' 'listener retention,' 'story arc,' 'sound design,' and similar requests. |
Audio Storytelling — Audio Narrative Design Methodology
Storytelling techniques unique to the audio medium, used by the scriptwriter and shownote-editor agents.
Why Audio Storytelling Is Different
Video is visual + auditory. Text is visual. Podcasts are purely auditory. This is not a limitation — it's an opportunity.
- Theater of the Mind: Listeners create their own visuals in their heads — vivid description is everything
- Intimacy: A medium that whispers directly into someone's ear through earbuds — it's a friend's conversation, not a formal speech
- Multitasking Competition: People listen while driving/exercising/doing chores — you need devices that recapture attention
Episode Narrative Structure: 5 Arc Patterns
1. Classic Three-Act Structure
- Best for: Character interviews, case studies
- Opening: Show the outcome first, then ask "How did we get here?"
- Middle: Specific scenes of failure/crisis
- End: Lesson + current status
2. Mystery Reveal Structure
- Best for: Investigative journalism, history, incident reconstruction
- Key: End each segment with "But then something unexpected happened"
- Rule: Plant clues fairly so listeners can theorize along the way
3. Parallel Comparison Structure
- Best for: Comparing two people/events/perspectives
- Key: Alternate between A and B, then converge at a surprising commonality/contrast
- Tip: Gradually shorten A and B segments → increasing crossover frequency → rising tension
4. List Frame Structure
- Best for: Educational, self-improvement, tip-sharing episodes
- Key: Not a simple enumeration — attach one story to each item
- Rule: Place the strongest items at position 3 (middle) and last (primacy + recency effect)
5. Conversational Exploration Structure
- Best for: Philosophical/social topics, panel discussions
- Key: The goal is not reaching a conclusion but showing the thinking process
- Tip: "There's no right answer — we're exploring this together"
Pacing Formula: BPM (Beats Per Minute) Model
In audio, a "beat" is a transition point in information units — a new fact, emotional shift, topic change, etc.
| Episode Type | Recommended BPM | Description |
|---|
| News/Information | 4–5 BPM | New info every 12–15 seconds |
| Interview/Conversation | 2–3 BPM | Perspective shift every 20–30 seconds |
| Storytelling | 1–2 BPM | Scene change every 30–60 seconds |
| Meditation/Reflection | 0.5–1 BPM | Generous use of space and silence |
BPM Shifting Rules
- Opening 30 seconds: High BPM (4–5) — hook fast
- Segment Intro: Medium BPM (2–3) — set context
- Segment Deep Dive: Low BPM (1–2) — in-depth exploration
- Transitions: High BPM (3–4) — reignite energy
- Climax: Variable — intentional silence (0 BPM) followed by burst (5+ BPM)
12 Audio Devices for Boosting Listener Retention
Narrative Devices
| # | Device | Usage | Effect |
|---|
| 1 | Open Loop | "The ending to this story — coming up shortly" | The brain tries to complete unfinished information (Zeigarnik effect) |
| 2 | Cold Open | Start with the most dramatic moment from mid-episode | "What is this?" curiosity trigger |
| 3 | Preview Teaser | "Today we cover X, and Y was particularly shocking" | Removes reasons not to listen |
| 4 | Callback | Reconnect something mentioned earlier to a later point | Narrative satisfaction |
| 5 | Cliffhanger | Leave a segment ending unresolved | Prevents drop-off before the next segment |
Vocal Performance Devices
| # | Device | Usage | Effect |
|---|
| 6 | Intentional Silence | 2–3 second pause after a shocking fact | Conveys the weight of the information |
| 7 | Speed Variation | Speak faster during tense parts, slower on key sentences | Rhythm and emphasis |
| 8 | Tone Shift | Explicit signal when shifting light → serious | "Let me get serious for a moment" |
| 9 | Direct Address | "You, listening with your earbuds right now" | Strengthens the 1:1 conversation illusion |
Structural Devices
| # | Device | Usage | Effect |
|---|
| 10 | Countdown | "The third reason is the most surprising" | Builds anticipation |
| 11 | Summary Anchor | One-sentence summary every 7–8 minutes | Recaptures listeners who drifted |
| 12 | Next Episode Preview | Reveal one key piece of next week's content at closing | Drives series listening |
Sound Design Direction Guide
Notation for sound cues in the script:
> [BGM: Gentle piano — reflective atmosphere]
> [SFX: Typewriter sounds — archival feel]
> [SILENCE: 3 seconds — emotional space]
> [MUSIC TRANSITION: Bright guitar → dark ambient]
> [INSERT: Narration with a news archive clip feel]
Recommended BGM Genres by Mood
| Mood | BGM Genre | Suggested Segment |
|---|
| Energy/Start | Upbeat indie pop, electronic | Opening, transitions |
| Exploration/Discovery | Minimal piano, ambient | Interview deep dives |
| Tension/Suspense | Low drone, dissonance | Mystery, crisis |
| Emotion/Reflection | Strings, acoustic guitar | Climax, closing |
| Humor/Light | Jazz, retro synth | Bridges, casual talk |
Script Density Formula by Duration
| Episode Length | Segment Count | Per-Segment Length | Transitions | CTA Placement |
|---|
| 15 min (short-form) | 2–3 | 4–5 min | 30 sec | After final segment |
| 30 min (standard) | 3–4 | 6–8 min | 30–60 sec | After Segment 2 + closing |
| 60 min (long-form) | 4–6 | 8–12 min | 60 sec | At 20 min/40 min + closing |
| 90 min+ (deep dive) | 5–8 | 10–15 min | 60–90 sec | At 25/50/75 min + closing |