Run any Skill in Manus with one click

$pwd:

presentation-builder

Name: Presentation Builder
Author: kpachhai

// End-to-end skill for creating professional Lunch & Learn style presentations. Produces a markdown transcript, PPTX slide deck, and DOCX presenter reference with diagrams. Supports iterative refinement, fact-checking via research sub-tasks, and optional cross-agent review for content accuracy. Works with any agent CLI (Claude, Codex, Gemini) as the primary session.

Run Skill in Manus

$ git log --oneline --stat

stars:0

forks:0

updated:May 17, 2026 at 05:24

SKILL.md

readonly

package.json

"author": "kpachhai"

"repository": "kpachhai/idea-forge"

View GitHub Repository

$ install --global

$ download --local

Run Skill in Manus

Run any Skill with one click

name	presentation-builder
description	End-to-end skill for creating professional Lunch & Learn style presentations. Produces a markdown transcript, PPTX slide deck, and DOCX presenter reference with diagrams. Supports iterative refinement, fact-checking via research sub-tasks, and optional cross-agent review for content accuracy. Works with any agent CLI (Claude, Codex, Gemini) as the primary session.
tools-needed	file read/write, shell execution, user interaction, parallel sub-tasks

Presentation Builder Skill

Foundation: This skill inherits the quality framework from skills/builders/builder-foundation/SKILL.md. Presentation-builder already has mature quality patterns (Phase 3.5 Humanizer, cross-agent review, iterative feedback). The foundation's Quality Dimensions (Depth, Originality, Coherence, Completeness) and weakness-weighted review tiers supplement the existing patterns.

Purpose

Create a complete, polished presentation package from a topic brief. The output is three artifacts:

Markdown transcript (<topic>-transcriptv<N>.md) — speaker script with slide-by-slide notes
PPTX slide deck (<topic>-presentationv<N>.pptx) — visual slides for screen sharing
DOCX presenter reference (<topic>-transcriptv<N>.docx) — formatted reading guide with diagrams

This skill codifies the workflow used to build the "DeFi Under The Hood" presentation (24 slides, 30 min + 10 min Q&A) and generalizes it for any technical topic.

When To Use

User asks to create a presentation, Lunch & Learn, talk, or lecture
User has a topic and audience in mind
Output should be presentation-ready (not just an outline)

Prerequisites

# In the working directory:
npm init -y
npm install pptxgenjs docx

Ensure package.json has "type": "commonjs" (both pptxgenjs and docx use CommonJS).

Workflow Overview

Phase 1: Define → Phase 2: Research & Plan Review (Cross-Agent) → Phase 3: Transcript → Phase 3.5: Humanizer Pass → Phase 4: PPTX → Phase 4.5: Deliverable Review (Cross-Agent) → Phase 5: DOCX → Phase 6: Final Review

Each phase produces a versioned artifact. Versions increment on user feedback (v1 → v2 → v3).

Cross-agent review is used for the ENTIRE lifecycle — both planning (Phase 2) and implementation (Phase 4.5). Claude, Codex, and Gemini should all participate to produce the most robust presentation, transcript, and code.

Phase 1: Define the Presentation

Collect these parameters before writing anything:

Parameter	Example	Required
Topic	"DeFi Under The Hood"	Yes
Audience	"DeFi beginners — assume zero crypto knowledge"	Yes
Duration	"30 minutes + 10 min Q&A"	Yes
Presenter	""	Yes
Slide count	24	Yes
Date/event	"February 2026 \| Lunch & Learn"	Yes
Tone	Conversational, humorous, analogy-heavy	Yes
Key constraints	"No financial advice disclaimer", "Must mention Hedera"	Optional

Audience Calibration

The audience level drives everything:

Audience	Language	Analogies	Jargon	Additional Info
Zero knowledge	Simple, everyday	Heavy (vending machines, Legos)	Define every term	Detailed for Q&A prep
Some knowledge	Mix of simple + technical	Moderate	Define new terms only	Medium depth
Technical	Direct, precise	Sparse	Assume familiarity	Deep technical details

Timing Budget

Allocate time per slide based on content density:

Content Type	Time per Slide
Title / intro	0:30
Concept introduction	1:00 - 1:30
Technical deep-dive	1:30 - 2:00
Comparison / walkthrough	1:15 - 1:45
Summary / takeaways	1:00 - 1:15
Q&A / resources	Remaining time

Total must equal stated duration. Build the timing table before writing any transcript.

Phase 2: Research

Specialized Critic Roles

Each counterpart agent has a specialized focus area that plays to its strengths. Use role-specific prompts instead of generic review prompts.

Codex — Fact Verification & References

Codex is the fact-checker. Use it to verify every claim, URL, code snippet, and technical detail.

Fact-check the following presentation content. For each claim, provide:
  1. Verification status: VERIFIED / OUTDATED / INCORRECT / UNVERIFIABLE
  2. Source URL confirming or correcting the claim
  3. Date of source data
  4. Any version-specific concerns

Also check:
  - All code snippets for syntax correctness
  - All CLI commands for correct flags and syntax
  - All system contract addresses against current documentation
  - All URLs for liveness (LIVE / DEAD / REDIRECT)
  Flag anything outdated, disputed, or missing a reference.

Gemini — Humor, Engagement & Visual Design

Gemini is the engagement and visual specialist. Use it for humor, audience interaction, analogies, transitions, and image/diagram prompt refinement.

For a [duration] presentation on [topic] aimed at [audience]:
  1. Generate 2-3 jokes/funny lines PER SECTION that match the stated tone
  2. Generate 10+ analogies that explain technical concepts using everyday objects
  3. Generate transition lines between major sections
  4. Identify engagement dead zones (3+ slides without humor or strong hooks)
  6. Refine all Image Prompt fields for better visual output:
     - Better composition and focal points
     - Color harmony with the design system
     - Diagram layout improvements and visual hierarchy
  7. Suggest speaker note improvements for natural delivery cadence

Both critics run in parallel with per-agent timeouts. Codex gets 480s (8 min) because it does 20-30+ web searches per review and frequently falls back from WebSocket to HTTPS transport. Gemini gets 180s (3 min). If Codex times out, fall back to a general-purpose Claude sub-agent with web search for fact verification.

Codex output feeds into the Fact Bank and code verification. Gemini output feeds into transcript writing, speaker notes, and image generation.

Cross-Agent Plan Review

For presentations with high accuracy requirements (financial data, regulatory claims, technical specifications), run the cross-agent plan review loop with specialized prompts:

Draft the slide outline as a "plan"
Run Codex with the fact-verification prompt (see skills/presentations/presentation-planner/SKILL.md for full template)
Run Gemini with the engagement/visual prompt (see skills/presentations/presentation-planner/SKILL.md for full template)
Merge, triage, and iterate (max 3 rounds)

This is opt-in. Ask the user before running it. Cost scales with critic count.

Phase 3: Transcript (Markdown)

File: `<topic>-transcriptv<N>.md`

Structure for each slide:

---

## SLIDE <N> - <Title> (<start> - <end>)

**[On screen: <description of what the audience sees on the slide>]**

<Transcript line 1 — natural speaking style, not read-word-for-word>

<Transcript line 2 — **bold** for emphasis, _(italics)_ for stage directions>

<Transcript line 3>

**[Advance]**

### Additional Info — Slide <N>
- **Term/concept 1**: Deep explanation at implementation level — what it is, how it
  works internally, why it matters. For technical talks: function signatures, API
  endpoints, protocol internals, error codes. For strategic talks: market data with
  sources, competitive details, regulatory specifics. The presenter should be able
  to answer any follow-up question from this section alone. 2-4 sentences per bullet.
- **Term/concept 2**: Same depth. Every concept the slide mentions gets its own
  bullet with enough detail to satisfy the most skeptical audience member.
- **Common question**: "Why does X happen?" — prepared answer with specifics
- **Edge case**: What breaks, what's unsupported, what's surprising
- ...8-15 bullets per content slide, each with genuine depth

### Diagrams (on slides with visual/flow/architecture content)

**<Diagram title>:**

<Multi-paragraph prose description explaining every element of the diagram.
Each box, arrow, and flow should be explained — what it represents, why it's
there, and how it connects to the concepts being discussed. Include anticipated
questions inline: "If someone asks 'why does X route through Y?' — the answer
is..." This section should be 2-5 paragraphs, rich enough that someone who
cannot see the diagram still understands the full picture.>

` ` `mermaid
<Mermaid diagram code — single-line labels only, no \n or <br/>>
` ` `

Note: Replace ` ` ` above with actual triple backticks (shown spaced to avoid markdown parsing issues in this skill file).

Presentation Type Adaptation

Before writing the transcript, check the plan's North Star for the declared Presentation Type (Executive / Technical / Community / Educational). Adapt the transcript accordingly:

Executive: No jargon in spoken text. Define every domain term inline on first use. Keep transcript paragraphs short (2-3 sentences). Numbers and outcomes over mechanisms. Revenue/business model slides come AFTER the solution is understood. Never explain implementation details in the spoken transcript - put them in Additional Info only.
Technical: Use precise terms. Include code snippets, formulas, and specific numbers in the spoken text. The audience expects depth - do not simplify away the interesting parts. Show your work.
Community: Conversational tone. Balance accessibility with substance. Show how the audience can participate or contribute. Avoid marketing language - community audiences detect it instantly.
Educational: Define EVERY term. Use analogies for every concept. Build knowledge incrementally - do not reference concepts before they are explained. Check: could someone with zero domain knowledge follow slide N without having read a separate glossary?

If no type is declared, default to Executive (the most constrained profile - it is easier to add depth than to remove it).

Transcript Writing Rules

Write how you talk, not how you write. Short sentences. Sentence fragments are fine. Conversational tone.
One idea per paragraph. Each > block is one thought.
Bold the punchlines. The key insight in each section should be bold.
Stage directions in italics. Use sparingly — _(pause)_, _(smile)_ for key moments. Avoid excessive _(gesture left)_, _(point to chart)_ — trust the presenter to know where to gesture.
Analogies anchor every concept. If you can't explain it with an analogy, simplify further.
Humor is earned. One or two jokes per slide max. Self-deprecating > punching down.
Transitions are explicit. Don't just advance — tell the audience where you're going.
"Additional Info" sections are deeply detailed and encyclopedic. These are NOT surface-level summaries. They are the presenter's lifeline during Q&A. Every concept mentioned on the slide or in the transcript must be explained at depth appropriate to the topic. For technical presentations: include function signatures, API endpoints, protocol internals, error codes, configuration specifics, and implementation details. For strategic presentations: include market data with sources, competitive analysis, regulatory details, and financial assumptions. The rule: if the slide mentions a concept, the Additional Info must explain it thoroughly enough that the presenter can answer any follow-up question without external research. Use **bold prefix**: for scannable headers. Aim for 8-15 bullets per content slide, each with 2-4 sentences of real depth.
No redundant explanations across slides. Each concept is explained fully ONCE at its primary slide. Later slides reference it briefly ("as we saw earlier") but never re-explain the full mechanism. This applies to: technical flows, error descriptions, key phrases/mottos, and analogies. Review all speaker notes as a single narrative — if a paragraph re-teaches something already covered, cut it. The audience heard it the first time; repetition wastes time and dilutes impact.
No-repetition rule. Do not repeat the same concept, phrase, or explanation more than twice in the entire transcript. Once is ideal, twice maximum if warranted (e.g., an introduction and a recap). Track key phrases across all slides — if you've already explained something (like how a technology works or why something breaks), reference it later with a short callback ("as we saw earlier" or "remember the error from slide 2") instead of re-explaining. This prevents the presentation from dragging. Specifically watch for: technical error descriptions, architecture explanations, taglines/catchphrases, and "how X works" chains.
Mixed-audience language rule. Write for a room that has both technical and non-technical people. Default to simple, concrete language. When introducing a technical concept, use a one-sentence plain-English explanation FIRST, then optionally add the technical term in parentheses. Avoid jargon cascades (chains of 3+ technical terms in one sentence). Put deep technical details in Additional Info or speaker notes, not in the spoken transcript. Examples: "the EVM's error code" not "the INVALID opcode (0xfe) defined in the Yellow Paper, Appendix H"; "shells out to fetch data from the API" not "uses FFI to spawn a subprocess that executes curl against the REST API endpoint."
Executive structure rule. For leadership/decision-maker audiences, follow the thread: Problem -> Solution -> Why Us -> Outcome -> How We Make Money -> Risks -> Ask. The business model and revenue slides should come AFTER the audience understands what the product IS and why it matters. Explaining monetization too early distracts from the value proposition. The audience needs to be convinced the product solves a real problem before they care how it generates revenue.
Follow-the-thread rule. The presentation should feel like one continuous thread moving forward, not a series of loops back to earlier content. If the audience needs context from an earlier slide, use a one-sentence callback ("remember the base pool from earlier"), not a re-explanation. Each slide should advance the narrative, not revisit it. Test: read all speaker notes consecutively - if any paragraph sounds like "going back to" or re-explains a mechanism already covered, cut it or replace with a brief callback.

Natural Delivery Rules (Critical)

The transcript should sound like a real human talking to a room, not an AI reading slide content out loud. Follow these rules strictly:

Anti-Patterns — NEVER Do These

Never enumerate slide content. If the slide shows three bullet points, do NOT say "Number one... Number two... Number three..." or "First... Second... Third..." The audience can read. Your job is to add context, stories, and color around what's on screen — not narrate it like a teleprompter.
- BAD: "Three things make DeFi work. Permissionless — no application forms. Transparent — every transaction is auditable. Composable — protocols plug into each other."
- GOOD: "What makes it actually work? Well, it's permissionless — meaning no application forms, no credit checks. Your wallet is your ID, full stop. It's transparent — every line of code is public. Imagine if your bank had to publish every decision it ever made — they'd shut down voluntarily. And it's composable — this one's the sleeper. We'll come back to it because it's honestly the most powerful idea in all of DeFi."
Never use uniform paragraph structure across slides. If every slide follows the same pattern (introduce concept → list items → bold conclusion), the presentation sounds monotonous. Vary the rhythm — some slides lead with a story, some with a question, some with a surprising fact.
Never describe the slide layout as a script. Avoid "As you can see on the left side of the slide..." or "The flow diagram at the bottom shows..." unless the visual is genuinely confusing. The audience has eyes. Instead, let your words complement what's on screen, not describe it.
Never make every sentence informational. Real speakers have "breathing room" lines — throwaway comments, personal reactions, rhetorical asides. These are essential for pacing. A presentation that's 100% information is exhausting to listen to.

Positive Patterns — ALWAYS Do These

Weave in tidbits from Additional Info. Pull 1-2 small facts, origin stories, or "did you know" moments from the Additional Info section into the spoken transcript. This adds texture that the slide alone doesn't provide.
- Example: "Fun fact — this formula was first sketched out by Vitalik Buterin in a Reddit post in 2017. A Reddit post! That's how some of the most important financial infrastructure in the world got started."
Use conversational bridges. Real people use connective phrases: "Here's the thing...", "What's interesting is...", "So picture this...", "I'll be honest...", "This blew my mind when I first learned it...", "And here's the part that trips people up..."
Add personal reactions. The presenter should react to their own content as a human would — "When I first learned about this, I spent two hours staring at the screen going 'wait, that can't be right.'" This builds rapport and signals authenticity.
Vary sentence rhythm aggressively. Mix long flowing explanatory sentences with punchy short ones. A paragraph with five same-length sentences sounds robotic. Interrupt yourself. Use dashes. Drop a one-word sentence for emphasis. Like that.
Let the slide do its job. If the slide shows a comparison table, don't read both columns. Pick one side and tell a story about it. The audience will read the other column themselves.
Connect ideas across slides. Reference earlier concepts: "Remember the vending machine from earlier? Same idea here, but now..." This makes the presentation feel like a coherent story instead of 24 disconnected slides.
Include at least one "off-script" moment per 3-4 slides. A brief aside, a personal anecdote, a "just between us" moment. These are what make a talk feel human rather than rehearsed. They can be planned — the audience won't know.

Timing Markers

Every slide header includes timing: (start - end). The sum of all slide durations must equal the stated presentation duration. Q&A time is separate and listed on the final slide.

Versioning

v1: First draft from outline + research
v2: After user feedback (language, coverage, timing adjustments)
v3: After second feedback round (simplified language, more analogies, fact corrections)

Only keep the latest version. Delete previous versions when a new one is created (e.g., delete transcriptv1.md when transcriptv2.md is written).

Phase 3.5: Humanizer Pass (Mandatory)

After completing the transcript draft, run it through the humanizer skill (skills/humanizer/SKILL.md). This is a mandatory step, not optional.

What the Humanizer Does

Applies 24 pattern-detection rules to remove AI writing tells:

Significance inflation ("serves as", "testament to", "pivotal moment")
Copula avoidance ("serves as" -> "is")
Promotional language ("groundbreaking", "vibrant", "showcasing")
Rule-of-three forcing
Em-dash overuse
Filler phrases ("In order to", "It is important to note that")
Generic positive conclusions
Uniform paragraph structure

Process

Re-read the full transcript after completing the draft
Apply all 24 humanizer patterns systematically
Apply the 7 presentation-specific rules from the humanizer skill
Run the anti-AI audit: "What makes this obviously AI generated?" - list remaining tells
Fix remaining tells
Update the transcript file in place

When to Skip

Never. The humanizer pass is always applied to transcripts. The only content exempted is structured data: tables, Mermaid diagrams, Fact Bank entries, and appendix reference material.

Phase 3.7: Shared Transcript Parser (Drift Prevention)

File: `workspace/<project>/transcript-parser.js`

The PPTX speaker notes (Phase 4) and the DOCX presenter reference (Phase 5) BOTH need the same slide-by-slide spoken content from the transcript. Hardcoding speaker-note strings inside generate-pptx.js creates a drift mode: every transcript edit (Humanizer Phase 3.5 polish, Phase 4.5 cross-agent fixes, last-minute corrections) updates the DOCX at next regenerate but leaves PPTX speaker notes pointing at stale text. The presenter ends up showing a deck whose notes contradict the printed reference.

Fix: both generators import a small shared parser that reads transcript-v<N>.md at generation time. One source of truth, zero manual sync. The parser is dependency-free (just fs) and lives next to the two generators in the project workspace.

Parser Template

Write workspace/<project>/transcript-parser.js once per project. The template below is the verbatim reference shipped with the H3 context-engineering talk (workspace/context-engineering-talk/transcript-parser.js):

// Shared transcript parser for presenter bundles.
// Consumed by generate-pptx.js (speaker notes) and generate-docx.js (presenter reference).
// One source of truth (transcript-v<N>.md) eliminates the manual-sync drift mode
// where hardcoded speaker notes diverge from the transcript as the talk gets edited.
//
// Exports parseTranscript(transcriptPath, opts?) -> { slides, qaItems }.
//   slides[i] = { num, title, timing, onScreen, spokenParas, infoBullets, diagrams }
//   qaItems[i] = { question, answer }
//
// opts.stripStageDirections (default false): when true, removes _(pause)_-style
// italic stage directions entirely. PPTX speaker notes use true (clean prose for
// presenter view); DOCX presenter reference uses false (preserves cues in print).

const fs = require("fs");

function clean(text, opts = {}) {
  const stripStage = !!opts.stripStageDirections;
  let out = text
    .replace(/\*\*\[On screen: ([\s\S]+?)\]\*\*/g, "")
    .replace(/\*\*\[Advance\]\*\*/g, "")
    .replace(/\*\*\[Advance to Q&A\]\*\*/g, "")
    .replace(/\*\*(.+?)\*\*/g, "$1")
    .replace(/(?<!\*)\*([^*\n]+?)\*(?!\*)/g, "$1");

  if (stripStage) {
    out = out
      .replace(/\s*_\([^)]+\)_\s*/g, " ")
      .replace(/[ \t]+/g, " ");
  } else {
    out = out.replace(/_(\([^)]+\))_/g, "$1");
  }

  out = out
    .replace(/_(\w[^_\n]*?)_/g, "$1")
    .replace(/`([^`]+)`/g, "$1")
    .trim();

  return out;
}

function parseTranscript(transcriptPath, opts = {}) {
  const transcript = fs.readFileSync(transcriptPath, "utf-8");
  const slideRegex = /^## SLIDE (\d+) - (.+?) \(([^)]+)\)\s*$/m;
  const sections = transcript.split(/^---\s*$/m);

  const slides = [];
  sections.forEach(section => {
    const headerMatch = section.match(slideRegex);
    if (!headerMatch) return;
    const [, num, title, timing] = headerMatch;

    const onScreenMatch = section.match(/\*\*\[On screen:\s*([\s\S]+?)\]\*\*/);
    const onScreen = onScreenMatch ? onScreenMatch[1].trim().replace(/\s+/g, " ") : "";

    const spokenMatch = section.match(/\*\*\[On screen:[\s\S]+?\]\*\*\s*([\s\S]*?)\s*\*\*\[Advance/);
    const spokenRaw = spokenMatch ? spokenMatch[1] : "";
    const spokenParas = spokenRaw
      .split(/\n\n+/)
      .map(p => clean(p, opts))
      .filter(p => p.length > 0 && !p.startsWith("###"));

    const infoMatch = section.match(/### Additional Info - Slide \d+\s*\n([\s\S]+?)(?=\n---|\n## |\n### |$)/);
    let infoBullets = [];
    if (infoMatch) {
      infoBullets = infoMatch[1]
        .split(/\n- /)
        .map(line => line.replace(/^- /, "").trim())
        .filter(line => line.length > 0)
        .map(line => clean(line, opts));
    }

    const diagMatch = section.match(/### Diagrams - Slide \d+\s*\n([\s\S]+?)(?=\n### Additional Info|\n---|\n## |$)/);
    const diagrams = diagMatch ? diagMatch[1].trim() : null;

    slides.push({
      num: parseInt(num, 10),
      title, timing, onScreen, spokenParas, infoBullets, diagrams,
    });
  });

  slides.sort((a, b) => a.num - b.num);

  const qaMatch = transcript.match(/## Anticipated Q&A \(Presenter Prep\)\s*\n([\s\S]+?)(?=\n## |$)/);
  const qaText = qaMatch ? qaMatch[1] : "";
  const qaItems = [];
  if (qaText) {
    const qBlocks = qaText.split(/\n### Q: /).slice(1);
    qBlocks.forEach(block => {
      const lines = block.split("\n");
      const question = lines[0].replace(/^"|"$/g, "").trim();
      const answer = lines.slice(1).join("\n").trim();
      qaItems.push({ question, answer: clean(answer, opts) });
    });
  }

  return { slides, qaItems };
}

module.exports = { parseTranscript, clean };

Two Cleaning Modes (Same Source, Surface-Appropriate Output)

PPTX speaker notes show in PowerPoint's presenter view at delivery time. Pass stripStageDirections: true so stage cues like _(pause)_ get removed - cleanest possible prose, zero visual noise during the talk.
DOCX presenter reference is the read-through artifact (paper or screen) used for rehearsal. Use the default (stripStageDirections: false) so stage cues stay as (pause) text - they remind the presenter where to pause, slow down, or beat for emphasis during read-aloud.

Same transcript, surface-appropriate cleaning. Both surfaces auto-update from a transcript edit at next regenerate.

Markdown Conventions the Parser Expects

The parser assumes the markdown transcript follows the structure documented in Phase 3:

Slide blocks separated by --- on its own line
Each block opens with ## SLIDE <N> - <Title> (<timing>)
**[On screen: ...]** line for the visual cue (parser strips it from spoken output)
Spoken prose between the on-screen line and **[Advance]** / **[Advance to Q&A]**
Stage directions as _(pause)_, _(beat)_, etc. (italic markdown, parens inside)
### Additional Info - Slide <N> bullet list for presenter-reference depth
### Diagrams - Slide <N> for optional diagram description block
## Anticipated Q&A (Presenter Prep) appendix with ### Q: "<question>" blocks

Deviating from these conventions breaks the parser silently (empty spokenParas, missing Q&A). If a project needs a different transcript shape, fork the parser into the project. Do NOT generalize this skill's reference template to support multiple shapes - the multi-format-support tax compounds fast and degrades the failure-loud behavior the silent-deviation rule depends on.

Sync-Divergence Discipline When Adopting On An Existing Deck

When retrofitting an existing deck whose generate-pptx.js has hardcoded speaker notes, the runtime-parsed notes may differ from the hardcoded ones in subtle ways (most commonly: stage directions present in transcript but absent from hand-polished hardcoded notes; whitespace collapse; bold/italic markdown stripping). Before swapping the source of truth, run a one-slide sample diff (e.g., spot-check slide 1 + slide N/2 + last slide) so the user knows whether speaker-note content will change at next regenerate. If divergence is purely stage-direction stripping (covered by stripStageDirections: true), the swap is safe. Other divergences warrant naming explicitly before commit.

Phase 4: PPTX Slide Deck

File: `workspace/<project>/generate-pptx.js` → `workspace/<project>/<slug>-presentation-v<N>.pptx`

Use pptxgenjs to generate visual slides programmatically.

Design System

Read DESIGN.md at the idea-forge repo root before defining any palette. It is the single source of truth for idea-forge's visual identity (PPTX dark palette, DOCX light palette, typography, components, do's and don'ts). Do not invent a new color set; copy the relevant tokens from DESIGN.md into the project's generate-pptx.js.

For PPTX specifically, the palette to use is the dark / presentation mode from DESIGN.md. The minimum tokens to copy:

// Copy from DESIGN.md (presentation palette - dark mode)
const C = {
  bgDark:    "0B1121",   // surfaceDark
  bgMid:     "141B2D",   // surfaceMid
  bgCard:    "1A1F2E",   // surfaceCard
  purple:    "8259EF",   // primary
  teal:      "00E5CC",   // accent
  warn:      "FF6B6B",   // semantic.warn (PPTX variant)
  white:     "FFFFFF",   // textOnDark
  body:      "D1D5DB",   // textOnDarkBody
  dim:       "666666",   // muted text
};

Conventions inherited from DESIGN.md:

Dark backgrounds for projector visibility (white-on-bgDark exceeds WCAG AAA).
One accent color per slide max - teal is the visual hook, do not stack accents.
Sharp-edged rectangles only - never rounded corners.
Typography: Arial Black for titles only (32pt+); Calibri for body (14-18pt at projection distance); Consolas for code.

If a project's tone genuinely calls for a different palette (e.g., a brand-required color not in idea-forge's identity), document the deviation in the project's North Star and capture the chosen palette in the project's own north-star.md rather than mutating DESIGN.md. DESIGN.md describes idea-forge's identity, not per-project overrides.

Slide Types

Build helper functions for each slide type:

Type	Use For	Key Elements
Title slide	Opening/closing	Large title, subtitle, gradient background
Concept slide	Introducing ideas	Title, 2-3 cards with icons, supporting text
Comparison slide	A vs B	Two-column layout, colored headers
Flow/timeline	Process, evolution	Numbered steps with arrows
Data table	Numbers, comparisons	Styled table with header row
Formula/code	Technical detail	Monospace font, highlighted box
Callout slide	Key stat or warning	Large number/emoji, minimal text
Diagram slide	Architecture, flow, relationships	Native pptxgenjs shapes (rectangles, lines, arrows)

Diagram Approach: Native Shapes Default, Mermaid for Markdown Only

Two separate diagram surfaces exist in this skill, with different defaults:

Surface	Renderer	Default	Why
Markdown transcript (`<slug>-transcript-v<N>.md`)	Notion / GitHub / markdown viewer	Mermaid	The consumer's tool already renders Mermaid. Code blocks are diff-friendly. Authoring is text-only.
PPTX deck (`<slug>-presentation-v<N>.pptx`)	PowerPoint / Keynote / Google Slides	Native `pptxgenjs` shapes	The renderer does NOT speak Mermaid. Producing in-deck Mermaid requires `mmdc` (Mermaid CLI) to render to PNG, then embedding the PNG - an extra runtime dependency rarely installed on Mac dev machines (`brew install mermaid-cli` is opt-in). Native shapes are deterministic, scriptable in the same JS file, version-controlled as code, and have zero external dependencies.

Use native shapes for PPTX diagrams. Compose flow / architecture / relationship diagrams from slide.addShape(pres.shapes.RECTANGLE, ...), pres.shapes.LINE, and pres.shapes.RIGHT_TRIANGLE (for arrowheads). The pattern shipped in workspace/context-engineering-talk/generate-pptx.js (Slide 4 - five slot boxes in three group containers with central question node + connecting lines) is the reference - check it before writing your own.

When Mermaid in PPTX is acceptable:

High-complexity diagrams (10+ nodes, multiple interconnections) where native-shape positioning math becomes more code than the Mermaid source.
Diagrams already authored as Mermaid for the markdown transcript and the visual is identical (avoids duplication).

In those cases, install mmdc (npm install -g @mermaid-js/mermaid-cli), render to PNG out-of-band, and embed via slide.addImage(...). The Visual Self-QA gate's primary path (or its degraded XML-grep variant) MUST run on PNG-embedded diagrams - PNG renders are an opaque blob and structural grep cannot validate them at all.

Keep Mermaid for the markdown transcript regardless. The transcript's diagram section serves the presenter as reference material and renders inline in Notion or any markdown viewer. The PPTX shape diagram and the transcript Mermaid diagram describe the same content from different surfaces; both can exist for the same slide.

Helper Function Pattern

function card(slide, x, y, w, h, opts = {}) {
  slide.addShape(pres.shapes.RECTANGLE, {
    x, y, w, h,
    fill: { color: opts.fill || C.bgCard },
    shadow: { type: "outer", blur: 6, offset: 2, color: "000000", opacity: 0.2 },
    rectRadius: opts.radius || 0.1,
  });
}

function titleText(slide, text, opts = {}) {
  slide.addText(text, {
    x: 0.5, y: 0.3, w: 9, h: 0.6,
    fontSize: 32, fontFace: "Arial Black", color: C.white, bold: true,
    ...opts,
  });
}

Speaker Notes via Shared Parser (No Hardcoded Strings)

Speaker notes MUST be loaded from transcript-v<N>.md at generation time via the shared parser from Phase 3.7. Do NOT inline note strings inside generate-pptx.js - hardcoded notes are the drift mode Phase 3.7 exists to prevent.

At the top of generate-pptx.js:

const path = require("path");
const pptxgen = require("pptxgenjs");
const { parseTranscript } = require("./transcript-parser");

const { slides: transcriptSlides } = parseTranscript(
  path.join(__dirname, "transcript-v1.md"),
  { stripStageDirections: true },
);
const notesBySlide = Object.fromEntries(
  transcriptSlides.map(s => [s.num, s.spokenParas.join("\n\n")]),
);

At each slide block:

{
  let s = pres.addSlide();
  // ...visual content...
  s.addNotes(notesBySlide[N]);  // N matches the slide's number in the transcript
}

The slide-number key (N) matches ## SLIDE <N> in the transcript markdown; the parser indexes by that number so slide order in source code can differ from slide order in the deck without breaking notes (the parser sorts by num internally).

If a transcript slide is missing its corresponding s.addNotes(notesBySlide[N]), the Visual Self-QA degraded-mode notesSlide*.xml count check (Phase 4, below) catches it - that's why "Speaker notes present per slide" is a structural-check pass condition.

Execution

node workspace/<project>/generate-pptx.js
# Produces: workspace/<project>/<slug>-presentation-v<N>.pptx

Visual Self-QA Checkpoint (Mandatory Before Declaring Build Done)

This is a Completion Gate item per builder-foundation/SKILL.md Section 4 (Visual Honesty, Binary Outputs). The agent writes JS code that emits a binary PPTX it never sees. Without this checkpoint, visible bugs (Mermaid \n line-break renders, autofit collapse, hex/RGBA mismatch, font overflow) surface only when the user opens the deck. This step makes the agent the first reader of its own output.

Procedure (Primary Path - requires soffice CLI):

Convert the generated PPTX to PDF:

python scripts/office/soffice.py --headless --convert-to pdf workspace/<project>/<slug>-presentation-v<N>.pptx

Convert at least 3 representative slides (title, one mid-deck content slide containing a diagram, conclusion) to JPEG:
```
pdftoppm -jpeg -r 150 -f <N> -l <N> workspace/<project>/<slug>-presentation-v<N>.pdf workspace/<project>/qa-slide
```
Read each generated qa-slide-*.jpg via Claude Code's image-reading capability.
Verify per slide:
- Diagrams render fully (no \n literal text appearing; labels not cut off; native shapes positioned correctly)
- No text overflow, no autofit collapse, no overlapping shapes
- Colors match the plan (hex format applied correctly)
- Font choices and sizes match the template
If any visible issue is found, fix the generator script and re-render before proceeding.
Optional but recommended: run the same check on every slide for high-stakes presentations.
After the build is approved, delete the temporary qa-slide-*.jpg files (they are throwaway QA artifacts).

Degraded Mode (when soffice is unavailable):

The primary path requires LibreOffice's soffice CLI for PPTX -> PDF conversion. On machines without it (typical macOS dev machines unless brew install --cask libreoffice ran), the gate cannot render visually. Do NOT skip silently - run the XML-grep fallback below, then explicitly mark visual rendering as NOT verified in the completion summary. This catches structural breakage (missing slides, dropped speaker notes, lost titles, missing diagram labels) but does NOT catch visual failure modes the gate exists for: autofit collapse, hex/RGBA mismatch, font overflow, color clash, label cutoff, shape positioning bugs. Treat the gate as PARTIALLY PASSED.

Probe for soffice first: command -v soffice >/dev/null && echo "primary" || echo "degraded". If degraded, proceed below.

Unpack the PPTX (it is a zip) and run structural checks:

unzip -o workspace/<project>/<slug>-presentation-v<N>.pptx -d /tmp/pptx-qa-<slug>
# Slide count matches plan:
ls /tmp/pptx-qa-<slug>/ppt/slides/slide*.xml | wc -l
# Speaker notes present per slide:
ls /tmp/pptx-qa-<slug>/ppt/notesSlides/notesSlide*.xml | wc -l
# No literal \n leaking into diagram labels (Mermaid-era failure mode):
grep -l '\\n' /tmp/pptx-qa-<slug>/ppt/slides/slide*.xml || echo "OK no literal \\n"
# Each slide title appears in its XML (sanity check on title rendering):
for n in $(seq 1 <SLIDE_COUNT>); do
  grep -q "<expected-title-substring-for-slide-$n>" /tmp/pptx-qa-<slug>/ppt/slides/slide${n}.xml \
    && echo "slide $n title OK" || echo "slide $n title MISSING"
done
# DESIGN.md hex tokens appear at expected frequency (color application sanity):
grep -o 'val="[0-9A-Fa-f]\{6\}"' /tmp/pptx-qa-<slug>/ppt/slides/slide*.xml | sort | uniq -c

Cleanup: rm -rf /tmp/pptx-qa-<slug> after the probe.
EXPLICITLY document what was NOT verified in the completion summary (see Reporting below). The signal is load-bearing - "degraded mode + silent pass" collapses to "skipped" and defeats the gate's purpose.
Surface every degraded-mode run as a [Notice] candidate for the quarterly Skill Health Audit: is this machine missing soffice by design (no install allowed), by oversight (install it), or by tradeoff (acceptable for the deck's use context)? Three different answers, three different follow-ups.

Reporting:

Primary path: Visual Self-QA: <N> slides rendered and verified.
Degraded path: Visual Self-QA: DEGRADED MODE (no soffice CLI). Structural checks passed (<N> slides, <M> speaker notes, <K> title-presence checks). Visual rendering NOT verified - presenter MUST open the deck in PowerPoint, Keynote, or Google Slides before any live use. Failure modes NOT caught by this gate: autofit collapse, hex/RGBA mismatch, font overflow, color clash, label cutoff, shape positioning.

Why this isn't optional: PPTX is the one place the builder pipeline violates the agent-native-substrate principle (the agent writes JS, JS emits a binary, the binary is opaque to the agent). The recurring failure modes documented in CLAUDE.md (literal \n leaks, autofit, color formats) all fall in this gap. The gate's intent is to make the agent the first reader of its own output. The primary path closes the gap fully; the degraded path closes the structural half and ships an explicit signal for the visual half. Silent skip is the failure mode the gate exists to prevent.

Phase 4.5: Cross-Agent Deliverable Review (Required)

After generating the transcript, PPTX, and any code artifacts, run a cross-agent review on the deliverables themselves — not just the plan. This ensures the implementation matches the North Star and catches errors that single-agent generation misses.

Why This Phase Exists

Cross-agent review is not just for planning. The whole point is to utilize Claude, Codex, and Gemini for everything — planning, implementation, and QA — to get the most robust output. Each agent brings unique strengths to deliverable review.

North Star Alignment

Every review round must reference the North Star from the plan to prevent drift:

Extract the North Star statement from the plan (e.g., "A hands-on workshop where attendees build and deploy...")
Include it in every critic prompt so reviewers can verify deliverables serve the original vision
Reject any critic suggestion that contradicts or dilutes the North Star

Critic Roles for Deliverable Review

Codex — Code & Fact Verification

Review the following generated deliverables against the plan's North Star:

NORTH STAR: [paste from plan]

Verify:
1. All Solidity code compiles and follows best practices (if code repo exists)
2. All import paths are correct (e.g., @hiero/system-contracts/ — use the current package, not the legacy one)
3. All CLI commands (Foundry, cast, forge) use correct syntax and flags
4. All URLs are valid and point to correct resources
5. All technical claims in transcript match the plan's Fact Bank
6. Speaker notes in PPTX match transcript content
7. No placeholder text remains in any deliverable
8. Code examples in slides match the actual contract code

Output format:
1) CRITICAL_ISSUES (bugs, wrong facts, broken commands)
2) NON_CRITICAL_IMPROVEMENTS (style, clarity)
3) NORTH_STAR_ALIGNMENT (does the output serve the stated goal?)
4) READY_FOR_DELIVERY (yes/no + reason)

Gemini — Engagement & Visual Quality

Review the following generated deliverables for engagement and visual quality:

NORTH STAR: [paste from plan]

Verify:
1. Transcript flows naturally as spoken word (not robotic reading)
2. Humor lands and is appropriately placed
3. Slide layouts are visually balanced (no overcrowding, proper spacing)
4. Code blocks are readable (font size, contrast)
5. Transitions between slides are smooth
6. Engagement is driven by content hooks, humor, and pacing (never audience interaction prompts)
7. Timing allocations are realistic for each slide's content density
8. Visual hierarchy guides the eye correctly on each slide
9. NO slide-narration: spoken words add context beyond what's on screen, not repeat it
10. NO monotonous enumeration: slides don't follow "here are N things" + read each one
11. NO audience interaction prompts: never write "show of hands", "raise your hand", "who here has", polls, or any line that requires the audience to respond. Assume zero interaction except Q&A at the end. Engagement comes from content, humor, pacing, and rhetorical questions (ones you answer yourself immediately).
12. Tidbits from Additional Info woven into spoken parts (origin stories, fun facts, personal reactions)
12. Varied paragraph structure across slides (not the same intro-list-conclusion pattern)
13. Conversational bridges used ("Here's the thing...", "What's interesting is...")
14. At least one "off-script" human moment per 3-4 slides

Output format:
1) ENGAGEMENT_ISSUES (dead zones, missing humor, awkward transitions)
2) SLIDE_NARRATION_ISSUES (slides that read like bullet-point recitation)
3) VISUAL_ISSUES (layout problems, readability concerns)
4) NORTH_STAR_ALIGNMENT (does the output serve the stated goal?)
5) READY_FOR_DELIVERY (yes/no + reason)

Procedure

Concatenate deliverable content into inline prompts (never reference file paths for critics)
Run Codex and Gemini critics in parallel with per-agent timeouts
Merge feedback, prioritizing consensus items
Apply accepted changes to deliverables
Re-generate PPTX if slide code was modified
Max 2 rounds for deliverable review (lighter than plan review)

Integration with Code Repos

When the deliverables include a code repository (e.g., workshop code), Codex should also verify:

Solidity syntax and pragma version
Import path correctness against the target library structure
Test coverage matches specification
README accuracy against actual project structure
Build configuration (foundry.toml, remappings) is correct

Phase 5: DOCX Presenter Reference

File: `workspace/<project>/generate-docx.js` → `workspace/<project>/<slug>-transcript-v<N>.docx`

Use the docx npm package to generate a formatted Word document.

Data: Shared Parser (Default Mode, Preserves Stage Cues)

Load slide data and Q&A from the transcript via the shared parser from Phase 3.7. Default mode preserves stage directions like (pause) as cues in the printed presenter reference (PPTX strips them for clean presenter-view notes; DOCX keeps them for read-through rehearsal).

const path = require("path");
const { parseTranscript } = require("./transcript-parser");

const transcriptPath = path.join(__dirname, "transcript-v1.md");
const { slides, qaItems } = parseTranscript(transcriptPath);
// slides[i] = { num, title, timing, onScreen, spokenParas, infoBullets, diagrams }
// qaItems[i] = { question, answer }

The DOCX renders slides[].spokenParas as body text, slides[].onScreen as the visual-cue bar, slides[].infoBullets as the Additional Info box, and qaItems as the Anticipated Q&A appendix. Every regenerate picks up the latest transcript edits with zero manual sync.

Design System (DOCX)

const C = {
  purple: "6B46C1",      // Primary heading color
  purpleLight: "E9E0FA",  // Light background
  teal: "0D9488",         // Secondary accent
  tealLight: "CCFBF1",    // Light secondary
  dark: "1E293B",         // Dark text
  body: "475569",         // Body text
  border: "CBD5E1",       // Table borders
  blue: "2563EB",         // Info boxes
  blueLight: "DBEAFE",    // Info box background
  warn: "DC2626",         // Warning color
  green: "059669",        // Success/defense
};

Component Library

Build these reusable functions:

Function	Purpose	Visual
`slideHeader(num, title, timing)`	Purple badge + title + timing	Numbered header bar
`onScreen(text)`	Gray bar describing slide visual	"On screen: ..."
`transcript(lines)`	Indented speaker lines	Bold/italic formatting
`infoBox(title, bullets)`	Blue bordered box	"Additional Info" section
`comparisonTable(left, right)`	Two-column comparison	Colored headers
`flowArrow(steps)`	Numbered vertical flow	Steps with arrows
`diagramTable(items)`	Label + description rows	Colored left column

Key docx-js Rules

Always use WidthType.DXA — never WidthType.PERCENTAGE (breaks in Google Docs)
US Letter: width 12240, height 15840, content width 9360 (with 1" margins)
Table width = sum of columnWidths — both columnWidths array AND cell width must match
Use ShadingType.CLEAR — never SOLID for table shading
Colors must be exactly 6 hex digits — never append opacity suffixes
Never use \n — use separate Paragraph elements
PageBreak must be inside a Paragraph

Validation

python scripts/office/validate.py output.docx
python3 -m markitdown output.docx | head -100

Check that all slides and Additional Info sections are present.

Phase 6: Review & Iterate

User Review Cycle

Present the transcript (markdown) first — it's fastest to review
Generate PPTX only after transcript is approved
Generate DOCX after PPTX is approved (or in parallel)
Each round of feedback creates a new version (v1 → v2 → v3)

Common Feedback Patterns

Feedback	Action
"Too technical"	Replace jargon with analogies, add everyday comparisons
"Too long"	Compress timing, merge slides, cut redundant examples
"Need more detail"	Expand Additional Info sections, add historical examples
"More humor"	Add self-deprecating jokes, funny analogies, unexpected observations
"Wrong fact"	Spawn research sub-task to verify, update transcript + slides
"Add topic X"	Add to existing slide's Additional Info or create new slide
"Timing is off"	Recalculate timing budget, adjust per-slide allocations

Fact Verification Checklist

Before finalizing any version, verify:

All numbers/statistics have a source and date
No outdated claims (check if data is > 6 months old)
Technical claims are accurate (protocol features, supported assets)
Historical events have correct dates and amounts
Disclaimers are present where required (financial, legal, medical)

File Structure

workspace/<project>/
  north-star.md                         # North Star guardrail
  <slug>-plan-v<N>.md                   # Document plan
  <slug>-research.md                    # Research brief (if applicable)
  <slug>-transcript-v<N>.md             # Versioned transcript
  <slug>-presentation-v<N>.pptx         # Final slide deck
  <slug>-transcript-v<N>.docx           # Final presenter reference
  generate-pptx.js                      # PPTX generator script (project-specific)
  generate-docx.js                      # DOCX generator script (project-specific)
  critique-r<N>-*.md                    # Cross-agent review output
  demo-repo/                            # Tutorial/demo code (if applicable)
  skills/
    presentation-builder/SKILL.md       # This skill
    cross-agent-plan-review/SKILL.md    # Content review skill (optional)

Quick Start Template

1. Initialize

mkdir <topic> && cd <topic>
npm init -y
npm install pptxgenjs docx
# Set "type": "commonjs" in package.json

2. Define Parameters

Ask the user for: topic, audience, duration, presenter, slide count, date/event, tone, constraints.

3. Build Slide Outline

Create a table:

| Slide | Title | Time | Content Type |
|-------|-------|------|-------------|
| 1 | Title | 0:00-0:30 | Title |
| 2 | Why This Matters | 0:30-1:45 | Concept |
| ... | ... | ... | ... |
| N | Q&A | XX:XX-end | Resources |

4. Research (parallel sub-tasks)

Spawn fact-checking + engagement content sub-tasks simultaneously.

5. Write Transcript v1

Follow the markdown template (Phase 3). Include all Additional Info sections.

6. Review → v2 → v3

Iterate based on user feedback. Each version is a new file.

7. Generate PPTX

Write workspace/<project>/generate-pptx.js with the slide content. Run node workspace/<project>/generate-pptx.js.

8. Generate DOCX

Write workspace/<project>/generate-docx.js with the presenter reference. Run node workspace/<project>/generate-docx.js.

9. Final Validation

All slides present in all three artifacts
Timing adds up to stated duration
Facts verified
Visual QA on PPTX
Content QA on DOCX

Lessons Learned

These patterns emerged from building the DeFi, Catalyst, and hedera-forking presentations and should be followed:

Content

Analogies are the #1 tool for a non-technical audience. Every concept needs at least one.
"Additional Info" sections save the presenter during Q&A. Make them comprehensive.
Humor works best when self-deprecating or absurdist. Avoid insider jokes the audience won't get.
Bold the insight, not the setup. The punchline of each section should pop visually.
Stage directions should be sparse. Only include key moments: _(pause)_, _(smile)_. Don't over-direct with constant _(gesture left)_, _(point to chart)_ — it makes the transcript feel like a robot script.
The #1 transcript mistake is slide-narration. If the transcript sounds like someone reading bullet points off the screen, it's wrong. The spoken words should ADD to what's on screen — context, stories, tidbits, personal reactions — not repeat it.
Pull tidbits from Additional Info into the spoken part. Each slide should have 1-2 small facts, origin stories, or "did you know" moments woven into the spoken transcript. This is the difference between a boring lecture and an engaging talk.
Vary paragraph structure across slides. Some open with questions, some with stories, some with surprising stats. Monotonous structure = monotonous delivery.
No-redundancy enforcement. Explain each concept once at its natural point in the presentation. Later slides should reference the earlier explanation, not repeat it. For example, explain how htsSetup() works on the architecture slide, then on later slides simply say "the emulation layer we set up earlier" rather than re-explaining the mechanism.

Mermaid Diagrams in Transcripts

Every slide with a visual/flow/architecture component MUST have a Diagrams section with: a bold diagram title, multi-paragraph prose description, and a Mermaid code block. This applies to architecture diagrams, flow charts, comparison diagrams, timelines, and any slide where the [On screen:] description shows a visual structure.
Diagram descriptions are the presenter's reference material, not just captions. Each description should be 2-5 paragraphs explaining every box, arrow, and flow in the diagram. Include what each element represents, why it's there, and anticipated questions ("If someone asks 'why does X happen?' - the answer is...").
Single-line labels only in Mermaid. Never use \n or <br/> in node labels. If a label is too long, shorten it.
Color-code diagram elements using Mermaid style directives: green for success/solved, red for failure/broken, blue for the solution/product, amber/orange for warnings, purple for infrastructure.
Slides that typically need diagrams: "Without vs With" comparisons, architecture overviews, data flow diagrams, deployment pipelines, development cycles, component relationships, and any slide titled "DIAGRAM:" in the plan.
Slides that do NOT need diagrams: title slides, Q&A slides, pure text/quote slides, resource/link slides, and comparison tables (unless the comparison has a flow component).

Additional Info Depth Requirements

Additional Info sections are NOT summaries. They are deep reference material. Each bullet should be 2-4 sentences explaining a concept at implementation level, not just defining it.
Every concept mentioned on the slide or in the transcript gets its own bullet. The depth should match the presentation's audience and topic. Technical presentations: function signatures, API endpoints, protocol internals, error codes, configuration specifics. Strategic presentations: market data with sources and dates, competitive positioning details, regulatory framework specifics, financial model assumptions.
The depth test: A developer/analyst in the audience should be able to look at the Additional Info and either implement the described feature or validate the described claim without further research.
Aim for 8-15 bullets per content slide, each with genuine depth. Title and Q&A slides can have fewer.
Common patterns for deep bullets: "What X is", "How X works internally", "Why X was chosen over alternatives", "What breaks when X fails", "Edge cases with X", "Common question about X with prepared answer".

Demo-Heavy Presentations

Demo transition slides are shown during live demo segments. These slides use timing markers like "during demo" instead of specific timestamps, and serve as visual anchors while the presenter is in a terminal window. They should have minimal text — just the demo topic and a brief description.
Real-world example appendices provide deep-dive reference material for demo-heavy presentations. For example, the hedera-forking presentation included appendix sections for the SaucerSwap mainnet fork test walkthrough and Bonzo Finance lending/borrowing walkthrough. These give the presenter detailed step-by-step reference material for real-world examples shown during demos.
Anticipated audience questions appendix should include 15+ detailed anticipated questions with thorough answers. These serve as Q&A prep and as a reference for the presenter during the Q&A portion. Organize by topic area (e.g., "Technical Questions", "Security Questions", "Ecosystem Questions") with full-length prepared answers, not just bullet points.

Host Q&A Format

For community call format where a host asks guided questions during the presentation, include a Host Questions Q&A Prep section with 6-10 pre-planned host questions and full-length prepared answers. The host may ask these questions at any time during the session, so prepare answers that work regardless of when they're asked. Each answer should be 3-5 sentences — conversational, not scripted — and reference specific slides or concepts from the presentation.

Technical

Write scripts in parts (~200 lines each) and concatenate. Large scripts hit output limits.
Dark PPTX backgrounds + light text = projector-friendly. Avoid light backgrounds.
DOCX color values must be exactly 6 hex digits. Never append opacity (e.g., "05966920" fails).
Use ShadingType.CLEAR not SOLID in docx-js to prevent black backgrounds.
Validate DOCX with both validate.py and markitdown to catch structural and content issues.
PPTX helper functions (card, titleText, addSlideNum) keep slide code DRY. Define them once.

Process

Transcript first, slides second. The words drive the visuals, not the other way around.
Research sub-tasks run in parallel. Don't block on fact-checking — spawn them early.
Never overwrite versions. Keep v1, v2, v3 side by side.
Cross-agent review is overkill for most presentations but valuable for regulatory, financial, or medical topics where accuracy is critical.
The presenter should NOT read word-for-word. The transcript is a guide, not a teleprompter script. Say this explicitly in the document.

Cross-Agent Review Integration (Full Lifecycle)

Cross-agent review applies to the entire presentation lifecycle, not just planning:

Phase 2 — Plan Review (Before Implementation)

Reference skills/review/cross-agent-plan-review/SKILL.md for the full procedure
Treat the slide outline + transcript as the "plan"
Codex: fact verification, code accuracy, URL validation
Gemini: humor, engagement, visual design, analogies
Max 3 rounds. Stop when CRITICAL_GAPS_REMAINING is none.

Phase 4.5 — Deliverable Review (After Implementation)

Review generated transcript, PPTX, and code artifacts
Always reference the North Star to prevent drift from original vision
Codex: verify code correctness, CLI commands, import paths, fact bank alignment
Gemini: verify engagement quality, slide layouts, transcript flow, visual balance
Max 2 rounds. Lighter touch — focus on implementation errors, not redesign.

When to Use

Always. Cross-agent review is not optional for presentations built with this skill. The whole point is to utilize Claude, Codex, and Gemini for everything to get the most robust output. Each agent catches different classes of errors:

Claude catches structural and logical issues
Codex catches factual errors, outdated references, and code bugs
Gemini catches engagement gaps, visual problems, and delivery issues

Output Rules

Never write files larger than 200 lines in a single response
For large files, write in sequential chunks and continue automatically
If you hit an output token limit, resume exactly where you stopped

Version: 1.2.0

Changelog:

1.2.0 (2026-05-17): New Phase 3.7 - Shared Transcript Parser (workspace/<project>/transcript-parser.js). Both generate-pptx.js (PPTX speaker notes, stripStageDirections: true) and generate-docx.js (presenter reference, default mode) import the same dependency-free parser, eliminating the manual-sync drift mode where transcript edits propagated to DOCX but not to hardcoded PPTX speaker-note strings. Reference parser embedded inline; markdown conventions and sync-divergence discipline documented. Phase 4 PPTX gained "Speaker Notes via Shared Parser (No Hardcoded Strings)" subsection. Phase 5 DOCX gained "Data: Shared Parser" subsection. Proven on the H3 context-engineering talk (workspace/context-engineering-talk/) - DOCX byte-identical pre/post refactor; 20 PPTX speaker notes auto-loaded from transcript with stage directions cleanly stripped.
1.1.0 (2026-05-17): Phase 4 - documented Visual Self-QA degraded mode (XML-grep fallback + explicit NOT-verified callout for machines without soffice); added Diagram Approach subsection (native pptxgenjs shapes default for PPTX deck, Mermaid stays for markdown transcript). Phase 3 - swapped em-dash for hyphen in slide-header template per global no-em-dash rule.
1.0.0: Initial skill from DeFi Under The Hood presentation patterns.

name	presentation-builder
description	End-to-end skill for creating professional Lunch & Learn style presentations. Produces a markdown transcript, PPTX slide deck, and DOCX presenter reference with diagrams. Supports iterative refinement, fact-checking via research sub-tasks, and optional cross-agent review for content accuracy. Works with any agent CLI (Claude, Codex, Gemini) as the primary session.
tools-needed	file read/write, shell execution, user interaction, parallel sub-tasks

Presentation Builder Skill

Foundation: This skill inherits the quality framework from skills/builders/builder-foundation/SKILL.md. Presentation-builder already has mature quality patterns (Phase 3.5 Humanizer, cross-agent review, iterative feedback). The foundation's Quality Dimensions (Depth, Originality, Coherence, Completeness) and weakness-weighted review tiers supplement the existing patterns.

Purpose

Create a complete, polished presentation package from a topic brief. The output is three artifacts:

Markdown transcript (<topic>-transcriptv<N>.md) — speaker script with slide-by-slide notes
PPTX slide deck (<topic>-presentationv<N>.pptx) — visual slides for screen sharing
DOCX presenter reference (<topic>-transcriptv<N>.docx) — formatted reading guide with diagrams

This skill codifies the workflow used to build the "DeFi Under The Hood" presentation (24 slides, 30 min + 10 min Q&A) and generalizes it for any technical topic.

When To Use

User asks to create a presentation, Lunch & Learn, talk, or lecture
User has a topic and audience in mind
Output should be presentation-ready (not just an outline)

Prerequisites

# In the working directory:
npm init -y
npm install pptxgenjs docx

Ensure package.json has "type": "commonjs" (both pptxgenjs and docx use CommonJS).

Workflow Overview

Phase 1: Define → Phase 2: Research & Plan Review (Cross-Agent) → Phase 3: Transcript → Phase 3.5: Humanizer Pass → Phase 4: PPTX → Phase 4.5: Deliverable Review (Cross-Agent) → Phase 5: DOCX → Phase 6: Final Review

Each phase produces a versioned artifact. Versions increment on user feedback (v1 → v2 → v3).

Phase 1: Define the Presentation

Collect these parameters before writing anything:

Parameter	Example	Required
Topic	"DeFi Under The Hood"	Yes
Audience	"DeFi beginners — assume zero crypto knowledge"	Yes
Duration	"30 minutes + 10 min Q&A"	Yes
Presenter	""	Yes
Slide count	24	Yes
Date/event	"February 2026 \| Lunch & Learn"	Yes
Tone	Conversational, humorous, analogy-heavy	Yes
Key constraints	"No financial advice disclaimer", "Must mention Hedera"	Optional

Audience Calibration

The audience level drives everything:

Audience	Language	Analogies	Jargon	Additional Info
Zero knowledge	Simple, everyday	Heavy (vending machines, Legos)	Define every term	Detailed for Q&A prep
Some knowledge	Mix of simple + technical	Moderate	Define new terms only	Medium depth
Technical	Direct, precise	Sparse	Assume familiarity	Deep technical details

Timing Budget

Allocate time per slide based on content density:

Content Type	Time per Slide
Title / intro	0:30
Concept introduction	1:00 - 1:30
Technical deep-dive	1:30 - 2:00
Comparison / walkthrough	1:15 - 1:45
Summary / takeaways	1:00 - 1:15
Q&A / resources	Remaining time

Total must equal stated duration. Build the timing table before writing any transcript.

Phase 2: Research

Specialized Critic Roles

Each counterpart agent has a specialized focus area that plays to its strengths. Use role-specific prompts instead of generic review prompts.

Codex — Fact Verification & References

Codex is the fact-checker. Use it to verify every claim, URL, code snippet, and technical detail.

Fact-check the following presentation content. For each claim, provide:
  1. Verification status: VERIFIED / OUTDATED / INCORRECT / UNVERIFIABLE
  2. Source URL confirming or correcting the claim
  3. Date of source data
  4. Any version-specific concerns

Also check:
  - All code snippets for syntax correctness
  - All CLI commands for correct flags and syntax
  - All system contract addresses against current documentation
  - All URLs for liveness (LIVE / DEAD / REDIRECT)
  Flag anything outdated, disputed, or missing a reference.

Gemini — Humor, Engagement & Visual Design

Gemini is the engagement and visual specialist. Use it for humor, audience interaction, analogies, transitions, and image/diagram prompt refinement.

For a [duration] presentation on [topic] aimed at [audience]:
  1. Generate 2-3 jokes/funny lines PER SECTION that match the stated tone
  2. Generate 10+ analogies that explain technical concepts using everyday objects
  3. Generate transition lines between major sections
  4. Identify engagement dead zones (3+ slides without humor or strong hooks)
  6. Refine all Image Prompt fields for better visual output:
     - Better composition and focal points
     - Color harmony with the design system
     - Diagram layout improvements and visual hierarchy
  7. Suggest speaker note improvements for natural delivery cadence

Codex output feeds into the Fact Bank and code verification. Gemini output feeds into transcript writing, speaker notes, and image generation.

Cross-Agent Plan Review

For presentations with high accuracy requirements (financial data, regulatory claims, technical specifications), run the cross-agent plan review loop with specialized prompts:

Draft the slide outline as a "plan"
Run Codex with the fact-verification prompt (see skills/presentations/presentation-planner/SKILL.md for full template)
Run Gemini with the engagement/visual prompt (see skills/presentations/presentation-planner/SKILL.md for full template)
Merge, triage, and iterate (max 3 rounds)

This is opt-in. Ask the user before running it. Cost scales with critic count.

Phase 3: Transcript (Markdown)

File: `<topic>-transcriptv<N>.md`

Structure for each slide:

---

## SLIDE <N> - <Title> (<start> - <end>)

**[On screen: <description of what the audience sees on the slide>]**

<Transcript line 1 — natural speaking style, not read-word-for-word>

<Transcript line 2 — **bold** for emphasis, _(italics)_ for stage directions>

<Transcript line 3>

**[Advance]**

### Additional Info — Slide <N>
- **Term/concept 1**: Deep explanation at implementation level — what it is, how it
  works internally, why it matters. For technical talks: function signatures, API
  endpoints, protocol internals, error codes. For strategic talks: market data with
  sources, competitive details, regulatory specifics. The presenter should be able
  to answer any follow-up question from this section alone. 2-4 sentences per bullet.
- **Term/concept 2**: Same depth. Every concept the slide mentions gets its own
  bullet with enough detail to satisfy the most skeptical audience member.
- **Common question**: "Why does X happen?" — prepared answer with specifics
- **Edge case**: What breaks, what's unsupported, what's surprising
- ...8-15 bullets per content slide, each with genuine depth

### Diagrams (on slides with visual/flow/architecture content)

**<Diagram title>:**

<Multi-paragraph prose description explaining every element of the diagram.
Each box, arrow, and flow should be explained — what it represents, why it's
there, and how it connects to the concepts being discussed. Include anticipated
questions inline: "If someone asks 'why does X route through Y?' — the answer
is..." This section should be 2-5 paragraphs, rich enough that someone who
cannot see the diagram still understands the full picture.>

` ` `mermaid
<Mermaid diagram code — single-line labels only, no \n or <br/>>
` ` `

Note: Replace ` ` ` above with actual triple backticks (shown spaced to avoid markdown parsing issues in this skill file).

Presentation Type Adaptation

Before writing the transcript, check the plan's North Star for the declared Presentation Type (Executive / Technical / Community / Educational). Adapt the transcript accordingly:

Executive: No jargon in spoken text. Define every domain term inline on first use. Keep transcript paragraphs short (2-3 sentences). Numbers and outcomes over mechanisms. Revenue/business model slides come AFTER the solution is understood. Never explain implementation details in the spoken transcript - put them in Additional Info only.
Technical: Use precise terms. Include code snippets, formulas, and specific numbers in the spoken text. The audience expects depth - do not simplify away the interesting parts. Show your work.
Community: Conversational tone. Balance accessibility with substance. Show how the audience can participate or contribute. Avoid marketing language - community audiences detect it instantly.
Educational: Define EVERY term. Use analogies for every concept. Build knowledge incrementally - do not reference concepts before they are explained. Check: could someone with zero domain knowledge follow slide N without having read a separate glossary?

If no type is declared, default to Executive (the most constrained profile - it is easier to add depth than to remove it).

Transcript Writing Rules

Write how you talk, not how you write. Short sentences. Sentence fragments are fine. Conversational tone.
One idea per paragraph. Each > block is one thought.
Bold the punchlines. The key insight in each section should be bold.
Stage directions in italics. Use sparingly — _(pause)_, _(smile)_ for key moments. Avoid excessive _(gesture left)_, _(point to chart)_ — trust the presenter to know where to gesture.
Analogies anchor every concept. If you can't explain it with an analogy, simplify further.
Humor is earned. One or two jokes per slide max. Self-deprecating > punching down.
Transitions are explicit. Don't just advance — tell the audience where you're going.
"Additional Info" sections are deeply detailed and encyclopedic. These are NOT surface-level summaries. They are the presenter's lifeline during Q&A. Every concept mentioned on the slide or in the transcript must be explained at depth appropriate to the topic. For technical presentations: include function signatures, API endpoints, protocol internals, error codes, configuration specifics, and implementation details. For strategic presentations: include market data with sources, competitive analysis, regulatory details, and financial assumptions. The rule: if the slide mentions a concept, the Additional Info must explain it thoroughly enough that the presenter can answer any follow-up question without external research. Use **bold prefix**: for scannable headers. Aim for 8-15 bullets per content slide, each with 2-4 sentences of real depth.
No redundant explanations across slides. Each concept is explained fully ONCE at its primary slide. Later slides reference it briefly ("as we saw earlier") but never re-explain the full mechanism. This applies to: technical flows, error descriptions, key phrases/mottos, and analogies. Review all speaker notes as a single narrative — if a paragraph re-teaches something already covered, cut it. The audience heard it the first time; repetition wastes time and dilutes impact.
No-repetition rule. Do not repeat the same concept, phrase, or explanation more than twice in the entire transcript. Once is ideal, twice maximum if warranted (e.g., an introduction and a recap). Track key phrases across all slides — if you've already explained something (like how a technology works or why something breaks), reference it later with a short callback ("as we saw earlier" or "remember the error from slide 2") instead of re-explaining. This prevents the presentation from dragging. Specifically watch for: technical error descriptions, architecture explanations, taglines/catchphrases, and "how X works" chains.
Mixed-audience language rule. Write for a room that has both technical and non-technical people. Default to simple, concrete language. When introducing a technical concept, use a one-sentence plain-English explanation FIRST, then optionally add the technical term in parentheses. Avoid jargon cascades (chains of 3+ technical terms in one sentence). Put deep technical details in Additional Info or speaker notes, not in the spoken transcript. Examples: "the EVM's error code" not "the INVALID opcode (0xfe) defined in the Yellow Paper, Appendix H"; "shells out to fetch data from the API" not "uses FFI to spawn a subprocess that executes curl against the REST API endpoint."
Executive structure rule. For leadership/decision-maker audiences, follow the thread: Problem -> Solution -> Why Us -> Outcome -> How We Make Money -> Risks -> Ask. The business model and revenue slides should come AFTER the audience understands what the product IS and why it matters. Explaining monetization too early distracts from the value proposition. The audience needs to be convinced the product solves a real problem before they care how it generates revenue.
Follow-the-thread rule. The presentation should feel like one continuous thread moving forward, not a series of loops back to earlier content. If the audience needs context from an earlier slide, use a one-sentence callback ("remember the base pool from earlier"), not a re-explanation. Each slide should advance the narrative, not revisit it. Test: read all speaker notes consecutively - if any paragraph sounds like "going back to" or re-explains a mechanism already covered, cut it or replace with a brief callback.

Natural Delivery Rules (Critical)

The transcript should sound like a real human talking to a room, not an AI reading slide content out loud. Follow these rules strictly:

Anti-Patterns — NEVER Do These

Never enumerate slide content. If the slide shows three bullet points, do NOT say "Number one... Number two... Number three..." or "First... Second... Third..." The audience can read. Your job is to add context, stories, and color around what's on screen — not narrate it like a teleprompter.
- BAD: "Three things make DeFi work. Permissionless — no application forms. Transparent — every transaction is auditable. Composable — protocols plug into each other."
- GOOD: "What makes it actually work? Well, it's permissionless — meaning no application forms, no credit checks. Your wallet is your ID, full stop. It's transparent — every line of code is public. Imagine if your bank had to publish every decision it ever made — they'd shut down voluntarily. And it's composable — this one's the sleeper. We'll come back to it because it's honestly the most powerful idea in all of DeFi."
Never use uniform paragraph structure across slides. If every slide follows the same pattern (introduce concept → list items → bold conclusion), the presentation sounds monotonous. Vary the rhythm — some slides lead with a story, some with a question, some with a surprising fact.
Never describe the slide layout as a script. Avoid "As you can see on the left side of the slide..." or "The flow diagram at the bottom shows..." unless the visual is genuinely confusing. The audience has eyes. Instead, let your words complement what's on screen, not describe it.
Never make every sentence informational. Real speakers have "breathing room" lines — throwaway comments, personal reactions, rhetorical asides. These are essential for pacing. A presentation that's 100% information is exhausting to listen to.

Positive Patterns — ALWAYS Do These

Weave in tidbits from Additional Info. Pull 1-2 small facts, origin stories, or "did you know" moments from the Additional Info section into the spoken transcript. This adds texture that the slide alone doesn't provide.
- Example: "Fun fact — this formula was first sketched out by Vitalik Buterin in a Reddit post in 2017. A Reddit post! That's how some of the most important financial infrastructure in the world got started."
Use conversational bridges. Real people use connective phrases: "Here's the thing...", "What's interesting is...", "So picture this...", "I'll be honest...", "This blew my mind when I first learned it...", "And here's the part that trips people up..."
Add personal reactions. The presenter should react to their own content as a human would — "When I first learned about this, I spent two hours staring at the screen going 'wait, that can't be right.'" This builds rapport and signals authenticity.
Vary sentence rhythm aggressively. Mix long flowing explanatory sentences with punchy short ones. A paragraph with five same-length sentences sounds robotic. Interrupt yourself. Use dashes. Drop a one-word sentence for emphasis. Like that.
Let the slide do its job. If the slide shows a comparison table, don't read both columns. Pick one side and tell a story about it. The audience will read the other column themselves.
Connect ideas across slides. Reference earlier concepts: "Remember the vending machine from earlier? Same idea here, but now..." This makes the presentation feel like a coherent story instead of 24 disconnected slides.
Include at least one "off-script" moment per 3-4 slides. A brief aside, a personal anecdote, a "just between us" moment. These are what make a talk feel human rather than rehearsed. They can be planned — the audience won't know.

Timing Markers

Every slide header includes timing: (start - end). The sum of all slide durations must equal the stated presentation duration. Q&A time is separate and listed on the final slide.

Versioning

v1: First draft from outline + research
v2: After user feedback (language, coverage, timing adjustments)
v3: After second feedback round (simplified language, more analogies, fact corrections)

Only keep the latest version. Delete previous versions when a new one is created (e.g., delete transcriptv1.md when transcriptv2.md is written).

Phase 3.5: Humanizer Pass (Mandatory)

After completing the transcript draft, run it through the humanizer skill (skills/humanizer/SKILL.md). This is a mandatory step, not optional.

What the Humanizer Does

Applies 24 pattern-detection rules to remove AI writing tells:

Significance inflation ("serves as", "testament to", "pivotal moment")
Copula avoidance ("serves as" -> "is")
Promotional language ("groundbreaking", "vibrant", "showcasing")
Rule-of-three forcing
Em-dash overuse
Filler phrases ("In order to", "It is important to note that")
Generic positive conclusions
Uniform paragraph structure

Process

Re-read the full transcript after completing the draft
Apply all 24 humanizer patterns systematically
Apply the 7 presentation-specific rules from the humanizer skill
Run the anti-AI audit: "What makes this obviously AI generated?" - list remaining tells
Fix remaining tells
Update the transcript file in place

When to Skip

Never. The humanizer pass is always applied to transcripts. The only content exempted is structured data: tables, Mermaid diagrams, Fact Bank entries, and appendix reference material.

Phase 3.7: Shared Transcript Parser (Drift Prevention)

File: `workspace/<project>/transcript-parser.js`

Parser Template

// Shared transcript parser for presenter bundles.
// Consumed by generate-pptx.js (speaker notes) and generate-docx.js (presenter reference).
// One source of truth (transcript-v<N>.md) eliminates the manual-sync drift mode
// where hardcoded speaker notes diverge from the transcript as the talk gets edited.
//
// Exports parseTranscript(transcriptPath, opts?) -> { slides, qaItems }.
//   slides[i] = { num, title, timing, onScreen, spokenParas, infoBullets, diagrams }
//   qaItems[i] = { question, answer }
//
// opts.stripStageDirections (default false): when true, removes _(pause)_-style
// italic stage directions entirely. PPTX speaker notes use true (clean prose for
// presenter view); DOCX presenter reference uses false (preserves cues in print).

const fs = require("fs");

function clean(text, opts = {}) {
  const stripStage = !!opts.stripStageDirections;
  let out = text
    .replace(/\*\*\[On screen: ([\s\S]+?)\]\*\*/g, "")
    .replace(/\*\*\[Advance\]\*\*/g, "")
    .replace(/\*\*\[Advance to Q&A\]\*\*/g, "")
    .replace(/\*\*(.+?)\*\*/g, "$1")
    .replace(/(?<!\*)\*([^*\n]+?)\*(?!\*)/g, "$1");

  if (stripStage) {
    out = out
      .replace(/\s*_\([^)]+\)_\s*/g, " ")
      .replace(/[ \t]+/g, " ");
  } else {
    out = out.replace(/_(\([^)]+\))_/g, "$1");
  }

  out = out
    .replace(/_(\w[^_\n]*?)_/g, "$1")
    .replace(/`([^`]+)`/g, "$1")
    .trim();

  return out;
}

function parseTranscript(transcriptPath, opts = {}) {
  const transcript = fs.readFileSync(transcriptPath, "utf-8");
  const slideRegex = /^## SLIDE (\d+) - (.+?) \(([^)]+)\)\s*$/m;
  const sections = transcript.split(/^---\s*$/m);

  const slides = [];
  sections.forEach(section => {
    const headerMatch = section.match(slideRegex);
    if (!headerMatch) return;
    const [, num, title, timing] = headerMatch;

    const onScreenMatch = section.match(/\*\*\[On screen:\s*([\s\S]+?)\]\*\*/);
    const onScreen = onScreenMatch ? onScreenMatch[1].trim().replace(/\s+/g, " ") : "";

    const spokenMatch = section.match(/\*\*\[On screen:[\s\S]+?\]\*\*\s*([\s\S]*?)\s*\*\*\[Advance/);
    const spokenRaw = spokenMatch ? spokenMatch[1] : "";
    const spokenParas = spokenRaw
      .split(/\n\n+/)
      .map(p => clean(p, opts))
      .filter(p => p.length > 0 && !p.startsWith("###"));

    const infoMatch = section.match(/### Additional Info - Slide \d+\s*\n([\s\S]+?)(?=\n---|\n## |\n### |$)/);
    let infoBullets = [];
    if (infoMatch) {
      infoBullets = infoMatch[1]
        .split(/\n- /)
        .map(line => line.replace(/^- /, "").trim())
        .filter(line => line.length > 0)
        .map(line => clean(line, opts));
    }

    const diagMatch = section.match(/### Diagrams - Slide \d+\s*\n([\s\S]+?)(?=\n### Additional Info|\n---|\n## |$)/);
    const diagrams = diagMatch ? diagMatch[1].trim() : null;

    slides.push({
      num: parseInt(num, 10),
      title, timing, onScreen, spokenParas, infoBullets, diagrams,
    });
  });

  slides.sort((a, b) => a.num - b.num);

  const qaMatch = transcript.match(/## Anticipated Q&A \(Presenter Prep\)\s*\n([\s\S]+?)(?=\n## |$)/);
  const qaText = qaMatch ? qaMatch[1] : "";
  const qaItems = [];
  if (qaText) {
    const qBlocks = qaText.split(/\n### Q: /).slice(1);
    qBlocks.forEach(block => {
      const lines = block.split("\n");
      const question = lines[0].replace(/^"|"$/g, "").trim();
      const answer = lines.slice(1).join("\n").trim();
      qaItems.push({ question, answer: clean(answer, opts) });
    });
  }

  return { slides, qaItems };
}

module.exports = { parseTranscript, clean };

Two Cleaning Modes (Same Source, Surface-Appropriate Output)

PPTX speaker notes show in PowerPoint's presenter view at delivery time. Pass stripStageDirections: true so stage cues like _(pause)_ get removed - cleanest possible prose, zero visual noise during the talk.
DOCX presenter reference is the read-through artifact (paper or screen) used for rehearsal. Use the default (stripStageDirections: false) so stage cues stay as (pause) text - they remind the presenter where to pause, slow down, or beat for emphasis during read-aloud.

Same transcript, surface-appropriate cleaning. Both surfaces auto-update from a transcript edit at next regenerate.

Markdown Conventions the Parser Expects

The parser assumes the markdown transcript follows the structure documented in Phase 3:

Slide blocks separated by --- on its own line
Each block opens with ## SLIDE <N> - <Title> (<timing>)
**[On screen: ...]** line for the visual cue (parser strips it from spoken output)
Spoken prose between the on-screen line and **[Advance]** / **[Advance to Q&A]**
Stage directions as _(pause)_, _(beat)_, etc. (italic markdown, parens inside)
### Additional Info - Slide <N> bullet list for presenter-reference depth
### Diagrams - Slide <N> for optional diagram description block
## Anticipated Q&A (Presenter Prep) appendix with ### Q: "<question>" blocks

Sync-Divergence Discipline When Adopting On An Existing Deck

Phase 4: PPTX Slide Deck

File: `workspace/<project>/generate-pptx.js` → `workspace/<project>/<slug>-presentation-v<N>.pptx`

Use pptxgenjs to generate visual slides programmatically.

Design System

For PPTX specifically, the palette to use is the dark / presentation mode from DESIGN.md. The minimum tokens to copy:

// Copy from DESIGN.md (presentation palette - dark mode)
const C = {
  bgDark:    "0B1121",   // surfaceDark
  bgMid:     "141B2D",   // surfaceMid
  bgCard:    "1A1F2E",   // surfaceCard
  purple:    "8259EF",   // primary
  teal:      "00E5CC",   // accent
  warn:      "FF6B6B",   // semantic.warn (PPTX variant)
  white:     "FFFFFF",   // textOnDark
  body:      "D1D5DB",   // textOnDarkBody
  dim:       "666666",   // muted text
};

Conventions inherited from DESIGN.md:

Dark backgrounds for projector visibility (white-on-bgDark exceeds WCAG AAA).
One accent color per slide max - teal is the visual hook, do not stack accents.
Sharp-edged rectangles only - never rounded corners.
Typography: Arial Black for titles only (32pt+); Calibri for body (14-18pt at projection distance); Consolas for code.

Slide Types

Build helper functions for each slide type:

Type	Use For	Key Elements
Title slide	Opening/closing	Large title, subtitle, gradient background
Concept slide	Introducing ideas	Title, 2-3 cards with icons, supporting text
Comparison slide	A vs B	Two-column layout, colored headers
Flow/timeline	Process, evolution	Numbered steps with arrows
Data table	Numbers, comparisons	Styled table with header row
Formula/code	Technical detail	Monospace font, highlighted box
Callout slide	Key stat or warning	Large number/emoji, minimal text
Diagram slide	Architecture, flow, relationships	Native pptxgenjs shapes (rectangles, lines, arrows)

Diagram Approach: Native Shapes Default, Mermaid for Markdown Only

Two separate diagram surfaces exist in this skill, with different defaults:

Surface	Renderer	Default	Why
Markdown transcript (`<slug>-transcript-v<N>.md`)	Notion / GitHub / markdown viewer	Mermaid	The consumer's tool already renders Mermaid. Code blocks are diff-friendly. Authoring is text-only.
PPTX deck (`<slug>-presentation-v<N>.pptx`)	PowerPoint / Keynote / Google Slides	Native `pptxgenjs` shapes	The renderer does NOT speak Mermaid. Producing in-deck Mermaid requires `mmdc` (Mermaid CLI) to render to PNG, then embedding the PNG - an extra runtime dependency rarely installed on Mac dev machines (`brew install mermaid-cli` is opt-in). Native shapes are deterministic, scriptable in the same JS file, version-controlled as code, and have zero external dependencies.

When Mermaid in PPTX is acceptable:

High-complexity diagrams (10+ nodes, multiple interconnections) where native-shape positioning math becomes more code than the Mermaid source.
Diagrams already authored as Mermaid for the markdown transcript and the visual is identical (avoids duplication).

Helper Function Pattern

function card(slide, x, y, w, h, opts = {}) {
  slide.addShape(pres.shapes.RECTANGLE, {
    x, y, w, h,
    fill: { color: opts.fill || C.bgCard },
    shadow: { type: "outer", blur: 6, offset: 2, color: "000000", opacity: 0.2 },
    rectRadius: opts.radius || 0.1,
  });
}

function titleText(slide, text, opts = {}) {
  slide.addText(text, {
    x: 0.5, y: 0.3, w: 9, h: 0.6,
    fontSize: 32, fontFace: "Arial Black", color: C.white, bold: true,
    ...opts,
  });
}

Speaker Notes via Shared Parser (No Hardcoded Strings)

At the top of generate-pptx.js:

const path = require("path");
const pptxgen = require("pptxgenjs");
const { parseTranscript } = require("./transcript-parser");

const { slides: transcriptSlides } = parseTranscript(
  path.join(__dirname, "transcript-v1.md"),
  { stripStageDirections: true },
);
const notesBySlide = Object.fromEntries(
  transcriptSlides.map(s => [s.num, s.spokenParas.join("\n\n")]),
);

At each slide block:

{
  let s = pres.addSlide();
  // ...visual content...
  s.addNotes(notesBySlide[N]);  // N matches the slide's number in the transcript
}

Execution

node workspace/<project>/generate-pptx.js
# Produces: workspace/<project>/<slug>-presentation-v<N>.pptx

Visual Self-QA Checkpoint (Mandatory Before Declaring Build Done)

Procedure (Primary Path - requires soffice CLI):

Convert the generated PPTX to PDF:

python scripts/office/soffice.py --headless --convert-to pdf workspace/<project>/<slug>-presentation-v<N>.pptx

Convert at least 3 representative slides (title, one mid-deck content slide containing a diagram, conclusion) to JPEG:
```
pdftoppm -jpeg -r 150 -f <N> -l <N> workspace/<project>/<slug>-presentation-v<N>.pdf workspace/<project>/qa-slide
```
Read each generated qa-slide-*.jpg via Claude Code's image-reading capability.
Verify per slide:
- Diagrams render fully (no \n literal text appearing; labels not cut off; native shapes positioned correctly)
- No text overflow, no autofit collapse, no overlapping shapes
- Colors match the plan (hex format applied correctly)
- Font choices and sizes match the template
If any visible issue is found, fix the generator script and re-render before proceeding.
Optional but recommended: run the same check on every slide for high-stakes presentations.
After the build is approved, delete the temporary qa-slide-*.jpg files (they are throwaway QA artifacts).

Degraded Mode (when soffice is unavailable):

Probe for soffice first: command -v soffice >/dev/null && echo "primary" || echo "degraded". If degraded, proceed below.

Unpack the PPTX (it is a zip) and run structural checks:

unzip -o workspace/<project>/<slug>-presentation-v<N>.pptx -d /tmp/pptx-qa-<slug>
# Slide count matches plan:
ls /tmp/pptx-qa-<slug>/ppt/slides/slide*.xml | wc -l
# Speaker notes present per slide:
ls /tmp/pptx-qa-<slug>/ppt/notesSlides/notesSlide*.xml | wc -l
# No literal \n leaking into diagram labels (Mermaid-era failure mode):
grep -l '\\n' /tmp/pptx-qa-<slug>/ppt/slides/slide*.xml || echo "OK no literal \\n"
# Each slide title appears in its XML (sanity check on title rendering):
for n in $(seq 1 <SLIDE_COUNT>); do
  grep -q "<expected-title-substring-for-slide-$n>" /tmp/pptx-qa-<slug>/ppt/slides/slide${n}.xml \
    && echo "slide $n title OK" || echo "slide $n title MISSING"
done
# DESIGN.md hex tokens appear at expected frequency (color application sanity):
grep -o 'val="[0-9A-Fa-f]\{6\}"' /tmp/pptx-qa-<slug>/ppt/slides/slide*.xml | sort | uniq -c

Cleanup: rm -rf /tmp/pptx-qa-<slug> after the probe.
EXPLICITLY document what was NOT verified in the completion summary (see Reporting below). The signal is load-bearing - "degraded mode + silent pass" collapses to "skipped" and defeats the gate's purpose.
Surface every degraded-mode run as a [Notice] candidate for the quarterly Skill Health Audit: is this machine missing soffice by design (no install allowed), by oversight (install it), or by tradeoff (acceptable for the deck's use context)? Three different answers, three different follow-ups.

Reporting:

Primary path: Visual Self-QA: <N> slides rendered and verified.
Degraded path: Visual Self-QA: DEGRADED MODE (no soffice CLI). Structural checks passed (<N> slides, <M> speaker notes, <K> title-presence checks). Visual rendering NOT verified - presenter MUST open the deck in PowerPoint, Keynote, or Google Slides before any live use. Failure modes NOT caught by this gate: autofit collapse, hex/RGBA mismatch, font overflow, color clash, label cutoff, shape positioning.

Phase 4.5: Cross-Agent Deliverable Review (Required)

Why This Phase Exists

North Star Alignment

Every review round must reference the North Star from the plan to prevent drift:

Extract the North Star statement from the plan (e.g., "A hands-on workshop where attendees build and deploy...")
Include it in every critic prompt so reviewers can verify deliverables serve the original vision
Reject any critic suggestion that contradicts or dilutes the North Star

Critic Roles for Deliverable Review

Codex — Code & Fact Verification

Review the following generated deliverables against the plan's North Star:

NORTH STAR: [paste from plan]

Verify:
1. All Solidity code compiles and follows best practices (if code repo exists)
2. All import paths are correct (e.g., @hiero/system-contracts/ — use the current package, not the legacy one)
3. All CLI commands (Foundry, cast, forge) use correct syntax and flags
4. All URLs are valid and point to correct resources
5. All technical claims in transcript match the plan's Fact Bank
6. Speaker notes in PPTX match transcript content
7. No placeholder text remains in any deliverable
8. Code examples in slides match the actual contract code

Output format:
1) CRITICAL_ISSUES (bugs, wrong facts, broken commands)
2) NON_CRITICAL_IMPROVEMENTS (style, clarity)
3) NORTH_STAR_ALIGNMENT (does the output serve the stated goal?)
4) READY_FOR_DELIVERY (yes/no + reason)

Gemini — Engagement & Visual Quality

Review the following generated deliverables for engagement and visual quality:

NORTH STAR: [paste from plan]

Verify:
1. Transcript flows naturally as spoken word (not robotic reading)
2. Humor lands and is appropriately placed
3. Slide layouts are visually balanced (no overcrowding, proper spacing)
4. Code blocks are readable (font size, contrast)
5. Transitions between slides are smooth
6. Engagement is driven by content hooks, humor, and pacing (never audience interaction prompts)
7. Timing allocations are realistic for each slide's content density
8. Visual hierarchy guides the eye correctly on each slide
9. NO slide-narration: spoken words add context beyond what's on screen, not repeat it
10. NO monotonous enumeration: slides don't follow "here are N things" + read each one
11. NO audience interaction prompts: never write "show of hands", "raise your hand", "who here has", polls, or any line that requires the audience to respond. Assume zero interaction except Q&A at the end. Engagement comes from content, humor, pacing, and rhetorical questions (ones you answer yourself immediately).
12. Tidbits from Additional Info woven into spoken parts (origin stories, fun facts, personal reactions)
12. Varied paragraph structure across slides (not the same intro-list-conclusion pattern)
13. Conversational bridges used ("Here's the thing...", "What's interesting is...")
14. At least one "off-script" human moment per 3-4 slides

Output format:
1) ENGAGEMENT_ISSUES (dead zones, missing humor, awkward transitions)
2) SLIDE_NARRATION_ISSUES (slides that read like bullet-point recitation)
3) VISUAL_ISSUES (layout problems, readability concerns)
4) NORTH_STAR_ALIGNMENT (does the output serve the stated goal?)
5) READY_FOR_DELIVERY (yes/no + reason)

Procedure

Concatenate deliverable content into inline prompts (never reference file paths for critics)
Run Codex and Gemini critics in parallel with per-agent timeouts
Merge feedback, prioritizing consensus items
Apply accepted changes to deliverables
Re-generate PPTX if slide code was modified
Max 2 rounds for deliverable review (lighter than plan review)

Integration with Code Repos

When the deliverables include a code repository (e.g., workshop code), Codex should also verify:

Solidity syntax and pragma version
Import path correctness against the target library structure
Test coverage matches specification
README accuracy against actual project structure
Build configuration (foundry.toml, remappings) is correct

Phase 5: DOCX Presenter Reference

File: `workspace/<project>/generate-docx.js` → `workspace/<project>/<slug>-transcript-v<N>.docx`

Use the docx npm package to generate a formatted Word document.

Data: Shared Parser (Default Mode, Preserves Stage Cues)

const path = require("path");
const { parseTranscript } = require("./transcript-parser");

const transcriptPath = path.join(__dirname, "transcript-v1.md");
const { slides, qaItems } = parseTranscript(transcriptPath);
// slides[i] = { num, title, timing, onScreen, spokenParas, infoBullets, diagrams }
// qaItems[i] = { question, answer }

Design System (DOCX)

const C = {
  purple: "6B46C1",      // Primary heading color
  purpleLight: "E9E0FA",  // Light background
  teal: "0D9488",         // Secondary accent
  tealLight: "CCFBF1",    // Light secondary
  dark: "1E293B",         // Dark text
  body: "475569",         // Body text
  border: "CBD5E1",       // Table borders
  blue: "2563EB",         // Info boxes
  blueLight: "DBEAFE",    // Info box background
  warn: "DC2626",         // Warning color
  green: "059669",        // Success/defense
};

Component Library

Build these reusable functions:

Function	Purpose	Visual
`slideHeader(num, title, timing)`	Purple badge + title + timing	Numbered header bar
`onScreen(text)`	Gray bar describing slide visual	"On screen: ..."
`transcript(lines)`	Indented speaker lines	Bold/italic formatting
`infoBox(title, bullets)`	Blue bordered box	"Additional Info" section
`comparisonTable(left, right)`	Two-column comparison	Colored headers
`flowArrow(steps)`	Numbered vertical flow	Steps with arrows
`diagramTable(items)`	Label + description rows	Colored left column

Key docx-js Rules

Always use WidthType.DXA — never WidthType.PERCENTAGE (breaks in Google Docs)
US Letter: width 12240, height 15840, content width 9360 (with 1" margins)
Table width = sum of columnWidths — both columnWidths array AND cell width must match
Use ShadingType.CLEAR — never SOLID for table shading
Colors must be exactly 6 hex digits — never append opacity suffixes
Never use \n — use separate Paragraph elements
PageBreak must be inside a Paragraph

Validation

python scripts/office/validate.py output.docx
python3 -m markitdown output.docx | head -100

Check that all slides and Additional Info sections are present.

Phase 6: Review & Iterate

User Review Cycle

Present the transcript (markdown) first — it's fastest to review
Generate PPTX only after transcript is approved
Generate DOCX after PPTX is approved (or in parallel)
Each round of feedback creates a new version (v1 → v2 → v3)

Common Feedback Patterns

Feedback	Action
"Too technical"	Replace jargon with analogies, add everyday comparisons
"Too long"	Compress timing, merge slides, cut redundant examples
"Need more detail"	Expand Additional Info sections, add historical examples
"More humor"	Add self-deprecating jokes, funny analogies, unexpected observations
"Wrong fact"	Spawn research sub-task to verify, update transcript + slides
"Add topic X"	Add to existing slide's Additional Info or create new slide
"Timing is off"	Recalculate timing budget, adjust per-slide allocations

Fact Verification Checklist

Before finalizing any version, verify:

All numbers/statistics have a source and date
No outdated claims (check if data is > 6 months old)
Technical claims are accurate (protocol features, supported assets)
Historical events have correct dates and amounts
Disclaimers are present where required (financial, legal, medical)

File Structure

workspace/<project>/
  north-star.md                         # North Star guardrail
  <slug>-plan-v<N>.md                   # Document plan
  <slug>-research.md                    # Research brief (if applicable)
  <slug>-transcript-v<N>.md             # Versioned transcript
  <slug>-presentation-v<N>.pptx         # Final slide deck
  <slug>-transcript-v<N>.docx           # Final presenter reference
  generate-pptx.js                      # PPTX generator script (project-specific)
  generate-docx.js                      # DOCX generator script (project-specific)
  critique-r<N>-*.md                    # Cross-agent review output
  demo-repo/                            # Tutorial/demo code (if applicable)
  skills/
    presentation-builder/SKILL.md       # This skill
    cross-agent-plan-review/SKILL.md    # Content review skill (optional)

Quick Start Template

1. Initialize

mkdir <topic> && cd <topic>
npm init -y
npm install pptxgenjs docx
# Set "type": "commonjs" in package.json

2. Define Parameters

Ask the user for: topic, audience, duration, presenter, slide count, date/event, tone, constraints.

3. Build Slide Outline

Create a table:

| Slide | Title | Time | Content Type |
|-------|-------|------|-------------|
| 1 | Title | 0:00-0:30 | Title |
| 2 | Why This Matters | 0:30-1:45 | Concept |
| ... | ... | ... | ... |
| N | Q&A | XX:XX-end | Resources |

4. Research (parallel sub-tasks)

Spawn fact-checking + engagement content sub-tasks simultaneously.

5. Write Transcript v1

Follow the markdown template (Phase 3). Include all Additional Info sections.

6. Review → v2 → v3

Iterate based on user feedback. Each version is a new file.

7. Generate PPTX

Write workspace/<project>/generate-pptx.js with the slide content. Run node workspace/<project>/generate-pptx.js.

8. Generate DOCX

Write workspace/<project>/generate-docx.js with the presenter reference. Run node workspace/<project>/generate-docx.js.

9. Final Validation

All slides present in all three artifacts
Timing adds up to stated duration
Facts verified
Visual QA on PPTX
Content QA on DOCX

Lessons Learned

These patterns emerged from building the DeFi, Catalyst, and hedera-forking presentations and should be followed:

Content

Analogies are the #1 tool for a non-technical audience. Every concept needs at least one.
"Additional Info" sections save the presenter during Q&A. Make them comprehensive.
Humor works best when self-deprecating or absurdist. Avoid insider jokes the audience won't get.
Bold the insight, not the setup. The punchline of each section should pop visually.
Stage directions should be sparse. Only include key moments: _(pause)_, _(smile)_. Don't over-direct with constant _(gesture left)_, _(point to chart)_ — it makes the transcript feel like a robot script.
The #1 transcript mistake is slide-narration. If the transcript sounds like someone reading bullet points off the screen, it's wrong. The spoken words should ADD to what's on screen — context, stories, tidbits, personal reactions — not repeat it.
Pull tidbits from Additional Info into the spoken part. Each slide should have 1-2 small facts, origin stories, or "did you know" moments woven into the spoken transcript. This is the difference between a boring lecture and an engaging talk.
Vary paragraph structure across slides. Some open with questions, some with stories, some with surprising stats. Monotonous structure = monotonous delivery.
No-redundancy enforcement. Explain each concept once at its natural point in the presentation. Later slides should reference the earlier explanation, not repeat it. For example, explain how htsSetup() works on the architecture slide, then on later slides simply say "the emulation layer we set up earlier" rather than re-explaining the mechanism.

Mermaid Diagrams in Transcripts

Every slide with a visual/flow/architecture component MUST have a Diagrams section with: a bold diagram title, multi-paragraph prose description, and a Mermaid code block. This applies to architecture diagrams, flow charts, comparison diagrams, timelines, and any slide where the [On screen:] description shows a visual structure.
Diagram descriptions are the presenter's reference material, not just captions. Each description should be 2-5 paragraphs explaining every box, arrow, and flow in the diagram. Include what each element represents, why it's there, and anticipated questions ("If someone asks 'why does X happen?' - the answer is...").
Single-line labels only in Mermaid. Never use \n or <br/> in node labels. If a label is too long, shorten it.
Color-code diagram elements using Mermaid style directives: green for success/solved, red for failure/broken, blue for the solution/product, amber/orange for warnings, purple for infrastructure.
Slides that typically need diagrams: "Without vs With" comparisons, architecture overviews, data flow diagrams, deployment pipelines, development cycles, component relationships, and any slide titled "DIAGRAM:" in the plan.
Slides that do NOT need diagrams: title slides, Q&A slides, pure text/quote slides, resource/link slides, and comparison tables (unless the comparison has a flow component).

Additional Info Depth Requirements

Additional Info sections are NOT summaries. They are deep reference material. Each bullet should be 2-4 sentences explaining a concept at implementation level, not just defining it.
Every concept mentioned on the slide or in the transcript gets its own bullet. The depth should match the presentation's audience and topic. Technical presentations: function signatures, API endpoints, protocol internals, error codes, configuration specifics. Strategic presentations: market data with sources and dates, competitive positioning details, regulatory framework specifics, financial model assumptions.
The depth test: A developer/analyst in the audience should be able to look at the Additional Info and either implement the described feature or validate the described claim without further research.
Aim for 8-15 bullets per content slide, each with genuine depth. Title and Q&A slides can have fewer.
Common patterns for deep bullets: "What X is", "How X works internally", "Why X was chosen over alternatives", "What breaks when X fails", "Edge cases with X", "Common question about X with prepared answer".

Demo-Heavy Presentations

Demo transition slides are shown during live demo segments. These slides use timing markers like "during demo" instead of specific timestamps, and serve as visual anchors while the presenter is in a terminal window. They should have minimal text — just the demo topic and a brief description.
Real-world example appendices provide deep-dive reference material for demo-heavy presentations. For example, the hedera-forking presentation included appendix sections for the SaucerSwap mainnet fork test walkthrough and Bonzo Finance lending/borrowing walkthrough. These give the presenter detailed step-by-step reference material for real-world examples shown during demos.
Anticipated audience questions appendix should include 15+ detailed anticipated questions with thorough answers. These serve as Q&A prep and as a reference for the presenter during the Q&A portion. Organize by topic area (e.g., "Technical Questions", "Security Questions", "Ecosystem Questions") with full-length prepared answers, not just bullet points.

Host Q&A Format

For community call format where a host asks guided questions during the presentation, include a Host Questions Q&A Prep section with 6-10 pre-planned host questions and full-length prepared answers. The host may ask these questions at any time during the session, so prepare answers that work regardless of when they're asked. Each answer should be 3-5 sentences — conversational, not scripted — and reference specific slides or concepts from the presentation.

Technical

Write scripts in parts (~200 lines each) and concatenate. Large scripts hit output limits.
Dark PPTX backgrounds + light text = projector-friendly. Avoid light backgrounds.
DOCX color values must be exactly 6 hex digits. Never append opacity (e.g., "05966920" fails).
Use ShadingType.CLEAR not SOLID in docx-js to prevent black backgrounds.
Validate DOCX with both validate.py and markitdown to catch structural and content issues.
PPTX helper functions (card, titleText, addSlideNum) keep slide code DRY. Define them once.

Process

Transcript first, slides second. The words drive the visuals, not the other way around.
Research sub-tasks run in parallel. Don't block on fact-checking — spawn them early.
Never overwrite versions. Keep v1, v2, v3 side by side.
Cross-agent review is overkill for most presentations but valuable for regulatory, financial, or medical topics where accuracy is critical.
The presenter should NOT read word-for-word. The transcript is a guide, not a teleprompter script. Say this explicitly in the document.

Cross-Agent Review Integration (Full Lifecycle)

Cross-agent review applies to the entire presentation lifecycle, not just planning:

Phase 2 — Plan Review (Before Implementation)

Reference skills/review/cross-agent-plan-review/SKILL.md for the full procedure
Treat the slide outline + transcript as the "plan"
Codex: fact verification, code accuracy, URL validation
Gemini: humor, engagement, visual design, analogies
Max 3 rounds. Stop when CRITICAL_GAPS_REMAINING is none.

Phase 4.5 — Deliverable Review (After Implementation)

Review generated transcript, PPTX, and code artifacts
Always reference the North Star to prevent drift from original vision
Codex: verify code correctness, CLI commands, import paths, fact bank alignment
Gemini: verify engagement quality, slide layouts, transcript flow, visual balance
Max 2 rounds. Lighter touch — focus on implementation errors, not redesign.

When to Use

Claude catches structural and logical issues
Codex catches factual errors, outdated references, and code bugs
Gemini catches engagement gaps, visual problems, and delivery issues

Output Rules

Never write files larger than 200 lines in a single response
For large files, write in sequential chunks and continue automatically
If you hit an output token limit, resume exactly where you stopped

Version: 1.2.0

Changelog:

1.2.0 (2026-05-17): New Phase 3.7 - Shared Transcript Parser (workspace/<project>/transcript-parser.js). Both generate-pptx.js (PPTX speaker notes, stripStageDirections: true) and generate-docx.js (presenter reference, default mode) import the same dependency-free parser, eliminating the manual-sync drift mode where transcript edits propagated to DOCX but not to hardcoded PPTX speaker-note strings. Reference parser embedded inline; markdown conventions and sync-divergence discipline documented. Phase 4 PPTX gained "Speaker Notes via Shared Parser (No Hardcoded Strings)" subsection. Phase 5 DOCX gained "Data: Shared Parser" subsection. Proven on the H3 context-engineering talk (workspace/context-engineering-talk/) - DOCX byte-identical pre/post refactor; 20 PPTX speaker notes auto-loaded from transcript with stage directions cleanly stripped.
1.1.0 (2026-05-17): Phase 4 - documented Visual Self-QA degraded mode (XML-grep fallback + explicit NOT-verified callout for machines without soffice); added Diagram Approach subsection (native pptxgenjs shapes default for PPTX deck, Mermaid stays for markdown transcript). Phase 3 - swapped em-dash for hyphen in slide-header template per global no-em-dash rule.
1.0.0: Initial skill from DeFi Under The Hood presentation patterns.

presentation-builder

Presentation Builder Skill

Purpose

When To Use

Prerequisites

Workflow Overview

Phase 1: Define the Presentation

Audience Calibration

Timing Budget

Phase 2: Research

Specialized Critic Roles

Codex — Fact Verification & References

Gemini — Humor, Engagement & Visual Design

Cross-Agent Plan Review

Phase 3: Transcript (Markdown)

File: <topic>-transcriptv<N>.md

Presentation Type Adaptation

Transcript Writing Rules

Natural Delivery Rules (Critical)

Anti-Patterns — NEVER Do These

Positive Patterns — ALWAYS Do These

Timing Markers

Versioning

Phase 3.5: Humanizer Pass (Mandatory)

What the Humanizer Does

Process

When to Skip

Phase 3.7: Shared Transcript Parser (Drift Prevention)

File: workspace/<project>/transcript-parser.js

Parser Template

Two Cleaning Modes (Same Source, Surface-Appropriate Output)

Markdown Conventions the Parser Expects

Sync-Divergence Discipline When Adopting On An Existing Deck

Phase 4: PPTX Slide Deck

File: workspace/<project>/generate-pptx.js → workspace/<project>/<slug>-presentation-v<N>.pptx

Design System

Slide Types

Diagram Approach: Native Shapes Default, Mermaid for Markdown Only

Helper Function Pattern

Speaker Notes via Shared Parser (No Hardcoded Strings)

Execution

Visual Self-QA Checkpoint (Mandatory Before Declaring Build Done)

Phase 4.5: Cross-Agent Deliverable Review (Required)

Why This Phase Exists

North Star Alignment

Critic Roles for Deliverable Review

Codex — Code & Fact Verification

Gemini — Engagement & Visual Quality

Procedure

Integration with Code Repos

Phase 5: DOCX Presenter Reference

File: workspace/<project>/generate-docx.js → workspace/<project>/<slug>-transcript-v<N>.docx

Data: Shared Parser (Default Mode, Preserves Stage Cues)

Design System (DOCX)

Component Library

Key docx-js Rules

Validation

Phase 6: Review & Iterate

User Review Cycle

Common Feedback Patterns

Fact Verification Checklist

File Structure

Quick Start Template

1. Initialize

2. Define Parameters

3. Build Slide Outline

4. Research (parallel sub-tasks)

5. Write Transcript v1

6. Review → v2 → v3

7. Generate PPTX

8. Generate DOCX

9. Final Validation

Lessons Learned

Content

Mermaid Diagrams in Transcripts

Additional Info Depth Requirements

Demo-Heavy Presentations

Host Q&A Format

Technical

Process

File: `<topic>-transcriptv<N>.md`

File: `workspace/<project>/transcript-parser.js`

File: `workspace/<project>/generate-pptx.js` → `workspace/<project>/<slug>-presentation-v<N>.pptx`

File: `workspace/<project>/generate-docx.js` → `workspace/<project>/<slug>-transcript-v<N>.docx`

File: `<topic>-transcriptv<N>.md`

File: `workspace/<project>/transcript-parser.js`

File: `workspace/<project>/generate-pptx.js` → `workspace/<project>/<slug>-presentation-v<N>.pptx`

File: `workspace/<project>/generate-docx.js` → `workspace/<project>/<slug>-transcript-v<N>.docx`