Run any Skill in Manus with one click

lesson-verification

Stars0

Forks0

UpdatedJune 9, 2026 at 18:07

Use when checking a completed Remotion education video for lesson accuracy, visual clarity, narration, captions, music, and render readiness.

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

Run Skill in Manus

Source

blueif16

blueif16/animate-svg

View GitHub Repository View Creator Repositories

Download

Run Skill in Manus

SKILL.md

readonly

name	lesson-verification
description	Use when checking a completed Remotion education video for lesson accuracy, visual clarity, narration, captions, music, and render readiness.

Lesson Verification

Purpose: Verify the finished lesson before delivery.

Primary review surface — the rendered contact sheet

Load lesson-data/<id>/<lesson-id>-contact.png first and only. This is one image containing the midpoint frame of every cue in narrative order — built automatically by scripts/make-contact-sheet.mjs at the end of lesson:render. The companion <lesson-id>-contact.json maps each tile (row, col, index) to its cue id and midpoint frame.

Why one image, not per-cue PNGs: an MP4 teaches across cues, not within a single frame. Reviewing per-cue PNGs in isolation lets aesthetic-checked-✓-per-frame results hide an arc that doesn't teach. Look at the whole strip and answer the one question that matters:

Would a child who doesn't already know this lesson learn it from this video?

If the answer is yes, the lesson passes the contact-sheet check. If no, name the cue(s) that break the arc — only then drill into per-cue PNGs or scrub the MP4.

Anti-pattern: opening verification-frames/cue-*.png one at a time and writing a per-frame checklist. That's how the visual-bug class survives — every frame "looks fine in isolation" while the lesson fails to teach. Do not do this.

Check artifacts (Wave 4 outputs)

After the contact-sheet review, cross-check the Wave 4 composer outputs:

Read lesson-data/<id>/bbox-manifest.json. The fast linear summary.collisionCount should be 0. Then read the measured block (written by npm run lesson:check -- --config <cfg> --measured): summary.measuredCollisionCount and summary.gatesFailed. The measured pass renders motion-peak frames and reads each element's TRUE getBBox(), so it catches easing-overshoot and between-keyframe overlaps the linear path is blind to — treat a measured-only collision as a real candidate, open the matching out/<id>/measured-frames/f<frame>.png, and rule it a true overlap or a by-design adjacency (a whole-card stacked on its diagram, etc.) IN WRITING. Also read the gate verdicts in measured.gates: lufs (voice integrated loudness vs −16 LUFS / true peak ≤ −1 dBFS), captionRedundancy (caption ≈ narration per cue — flags clutter; literacy/pinyin exempt), contrast (WCAG ≥ 4.5:1), legibility (glyph ≥ 24px), motionFast (WARN-only). Any non-zero collision (linear OR measured) or failed gate must either be fixed (kick back to composer) or carry an explicit written justification in this report — silent acceptance is forbidden; a SKIP: <reason> is acceptable but must be acknowledged.
If lesson-data/<id>/primitive-checks/*.png exist, inspect each. Does the primitive read as the visual-design §5 acceptance criteria specify (e.g. "kid reads 'sticks tied with a bow'")?
Cross-check that every element listed in visual-design §3 Visual Contract has a corresponding SceneElement entry in src/lessons/<camelLessonId>/manifest.ts.

Per-cue text-vs-audio checks (scene strings against the spoken phrase)

A frame can look correct yet show a word the audio never speaks, or hold a learner-response beat as silent dead air. The contact sheet alone won't catch these — walk every cue against lesson-data/<id>/script-cues.json:

On-screen target text == the cue's spoken audio. For each cue, compare every on-screen target STRING in src/lessons/<camelLessonId>LessonScene.tsx (DialogueExchange lines, ReadAlongHighlight word glyphs, name tags) against that cue's spoken phrase in lesson-data/<id>/script-cues.json. The on-screen target strings must be a SUBSET of the cue's spoken phrase, in spoken order. FLAG any on-screen word the cue's audio does not speak — e.g. a bubble that shows a word carried over from an earlier cue, or lifted from the brief, that this cue's audio never speaks. This is a HARD finding mapped to W4a composer.
Learner-response gap is a legible invitation, not dead air. For any cue with gap.reason === "learner-response", confirm the scene holds a READABLE "your turn" affordance during the gap window — a localized label ("你来说" / "Your turn") + a pulse/ring on the read-along row + a speech/mic glyph. A bare low-opacity glow with no label/icon FAILS (it reads as awkward silence, not an invitation to speak). HARD finding mapped to W4a composer.

Sound checks (when the lesson has `audio-cues.json`)

The master loudness target (≈ −16 LUFS / TP ≤ −1 dBFS) is the lufs gate above — do not re-measure it by hand. These are the QUALITATIVE checks that gate can't make; scrub the MP4 with sound on:

Melody NOT identifiable under narration. While any narration plays, the bed must read as warmth, not a tune you could hum. If you can follow the melody, the duck is too shallow → FAIL.
3-point duck. Confirm the arc: intro bed ducks cleanly as the first words start; the bed rises in a mid narration GAP then ducks again; the outro resolves to full as the last narration ends.
No SFX over instruction words, and no SFX louder than narration. Reward/interaction sounds land in gaps or after the line; ta-da fires once. SFX sit below the voice.
Tone lessons (toneSafe): the spoken tone is unmistakable over the bed — no melodic motif competes with the lexical pitch.

Verdict

The final verdict (GREEN / YELLOW / RED) cites:

contact-sheet teach test — does the arc teach the KP? (single paragraph)
bbox-manifest collision count + any justified collisions
text-vs-audio checks — on-screen target strings ⊆ each cue's spoken phrase; learner-response gaps hold a legible "your turn" affordance
primitive-check observations (per redesigned primitive)
sound checks (if audio-cues.json present): melody-under-narration, 3-point duck, SFX discipline, lufs gate verdict
pedagogy + pacing checks against audio-captions.md and visual-design.md

The contact sheet is the canonical review surface. The bbox/primitive artifacts complement it.

Lesson Verification

Purpose: Verify the finished lesson before delivery.

Primary review surface — the rendered contact sheet

Would a child who doesn't already know this lesson learn it from this video?

If the answer is yes, the lesson passes the contact-sheet check. If no, name the cue(s) that break the arc — only then drill into per-cue PNGs or scrub the MP4.

Check artifacts (Wave 4 outputs)

After the contact-sheet review, cross-check the Wave 4 composer outputs:

Read lesson-data/<id>/bbox-manifest.json. The fast linear summary.collisionCount should be 0. Then read the measured block (written by npm run lesson:check -- --config <cfg> --measured): summary.measuredCollisionCount and summary.gatesFailed. The measured pass renders motion-peak frames and reads each element's TRUE getBBox(), so it catches easing-overshoot and between-keyframe overlaps the linear path is blind to — treat a measured-only collision as a real candidate, open the matching out/<id>/measured-frames/f<frame>.png, and rule it a true overlap or a by-design adjacency (a whole-card stacked on its diagram, etc.) IN WRITING. Also read the gate verdicts in measured.gates: lufs (voice integrated loudness vs −16 LUFS / true peak ≤ −1 dBFS), captionRedundancy (caption ≈ narration per cue — flags clutter; literacy/pinyin exempt), contrast (WCAG ≥ 4.5:1), legibility (glyph ≥ 24px), motionFast (WARN-only). Any non-zero collision (linear OR measured) or failed gate must either be fixed (kick back to composer) or carry an explicit written justification in this report — silent acceptance is forbidden; a SKIP: <reason> is acceptable but must be acknowledged.
If lesson-data/<id>/primitive-checks/*.png exist, inspect each. Does the primitive read as the visual-design §5 acceptance criteria specify (e.g. "kid reads 'sticks tied with a bow'")?
Cross-check that every element listed in visual-design §3 Visual Contract has a corresponding SceneElement entry in src/lessons/<camelLessonId>/manifest.ts.

Per-cue text-vs-audio checks (scene strings against the spoken phrase)

On-screen target text == the cue's spoken audio. For each cue, compare every on-screen target STRING in src/lessons/<camelLessonId>LessonScene.tsx (DialogueExchange lines, ReadAlongHighlight word glyphs, name tags) against that cue's spoken phrase in lesson-data/<id>/script-cues.json. The on-screen target strings must be a SUBSET of the cue's spoken phrase, in spoken order. FLAG any on-screen word the cue's audio does not speak — e.g. a bubble that shows a word carried over from an earlier cue, or lifted from the brief, that this cue's audio never speaks. This is a HARD finding mapped to W4a composer.
Learner-response gap is a legible invitation, not dead air. For any cue with gap.reason === "learner-response", confirm the scene holds a READABLE "your turn" affordance during the gap window — a localized label ("你来说" / "Your turn") + a pulse/ring on the read-along row + a speech/mic glyph. A bare low-opacity glow with no label/icon FAILS (it reads as awkward silence, not an invitation to speak). HARD finding mapped to W4a composer.

Sound checks (when the lesson has `audio-cues.json`)

Melody NOT identifiable under narration. While any narration plays, the bed must read as warmth, not a tune you could hum. If you can follow the melody, the duck is too shallow → FAIL.
3-point duck. Confirm the arc: intro bed ducks cleanly as the first words start; the bed rises in a mid narration GAP then ducks again; the outro resolves to full as the last narration ends.
No SFX over instruction words, and no SFX louder than narration. Reward/interaction sounds land in gaps or after the line; ta-da fires once. SFX sit below the voice.
Tone lessons (toneSafe): the spoken tone is unmistakable over the bed — no melodic motif competes with the lexical pitch.

Verdict

The final verdict (GREEN / YELLOW / RED) cites:

contact-sheet teach test — does the arc teach the KP? (single paragraph)
bbox-manifest collision count + any justified collisions
text-vs-audio checks — on-screen target strings ⊆ each cue's spoken phrase; learner-response gaps hold a legible "your turn" affordance
primitive-check observations (per redesigned primitive)
sound checks (if audio-cues.json present): melody-under-narration, 3-point duck, SFX discipline, lufs gate verdict
pedagogy + pacing checks against audio-captions.md and visual-design.md

The contact sheet is the canonical review surface. The bbox/primitive artifacts complement it.

lesson-verification

Lesson Verification

Primary review surface — the rendered contact sheet

Check artifacts (Wave 4 outputs)

Per-cue text-vs-audio checks (scene strings against the spoken phrase)

Sound checks (when the lesson has audio-cues.json)

Verdict

More from this repository

More from this repository

Lesson Verification

Primary review surface — the rendered contact sheet

Check artifacts (Wave 4 outputs)

Per-cue text-vs-audio checks (scene strings against the spoken phrase)

Sound checks (when the lesson has audio-cues.json)

Verdict

Sound checks (when the lesson has `audio-cues.json`)

Sound checks (when the lesson has `audio-cues.json`)