تشغيل أي مهارة في Manus بنقرة واحدة

asr-timing-review

النجوم٠

التفرعات٠

آخر تحديث١٧ مايو ٢٠٢٦ في ٢٣:٣٦

Use for global recorded-video transcript and timing work: run ASR, inspect timestamped transcript segments, infer intended speech, manually decide visual cue timestamps, and produce timing-cues.json without relying on automatic cue matching alone.

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

blueif16

blueif16/vlog_test

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

مستكشف الملفات

2 ملفات

SKILL.md

readonly

name	asr-timing-review
description	Use for global recorded-video transcript and timing work: run ASR, inspect timestamped transcript segments, infer intended speech, manually decide visual cue timestamps, and produce timing-cues.json without relying on automatic cue matching alone.

ASR Timing Review

ASR is evidence for editorial timing, not the final authority.

Required Loop

Run the local ASR command when available: LESSON_ID=<lesson-id> npm run asr:recording
Read shared/composer/<lesson-id>/asr-transcript.json.
Compare the raw ASR with the intended script, recording context, and audio/video timing.
Write a human-review ledger:
- time range
- raw ASR text
- inferred intended speech
- visual event
- confidence
- review note
Manually write timing-cues.json from the reviewed ledger.
Update render-plan shot starts/durations only after the ledger explains why each cue time is chosen.

Rules

NEVER treat programmatic cue matching as final timing when ASR is weak.
NEVER match only sparse English tech tokens in Chinese recordings.
Use rough script timestamps only as prior expectations, not final timing.
If the ASR is crowded but the idea is inferable, state the inferred intended speech and use the timestamp range.
If the idea is not inferable, mark needs_manual_review and do not call the render final.
Prefer cue anchors that the speaker can say clearly next time, such as 第一条：动画时钟, 第二条：浏览器资产, 警告：不是迁移口号, and 最后：声音驱动画布.

Output Shape

Create or update:

shared/composer/<lesson-id>/asr-human-review.md
shared/composer/<lesson-id>/timing-cues.json
render plan timings that reference timingSource: human-reviewed ASR ledger

المزيد من هذا المستودع

نفس المستودع

apple-keynote-motion

blueif16/vlog_test

Define Apple-Keynote-style motion grammar for educational videos. Use when selecting builds, handoffs, object continuity, stage transitions, framed content motion, typography reveals, or smooth presenter-to-canvas movement for the Remotion/Hyperframes vlog pipeline.

2026-06-120

generated-video-director

blueif16/vlog_test

Specify inserted generated-video clips for the composer pipeline. Use when a transcript section needs a generated visual metaphor, illustrative clip, b-roll, product-style animation, or cinematic insert that must match the global style and neighboring shots.

2026-06-120

narration-slide-division

blueif16/vlog_test

Split content between the voice and the slide so they complement instead of repeat. Use at node 03 (script / cue-plan) AND node 04 (visual / slide copy) when authoring any beat that has both narration and on-screen text — the anti-duplication contract that keeps the voice from reading the slide aloud.

2026-06-120

render-lab-design-system

blueif16/vlog_test

Use when a recorded-video composition should use the Render Lab design system: production-control-room visuals, browser/render/QC instruments, frame-driven Remotion motion, and semantic components instead of generic cards or decorative UI.

2026-06-120

render-qc

blueif16/vlog_test

Quality-check rendered video outputs, Hyperframes atoms, Remotion previews, and composer artifacts for this project. Use after creating or changing any atom, generated clip, render plan, timeline, caption layout, presenter framing, or final MP4.

2026-06-120

shot-brief-writer

blueif16/vlog_test

Convert video-composer decisions into precise shot briefs for sub-agents or sub-nodes. Use when a section needs a self-contained spec containing global style, transcript excerpt, neighboring shots, dependencies, asset requirements, presenter/content layout, and expected output.

2026-06-120

name	asr-timing-review
description	Use for global recorded-video transcript and timing work: run ASR, inspect timestamped transcript segments, infer intended speech, manually decide visual cue timestamps, and produce timing-cues.json without relying on automatic cue matching alone.

ASR Timing Review

ASR is evidence for editorial timing, not the final authority.

Required Loop

Run the local ASR command when available: LESSON_ID=<lesson-id> npm run asr:recording
Read shared/composer/<lesson-id>/asr-transcript.json.
Compare the raw ASR with the intended script, recording context, and audio/video timing.
Write a human-review ledger:
- time range
- raw ASR text
- inferred intended speech
- visual event
- confidence
- review note
Manually write timing-cues.json from the reviewed ledger.
Update render-plan shot starts/durations only after the ledger explains why each cue time is chosen.

Rules

NEVER treat programmatic cue matching as final timing when ASR is weak.
NEVER match only sparse English tech tokens in Chinese recordings.
Use rough script timestamps only as prior expectations, not final timing.
If the ASR is crowded but the idea is inferable, state the inferred intended speech and use the timestamp range.
If the idea is not inferable, mark needs_manual_review and do not call the render final.
Prefer cue anchors that the speaker can say clearly next time, such as 第一条：动画时钟, 第二条：浏览器资产, 警告：不是迁移口号, and 最后：声音驱动画布.

Output Shape

Create or update:

shared/composer/<lesson-id>/asr-human-review.md
shared/composer/<lesson-id>/timing-cues.json
render plan timings that reference timingSource: human-reviewed ASR ledger