Skip to main content
Manusで任意のスキルを実行
ワンクリックで
$pwd:

doubao-multimodal

// Async audio/video understanding with Doubao-Seed (火山方舟 Ark). Handles ASR (plain / per-character timestamps / multispeaker), AST translation, speaker diarization, subtitle alignment, audio-visual captioning, video timeline JSON, technical-blog keyframe selection (with optional transcript hint), and free-form prompts over an audio or video file. Auto-uploads local files to Volcano TOS and auto-downloads remote URLs. Auto-splits videos > 20 min (≤ 50 MB/segment) and audio > 120 min (≤ 50 MB/segment) and merges results. Use when the user wants to transcribe, translate, diarize, caption, pick blog illustration frames, or ask questions about an audio/video file or URL.

$ git log --oneline --stat
stars:26
forks:8
updated:2026年5月6日 14:21
ファイルエクスプローラー
21 ファイル
SKILL.md
readonly