Skip to main content
Execute qualquer Skill no Manus
com um clique
$pwd:

audio-transcribe

// This skill should be used when the user explicitly asks to "transcribe a meeting", "transcribe audio", "transcribe a meeting recording", "convert audio to text", "generate meeting minutes from audio", "do speech-to-text", "transcribe with speaker diarization", "identify speakers in audio", "transcribe Chinese audio", "transcribe English audio", "transcribe Japanese audio", "multi-speaker transcription", "transcribe a podcast", "transcribe podcast episode", "transcribe an interview", "convert podcast to text", "podcast to transcript", or mentions FunASR, Paraformer, SenseVoice, Whisper, MiMo, MiMo-V2.5-ASR, meeting transcription, podcast transcription, or speaker diarization. Supports multi-speaker meeting and podcast transcription in Chinese, English, Japanese, Korean, Cantonese, and 99 languages (via Whisper), plus Xiaomi MiMo-V2.5-ASR (8B, local GPU) for stronger proper-noun and code-switching accuracy. Automatic speaker diarization via CAM++, hotword biasing (FunASR path), LLM cleanup. FunASR works on GPU

$ git log --oneline --stat
stars:2
forks:2
updated:1 de maio de 2026 às 15:37
Explorador de arquivos
12 arquivos
SKILL.md
readonly