Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

whisper-audio-transcriber

Sterne0

Forks0

Aktualisiert13. April 2026 um 17:52

Use when you need to transcribe a local audio file with whisper-cpp. The skill creates a timestamped folder next to the source audio, copies the original file there, converts it to a Whisper-friendly WAV, and writes a plain-text transcript. Default language is Italian; optionally force English.

Installation

Mit Codex oder Claude installieren Kopieren Sie diesen Prompt, fügen Sie ihn in Codex, Claude oder einen anderen Assistant ein und lassen Sie die Skill-Seite prüfen und installieren.

In Manus ausführen

Quelle

mameli

mameli/dotfiles

GitHub-Repository öffnen Creator-Repositorys ansehen

Download

In Manus ausführen

Verwandte BerufeSOC

Basierend auf der SOC-Berufsklassifikation

SoftwareentwicklerInformatik- und Mathematikberufe·SOC 15-1252

Datei-Explorer

3 Dateien

SKILL.md

readonly

name	whisper-audio-transcriber
description	Use when you need to transcribe a local audio file with whisper-cpp. The skill creates a timestamped folder next to the source audio, copies the original file there, converts it to a Whisper-friendly WAV, and writes a plain-text transcript. Default language is Italian; optionally force English.

Whisper Audio Transcriber

Use this skill for one-off local transcriptions with whisper-cli and ffmpeg.

Default language is Italian. Use English only when the source audio is clearly in English.

Workflow

Confirm the input audio path.
Run scripts/transcribe_audio.sh with the audio path.
By default, do not pass a language flag, so the script uses Italian.
If the user says the audio is in English, pass --language en.
Return the output folder path and the transcript path.

Command

Italian by default:

/Users/mameli/.codex/skills/whisper-audio-transcriber/scripts/transcribe_audio.sh /absolute/path/to/file.m4a

Force English:

/Users/mameli/.codex/skills/whisper-audio-transcriber/scripts/transcribe_audio.sh --language en /absolute/path/to/file.m4a

Optional model override:

/Users/mameli/.codex/skills/whisper-audio-transcriber/scripts/transcribe_audio.sh --model /Users/mameli/Ai_models/ggml-medium.bin /absolute/path/to/file.m4a

Output Layout

The script creates a folder next to the source file with this naming pattern:

YYYY-MM-DD_HH-MM_file-name/

Inside it, write:

the original audio file
*_whisper.wav converted to mono, 16 kHz, pcm_s16le
*_transcript.txt as plain text

Requirements

ffmpeg
whisper-cli
a local GGML model, defaulting to /Users/mameli/Ai_models/ggml-medium.bin when present

If the default model is missing, pass --model.

Notes

Keep the transcript as plain text only. Do not request subtitle output unless the user asks.
If the input audio is already WAV, still place all generated files inside the timestamped folder.
If language is not specified by the user, use Italian.

Mehr aus diesem Repository

gleiches Repository

office-docs-to-markdown

mameli/dotfiles

Convert local office and document files into Markdown, excluding PDFs. Use when the user asks to convert DOCX, PPTX, XLSX, HTML, CSV, JSON, XML, EPUB, or similar local files into Markdown.

2026-04-130

pdf-to-markdown

mameli/dotfiles

Convert local PDF files into Markdown. Use when the user asks to convert a PDF into Markdown.

2026-04-130

markitdown-file-to-markdown

mameli/dotfiles

Convert local files and office documents into Markdown using Microsoft MarkItDown. Use when the user asks to convert a PDF, DOCX, PPTX, XLSX, HTML, CSV, JSON, XML, EPUB, or similar local file into Markdown, including generic requests to "use MarkItDown" or "convert this document/file to Markdown".

2026-04-090

playwright-cli

mameli/dotfiles

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.

2026-04-090

python-antirez-testing

mameli/dotfiles

Design and generate robust tests for Python repos using antirez-style testing: inspect the repo first, prioritize risky public behavior, define invariants and simple oracles, target edge cases and structured randomness, then write pytest tests. Use Hypothesis when it adds value, but do not depend on it.

2026-04-090

youtube-to-obsidian-raw

mameli/dotfiles

Extract a YouTube video into a structured Markdown source note in an Obsidian vault `_Wiki/raw` folder using `yt-dlp`. Use when the user wants the video content captured as structured source material, not as a raw transcript dump and not as a direct wiki note.

2026-04-090

name	whisper-audio-transcriber
description	Use when you need to transcribe a local audio file with whisper-cpp. The skill creates a timestamped folder next to the source audio, copies the original file there, converts it to a Whisper-friendly WAV, and writes a plain-text transcript. Default language is Italian; optionally force English.

Whisper Audio Transcriber

Use this skill for one-off local transcriptions with whisper-cli and ffmpeg.

Default language is Italian. Use English only when the source audio is clearly in English.

Workflow

Confirm the input audio path.
Run scripts/transcribe_audio.sh with the audio path.
By default, do not pass a language flag, so the script uses Italian.
If the user says the audio is in English, pass --language en.
Return the output folder path and the transcript path.

Command

Italian by default:

/Users/mameli/.codex/skills/whisper-audio-transcriber/scripts/transcribe_audio.sh /absolute/path/to/file.m4a

Force English:

/Users/mameli/.codex/skills/whisper-audio-transcriber/scripts/transcribe_audio.sh --language en /absolute/path/to/file.m4a

Optional model override:

/Users/mameli/.codex/skills/whisper-audio-transcriber/scripts/transcribe_audio.sh --model /Users/mameli/Ai_models/ggml-medium.bin /absolute/path/to/file.m4a

Output Layout

The script creates a folder next to the source file with this naming pattern:

YYYY-MM-DD_HH-MM_file-name/

Inside it, write:

the original audio file
*_whisper.wav converted to mono, 16 kHz, pcm_s16le
*_transcript.txt as plain text

Requirements

ffmpeg
whisper-cli
a local GGML model, defaulting to /Users/mameli/Ai_models/ggml-medium.bin when present

If the default model is missing, pass --model.

Notes

Keep the transcript as plain text only. Do not request subtitle output unless the user asks.
If the input audio is already WAV, still place all generated files inside the timestamped folder.
If language is not specified by the user, use Italian.