mit einem Klick
openai-whisper-api
// Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
// Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Create designed, editable PowerPoint .pptx presentations with PptxGenJS. Use when the user asks to create, generate, update, or inspect a deck, slide deck, presentation, or .pptx file.
Gemini CLI for one-shot Q&A, summaries, and generation.
CLI to manage emails via IMAP/SMTP. Use `himalaya` to list, read, write, reply, forward, search, and organize emails from the terminal. Supports multiple accounts and message composition with MML (MIME Meta Language).
Notion API for creating and managing pages, databases, and blocks.
Use when work should span one or more detached tasks but still behave like one job with a single owner context. TaskFlow is the durable flow substrate under authoring layers like Lobster, ACPX, plugins, or plain code. Keep conditional logic in the caller; use TaskFlow for flow identity, child-task linkage, waiting state, revision-checked mutations, and user-facing emergence.
Remote-control tmux sessions for interactive CLIs by sending keystrokes and scraping pane output.
| name | openai-whisper-api |
| description | Transcribe audio via OpenAI Audio Transcriptions API (Whisper). |
| homepage | https://platform.openai.com/docs/guides/speech-to-text |
| metadata | {"emoji":"🌐","requires":{"bins":["curl"],"env":["OPENAI_API_KEY"]},"primaryEnv":"OPENAI_API_KEY"} |
Transcribe an audio file via OpenAI's /v1/audio/transcriptions endpoint. Set OPENAI_BASE_URL to use an OpenAI-compatible proxy or local gateway.
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F "file=@/path/to/audio.m4a" \
-F "model=whisper-1" \
-F "response_format=text" \
> transcript.txt
Defaults:
whisper-1text# With language hint
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F "file=@audio.ogg" \
-F "model=whisper-1" \
-F "response_format=text" \
-F "language=en" \
> transcript.txt
# With speaker hint (prompt)
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F "file=@audio.m4a" \
-F "model=whisper-1" \
-F "response_format=text" \
-F "prompt=Speaker names: Peter, Daniel" \
> transcript.txt
# JSON output
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F "file=@audio.m4a" \
-F "model=whisper-1" \
-F "response_format=json" \
> transcript.json
Set OPENAI_BASE_URL to use an OpenAI-compatible proxy or local gateway:
API_BASE="${OPENAI_BASE_URL:-https://api.openai.com/v1}"
curl -sS "${API_BASE}/audio/transcriptions" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F "file=@audio.m4a" \
-F "model=whisper-1" \
-F "response_format=text" \
> transcript.txt
Set OPENAI_API_KEY environment variable before running commands.