Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

Loslegen

$pwd:

openai-whisper-api

Name: Openai Whisper Api
Author: the-open-agent

// Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

In Manus ausführen

$ git log --oneline --stat

stars:5.095

forks:583

updated:21. Mai 2026 um 16:58

SKILL.md

readonly

name	openai-whisper-api
description	Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
homepage	https://platform.openai.com/docs/guides/speech-to-text
metadata	{"emoji":"🌐","requires":{"bins":["curl"],"env":["OPENAI_API_KEY"]},"primaryEnv":"OPENAI_API_KEY"}

OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI's /v1/audio/transcriptions endpoint. Set OPENAI_BASE_URL to use an OpenAI-compatible proxy or local gateway.

Quick start

curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@/path/to/audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  > transcript.txt

Defaults:

Model: whisper-1
Output format: text

Options

# With language hint
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.ogg" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  -F "language=en" \
  > transcript.txt

# With speaker hint (prompt)
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  -F "prompt=Speaker names: Peter, Daniel" \
  > transcript.txt

# JSON output
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=json" \
  > transcript.json

Custom base URL

Set OPENAI_BASE_URL to use an OpenAI-compatible proxy or local gateway:

API_BASE="${OPENAI_BASE_URL:-https://api.openai.com/v1}"
curl -sS "${API_BASE}/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  > transcript.txt

API key

Set OPENAI_API_KEY environment variable before running commands.

related-skills.json

gleiches Repository

powerpoint.md

from "the-open-agent/openagent"

Create designed, editable PowerPoint .pptx presentations with PptxGenJS. Use when the user asks to create, generate, update, or inspect a deck, slide deck, presentation, or .pptx file.

2026-05-275.1k

gemini.md

from "the-open-agent/openagent"

Gemini CLI for one-shot Q&A, summaries, and generation.

2026-05-215.1k

himalaya.md

from "the-open-agent/openagent"

CLI to manage emails via IMAP/SMTP. Use `himalaya` to list, read, write, reply, forward, search, and organize emails from the terminal. Supports multiple accounts and message composition with MML (MIME Meta Language).

2026-05-215.1k

notion.md

from "the-open-agent/openagent"

Notion API for creating and managing pages, databases, and blocks.

2026-05-215.1k

taskflow.md

from "the-open-agent/openagent"

Use when work should span one or more detached tasks but still behave like one job with a single owner context. TaskFlow is the durable flow substrate under authoring layers like Lobster, ACPX, plugins, or plain code. Keep conditional logic in the caller; use TaskFlow for flow identity, child-task linkage, waiting state, revision-checked mutations, and user-facing emergence.

2026-05-215.1k

tmux.md

from "the-open-agent/openagent"

Remote-control tmux sessions for interactive CLIs by sending keystrokes and scraping pane output.

2026-05-215.1k

package.json

"author": "the-open-agent"

"repository": "the-open-agent/openagent"

GitHub-Repository öffnen Creator-Repositorys ansehen

$ install --global

$ download --local

In Manus ausführen

$ useful --forSOC

SoftwareentwicklerInformatik- und Mathematikberufe15-1252L4

name	openai-whisper-api
description	Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
homepage	https://platform.openai.com/docs/guides/speech-to-text
metadata	{"emoji":"🌐","requires":{"bins":["curl"],"env":["OPENAI_API_KEY"]},"primaryEnv":"OPENAI_API_KEY"}

OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI's /v1/audio/transcriptions endpoint. Set OPENAI_BASE_URL to use an OpenAI-compatible proxy or local gateway.

Quick start

curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@/path/to/audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  > transcript.txt

Defaults:

Model: whisper-1
Output format: text

Options

# With language hint
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.ogg" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  -F "language=en" \
  > transcript.txt

# With speaker hint (prompt)
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  -F "prompt=Speaker names: Peter, Daniel" \
  > transcript.txt

# JSON output
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=json" \
  > transcript.json

Custom base URL

Set OPENAI_BASE_URL to use an OpenAI-compatible proxy or local gateway:

API_BASE="${OPENAI_BASE_URL:-https://api.openai.com/v1}"
curl -sS "${API_BASE}/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  > transcript.txt

API key

Set OPENAI_API_KEY environment variable before running commands.

openai-whisper-api

OpenAI Whisper API (curl)

Quick start

Options

Custom base URL

API key

Mehr aus diesem Repository

Mehr aus diesem Repository

OpenAI Whisper API (curl)

Quick start

Options

Custom base URL

API key