بنقرة واحدة
mlx-audio-server
Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.
التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.
القائمة
Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.
التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.
استنادا إلى تصنيف SOC المهني
Convert URL to PDF suitable for mobile reading.
Provision and reuse a global uv environment for ad hoc Python scripts.
Extract a HackerNews post (article + comments) into single clean Markdown for quick reading or LLM input.
Extract content from a GitHub url.
Convert URL to PNG suitable for mobile reading.
Speech-To-Text with MLX (Apple Silicon) and opensource models (default GLM-ASR-Nano-2512) locally.
| name | mlx-audio-server |
| description | Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac. |
| metadata | {"openclaw":{"always":false,"emoji":"🦞","homepage":"https://github.com/guoqiao/skills/blob/main/mlx-audio-server/mlx-audio-server/SKILL.md","os":["darwin"],"requires":{"bins":["brew"]}}} |
Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.
mlx-audio: The best audio processing library built on Apple's MLX framework, providing fast and efficient text-to-speech (TTS), speech-to-text (STT), and speech-to-speech (STS) on Apple Silicon.
guoqiao/tap/mlx-audio-server: Homebrew Formula to install mlx-audio with brew, and run mlx_audio.server as a LaunchAgent service on macOS.
mlx: macOS with Apple Siliconbrew: used to install deps if not availablebash ${baseDir}/install.sh
This script will:
mlx-audio-server from guoqiao/tapmlx-audio-serverSTT/Speech-To-Text(default model: mlx-community/glm-asr-nano-2512-8bit):
# input will be converted to wav with ffmpeg, if not yet.
# output will be transcript text only.
bash ${baseDir}/run_stt.sh <audio_or_video_path>
TTS/Text-To-Speech(default model: mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-bf16):
# audio will be saved into a tmp dir, with default name `speech.wav`, and print to stdout.
bash ${baseDir}/run_tts.sh "Hello, Human!"
# or you can specify a output dir
bash ${baseDir}/run_tts.sh "Hello, Human!" ./output
# output will be audio path only.
You can use both scripts directly, or as example/reference.