تشغيل أي مهارة في Manus بنقرة واحدة

ابدأ الآن

mlx-audio-server

النجوم٠

التفرعات٠

آخر تحديث١١ فبراير ٢٠٢٦ في ٢٢:٠٧

Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

guoqiao

guoqiao/skills

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

مطوّرو البرمجياتمهن الحاسوب والرياضيات·SOC 15-1252

مستكشف الملفات

6 ملفات

SKILL.md

readonly

name	mlx-audio-server
description	Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.
metadata	{"openclaw":{"always":false,"emoji":"🦞","homepage":"https://github.com/guoqiao/skills/blob/main/mlx-audio-server/mlx-audio-server/SKILL.md","os":["darwin"],"requires":{"bins":["brew"]}}}

MLX Audio Server

Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.

mlx-audio: The best audio processing library built on Apple's MLX framework, providing fast and efficient text-to-speech (TTS), speech-to-text (STT), and speech-to-speech (STS) on Apple Silicon.

guoqiao/tap/mlx-audio-server: Homebrew Formula to install mlx-audio with brew, and run mlx_audio.server as a LaunchAgent service on macOS.

Requirements

mlx: macOS with Apple Silicon
brew: used to install deps if not available

Installation

bash ${baseDir}/install.sh

This script will:

install ffmpeg/jq with brew if missing.
install homebrew formula mlx-audio-server from guoqiao/tap
start brew service for mlx-audio-server

Usage

STT/Speech-To-Text(default model: mlx-community/glm-asr-nano-2512-8bit):

# input will be converted to wav with ffmpeg, if not yet.
# output will be transcript text only.
bash ${baseDir}/run_stt.sh <audio_or_video_path>

TTS/Text-To-Speech(default model: mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-bf16):

# audio will be saved into a tmp dir, with default name `speech.wav`, and print to stdout.
bash ${baseDir}/run_tts.sh "Hello, Human!"
# or you can specify a output dir
bash ${baseDir}/run_tts.sh "Hello, Human!" ./output
# output will be audio path only.

You can use both scripts directly, or as example/reference.

المزيد من هذا المستودع

نفس المستودع

url2pdf

guoqiao/skills

Convert URL to PDF suitable for mobile reading.

2026-02-190

uv-global

guoqiao/skills

Provision and reuse a global uv environment for ad hoc Python scripts.

2026-02-170

hn-extract

guoqiao/skills

Extract a HackerNews post (article + comments) into single clean Markdown for quick reading or LLM input.

2026-02-140

gh-extract

guoqiao/skills

Extract content from a GitHub url.

2026-02-130

url2png

guoqiao/skills

Convert URL to PNG suitable for mobile reading.

2026-02-120

mlx-stt

guoqiao/skills

Speech-To-Text with MLX (Apple Silicon) and opensource models (default GLM-ASR-Nano-2512) locally.

2026-02-110

name	mlx-audio-server
description	Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.
metadata	{"openclaw":{"always":false,"emoji":"🦞","homepage":"https://github.com/guoqiao/skills/blob/main/mlx-audio-server/mlx-audio-server/SKILL.md","os":["darwin"],"requires":{"bins":["brew"]}}}

MLX Audio Server

Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.

mlx-audio: The best audio processing library built on Apple's MLX framework, providing fast and efficient text-to-speech (TTS), speech-to-text (STT), and speech-to-speech (STS) on Apple Silicon.

guoqiao/tap/mlx-audio-server: Homebrew Formula to install mlx-audio with brew, and run mlx_audio.server as a LaunchAgent service on macOS.

Requirements

mlx: macOS with Apple Silicon
brew: used to install deps if not available

Installation

bash ${baseDir}/install.sh

This script will:

install ffmpeg/jq with brew if missing.
install homebrew formula mlx-audio-server from guoqiao/tap
start brew service for mlx-audio-server

Usage

STT/Speech-To-Text(default model: mlx-community/glm-asr-nano-2512-8bit):

# input will be converted to wav with ffmpeg, if not yet.
# output will be transcript text only.
bash ${baseDir}/run_stt.sh <audio_or_video_path>

TTS/Text-To-Speech(default model: mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-bf16):

# audio will be saved into a tmp dir, with default name `speech.wav`, and print to stdout.
bash ${baseDir}/run_tts.sh "Hello, Human!"
# or you can specify a output dir
bash ${baseDir}/run_tts.sh "Hello, Human!" ./output
# output will be audio path only.

You can use both scripts directly, or as example/reference.