بنقرة واحدة
elevenlabs
Text-to-speech, sound effects, music generation, and audio processing using the ElevenLabs API via the el CLI. Use for voice generation, audio content creation, sound design, or audio processing.
Text-to-speech, sound effects, music generation, and audio processing using the ElevenLabs API via the el CLI. Use for voice generation, audio content creation, sound design, or audio processing.
Executes saved browser automation workflows with consistent setup, teardown, and reporting. Loads workflow files and runs them through the browser skill. Use for repeatable browser automations like scraping, form-filling, or monitoring.
Ultra-compressed communication mode. Cuts token usage ~75% by dropping filler, articles, and pleasantries while keeping full technical accuracy. Use when user says "caveman mode", "talk like caveman", "use caveman", "less tokens", "be brief", or invokes /caveman.
Disciplined diagnosis loop for hard bugs and performance regressions. Reproduce, minimise, hypothesise, instrument, fix, regression-test. Use when user says "diagnose this", "debug this", reports a bug, says something is broken / throwing / failing, or describes a performance regression. Triggers: "diagnose", "debug this", "why is X broken", "X is throwing", "X is failing", "perf regression", "this got slow", "intermittent failure", "flaky test".
Generate or edit images using AI image models. Currently supports Google Gemini (Nano Banana Pro for creation, Nano Banana for editing). Provider-agnostic design. Use for image generation, visual content creation, or image editing tasks.
Relentless interview mode that drives toward shared understanding of a plan or design. Walks every branch of the decision tree one question at a time, each with a recommended answer. Use when user says "grill me", "interview me", or invokes /grill-me.
Grilling interview that stress-tests a plan against the project's existing domain language and documented decisions, sharpening terminology and updating CONTEXT.md and ADRs inline as decisions crystallise. Use when user says "grill me with docs", "grill on the design", "challenge this against our context", or invokes /grill-with-docs.
| name | elevenlabs |
| description | Text-to-speech, sound effects, music generation, and audio processing using the ElevenLabs API via the el CLI. Use for voice generation, audio content creation, sound design, or audio processing. |
| argument-hint | [tts|sfx|music|voices] [text or options] |
| allowed-tools | ["Bash","Read"] |
EL_CLI: python3 ./scripts/el.py # Path to the el CLI wrapper DEFAULT_VOICE: Aa6nEBJJMKJwJkCx8VU2 # Quentin — calm, male, narrator. Override with --voice DEFAULT_MODEL: eleven_multilingual_v2 # Options: eleven_v3, eleven_multilingual_v2, eleven_flash_v2_5 OUTPUT_DIR: ./audio # Where generated audio files are saved OUTPUT_FORMAT: mp3_44100_128 # Options: mp3_44100_128, wav_44100, pcm_44100, opus_48000_128
reference/commands.mdCheck Prerequisites
which python3 fails → report "python3 not found" and stop (no pip packages needed — stdlib only)el.py script from EL_CLI path not found → report "el CLI missing" and stop<EL_CLI> models as a lightweight auth check. If it fails with "ELEVENLABS_API_KEY not set", stop and tell the user:
ELEVENLABS_API_KEY is not configured. Set it in one of:
- Project-level: add
ELEVENLABS_API_KEY=your-keyto./.env- Global (recommended for personal use): add
ELEVENLABS_API_KEY=your-keyto~/.claude/.env- Shell:
export ELEVENLABS_API_KEY=your-keyGet your API key at: https://elevenlabs.io/app/settings/api-keys
Discover Voices
<EL_CLI> voices --search "calm female narrator" --limit 5<EL_CLI> voices --category cloned --limit 10<EL_CLI> voice <voice_id><EL_CLI> voices [--search <query>] [--category <cat>] [--limit <n>]Generate Speech (TTS)
--voice → use DEFAULT_VOICE--model → use DEFAULT_MODEL<EL_CLI> voices --search "calm male narrator" --limit 3--stability, --similarity, --style, --speed<EL_CLI> tts "Welcome to the show." --voice Aa6nEBJJMKJwJkCx8VU2 --out welcome.mp3<EL_CLI> tts "Hello world" --model eleven_v3 --out hello.mp3<EL_CLI> tts <text> --voice <id> [--model <id>] [--out <path>] [--stability n] [--similarity n] [--style n] [--speed n]Generate Sound Effects
<EL_CLI> sfx "thunder rolling across mountains" --duration 5 --out thunder.mp3<EL_CLI> sfx <description> [--duration <secs>] [--out <path>]Generate Music
--instrumental<EL_CLI> music "upbeat jazz intro for a podcast" --duration 15 --out intro.mp3<EL_CLI> music "ambient lo-fi beat" --duration 60 --instrumental --out lofi.mp3<EL_CLI> music <prompt> [--duration <secs>] [--instrumental] [--out <path>]Process Audio
<EL_CLI> isolate noisy-recording.mp3 --out clean-voice.mp3<EL_CLI> stems song.mp3 --variation six --out stems.ziptwo (vocals + accompaniment) or six (vocals, drums, bass, guitar, piano, other)<EL_CLI> isolate interview.mp3 --out clean.mp3Check History & Models
<EL_CLI> history --limit 10<EL_CLI> models--json to any command for raw JSON output<EL_CLI> history --voice Aa6nEBJJMKJwJkCx8VU2 --limit 5