بنقرة واحدة
بنقرة واحدة
Control a browser via Chrome DevTools Protocol (CDP) proxy. Use when: (1) navigating to URLs and reading page content, (2) taking screenshots, (3) executing JavaScript in the browser, (4) clicking elements or filling forms, (5) searching and installing Chrome Web Store extensions, (6) interacting with web APIs that require a real browser. NOT for: simple HTTP requests (use curl), local file operations, or when no CDP proxy is available.
Control Eight Sleep pods (status, temperature, alarms, schedules).
Gemini CLI for one-shot Q&A, summaries, and generation.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
ElevenLabs text-to-speech with mac-style say UX.
| name | openai-whisper |
| description | Local speech-to-text with the Whisper CLI (no API key). |
| homepage | https://openai.com/research/whisper |
| metadata | {"openclaw":{"emoji":"🎤","requires":{"bins":["whisper"]},"install":[{"id":"brew","kind":"brew","formula":"openai-whisper","bins":["whisper"],"label":"Install OpenAI Whisper (brew)"}]}} |
Use whisper to transcribe audio locally.
Quick start
whisper /path/audio.mp3 --model medium --output_format txt --output_dir .whisper /path/audio.m4a --task translate --output_format srtNotes
~/.cache/whisper on first run.--model defaults to turbo on this install.