ワンクリックで
voice-setup
Complete voice configuration in chat - PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting
Codex または Claude でインストール この Prompt をコピーして Codex、Claude、または他のアシスタントに貼り付けると、Skill ページを確認してインストールできます。
メニュー
Complete voice configuration in chat - PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting
Codex または Claude でインストール この Prompt をコピーして Codex、Claude、または他のアシスタントに貼り付けると、Skill ページを確認してインストールできます。
SOC 職業分類に基づく
Read, search, send, and manage messages across Gmail, Outlook, Telegram, and other platforms
An on-demand personal daily briefing — weather, headlines, the shape of your day, and one thing worth your attention — in a sharp executive-assistant voice. The general-purpose morning brief; richer work or admin digests compose it as their general layer.
One-time migration of an existing memory-v2 concept corpus into the memory-v3 section-grain "wiki" — topical articles with a stand-alone lead and queryable sections — with loss-proof staging, assistant-reviewed authoring, and a retrieval-eval gate before cutover.
Delegate a big or high-stakes job to a fleet of parallel subagents, orchestrated deterministically; runs unattended and reports back
Manage contacts, communication channels, access control, and invite links
Build and edit small, personal visual tools and artifacts — dashboards, trackers, calculators, data visualizations, charts, simple landing pages, and slide decks the user wants for THEMSELVES. This is the right skill whenever the user asks to "visualize this," "make a chart," or "build an artifact" for their own use, or to edit an app they already built here. Do NOT reach for a ui_show dynamic_page to fake an artifact — build a real persistent app here. NOT for complex, multi-user, or shippable products — those go to a real project folder with a coding agent (see Scope below).
| name | voice-setup |
| description | Complete voice configuration in chat - PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting |
| compatibility | Designed for Vellum personal assistants |
| metadata | {"icon":"assets/icon.svg","emoji":"🎙️","vellum":{"category":"voice","display-name":"Voice Setup","includes":["elevenlabs-voice"],"activation-hints":["Guided setup or troubleshooting (walkthrough, PTT not working, mic issues, ElevenLabs/TTS)","Simple voice setting changes (PTT key, wake word) -> use voice_config_update directly"],"avoid-when":["If \"voice\" is in a Twilio/phone context, load phone-calls instead"]}} |
You are helping the user set up and troubleshoot voice features (push-to-talk, text-to-speech) entirely within this conversation. Do NOT direct the user to the Settings page for initial setup - handle everything in-chat using the tools below.
voice_config_update - Change any voice setting (PTT key, conversation timeout, TTS voice ID)open_system_settings - Open macOS System Settings to a specific privacy panenavigate_settings_tab - Open the Vellum settings panel to the Voice tabassistant credentials prompt - Collect API keys securely (for ElevenLabs TTS)Walk the user through each section in order. Skip sections they don't need. Ask before proceeding to the next section.
Check <channel_capabilities> for microphone_permission_granted.
If false or missing:
open_system_settings with pane: "microphone" to open the right System Settings pane.If true: Tell them microphone is already granted and move on.
Present common PTT key options:
Ask which key they prefer, then use voice_config_update with setting: "activation_key" and the chosen value.
Common issues to mention:
Ask if they want high-quality text-to-speech voices via ElevenLabs (optional - standard TTS works without it).
If yes, the included ElevenLabs Voice skill (automatically appended below via includes) provides the full setup flow: curated voice list, API key collection, advanced voice selection, and tuning parameters. Follow the instructions there.
Note: The config key services.tts.providers.elevenlabs.voiceId controls the voice for both in-app TTS and phone calls. If the user sets up phone calls later, they will automatically use the same voice for a consistent experience.
After setup is complete:
navigate_settings_tab with tab: "Voice".When the user reports a problem, follow the appropriate decision tree:
microphone_permission_granted in capabilities. If false, guide through granting it.open_system_settings with pane: "speech_recognition".navigate_settings_tab to confirm the setting was persisted.For persistent issues, suggest checking system logs:
log stream --predicate 'subsystem == "com.vellum.assistant"' --level debug
Key log categories:
voice - PTT activation, recording statespeech - Speech recognition resultsnavigate_settings_tab only for review/verification after in-chat setup, not as the primary setup method.