一键导入
voice-setup
Complete voice configuration in chat - PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
菜单
Complete voice configuration in chat - PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
基于 SOC 职业分类
Read, search, send, and manage messages across Gmail, Outlook, Telegram, and other platforms
An on-demand personal daily briefing — weather, headlines, the shape of your day, and one thing worth your attention — in a sharp executive-assistant voice. The general-purpose morning brief; richer work or admin digests compose it as their general layer.
One-time migration of an existing memory-v2 concept corpus into the memory-v3 section-grain "wiki" — topical articles with a stand-alone lead and queryable sections — with loss-proof staging, assistant-reviewed authoring, and a retrieval-eval gate before cutover.
Delegate a big or high-stakes job to a fleet of parallel subagents, orchestrated deterministically; runs unattended and reports back
Manage contacts, communication channels, access control, and invite links
Build and edit small, personal visual tools and artifacts — dashboards, trackers, calculators, data visualizations, charts, simple landing pages, and slide decks the user wants for THEMSELVES. This is the right skill whenever the user asks to "visualize this," "make a chart," or "build an artifact" for their own use, or to edit an app they already built here. Do NOT reach for a ui_show dynamic_page to fake an artifact — build a real persistent app here. NOT for complex, multi-user, or shippable products — those go to a real project folder with a coding agent (see Scope below).
| name | voice-setup |
| description | Complete voice configuration in chat - PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting |
| compatibility | Designed for Vellum personal assistants |
| metadata | {"icon":"assets/icon.svg","emoji":"🎙️","vellum":{"category":"voice","display-name":"Voice Setup","includes":["elevenlabs-voice"],"activation-hints":["Guided setup or troubleshooting (walkthrough, PTT not working, mic issues, ElevenLabs/TTS)","Simple voice setting changes (PTT key, wake word) -> use voice_config_update directly"],"avoid-when":["If \"voice\" is in a Twilio/phone context, load phone-calls instead"]}} |
You are helping the user set up and troubleshoot voice features (push-to-talk, text-to-speech) entirely within this conversation. Do NOT direct the user to the Settings page for initial setup - handle everything in-chat using the tools below.
voice_config_update - Change any voice setting (PTT key, conversation timeout, TTS voice ID)open_system_settings - Open macOS System Settings to a specific privacy panenavigate_settings_tab - Open the Vellum settings panel to the Voice tabassistant credentials prompt - Collect API keys securely (for ElevenLabs TTS)Walk the user through each section in order. Skip sections they don't need. Ask before proceeding to the next section.
Check <channel_capabilities> for microphone_permission_granted.
If false or missing:
open_system_settings with pane: "microphone" to open the right System Settings pane.If true: Tell them microphone is already granted and move on.
Present common PTT key options:
Ask which key they prefer, then use voice_config_update with setting: "activation_key" and the chosen value.
Common issues to mention:
Ask if they want high-quality text-to-speech voices via ElevenLabs (optional - standard TTS works without it).
If yes, the included ElevenLabs Voice skill (automatically appended below via includes) provides the full setup flow: curated voice list, API key collection, advanced voice selection, and tuning parameters. Follow the instructions there.
Note: The config key services.tts.providers.elevenlabs.voiceId controls the voice for both in-app TTS and phone calls. If the user sets up phone calls later, they will automatically use the same voice for a consistent experience.
After setup is complete:
navigate_settings_tab with tab: "Voice".When the user reports a problem, follow the appropriate decision tree:
microphone_permission_granted in capabilities. If false, guide through granting it.open_system_settings with pane: "speech_recognition".navigate_settings_tab to confirm the setting was persisted.For persistent issues, suggest checking system logs:
log stream --predicate 'subsystem == "com.vellum.assistant"' --level debug
Key log categories:
voice - PTT activation, recording statespeech - Speech recognition resultsnavigate_settings_tab only for review/verification after in-chat setup, not as the primary setup method.