一键在 Manus 中运行任何 Skill

开始使用

voice-setup

星标744

分支114

更新时间2026年6月15日 21:08

Complete voice configuration in chat - PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting

安装

用 Codex 或 Claude 帮你安装复制这段 Prompt，粘贴到 Codex、Claude 或其他助手里，让它检查 Skill 页面并帮你完成安装。

在 Manus 中运行

来源

vellum-ai

vellum-ai/vellum-assistant

打开 GitHub 仓库查看创作者相关仓库

下载

在 Manus 中运行

Available Tools

voice_config_update - Change any voice setting (PTT key, conversation timeout, TTS voice ID)
open_system_settings - Open macOS System Settings to a specific privacy pane
navigate_settings_tab - Open the Vellum settings panel to the Voice tab
assistant credentials prompt - Collect API keys securely (for ElevenLabs TTS)

Setup Flow

Walk the user through each section in order. Skip sections they don't need. Ask before proceeding to the next section.

1. Microphone Permission

Check <channel_capabilities> for microphone_permission_granted.

If false or missing:

Explain that macOS requires microphone permission for voice features.
Use open_system_settings with pane: "microphone" to open the right System Settings pane.
Tell the user: "I've opened System Settings to the Microphone section. Please toggle Vellum Assistant on, then come back here."
After they confirm, verify by checking capabilities on the next turn.

If true: Tell them microphone is already granted and move on.

2. Push-to-Talk Activation Key

Present common PTT key options:

Right Option - Default, good general choice
Fn - Dedicated key on most Mac keyboards
Right Command - Easy to reach
Right Control - Familiar from gaming

Ask which key they prefer, then use voice_config_update with setting: "activation_key" and the chosen value.

Common issues to mention:

If they pick a key that conflicts with their emoji picker (Fn or Globe on newer Macs), warn them and suggest an alternative.
If they use a terminal app heavily, warn that some keys may be captured by the terminal.

3. Text-to-Speech / ElevenLabs (Optional)

Ask if they want high-quality text-to-speech voices via ElevenLabs (optional - standard TTS works without it).

If yes, the included ElevenLabs Voice skill (automatically appended below via includes) provides the full setup flow: curated voice list, API key collection, advanced voice selection, and tuning parameters. Follow the instructions there.

Note: The config key services.tts.providers.elevenlabs.voiceId controls the voice for both in-app TTS and phone calls. If the user sets up phone calls later, they will automatically use the same voice for a consistent experience.

4. Verification

After setup is complete:

Summarize what was configured.
Suggest they test by pressing their PTT key and speaking.
Offer to open the Voice settings tab if they want to review: use navigate_settings_tab with tab: "Voice".

Troubleshooting Decision Trees

When the user reports a problem, follow the appropriate decision tree:

"PTT isn't working" / "Can't record"

Microphone permission - Check microphone_permission_granted in capabilities. If false, guide through granting it.
Key check - Ask what key they're using. Confirm it matches their configured PTT key.
Emoji picker conflict - On newer Macs, Fn/Globe opens the emoji picker. If they're using Fn, suggest switching to Right Option or Right Command.
Speech Recognition permission - Some voice features need this. Use open_system_settings with pane: "speech_recognition".
App focus - PTT may not work when Vellum is not the frontmost app or if another app has captured the key.

"Recording but no text" / "Transcription not working"

Speech Recognition permission - Must be granted for transcription.
Microphone input - Ask if they see the recording indicator. If yes, the mic works but transcription is failing.
Locale/language - Speech recognition works best with the system language. Ask if they're speaking in a different language.
Background noise - Excessive noise can prevent transcription. Suggest a quieter environment or a closer microphone.

"Changed a setting but it didn't work"

Event broadcast - The setting should take effect immediately. If it didn't, suggest restarting the assistant.
Verify - Open the Voice settings tab with navigate_settings_tab to confirm the setting was persisted.

Deep Debugging

For persistent issues, suggest checking system logs:

log stream --predicate 'subsystem == "com.vellum.assistant"' --level debug

Key log categories:

voice - PTT activation, recording state
speech - Speech recognition results

Rules

Always handle setup conversationally in-chat. Do NOT tell the user to go to Settings for initial configuration.
Use navigate_settings_tab only for review/verification after in-chat setup, not as the primary setup method.
Be concise. Don't explain every option exhaustively - present the most common choices and let the user ask for more.
If a permission is denied, acknowledge it gracefully and explain what features won't work without it.

同仓库更多 Skills

同仓库

messaging

vellum-ai/vellum-assistant

Read, search, send, and manage messages across Gmail, Outlook, Telegram, and other platforms

2026-06-23744

start-the-day

vellum-ai/vellum-assistant

An on-demand personal daily briefing — weather, headlines, the shape of your day, and one thing worth your attention — in a sharp executive-assistant voice. The general-purpose morning brief; richer work or admin digests compose it as their general layer.

2026-06-22744

vellum-memory-v3-migration

vellum-ai/vellum-assistant

One-time migration of an existing memory-v2 concept corpus into the memory-v3 section-grain "wiki" — topical articles with a stand-alone lead and queryable sections — with loss-proof staging, assistant-reviewed authoring, and a retrieval-eval gate before cutover.

2026-06-22744

workflows

vellum-ai/vellum-assistant

Delegate a big or high-stakes job to a fleet of parallel subagents, orchestrated deterministically; runs unattended and reports back

2026-06-22744

contacts

vellum-ai/vellum-assistant

Manage contacts, communication channels, access control, and invite links

2026-06-22744

app-builder

vellum-ai/vellum-assistant

Build and edit small, personal visual tools and artifacts — dashboards, trackers, calculators, data visualizations, charts, simple landing pages, and slide decks the user wants for THEMSELVES. This is the right skill whenever the user asks to "visualize this," "make a chart," or "build an artifact" for their own use, or to edit an app they already built here. Do NOT reach for a ui_show dynamic_page to fake an artifact — build a real persistent app here. NOT for complex, multi-user, or shippable products — those go to a real project folder with a coding agent (see Scope below).

2026-06-22744

name	voice-setup
description	Complete voice configuration in chat - PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting
compatibility	Designed for Vellum personal assistants
metadata	{"icon":"assets/icon.svg","emoji":"🎙️","vellum":{"category":"voice","display-name":"Voice Setup","includes":["elevenlabs-voice"],"activation-hints":["Guided setup or troubleshooting (walkthrough, PTT not working, mic issues, ElevenLabs/TTS)","Simple voice setting changes (PTT key, wake word) -> use voice_config_update directly"],"avoid-when":["If \"voice\" is in a Twilio/phone context, load phone-calls instead"]}}

Available Tools

voice_config_update - Change any voice setting (PTT key, conversation timeout, TTS voice ID)
open_system_settings - Open macOS System Settings to a specific privacy pane
navigate_settings_tab - Open the Vellum settings panel to the Voice tab
assistant credentials prompt - Collect API keys securely (for ElevenLabs TTS)

Setup Flow

Walk the user through each section in order. Skip sections they don't need. Ask before proceeding to the next section.

1. Microphone Permission

Check <channel_capabilities> for microphone_permission_granted.

If false or missing:

Explain that macOS requires microphone permission for voice features.
Use open_system_settings with pane: "microphone" to open the right System Settings pane.
Tell the user: "I've opened System Settings to the Microphone section. Please toggle Vellum Assistant on, then come back here."
After they confirm, verify by checking capabilities on the next turn.

If true: Tell them microphone is already granted and move on.

2. Push-to-Talk Activation Key

Present common PTT key options:

Right Option - Default, good general choice
Fn - Dedicated key on most Mac keyboards
Right Command - Easy to reach
Right Control - Familiar from gaming

Ask which key they prefer, then use voice_config_update with setting: "activation_key" and the chosen value.

Common issues to mention:

If they pick a key that conflicts with their emoji picker (Fn or Globe on newer Macs), warn them and suggest an alternative.
If they use a terminal app heavily, warn that some keys may be captured by the terminal.

3. Text-to-Speech / ElevenLabs (Optional)

Ask if they want high-quality text-to-speech voices via ElevenLabs (optional - standard TTS works without it).

4. Verification

After setup is complete:

Summarize what was configured.
Suggest they test by pressing their PTT key and speaking.
Offer to open the Voice settings tab if they want to review: use navigate_settings_tab with tab: "Voice".

Troubleshooting Decision Trees

When the user reports a problem, follow the appropriate decision tree:

"PTT isn't working" / "Can't record"

Microphone permission - Check microphone_permission_granted in capabilities. If false, guide through granting it.
Key check - Ask what key they're using. Confirm it matches their configured PTT key.
Emoji picker conflict - On newer Macs, Fn/Globe opens the emoji picker. If they're using Fn, suggest switching to Right Option or Right Command.
Speech Recognition permission - Some voice features need this. Use open_system_settings with pane: "speech_recognition".
App focus - PTT may not work when Vellum is not the frontmost app or if another app has captured the key.

"Recording but no text" / "Transcription not working"

Speech Recognition permission - Must be granted for transcription.
Microphone input - Ask if they see the recording indicator. If yes, the mic works but transcription is failing.
Locale/language - Speech recognition works best with the system language. Ask if they're speaking in a different language.
Background noise - Excessive noise can prevent transcription. Suggest a quieter environment or a closer microphone.

"Changed a setting but it didn't work"

Event broadcast - The setting should take effect immediately. If it didn't, suggest restarting the assistant.
Verify - Open the Voice settings tab with navigate_settings_tab to confirm the setting was persisted.

Deep Debugging

For persistent issues, suggest checking system logs:

log stream --predicate 'subsystem == "com.vellum.assistant"' --level debug

Key log categories:

voice - PTT activation, recording state
speech - Speech recognition results

Rules

Always handle setup conversationally in-chat. Do NOT tell the user to go to Settings for initial configuration.
Use navigate_settings_tab only for review/verification after in-chat setup, not as the primary setup method.
Be concise. Don't explain every option exhaustively - present the most common choices and let the user ask for more.
If a permission is denied, acknowledge it gracefully and explain what features won't work without it.