Run any Skill in Manus with one click

$pwd:

deepgram-go-voice-agent

Name: Deepgram Go Voice Agent
Author: deepgram

// Use when writing or reviewing Go code in this repo that runs a Deepgram Voice Agent session over WebSockets, including runtime settings, prompt updates, speak updates, injected messages, and event handling. Route standalone STT/TTS work to deepgram-go-speech-to-text or deepgram-go-text-to-speech.

Run Skill in Manus

$ git log --oneline --stat

stars:84

forks:52

updated:April 27, 2026 at 09:40

SKILL.md

readonly

name	deepgram-go-voice-agent
description	Use when writing or reviewing Go code in this repo that runs a Deepgram Voice Agent session over WebSockets, including runtime settings, prompt updates, speak updates, injected messages, and event handling. Route standalone STT/TTS work to deepgram-go-speech-to-text or deepgram-go-text-to-speech.

Using Deepgram Voice Agent from the Go SDK

When to use this product

Use this skill for live Voice Agent runtime flows in pkg/client/agent.

open an agent WebSocket session
send initial settings
stream audio
react to agent events and function-call messages
update prompt/speak settings during the session

Use a different skill when:

you only need STT (deepgram-go-speech-to-text)
you only need TTS (deepgram-go-text-to-speech)
you need management/admin endpoints (deepgram-go-management-api)

Authentication

Set DEEPGRAM_API_KEY before opening the agent connection.

export DEEPGRAM_API_KEY="your_api_key"

Quick start

package main

import (
	"context"
	"fmt"
	"log"

	agentws "github.com/deepgram/deepgram-go-sdk/v3/pkg/api/agent/v1/websocket"
	agent "github.com/deepgram/deepgram-go-sdk/v3/pkg/client/agent"
)

func main() {
	if err := run(); err != nil {
		log.Fatal(err)
	}
}

func run() error {
	ctx := context.Background()
	settings := agent.NewSettingsConfigurationOptions()
	handler := agentws.NewDefaultChanHandler()

	conn, err := agent.NewWSUsingChanWithDefaults(ctx, settings, handler)
	if err != nil {
		return err
	}
	defer conn.Stop()

	if ok := conn.Connect(); !ok {
		return fmt.Errorf("connect failed")
	}

	conn.Start()

	// The handler receives Welcome, ConversationText, FunctionCallRequest, and audio events.
	// Stream audio frames, watch agent events, and respond to function calls as needed.
	return nil
}

Key parameters

constructors
- agent.NewSettingsConfigurationOptions()
- agent.NewWSUsingChanWithDefaults(...)
- agent.NewWSUsingChan(...)
runtime methods
- Connect, Start, ProcessMessage, Stream, Write, KeepAlive
- reconnect helpers like AttemptReconnect and error handling helpers in the WS client
message and event payloads in pkg/api/agent/v1/websocket/interfaces/types.go
- UpdatePrompt
- UpdateSpeak
- InjectAgentMessage
- InjectUserMessage
- FunctionCallResponse
- server events such as WelcomeResponse, ConversationTextResponse, FunctionCallRequestResponse, AgentStartedSpeakingResponse, AgentAudioDoneResponse

API reference (layered)

In-repo reference
- README.md
- pkg/client/agent/client.go
- pkg/client/agent/v1/websocket/client_channel.go
- pkg/client/agent/v1/websocket/new_using_chan.go
- pkg/client/interfaces/v1/types-agent.go
- pkg/api/agent/v1/websocket/interfaces/types.go
OpenAPI
- https://developers.deepgram.com/openapi.yaml
AsyncAPI
- https://developers.deepgram.com/asyncapi.yaml
Context7
- /llmstxt/developers_deepgram_llms_txt
Product docs
- https://developers.deepgram.com/reference/voice-agent/voice-agent
- https://developers.deepgram.com/docs/voice-agent
- https://developers.deepgram.com/docs/configure-voice-agent
- https://developers.deepgram.com/docs/voice-agent-message-flow

Gotchas

This repo exposes live Voice Agent runtime over WebSockets, not a persisted configuration-management surface.
Keep audio streaming, event handling, and any function-call response loop running concurrently.
Follow the example session setup instead of inventing your own event names; the repo already defines concrete message structs.
Channel-based constructors require an AgentMessageChan handler; events are routed there rather than pulled from the client.
Use defer conn.Stop(), and remember that Connect() returns bool rather than error.

Example files in this repo

examples/agent/websocket/simple/main.go
examples/agent/websocket/no_mic/main.go
examples/agent/websocket/arbitrary_keys/main.go
tests/unit_test/agent/agent_speak_test.go

Central product skills

For cross-language Deepgram product knowledge — the consolidated API reference, documentation finder, focused runnable recipes, third-party integration examples, and MCP setup — install the central skills:

npx skills add deepgram/skills

This SDK ships language-idiomatic code skills; deepgram/skills ships cross-language product knowledge (see api, docs, recipes, examples, starters, setup-mcp).

related-skills.json

same repository

deepgram-go-audio-intelligence.md

from "deepgram/deepgram-go-sdk"

Use when writing or reviewing Go code in this repo that applies summaries, topics, intents, sentiment, language detection, diarization, redaction, or entity extraction to audio inputs through Listen v1 REST. Route plain transcription to deepgram-go-speech-to-text and plain-text Read requests to deepgram-go-text-intelligence.

2026-04-2784

deepgram-go-conversational-stt.md

from "deepgram/deepgram-go-sdk"

Use when planning or reviewing Go SDK work for Deepgram conversational STT / Flux v2. This repo does not currently ship a first-class v2 listen client, so route supported v1 transcription to deepgram-go-speech-to-text and document raw WebSocket fallback honestly when v2 is requested.

2026-04-2784

deepgram-go-management-api.md

from "deepgram/deepgram-go-sdk"

Use when writing or reviewing Go code in this repo that works with Deepgram management endpoints for projects, keys, members, scopes, invitations, usage, balances, or models. Route live voice runtime to deepgram-go-voice-agent and repo workflow questions to deepgram-go-maintaining-sdk.

2026-04-2784

deepgram-go-speech-to-text.md

from "deepgram/deepgram-go-sdk"

Use when writing or reviewing Go code in this repo that transcribes prerecorded audio with Listen v1 REST or streams live audio with Listen v1 WebSockets. Route text generation to deepgram-go-text-to-speech, text analysis to deepgram-go-text-intelligence, audio analytics overlays to deepgram-go-audio-intelligence, and Flux or other v2 conversational work to deepgram-go-conversational-stt.

2026-04-2784

deepgram-go-text-intelligence.md

from "deepgram/deepgram-go-sdk"

Use when writing or reviewing Go code in this repo that sends text to Deepgram Read via the analyze client. Route speech/audio inputs to deepgram-go-speech-to-text or deepgram-go-audio-intelligence, and management/admin work to deepgram-go-management-api.

2026-04-2784

deepgram-go-text-to-speech.md

from "deepgram/deepgram-go-sdk"

Use when writing or reviewing Go code in this repo that synthesizes audio with Speak v1 REST or Speak WebSockets. Route transcription work to deepgram-go-speech-to-text, voice conversation runtime work to deepgram-go-voice-agent, and repository maintenance work to deepgram-go-maintaining-sdk.

2026-04-2784

package.json

"author": "deepgram"

"repository": "deepgram/deepgram-go-sdk"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	deepgram-go-voice-agent
description	Use when writing or reviewing Go code in this repo that runs a Deepgram Voice Agent session over WebSockets, including runtime settings, prompt updates, speak updates, injected messages, and event handling. Route standalone STT/TTS work to deepgram-go-speech-to-text or deepgram-go-text-to-speech.

Using Deepgram Voice Agent from the Go SDK

When to use this product

Use this skill for live Voice Agent runtime flows in pkg/client/agent.

open an agent WebSocket session
send initial settings
stream audio
react to agent events and function-call messages
update prompt/speak settings during the session

Use a different skill when:

you only need STT (deepgram-go-speech-to-text)
you only need TTS (deepgram-go-text-to-speech)
you need management/admin endpoints (deepgram-go-management-api)

Authentication

Set DEEPGRAM_API_KEY before opening the agent connection.

export DEEPGRAM_API_KEY="your_api_key"

Quick start

package main

import (
	"context"
	"fmt"
	"log"

	agentws "github.com/deepgram/deepgram-go-sdk/v3/pkg/api/agent/v1/websocket"
	agent "github.com/deepgram/deepgram-go-sdk/v3/pkg/client/agent"
)

func main() {
	if err := run(); err != nil {
		log.Fatal(err)
	}
}

func run() error {
	ctx := context.Background()
	settings := agent.NewSettingsConfigurationOptions()
	handler := agentws.NewDefaultChanHandler()

	conn, err := agent.NewWSUsingChanWithDefaults(ctx, settings, handler)
	if err != nil {
		return err
	}
	defer conn.Stop()

	if ok := conn.Connect(); !ok {
		return fmt.Errorf("connect failed")
	}

	conn.Start()

	// The handler receives Welcome, ConversationText, FunctionCallRequest, and audio events.
	// Stream audio frames, watch agent events, and respond to function calls as needed.
	return nil
}

Key parameters

constructors
- agent.NewSettingsConfigurationOptions()
- agent.NewWSUsingChanWithDefaults(...)
- agent.NewWSUsingChan(...)
runtime methods
- Connect, Start, ProcessMessage, Stream, Write, KeepAlive
- reconnect helpers like AttemptReconnect and error handling helpers in the WS client
message and event payloads in pkg/api/agent/v1/websocket/interfaces/types.go
- UpdatePrompt
- UpdateSpeak
- InjectAgentMessage
- InjectUserMessage
- FunctionCallResponse
- server events such as WelcomeResponse, ConversationTextResponse, FunctionCallRequestResponse, AgentStartedSpeakingResponse, AgentAudioDoneResponse

API reference (layered)

In-repo reference
- README.md
- pkg/client/agent/client.go
- pkg/client/agent/v1/websocket/client_channel.go
- pkg/client/agent/v1/websocket/new_using_chan.go
- pkg/client/interfaces/v1/types-agent.go
- pkg/api/agent/v1/websocket/interfaces/types.go
OpenAPI
- https://developers.deepgram.com/openapi.yaml
AsyncAPI
- https://developers.deepgram.com/asyncapi.yaml
Context7
- /llmstxt/developers_deepgram_llms_txt
Product docs
- https://developers.deepgram.com/reference/voice-agent/voice-agent
- https://developers.deepgram.com/docs/voice-agent
- https://developers.deepgram.com/docs/configure-voice-agent
- https://developers.deepgram.com/docs/voice-agent-message-flow

Gotchas

This repo exposes live Voice Agent runtime over WebSockets, not a persisted configuration-management surface.
Keep audio streaming, event handling, and any function-call response loop running concurrently.
Follow the example session setup instead of inventing your own event names; the repo already defines concrete message structs.
Channel-based constructors require an AgentMessageChan handler; events are routed there rather than pulled from the client.
Use defer conn.Stop(), and remember that Connect() returns bool rather than error.

Example files in this repo

examples/agent/websocket/simple/main.go
examples/agent/websocket/no_mic/main.go
examples/agent/websocket/arbitrary_keys/main.go
tests/unit_test/agent/agent_speak_test.go

Central product skills

npx skills add deepgram/skills

This SDK ships language-idiomatic code skills; deepgram/skills ships cross-language product knowledge (see api, docs, recipes, examples, starters, setup-mcp).

deepgram-go-voice-agent

Using Deepgram Voice Agent from the Go SDK

When to use this product

Authentication

Quick start

Key parameters

API reference (layered)

Gotchas

Example files in this repo

Central product skills

More from this repository

More from this repository

Using Deepgram Voice Agent from the Go SDK

When to use this product

Authentication

Quick start

Key parameters

API reference (layered)

Gotchas

Example files in this repo

Central product skills