| name | starters |
| description | Clone a ready-to-run Deepgram demo app and start building on top of it. Use whenever someone wants a quick working demo, needs to prototype with Deepgram, or is starting a new project that uses speech-to-text, text-to-speech, voice agents, audio intelligence, or live streaming. Match the user's language, framework, and desired Deepgram feature to the right starter.
|
Deepgram Starter Apps
Clone a working demo and start building. Every starter is a minimal, runnable app you can extend.
1. Pick Your Feature
What do you want to build?
- Transcribe a file ā
transcription ā send audio/video, get text back (REST, Nova)
- Transcribe a live stream ā
live-transcription ā real-time speech-to-text (WebSocket, Nova)
- Generate speech ā
text-to-speech ā send text, get audio back (REST)
- Stream speech ā
live-text-to-speech ā real-time text-to-audio (WebSocket)
- Analyze text or audio ā
text-intelligence ā sentiment, topics, intents, summaries (REST)
- Build a voice agent ā
voice-agent ā conversational AI agent (WebSocket, agent.deepgram.com)
- Conversational STT with turn detection ā
flux ā Deepgram Flux for voice agents and interactive assistants (WebSocket, /v2/listen)
Nova vs Flux for speech-to-text: use transcription or live-transcription (Nova, /v1/listen) for general-purpose transcription, captions, and batch workloads. Use flux (Flux, /v2/listen) when you need built-in turn detection for conversational audio. See the api skill for a full comparison.
2. Pick Your Stack
| Language | Frameworks |
|---|
| JavaScript | node |
| TypeScript | bun, deno |
| Python | fastapi, flask, django |
| Go | go |
| Java | java |
| C# | csharp |
| Rust | rust |
| Ruby | ruby |
| PHP | php |
| C++ | cpp |
3. Clone and Run
Every starter lives at https://github.com/deepgram-starters/{framework}-{feature}:
git clone https://github.com/deepgram-starters/{framework}-{feature}.git
cd {framework}-{feature}
Set your API key and follow the README:
export DEEPGRAM_API_KEY=your_key_here
Get an API key at https://console.deepgram.com.
Examples
"I want to build a voice agent in Python"
ā git clone https://github.com/deepgram-starters/fastapi-voice-agent.git
"I need live transcription in my Node app"
ā git clone https://github.com/deepgram-starters/node-live-transcription.git
"I want to add text-to-speech to my Go service"
ā git clone https://github.com/deepgram-starters/go-text-to-speech.git
"I want to analyze audio for sentiment in C#"
ā git clone https://github.com/deepgram-starters/csharp-text-intelligence.git
All Starters
Need something more specific?
- Focused feature snippets (one feature, one language, < 50 lines) ā
recipes skill ā https://github.com/deepgram/recipes
- Third-party integrations (Twilio, LiveKit, LangChain, Vercel AI SDK, Discord, etc.) ā
examples skill ā https://github.com/deepgram/examples
- SDK-specific code skills (idiomatic imports, async patterns, gotchas) ā
npx skills add deepgram/deepgram-{lang}-sdk ā see the api skill for the full list of 9 SDKs.
Related Deepgram skills
api ā consolidated REST + WebSocket API reference
recipes ā minimal runnable feature snippets per language
examples ā full integration examples with third-party platforms
docs ā documentation finder
setup-mcp ā Deepgram MCP server installation