| name | starters |
| description | Clone a ready-to-run Deepgram demo app and start building on top of it. Use whenever someone wants a quick working demo, needs to prototype with Deepgram, or is starting a new project that uses speech-to-text, text-to-speech, voice agents, audio intelligence, or live streaming. Match the user's language, framework, and desired Deepgram feature to the right starter.
|
Deepgram Starter Apps
Clone a working demo and start building. Every starter is a minimal, runnable app you can extend.
1. Pick Your Feature
What do you want to build?
- Transcribe a file →
transcription — send audio/video, get text back (REST, Nova)
- Transcribe a live stream →
live-transcription — real-time speech-to-text (WebSocket, Nova)
- Generate speech →
text-to-speech — send text, get audio back (REST)
- Stream speech →
live-text-to-speech — real-time text-to-audio (WebSocket)
- Analyze text or audio →
text-intelligence — sentiment, topics, intents, summaries (REST)
- Build a voice agent →
voice-agent — conversational AI agent (WebSocket, agent.deepgram.com)
- Conversational STT with turn detection →
flux — Deepgram Flux for voice agents and interactive assistants (WebSocket, /v2/listen)
Nova vs Flux for speech-to-text: use transcription or live-transcription (Nova, /v1/listen) for general-purpose transcription, captions, and batch workloads. Use flux (Flux, /v2/listen) when you need built-in turn detection for conversational audio. See the api skill for a full comparison.
2. Pick Your Stack
| Language | Frameworks |
|---|
| JavaScript | node |
| TypeScript | bun, deno |
| Python | fastapi, flask, django |
| Go | go |
| Java | java |
| C# | csharp |
| Rust | rust |
| Ruby | ruby |
| PHP | php |
| C++ | cpp |
3. Clone and Run
Every starter lives at https://github.com/deepgram-starters/{framework}-{feature}:
git clone https://github.com/deepgram-starters/{framework}-{feature}.git
cd {framework}-{feature}
Set your API key and follow the README:
export DEEPGRAM_API_KEY=your_key_here
Get an API key at https://console.deepgram.com.
Examples
"I want to build a voice agent in Python"
→ git clone https://github.com/deepgram-starters/fastapi-voice-agent.git
"I need live transcription in my Node app"
→ git clone https://github.com/deepgram-starters/node-live-transcription.git
"I want to add text-to-speech to my Go service"
→ git clone https://github.com/deepgram-starters/go-text-to-speech.git
"I want to analyze audio for sentiment in C#"
→ git clone https://github.com/deepgram-starters/csharp-text-intelligence.git
All Starters
Need something more specific?
- Focused feature snippets (one feature, one language, < 50 lines) →
recipes skill → https://github.com/deepgram/recipes
- Third-party integrations (Twilio, LiveKit, LangChain, Vercel AI SDK, Discord, etc.) →
examples skill → https://github.com/deepgram/examples
- SDK-specific code skills (idiomatic imports, async patterns, gotchas) →
npx skills add deepgram/deepgram-{lang}-sdk — see the api skill for the full list of 9 SDKs.
Related Deepgram skills
api — consolidated REST + WebSocket API reference
recipes — minimal runnable feature snippets per language
examples — full integration examples with third-party platforms
docs — documentation finder
setup-mcp — Deepgram MCP server installation