speech to text real time | Deepgram | Page 1

sick wolf Sep 6, 2025, 8:30 PM

#

I wanna build something where the bot is sending responses in realtime into our kickchat

real lanternBOT Sep 6, 2025, 8:30 PM

#

Hey there! This channel allows you to ask Kapa.ai for assistance. But it looks like you haven't connected your Discord profile to your Deepgram account yet. You can do so at https://community.deepgram.com - by doing so, you'll be able to ask questions in #1115960287183511643 if you need human assistance!

Deepgram Community

Deepgram’s community offers tons of resources to support you on your voice AI journey

hazy spindleBOT Sep 6, 2025, 8:31 PM

#

Hi @sick wolf. I'm kapa.ai, your friendly support bot. I will try to answer your question by searching through the knowledge base. :wave:

#

If you want real-time speech-to-text that you can stream into your chat, use Deepgram’s Streaming STT WebSocket API. It sends incremental transcripts as audio is received, ideal for live chat updates.

Basic approach:

Open a WebSocket to wss://api.deepgram.com/v1/listen with your API key.
Stream audio chunks to the socket.
Listen for messages of type "Results" and post transcripts to your chat as they arrive.
Send {"type":"CloseStream"} when done.

Docs and examples:

Streaming STT overview and SDK/non-SDK examples (including interim results) (Getting Started with live streaming audio) (Getting Started).
WebSocket API reference with handshake details (Live Audio reference).
Node.js example streaming a remote audio source and printing transcripts (pipe these into your chat) (JS sample).
Python streaming examples (threading and asyncio) you can adapt similarly (Python sample 1, Python sample 2).
For conversation flows (detecting utterance boundaries), see utterance segmentation guidance in the Dialogflow CX integration doc; same pattern applies if you’re forwarding to your own bot (High-level architecture).

#

Notes:

If you need the text to appear as the speaker talks, enable interim results (your chat can update progressively) (Interim results).
Pre-recorded vs streaming can differ slightly in accuracy; streaming is optimized for low latency (Model behavior note).

If you share your stack (Node/Python/etc.), I can point you to the closest sample and show a minimal snippet to forward transcripts into Kickchat.

Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:

#

@sick wolf Kapa.ai is still learning and improving, please let me know how I did by reacting below.

sick wolf Sep 6, 2025, 8:45 PM

#

im using cursor ever heard of it

#speech to text real time

If you share your stack (Node/Python/etc.), I can point you to the closest sample and show a minimal snippet to forward transcripts into Kickchat.

Note: you can ask me a follow up question by @ mentioning me again :speech_balloon: