#Slight static with voice agent

1 messages · Page 1 of 1 (latest)

full willow
#

Hey team! We're experiencing some static/distortion in our Voice Agent audio when using Twilio and could use some help diagnosing.

Setup:
Twilio media streams → Voice Agent API (wss://agent.deepgram.com/v1/agent/converse)
Listen: flux-general-en
Speak: aura-2-thalia-en
Think: OpenAI gpt-4.1
Config (matching the docs):
{ "type": "Settings", "audio": { "input": { "encoding": "mulaw", "sample_rate": 8000 }, "output": { "encoding": "mulaw", "sample_rate": 8000, "container": "none" } }}

What we've verified:
✅ SettingsApplied received after sending config
✅ Using container: "none" (to avoid WAV header issues)
✅ mulaw @ 8kHz (matching Twilio's format)
✅ Audio buffered until SettingsApplied before forwarding
✅ Raw bytes forwarded directly (no transcoding)

Issue:
Subtle but noticeable static/distortion in TTS output throughout the call (not just at the start).
Sample for debugging:
Request ID: 75c60da2-a3f9-40c0-bf2d-1ca96b8451b8
Timestamp: 2025-12-30 21:55:05 UTC

Questions:
Anything in SettingsApplied we should check to confirm audio config was applied?
Any known issues with flux-general-en or aura-2-thalia-en causing artifacts?
Can someone check the request ID above for anything unusual?
Happy to provide recordings or additional logs. Thanks!
Want me to shorten it further or add anything?

plucky joltBOT
#

Thanks for asking your question. Please be sure to reply with as much detail as possible so the community can assist you efficiently.
-# If you haven't done so, ensure your Discord and Github profiles are linked to Deepgram so you can earn points to redeem on cool stuff just by being active!

#

It looks like we're missing some important information to help debug your issue. Would you mind providing us with the following details in a reply?

  • The programming language you are working in (e.g. JavaScript, Python).
full willow
#

Javascript using node js

terse badge
#

Hey @full willow thanks for the message. Can you share some audio static that you're hearing?

We heard one other report of static a couple days ago. When they sent a sample, I could not hear any static though. The only common issue I can think of is that using an audio container format for TTS sends back header info in each message. If the client application attempts to interpret a JSON payload as raw audio bytes, this can result in static / pops.

You can DM me a file or e-mail me at first.last.