Slow realtime transcription speeds with nova and nova2 | Deepgram | Page 1

azure frost Nov 7, 2023, 4:20 PM

#

I'm experiencing 3-4s of latency with both the nova and nova2 models. for testing purposes I'm reading from an audio file and streaming it to deepgram using the code below. What am I doing wrong?

@unkempt crest

const deepgramLive =  deepgramClient.transcription.live({
  punctuate: true,
  interim_results: false,
  language: "en-US",
  // model: "nova-2-ea",
  model: "phonecall",
  tier: "nova",
  encoding: "mulaw",
  sample_rate: 8000,
  keywords: keywords
});

crude juniperBOT Nov 7, 2023, 4:20 PM

#

Thanks for asking your question. Please be sure to reply with as much detail as possible so we can assist you efficiently. Such as:

Provide the request_id if you've a question about a transcription response.
The options you used or the api.deepgram.com URL you sent your request to, including parameters.
Any code snippets you can include.
Any audio you can include, or if you can't share it here please email it to us at [email protected] and provide a link to this thread.

azure frost Nov 7, 2023, 4:21 PM

#

here's a recent request ID 49b21d08-0677-4245-9ef1-cae2e956a640

unkempt crest Nov 7, 2023, 4:34 PM

#

okay, so the real-time transcription will wait for a natural pause in the speech to finalise the utterance.

#

i would turn on interim results... you'll get a response several times a second. that utterance is complete when one includes is_final: true in the response

#

check out how mine is working here: https://live-nextjs-starter.vercel.app/

Create Next App

Generated by create next app

#

And although it is typescript, you can see here my options https://github.com/deepgram-starters/live-nextjs-starter/blob/main/app/microphone.tsx#L80-L84

GitHub

live-nextjs-starter/app/microphone.tsx at main · deepgram-starters/...

Contribute to deepgram-starters/live-nextjs-starter development by creating an account on GitHub.

#Slow realtime transcription speeds with nova and nova2