Hi,
Reading this doc, I see there are different nova2 models with specific use cases.
Seeing one of old posts, it seems like "primary difference is the data that was used to train the model." And "[I] would need to try audio to determine which is better suited."
It makes sense that different audio was used (presumably different audio quality etc), but does it mean both models are trained the same in terms of amount of data?
Context is I currently use -general model but am trying to see if other models (likely -phonecall or -converstaionalai) would perform any better, as our application is converstaion ai over phone call.
But what I am worried is if general would generally perform better due to amount of data used to train each model.
Any tips/guidance would be very appreciated 🙏
Another side question is, do you happen to have any guidance on measuring quality of transcriptions?
I know confidence value is returned from Deepgram, but wonder if there is any recommended way of measuring transcription quality.
Thanks!
Model options allows you to supply a model to use to process submitted audio.