#Need help to change from google stt to deepgram

1 messages · Page 1 of 1 (latest)

faint spindle Dec 10, 2024, 10:17 PM

I tried like this attachment. plz let me know how can I do this.

📎 message.txt

outer shoalBOT Dec 10, 2024, 10:17 PM

Hey there! This channel allows you to ask Kapa.ai for assistance. But it looks like you haven't connected your Discord profile to your Deepgram account yet. You can do so at https://community.deepgram.com - by doing so, you'll be able to ask questions in #1115960287183511643 if you need human assistance!

Deepgram Community

Deepgram’s community offers tons of resources to support you on your voice AI journey

feral solsticeBOT Dec 10, 2024, 10:17 PM

Hi @faint spindle. I'm kapa.ai, your friendly support bot. I will try to answer your question by searching through the knowledge base. :wave:

I apologize, but I don't see any attachment in your question. However, I can provide some general guidance on migrating from Google Speech-to-Text (STT) to Deepgram based on the information available in the knowledge sources.

To migrate from Google STT to Deepgram, you'll need to follow these general steps:

Create a Deepgram account and obtain an API key Migrating From Google Speech-to-Text (STT) to Deepgram (https://developers.deepgram.com/docs/migrating-from-google-speech-to-text-stt-to-deepgram#getting-started).
Familiarize yourself with Deepgram's API and understand the differences from Google STT Migrating From Google Speech-to-Text (STT) to Deepgram (https://developers.deepgram.com/docs/migrating-from-google-speech-to-text-stt-to-deepgram#differences).
Test your audio with Deepgram to ensure compatibility and accuracy.
Update your code to use Deepgram's API instead of Google's. This will involve changing the API calls and handling the response format differences.
Configure your environment for Deepgram. The requirements vary depending on whether you're using pre-recorded audio or live streaming Configure Environment (https://developers.deepgram.com/docs/migrating-from-google-speech-to-text-stt-to-deepgram#configure-environment).

Here's a basic example of how to transcribe audio using Deepgram in Python:

from deepgram import Deepgram
import asyncio, json

DEEPGRAM_API_KEY = 'YOUR_API_KEY'
PATH_TO_FILE = 'some/file.wav'

async def main():
# Initializes the Deepgram SDK
deepgram = Deepgram(DEEPGRAM_API_KEY)
# Open the audio file
with open(PATH_TO_FILE, 'rb') as audio:
# ...or replace mimetype as appropriate
source = {'buffer': audio, 'mimetype': 'audio/wav'}
response = await deepgram.transcription.prerecorded(source, {'punctuate': True})
print(json.dumps(response, indent=4))

asyncio.run(main())


This is different from Google's approach, which might look like this:

```python
from google.cloud import speech

client = speech.SpeechClient()

gcs_uri = 'gs://my-bucket/audio.raw'
audio = {"uri": gcs_uri}

config = speech.RecognitionConfig(
    encoding=speech.RecognitionConfig.AudioEncoding.LINEAR16,
    sample_rate_hertz=16000,
    language_code="en-US",
)

response = client.recognize(config=config, audio=audio)

for result in response.results:
    print("Transcript: {}".format(result.alternatives[0].transcript))

Remember to handle the differences in response formats and features between Google STT and Deepgram. For example, Deepgram provides confidence scores and word timings by default, which you may need to handle differently in your application.

For more detailed information on the migration process and best practices, please refer to the Migration Best Practices (https://developers.deepgram.com/docs/migrating-from-google-speech-to-text-stt-to-deepgram#migration-best-practices) section of the Deepgram documentation.

If you have a specific code snippet or more details about your current implementation, I'd be happy to provide more targeted advice on how to adapt it for Deepgram.

Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:

@faint spindle Kapa.ai is still learning and improving, please let me know how I did by reacting below.