It's happening intermittently. Here are the logs from one of the runs this affected today:
2024-11-19 21:47:35 INFO Downloading S3 mp3 file locally to pass to transcribe function from front_end/mp3_files/e52f92d7b8b347c592545c06dc114518.mp3
2024-11-19 21:47:36 INFO Kicking off transcribe function
2024-11-19 21:47:36 INFO Connecting to Deepgram API
2024-11-19 21:47:36 INFO LOCAL Audio duration: 55.60 minutes for ./downloads/e52f92d7b8b347c592545c06dc114518podcast_s3_downloaded.mp3
2024-11-19 21:47:36 INFO Connected. Passing local mp3 podcast file - ./downloads/e52f92d7b8b347c592545c06dc114518podcast_s3_downloaded.mp3 - to API
2024-11-19 21:47:37 INFO API response from Deepgram returned
2024-11-19 21:47:37 INFO {'metadata': {'transaction_key': 'deprecated', 'request_id': '04e3a98d-0289-430d-be02-ee2f85d72771'...
2024-11-19 21:48:07 INFO Grabbing full transcription and word index from Deepgram API
2024-11-19 21:48:07 INFO Posting to slack
2024-11-19 21:48:07 INFO Message: ❌ ERROR: transcript from Deepgram less than 4000 characters
2024-11-19 21:48:07 INFO <Response [200]>
This request only returned the first few sentences for a 55 minute file -- 46 words
I just kicked it off again and it returned the full transcript (10,729 words) as expected. Logs below:
Downloading S3 mp3 file locally to pass to transcribe function from front_end/mp3_files/e52f92d7b8b347c592545c06dc114518.mp3
Kicking off transcribe function
Connecting to Deepgram API
LOCAL Audio duration: 55.60 minutes for ./downloads/e52f92d7b8b347c592545c06dc114518podcast_s3_downloaded.mp3
Connected. Passing local mp3 podcast file - ./downloads/e52f92d7b8b347c592545c06dc114518podcast_s3_downloaded.mp3 - to API
API response from Deepgram returned
{'metadata': {'transaction_key': 'deprecated', 'request_id': '
5543dcb0-f4d0-4ad8-a9a0-940d30f49b53'...
Grabbing full transcription and word index from Deepgram API