Basically I want the bot to listen to each user and transcribe their voice into text. Then send that as a file when the call has ended. I know how to record and store the raw audio data but how would I go about transcribing it?