Ok so basically elevenlabs websocket sends the audio in chunks via websocket. I can get it work using the python demo, where you play straight from the stream straight from the listen function here https://docs.elevenlabs.io/api-reference/text-to-speech-websockets
But I'm struggling when:
I need to send the audio recieved over websocket to a client, and then have the client play it
Whether that client is in python or javascript. I can hear the little delay between playing the consecutive audio chunks, if I do play the audio (bytes) I receive from elevenlabs. Sometimes that delay is in the middle of the words. Any advice on this? 🙂