#Lot of variation in generated voice through sockets

1 messages · Page 1 of 1 (latest)

jovial nimbus
#

"voice_settings": {
"stability": 0.75,
"similarity_boost": 0.8
},

Using Sarah voice
model: eleven_turbo_v2

if you listen the attached recording, you'd notice how sweet the voice towards the end (last 20 seconds)

odd apex
#

The length of your audio is too long, these variations happen the longer the audio being created is. I suggest you break your content into smaller chunks. Or use Projects as it is more stable in these cases..

jovial nimbus
#

thanks @odd apex I am using websockets during a phone call to generate so looks like I d have to reconnect after every 30 seconds or so. should that solve the problem?

odd apex
#

I'm not sure about that, seems like a hard thinkg to solve if you need to use Websockets..

jovial nimbus
#

What about with stream?

#

POST request would be slow i guess

odd apex
#

most likely

#

unfortunately apart from projects and even there, once you get over around 2000 characters in one api call you can start to see these variations..

jovial nimbus
#

hmm in that case reconnect on websocket, should perhaps solve the issue?

#

the voice is getting extra sweet as we go long 😄