#Hi <@1093927595063660645> How does the break time (It works with https APIs call) feature work wit
1 messages · Page 1 of 1 (latest)
The <break time="x.xs"> feature works with ElevenLabs' API, including HTTP calls, to introduce natural pauses in speech. However, WebSocket communication does not natively support SSML tags like <break> for pauses. Instead, you can consider the following approaches to introduce pauses between words when using WebSocket communication:
-
Chunked Input: Since WebSocket streaming generates audio from partial text input, you can send smaller chunks of text with pauses implied by the segmentation. For example:
"Hello, how are you?" [Pause for 2 seconds] "I am fine, thank you."This method relies on breaking the input text into separate chunks and sending them sequentially with delays between transmissions.
-
Manual Timing: You can programmatically introduce pauses by controlling the timing of when you send the next chunk of text. For example, after sending one chunk, wait for a specific duration before sending the next.
-
Alternative Symbols: If pauses are critical, you can experiment with alternative symbols like dashes (
-or—) or ellipses (...) in your text. These can sometimes create a sense of hesitation or pause in the generated speech, though they are less consistent than<break>tags.
For more precise control over pauses, using the HTTP API with SSML tags is recommended. If you need further assistance, feel free to ask!
"Hello, how are you?"
[Pause for 2 seconds]
"I am fine, thank you."
This will not work since text length can vary and pausing for 2 seconds will be of no help for longer rest