#TTS API reading out emotion tags as well
12 messages · Page 1 of 1 (latest)
Hi @rocky hedge were you able to solve that? It happens the same to me
Hi! Did you solve it?
Hi! Same question here
I find that if it is reading it out I have to run it multiple times. Eventually I can get it to work if I make small changes as I run the software. Sucks, but right now it is where the software is at. Learned to work around it but hope they improve that at some point in the future.
What @sharp grail said. You have to regenerate. My hit ratio on emotional emphasis tags is every 1 out 6 will use the tag correctly, rest go in the trash. After around 10 regenereations the sample returned will start getting severely mutated so wait, generate other sentances and come back later, repeat.
I came here for the same problem. I'm using almost exactly one of the samples from the SDK developer doc, and it's reading out loud the emotion tags. This doesn't happen in the playground, only when using the API:
session = Session("xxx")
request = TTSRequest( text="(happy) I'm excited to share this!" )
with open("test.mp3", "wb") as f:
for chunk in session.tts(request):
f.write(chunk)```
Well, is it possible that it plays S1 in PG by default and v1.5 if you access through API? I don't see how to specify the voice model in API request using Python SDK.
Hi! Make sure you are using S1 for emotion tag control
we are updating our docs, in the mean time please reference this repo for setting backend in python https://github.com/fishaudio/fish-audio-python
request = TTSRequest(
reference_id="79973fbd9f374e21b29354e0477389b6",
text="(happy) It brings me such joy to connect with you",
format="mp3",
# top_p=0.8, # adjust as needed (0.0–1.0)
# temperature=0.7, # adjust as needed (0.0–2.0 typically)
)
with open("output1.mp3", "wb") as f:
for chunk in session.tts(request, backend='s1'):
f.write(chunk)
see this for python sdk
Can you provide an example of how to do that? I don't see any way to specify the model to use on either the Session or the TTSRequest (Session has some model related calls, but from what I can tell it looks like that is actually creating reference_ids, not models)