#how to start using it

1 messages · Page 1 of 1 (latest)

warm zenith
#

I m a beginner and just started exploring these things it would be really helpful if someone told me how to use the voice models provided here like text to speech like is there any opensource for it and can i add them to wokada. thnk u😄

open lintel
#

the models on this server are not for text to speech

hearty jungle
# warm zenith I m a beginner and just started exploring these things it would be really helpfu...

text to speech

There are different Text To Speech (TTS) AIs:

GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese, Cantonese, japanese & korean, if you wanna check gpt so vits instead, read https://docs.aihub.gg/tts/gpt-sovits/

Freemium 11labs: Easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS

FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site

You can check TTS in our tts index

With RVC Models:

RVC is natively for Speech To Speech, but forks such as Applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

If you wanna do tts locally with RVC Voice Models (if you got a good pc):

  • You can get Applio in our docs

If you don't got a good pc you can do tts with RVC Voice Models on cloud: