#🧬│ai-chat
1 messages · Page 357 of 1
Since both SoVitsSVC and RVC are STS
old is gold my dude just uh have to give time but i say its not worth it 100 to 150 good sample crazy man
Well, nope unless you use another TTS to make first the audio then use it as an input for RVC
Bro it's not good 😭
We used to have so vits SVC models
Everyone transitioned to RVC
It's way better
didn't under can uh elobrate a bit
yeah i guess uh r righ it gives better voice in lesss time
TTS = text to speech
STS = speech to speech
So-VITS-SVC and RVC are both STS, not TTS
yeahh we have to use another tts i knw that brother
The only way to use them as "TTS" is to use first another TTS program to make a TRS audio, then change the voice of that audio by using it as an input in RVC
If you want, there's other TTS options, ofcourse there will be less models and less popular than RVC though
thats the reason i was asking are they good ? the ones that are in voice models
It got better quality and it's still maintained ye
That depends on how good the person trained the model, you can listen to the audio samples or try it out yourself
are there any anime girl model and are they good in expressing emotions?
bro i need pre-trained i canot train coz i don't have good samples
There aren't much TTS models so I'm not sure on your anime girl question
About emotions, can depend, F5, Fish Speech and GPT-SoVITS are good with emotions
And I'm not telling you to train, I'm telling you to test the models yourself, RVC is good but it depends on how good other users also trained the model
well i can use rvc it will work just have to work little more and thats not a problem
Those ones in the channel are already trained by other users
Ofcourse the best TTS is 11labs, but it's paid and not open source
ahh uh mean those who are hosting the model ?
yh i knw but right now i am flat broke
The models posted by users in #1175430844685484042 are trained by model makers in this server
can uh suggest me few with good voice ?
Some models can sound more realistic, some others a lil bit less, it's better you test the model you want to use or use it yourself
that's why i am asking for recommendation
I can just tell you the ones in our guides, but I haven't tested 10k+ models
i knw but anyways thank i will chk myself too
Alright, goodluck
I gave you those 2, u could check them out
thanks dude
Yw
yeah trying to download
In case you are having issues downloading, To Download a Model from Weights.gg:
- Login
- Click the 3 dots at the right of the image of the model
- Click download
- Download Anyways
- Unzip the zip, and you might wanna rename the pth and index since all models on weights are renamed as 'model'
uh read my mind lol
Weights.gg is our Partner, it's a site where models get hosted too
It's ofc still RVC models
Thanks i appreciate the help
You're welcome
by LUSBERT!?!???
well can't be helped free always comes with hardships
Yep
Zamn
Huh?
i have to write extra code tts and thn sts
this is teas voice model
iirc it was a harrasment campaign or some bullshit
oh dear heavens!
5 lines of ass code!
how difficult!
you could use Edge TTS for the TTS, it's by Microsoft but free API to use, good with quality and multilingual, but it only has premade voices and not that good with emotion
bro i am frustrated i was stuck in a f*cking error for 2 whole days can't write a single line of code right now 4 lines are hell for me
i used google tts but nahh voice was shit
why the fuck r u censoring ur own shi
literally use any tts model
even kokoro
rvc will just en-de-shittify it
Google tts is another tts unrelated to edge tts
not talking abt model
yeah
anyways guyz see ya, will see uh soon
keep ur shit to urself please
trying to get attention how miserable a persona can be 
ok monkey butt headass boy
@rich path @west burrow let's not do useless drama here, and Ofcourse respect each other.
i ain even doin shit Lol
im just talkin about Bullshiot
Fork
🍴
i was but they say give respect take respect
Lmfao gg
Insulting someone is not accepted here, let's be civil
firefox is fucked
ok monkey butt headass boy
this makes no sense LMfao
its just random ass words stringed together to sound like an insult
but it just means that ur a monkeybutt
that's barely insult that's like 4th grade shir
@tranquil lantern monkeybutt headass boy
That still ain't enough to make someone cry. But if someone call you with a hard slur, you'd getting cooked haha. 
lowkey id be the guy spewing slurs
THATS SO OLD LMAO
Hello
Guys help pls. What the app and voice model would be best for hide voice in Investigated video
I think L's voice would be good
Sorry, I didn't really understand how covers are created. I was in a telegram group of an Italian developer who had created a site where covers could be made, but now it's closed and the site that he had provided where to retrieve the models it was closed. he said he is now operating as admin on this server. could you explain to me how it works now?
I would need the model of an Italian rapper "Guè"
I be looking at the voice models section and I am just thinkin
What are some of these models even being used for?
What use is somebody getting out of "Jeremy finkleforter from highschool 5 years ago"
how you create a custom voice cover?
what's your pc gpu
ig ai covers or realtime lol
all that sites use is RVC, it's better you use that, what's ur pc gpu
rtx 3070 why
why
Because computer power is needed for doing heavy tasks, just like how games and blender rendering does too for example
anyways, ur pc gpu is good
I just dont even know how to make stuff
I wanted like to choose a song, create a clone voice with AI using clips of videos of me talking, and then put my clown voice singing that music
how could I do that
Everything you need to know is in our docs
Last update: Oct 21, 2024
I would suggest you to use locally applio
which is a fork of rvc, fork means a modified version
But I need to donwload something?
you need to first train yourself your own voice model, then separate the vocals and instrumentals of the song, then inference (use model) on the vocals, then mix the converted vocals with the instrumentals
Yeah, it's local, which means it runs on your pc
I suggest you that since your GPU is good
can't be online?
it can be cloud, which uses remote good pc, but you will have to deal with limited GPU time on the free tier
It's used for people who got a bad pc, which isn't really your case tbh
you still want me to give you the links?
for both cloud and local there are written guides dw
all you gotta do is read them
if you're wondering about youtube videos, they are pretty outdated
Hey guys
I'm a junior js developer (mainly frontend) and currently using this platform - https://sparkengine.ai and building some AI tools cause I like building them without coding. Where can I find more no-code tools?
which tools uh want i am also building a ai assistant but my project is a big one
monkey butt headass
Which chunk I can use in rtx 3060 mobile 6gb (notebook) in Ai voice changer?
@late nest sorry but promos aren't allowed here
tell the guide link u followed in #🔍│help-w-okada
is mmvc down? I cannot get it to start for the life of me. I would reinstall it but I can't find the link
Stupid ass boy. 
You insult like what a bully from an elemental school would do. I insult with something harder than yours.
For W-Okada, go to #🔍│help-w-okada. The chat isn't where you asking where to download W-Okada.
Server mad dead now
bro left
mikus been here for
Guys does someone knows when the day resets on Weights.com? I tried to look around the site but im dumb 
You meant Weights.gg?
Please be specific on which problem you encountered.
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
yeah
I'm not sure what you mean by the day resetting. Do you mean something like this on Weights?
like, i try to create images and it says theres a limit of generations per day on the F12 console
I don't know how many features are limited for non premium users. I got premium, and I found no problem using image generator. It's more likely there are too many queue at the time trying to generate.
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Elon Musk's Grok 3 AI Is A GAMECHANGER
Game deez nutter. 
fix yo shi and maybe people'll like it
@elder willow in #🤖│bots
do you know tts i want to make a model that can speak and when chatting will speak and will have to move when speaking
I want to do it on reactjs only but I don't know if it's possible
No, thanks. I can work on my own.
Promo is NOT allowed here @leaden halo
Any careers channel?
Nope, sorry.
hi guys my name is abhiraj shah and i am NOT the ceo of a new company
@cyan onyx Promo is not allowed.
No worries!
any content creators on here?
roomie was here
@tepid basin did you ever play humanornot.ai ?
No
great bc it has a javascript vulnerability 😭
Thinking about investing in the mcchicken
huh
It’s a classic for a reason. Savor the satisfying crunch of our juicy chicken patty, topped with shredded lettuce and just the right amount of creamy mayonnaise, all served on a perfectly toasted bun. The McChicken has 390 calories.
Hi guys, I have a question. Can we monetize an AI influencer? And if so, how? I've seen a lot of videos about this, but I'm not sure, and I don't want to waste my time. I'm too old to waste time without a payback
With ads
no with onlyfans and fanvues
@jaunty osprey asking for NSFW help isn't accepted here.
Question: How good is Grok3 compared to ChatGPT and Deepseek?
I've been feeding ChatGPT with sources to learn an obscure language and it generate texts pretty well (I ask it to make an essay about genetical engineering and how this is good to our future and it's pretty decent)
So I want to know if Grok can do better
I haven't tested grok
Hmmm.. I've been wondering for a while, is there any known and more or less reliable workflow for " recreating " character ai locally?
any particular model / backend / frontend etc?
Looking for something as advanced ( and functional ) as it can get
either way, any info / ideas appreciated ~
ew
^ 100% disgusting
guys im trying to upload my model and i put the hugging face link and it still isnt accepting it, it says (must be a hugging face link or weights.gg)
and im putting the hugging face link and it still wont allow
作編曲MIX&Mastering映像 / 柊マグネタイト
絵 / 瀬奈悠太
歌 / 重音テトSV
<歌詞>
どうしてすぐ食べてしまうの
空腹で苦しんでし罵倒
Q.行進で行くマクドナルド可?
悪くない
ひる限定すぐに食べたいの
常備ねえおひる食べBUGER
Q.超うめえひるてりやき食べようか?
金による
昼飯ねえ空きっ腹で参上
おうちにする? いやマクドナルド
Q.定番を食いたいダメCancelは?
昼食える
どうしてすぐ知ってしまうの
どうしてすぐ行ってしまうの
どうしてすぐ食べてしまうかな
(セセセセットセット)
ビッグマックしか勝たん
フィレオフィッシュのえだまめコーン
二重乾酪(ダブルチーズ)しか勝たん
ランチタイムの貯金でひるまック
おひるだけ210分コー...
holy chat this neew tupac data sety go crazyyy at 116 this me rapping
guys how to add model to tts
how do i train a model? can someone point me towards a direction
this mixed with adobes audio enhancer would sound so real
you could probably put some audio of tupac before it too so it really gets the job done
bro this is at 160
bro 320 sooo crazy just updated
the breathing parts sound weird tho
yeah thats because its me rapping and im white lol
hi
hello is anyone here who is good in setup and use of RVC i was trying to use but i cannot i don't know how can anyone please help me its urgent.
Nope, that has nothing to do if a model sounds robotic.
If a model's breaths sound robotic that's because you didn't clean the dataset properly or you under/or overtrained it.
Will reply in #✨│ai-help
Please never use Voice.ai again. This site is a scam, and eats more performance that any of W-Okada.
I try W-Okada but the changed voice sucks and doesn't even sound good, I have a really good pc aswell😭
Actually the results will depend on the model you use and your own voice. (also your settings)
ik
I think you might have used the original version of W-Okada, you didn't set anything right, and you gave up that way instead of getting help from AI Hub by Weights.
Also.. I would suggest you to use Deiteris' fork instead of OG W-Okada, including reading the docs.
What's your GPU?
Thanks vtarcelia for corrections, Nick088 for contributions. Most technical information comes from deiteris.
Latest Version b2332 from December 2024
RTX 5000 series no support yet. Not clear when they will work
Translations (outdated but works)
German: https://rentry.co/ForkVoiceChangerGuide_de
T...
Refrigerator full of game cartridges. 🔥
well my processor is a AMD Ryzen 7 7800X3D 8-core Processor, I also have a NVIDIA GeForce RTX 4070
Cool set up you have.
Hi Unity.
Nice setup, you should be able to use Deiteris' fork with no problems.
yeah, I actually already tried that lol
no performance issues
Might need a couple more 4070s for the voice changer tbh
yeah
That's good. Altho maybe your issue is getting a proper model that works with you.
I'm joking btw
ik ik
Runs good on my 4070
lol, I already uninstalled it xd
If i had an actual GPU i would test realtime with my BM800 mic that will arrive on next week hehe.
use the wokada deiteris fork 🔥
also the quality can depend on which model you use
yeah
Nick i think i got motivated and ordered a mic.
It won't load but based on the colors of the preview I bet its that spongebob gif isnt it
a good one
spongebob throws voice.ai in the trash
nice
there's no way you have any performance issues unless when running a demanding game
yeah ik, it's probably because of my low quality goofy ah mic🙏
btw which F0 detector would you recommend?
Rmvpe
oh okay, I was using fcpe this whole time😅
fcpe is lower quality
it uses lower resources but has worse quality
yeah, is Rmvpe the highest quality one?
Very technically crepe with a low hop length is but you need very clean audio
I have crepe_full and crepe_tiny lol
I imagine full is better?
It would be crepe full but use rmvpe
okay thanks!👍
oh lol
crepe beats rmvpe when has a hop length of 64
so which one is ultimately the best for quality if you have a really good pc?
a good model
rmvpe f0 (both for training the model and for realtime inference)
Wokada has 2 main versions:
- Original made by Wok
- Deiteris fork (modified version) made by Deiteris
each version has it's own updates
the latest deiteris fork has way better performance and quality than the latest original
you probably downloaded an old original version many months ago
and want to update it
pls tell me ur pc gpu in #🔍│help-w-okada
Is the a Applio Collab that's able to use the new hifigan pretrains?
oh I read it as refinegan somehow 💀
I gotta buy new eyes
my bad I did mean the new ones, I wrote the wrong one
this is because the latest stable release 3.2.8-bugfix don't contain the latest experimental code
refinegan is the new ones
does that only work with the no ui applio colab? or can I do that with the ui colab
both
just remove that part I told you
Alright, thank you
it would only work in UI
noUI needs to pass a vocoder parameter, that's not there yet
oh it wasn't added there yet?
@pliant laurel then nvm, just UI
Thank you, that's ok. I perfer the UI version anyway
Hey guys. Maybe someone knows a good and cheap API to create AI avatars from photos/videos and stream them?
why does the lightning ai applio have more features than the local one am i tweaking 
what?
there's more training options in the lightning ai version of applio
than the local version I have
and i can't find a more recent version anywhere
which lightning.ai link are u using?
and also what local version
https://lightning.ai/guilhermecardoso1/studios/applio-latest?section=all&query=applio
https://huggingface.co/IAHispano/Applio/tree/main/Compiled
Just both of the ones linked on the guide page
that's because the latest precompiled stable applio version is 3.2.8-bugfix, while the lightning.ai notebook uses the main branch, which has newer experimental updates that aren't in a stable release yet
my tupac ai new model with suno is really good
damn bruh this is good
do i need python/any other things installed for okada
thanks bro
yo
do you know abt this?
hello, are there people here who make complete music videos like me?
what to do in Realtime Voice Changer when I speak, my voice is choppy?
to many of your models were bad so they were removed along with your role
how do people do this i wanna do this for a kick user https://www.youtube.com/watch?v=EC9Fyz4exII
🔴 LIVE: AI TRUMP Q&A (ask questions!)
#deepseek #ai #grok3
Tip $2 and put a topic in your donation message: https://streamelements.com/jazza42261/tip
join the discord to vote on questions: https://discord.gg/Hj72XDtFZF
clone any voice with elevenlabs: https://elevenlabs.io/?from=partnercarson4100
buy the dev a coffee: https://ko-fi.com/jazza1...
is there any more beatrice v2 voice models?
@dusky grove any account requesting or sharing is strictly prohibited
sorry
So u guys have any good ai voice models for girl voice trolling
For W-Okada, go to #🔍│help-w-okada
any Developer here
girl idk is there
Hello, what's the best TTS you guys could recommend me? Especially for Japanese
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- Hina's Mod AICoverGen WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
- 🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
I'm really struggling to use colabs, they all seem to be broken
I used ultimate and hina usually but both are broken
elaborate:
- ur pc gpu
- what google colab link you used
- what you want to do
- the issue/error
Are the voice models open source?
wdym
Can it be freely used without restrictions on social media?
The Voice. ai rvc models
I use ultimate google colab and it just doesn't show any mp3 after ive generated it just says 0 seconds
i just need a simple one which works
I tried applio but i couldn't figure it out and also rvc disconnected didnt work
I just need a realistic female voice translator that these catfish vids use on YouTube.
the rvc technology is, but don't expect anyone disclose the model dataset used
ditch that paywalled garbage
Any recommendation for free sources that is realistic?
Clownfish isn't good
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
voice.ai sucks
it uses RVC on the background, which is free and open source
but makes u pay for them and does distributed training with ur pc
I'm guessing you want realtime voice changing right? tell your pc gpu in #🔍│help-w-okada
None of them are working 😦
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
you need to explain the things I said to make so people are able to help you
Does anyone know how to make your own song parody with changed lyrics?
For that you gotta record yourself singing the changed lyrics and then pass your vocals thru a model.
okay'
hi y'all, quick question
i used to use the voice models with clownfish vc but are they useable with voicemod ?
don't use those
you want a realtime voice changer right?
tell ur pc gpu in #🔍│help-w-okada
I'm not very tech savvy so i'm not sure on some of these but my pc gpu is fine, i have used multiple diff google colab links, I just want to use singers voicers and apply them to other songs and there is a few diff eras
errors*
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
tell me the name of your pc gpu
also send the links of the colabs you used
My pc gpu is fine because it was working yday it just randomly stopped today
okay lemme send 1 sec
and also applio, rv disconnected
I'm asking you to tell me the name
there's many different gpus
rvc disconnected got its end of life update, it won't ever be updated again and it's old
is a specific model not working on those colabs? if so, send the model download link
Do the things I told you on your pc
.
Do you remember the model download link
Hi, how do i find these exact models listed in the Best models in UVR beta? https://docs.aihub.gg/rvc/resources/dataset-isolation/
don't worry sir the spider on the bread will get him
mm this model seems fine
did you try what I said? #🧬│ai-chat message
which part sorry
the one to check your pc gpu
I was talking about your pc one
to check if it's good enough to run it locally
bc google colab is a cloud (remote good pc) service only for people with a bad pc
also I have just tested this on the Applio colab and it works all fine
I have an idea for a YouTube channel where AI-generated celebrities teach you different skills. Imagine Danny DeVito showing you how to fish or Arnold Schwarzenegger teaching you how to drive. It’ll be funny and a bit silly. My plan is to use text-to-speech AI to create the videos, then use AI to generate the celebrity voices. Alternatively, we could have someone record the script and then use AI to mimic the celebrity voices. The final videos will be AI-generated. I’m looking to collaborate with someone on this project.
update: codename fork's up to date with applio changes
@arctic tinsel Damn man, I just tested ur hifi pretrains
Truly outstanding 🔥
( Just a lil short test on rather low-mid quality -roformer set, [ 32 secs bs4 ] and it already outperformed stock ones by at least 75% )
Hello, I want to ask you something. I cannot view the Turkish voice models by typing "Turk" in the voice models channel. Could I have made incomplete or incorrect choices while registering? Can you help me with how to fix it?
There's an app or website for Android or mobile that we can change the lyrics of existing songs?
nick
?
I don't think so
Okay thanks
هلا
@covert lake
type turkish
Also, You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @covert lake, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
thanks
yw
where is the download link
currently I am in second year of CSE and I was making Deepfake detection project, now I got stucked in training model, means I take dataset of images from Kaggle and while training model it requires more RAM and GPU, I used colab's TPU , but it is free for 4 hours only, and I need more time because my dataset are too large
and if I train the model on 10,000 images then it provides only 0.25 accuracy and if I use 60,000 images then it provides me 0.77 accuracy,
can anyone suggest me with this?
serbian artillery is guided by god
Download link of what? What are you doing/want to do?
yet still cannot put down Kosovo 😂
@heavy bone promos ain't allowed
Anyone have the AI documenting where it tells you which one you should use for clean vocals
Hello everyone! How to create something simalar to this project on the youtube video (it uses old photos with paralax effects and probably AI Kling, as I guessed). Thanks you for your afvices! https://www.youtube.com/watch?v=T7X-9qkd0iQ&ab_channel=ZAJACFOTO.COM-REMASTERING
By the way, that 4 minutes project took roughly 2 months of creator's time, but quality of work is amazing!
promos ain't allowed
people, what should I do if I have a gtx 560 ti?
buy something better
Are you mentally insane?
You cant say such shit to a most likely minor
@tepid basin you may want to look at this lol
Why ping me 😭
You dont see the problem here?
Bc Sky probably wants you to nag Noobies for what he said to a supposed "minor"
I don't do critical thinking but I'll forward your concern to staff
how can i run ai models locally
RVC models?
Welp, you can install RVC locally by reading a guide i'll give you.
Last update: Apr 01, 2024
Yup, of course.
Wassup
For further help, use the #✨│ai-help channel 🙂
wassup my clone
anyone has the download because when I follow a tutorial the downloaded file is always different
tf kinda tutorial youre downloading brah
what do you guys use for rt voices? beatrice?
Beatrice v2 models are Faster and more Lightweight than RVC v2 models, but have Less Quality and there are Less Publicly Available models
RVC V2 is better
ok, thank you
Tell your PC GPU and what program download link you used in #🔍│help-w-okada
I will check if u use the newer one
Wokada deiteris fork, the best quality and most optimized one
Lemme guess, you seen YouTube tuts?
Please tell your PC GPU and program download link in #🔍│help-w-okada
That is the right channel for this
where can I download that one?
Explained you in #🔍│help-w-okada
Anyone wanna help me sell subscriptions to my social media automation tool?
Use ollama
I don't share my email account to anyone.
Do you know of any AI to help me generate an image from another?
basically pretty much every AI image generator using image prompt

Hey what is the best voice changer for male to female ?
is there any deep voice models?
tell ur pc gpu in #🔍│help-w-okada
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @covert lake, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
how tf can i make the ai sound good in spanish
Are you all AI
To generate images for free (text2img), either:
- Use @elder willow in https://discord.com/channels/1159260121998827560/1202754985255764060 (It's powered by DALLE3, from ChatGPT+), pretty easy
- Another easy and good ways with weighs.gg are:
- Use /image with @hidden grotto in https://discord.com/channels/1159260121998827560/1202754985255764060
- Create an image on their site https://www.weights.gg/ (which you can also use LoRAs, Low-Rank Adaptations, basically a small trained additional model to adjust your generation)
- Use Open Source Models like stable diffusion & flux that could be a bit **harder **but good, what's ur pc gpu? As you could run them locally (on ur pc) or on cloud (remote good pc)
:wave: @covert lake, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
be sure to get a good quality model, or dataset if you’re training one
also use help channels
Thank you

Dont die
Make a game
Fail with the game
make a new game
succeed with the game
Make lots of money
Make a new game thats literally the same but new story
Make lots and lots and lots and lots and lots of money
Name your studio Rockstar
does anyone know a free alternative to elevenlabs ?
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
hey
can anyone help me out why my rvc trained models sound like robotic vocoder
idk were i am going wrong
new colab instance?
see the output for the install cell
what does that mean sorry - yes for colab
Traceback (most recent call last):
File "/content/program_ml/app.py", line 1, in <module>
import gradio as gr
ModuleNotFoundError: No module named 'gradio'
How do you find a jet ai model?
I have my own question though
I haven't messed with ai in multiple years, so there is probably new software. So what is the best app I can run locally on my pc to create tts that allows custom models and stuff?
I also need it to be able to emotion to the voice
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
OMG NICK GOT HACKED, HE IS A BOT NOW
or he always was 🤔
What is that supposed to mean?
Real
Nuh uh. I don't accept friend request from random people, so don't worry about me.
When I use a voice model to speak with the voice I have to use a virtual cable. All that is fine and also working. Though once I do that I dont hear myself anymore and I'd like to do that. I there a solution?
For W-Okada, let's go to #🔍│help-w-okada.
oh wah, missed that channel. Sorry!
huh'
This channel has been set up to receive official Discord announcements for admins and moderators of Community servers. We'll let you know about important updates, such as new moderation features or changes to your server's eligibility for Server Discovery, here.
You can change which channel these messages are sent to at any time inside Server Settings. We recommend choosing your staff channel, as some information may be sensitive to your server.
Thanks for choosing Discord as the place to build your community!
100% rial 
Did you copy a message from an admin channel of a Discord server you managing? 
ded chat
There are two chats: #✦│chat and #🧬│ai-chat.
Hi.
yus
this is new o- o
What is this Gradio UI for?
uvr ui
Elevenlabs ai is good
As I know it’s just a UI front end used for easy development of web application or chatbot kind of things
YUPPPPI
HELLO GM
Hi
Can someone tell me why character consistency is difficult?
Or anyone know how is this possible?
And how the characters can have customised in faces and features as per input image?
Is it possible??
If not why not?
If yes then how?
Do anyone know any tool.
?
some are broken
applio ui and noui should work
a colab update broke some notebooks
of all kinds of AIs
hallo
😦
I was using it for real time voice changing any alternatives in games my gpu was too weak for it :/
Hey I installed okada tts, how could I get new models for it?
How to I find ai voices from some ppl?
I made a cover with scott knuckles and jennifer amy models and it sounded so awful
If youre using RVC, use these for models ^
yeah I use the voice models from that channel but It doesnt seem to work with tts version
or you only find tts ones in that website then?
yep pretty much dead chat
Question, do you really need to normalize the backing vocals?
is this a mixing question
I'd say yeah, normalize the backings for consistency, but don't let it be as loud as the lead vocals
because backings supposed to be subtle (depends on the music though)
hi, im fairly new to making my own AI but I know how to code,
i want to make an ai for a game i made in python using pytorch. I trained some models and even got claude 3.7 and it made a completely new structure but it still sucks
Could someone help me to train a good model and maybe explain to me a couple of things?
Might take an hour or two
@ancient swan Hmmm, got lost a lil in the doc but, is the dim_t conversion / necessity to align it, still a thing?
also how about the ui's overlap?
so, to be more exact, say I wanna use the ' dereverb_echo_mbr_fused v2 '
now, if to assume we aren't meant to tweak or align anything related to dim_t, but focus on chunk_size, it's 352800 samples, that'd be / 44100 = 8
So now, most important question:
- Config has " num_overlap: 4 " should it be aligned with the yaml's chunk ( 8 ) or nah?
- Does ui's overlap have any kind of effect? ( as in, override the yaml ) Specifically these two:
edit1: ( Hmmm.. seems like advanced tab's overlap does work I think )
edit2: Correct me if I'm wrong; I read up a lil more and it seems it can go both ways for most rofo if not for all recent ones?
- chunk_size = 352800, dim_t = 801, segment = 8
- chunk_size = 485100, dim_t = 1101, segment = 11
and I suppose, a bonus,
3. Is overlap's misalignment a major issue? ( any form of degradation / discontinuity happens? )
hola

Tell ur PC GPU in #🔍│help-w-okada
Show a screenshot of ur wokada in #🔍│help-w-okada
where i download the voice changer?
tell ur pc gpu in #🔍│help-w-okada
Hell o
ai
guys, what's wrong with the weights site?, it stopped working, error Internal Server Error
idk
Hello guys
i need some help to find something !!
int his video at 0:55 i love the way the voice is
do you guys think its possible to find an ai that can do the same kind of voice with an accent
https://www.youtube.com/watch?v=4sA3TyjS3k0
Hey I created a VC for myself can I get approved
chunk_size = 485100 corresponds to dim_t = 1101
overlap doesnt relate to them, still the higher the better consistency (less noticable cut parts) unless you are using the older patch. for most cases 4-8 is enough and power of two numbers are desirable (otherwise the higher number like 50 could decrease the SDR).
augh
Staff Applications Open
We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!
Click here to apply!
VC of which?
get approved of what?
Are you developing your own fork of W-Okada or you trained a voice model and wanted to get approved in this server?
@solar torrent no I'm trying to create a custom voice channel
Ehh, if you mean by #jointocreate, you'll have to be in a voice chat for a while and then you'll get approved to create a custom voice channel in this server.
And also please use the reply feature instead of pinging me in a separated message.
Why the #promo channel has been removed but maybe because of the copyright?
Or maybe it's just changed the name/title?
keep overlap at 2, i think no allignements are needed, you can just use whatever values the configs are coming with
promos ain’t allowed anymore
Oh thanks ☹️
Ramadan mubarak
ramadhan mubarak to you too, talha
where do i download
elaborate:
- what you want to download
- what's your pc gpu
- what do you want to do
just download the voice changer in question
realtime voice changer for calls?
also, what's your pc gpu still?
yes for calls
and ion got a gpu
are you really sure? you got a pc right?
let's talk in #🔍│help-w-okada
yes
tho, according to yaml values overlap, mathematically would be either 8 or 11, according to models I use. So why 2?
people have tested different overlap values and their speed and their both audible and scores quality changes
and the only thing that improves when you put something like 8 overlap instead of 2, sdr rises by 0.1-0.2, but no audible differences
so putting anything higher than 2 is just not worth it
same with batch size, it doesn't affect anything so now everyone just uses batch size 1 for inference
higher batch sizes should increase the speed of inference at the cost of more vram being consumed but that just doesn't happen for some reason
I mean, I did notice different alignment of low frequency range / respiratory range, becomes less " chaotic "
the higher the overlap
I tested 8, 11 and custom 16
speed however doesn't matter for me, you know me, I favor accuracy over anything else
it is only slower but still has virtually same vram (4 GB on MSST colab), unless batch_size is changed
oh yeah, also In higher overlap and settings generally, some clicks are ignored ( background related clicks )
if you can hear the difference then go for it what can i say, i can't really spend 20 minutes inferencing an audio that will sound mildly different from audio inferenced in 12 seconds
don't use it that high with apollo models though, it'll ruin them
use batch_size = 2 in the model config file and there wont be noticable clicks/pops
nah, clicks and pops now go from overlap setting only i'm pretty sure, i think jarredou fixed a bug that was causing batch size 1 to cause some weird clicks and pops
Well then, I'll stick to formula values. I can wait 3 mins no issues, given I extract few mins by few mins, not all at once ( my workflow )
was just curious on proper methodology / mathematically correct settings
in the zfturbo script there are no clicks for batch_size = 2 and any overlap value, but the higher overlap the less cut parts and more consistency (less noticable on >8)
yeah, but i think batch size 1 now also shouldn't cause any problems like it used to
btw, does batch have any effect on inference?
still couldn't make it work? hmm, i wonder what could be causing it not to work
tried 2 in models where 1 is in config and tbf, absolutely no difference
yeah, no budge
pretty sure it doesn't
maybe I'll reinstall all the packages but then, probs no point given uvr works just fine
unless msst does bring audible difference ( which I think it doesn't cause why would it
if you match the settings then it should be the same
ah ye
ig it should work
Also, I'm curious.. Anyone already tested gpt-sovits v3?
released yesterday iirc
( I personally haven't tried it at all, incl. v1 and v2 but supposedly, v3 compared to v2 doesn't " average ish " over fine-tune dataset and is more faithful to reference audio )
There's also Zonos that came out and ngl.. that thing is instantly outstanding for zero-shot
You can test it on the site: https://playground.zyphra.com
And repo's in here:
https://github.com/Zyphra/Zonos
( There's also an exp windows fork, just need to read the desc )
Your email is on the bottom left btw
oh, lol
good point, but then, that's the one I barely even use, almost tras/risky content-dedicated one with fake info on it but, good point still 🤔
@night lake Either way, you should give it a try, it's actually op and actually outputs / uses 44.1khz
compared to whatever else
Really?
Will try then
But imo tts is overrated
from what I've read here n there or on their disc, windows exp has some issues with some jp related stuff
Any amd support?
Sadge
Butttt, from all zero-shot I tested
only that one was capable of handling Esdeath's voice
In any case, looking forward to it and potentially fine-tuning, in future
( currently not supported
That's wild is her voice that unique
No no, that's not my point
my point is, it's super robust, compared to sovits / F5 / E2
yeyeeee
As for tts being overrated.. well
voice to voice is amazing and all
but if you're like me and want to, in future, play around roleplaying llm + stable diffusion + input voice recog. / tts output ( silly tavern ish workflow pretty much
that's some nice shit
Hmm I see
where is the screaming emoji
Staff Applications Open
We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!
Click here to apply!
lfg ?
dont mind if i ask , is there any website that generated ai music covers?
ai covers or u mean just the music?
AI generated music
you can try weights.com, suno or udio
yw
yw?
it's an abbreviation for "you're welcome"
so can you extend a song in weights? like upload an audio and extend the song from a specific time stamp?
you can try using a reference track
like instrumental reference track?
nvm
bizarro
@flint zenith never come back. thank you.
🥶
hello im cris nice to meet eevryone
im need help make a good ai cast for my realities shows of big brother or survivor
Hi everyone, I hope you're all doing well! I'm looking for a specialized website that allowed users to upload recordings of their voice to create a personalized voice model saved online. This platform even let you install different AI models and use your cloned voice to sing over songs. Unfortunately, I lost the link and can’t find it anymore. Has anyone come across this site or know of something similar? Any help would be greatly appreciated. Thank you so much!
@covert lake Hmmm, you up?
hmmm.. about to test gpt-sovits but 🤔 curious if 3-bert should be empty? Tried to find anything on that but, nope, no info
guess I'll try first lol ( edit, all works ye. Now to figure out optimal settings.. gosh
Why the promos ain't allow anymore its because of copyright right?
I believe that's the case yeah. I've seen some people started to get account claims on content they uploaded even months ago n stuff
Oh okay
Likely some people here kept promoting some random job applications in promo chat with no trace of origin.
oh, it got that bad?
Oh yeah
I know such things been here for a while. I meant it more in a " it got that that bad? " manner
someone should make a song maker based on the artist
like for example "Lady Gaga Song, Synthpop
and u put the lyrics
and u can upload audio files for ideas
up to 5
Really? If it comes to DAW music making software, nah I just suck at replicating other artists styles.
i'm making a lady gaga album
lyric snippet for song on album "she's breaking the system, breaking it down"
Laga Dyga 🔥
this one my friend did and they won't tell me what they used
Can anyone guide me how to install the voice model once I have downloaded it from voice-models .com ? I am using tortoise tts btw
Bc of scams and other things that would have kept it hard to moderate
Oh yeah okay
Sorry but I don't know much of Gpt so vits
@chilly lake Yo when next (Kaggle) Applio update so the spilt infer bug would be fixed?
I have a voice model that isn't in #🔍│find-models or #1175430844685484042 Link is here: #1345696074005876808
next official release? dunno
you can always clone the latest code
@chilly lake is it me or this looks like applio with built in music separation
seems like an applio x aicovergen/aicover maker lol
got nice models tho
well, rvc originally had uvr built in
so no surprise someone decided to bring it back
welcome back mainline
yeah
mainline with a cool ui
ngl I like applio's ui
it got a cool theme and easy
yeah, it does
oh well the colab is dead bc of using uv
applio has uvr plugin as well
no
when it will be back, the engineer will tell me and #📰│dev-updates will be updated
Any estimates though
duks
idk when Hina will update it
I can just suggest u to use Applio for now
hello I need a model for using it on my male voice to turn it into women voice
tell ur pc gpu in #🔍│help-w-okada
ım using colab till my rtx 3070 came from my order
let's talk in #🔍│help-w-okada
This guy got 19k subs with a single video. https://www.youtube.com/watch?v=r2OckF3Gk_8&t=210s
What AI Image generator is he using to create the black and white animated pictures, any idea? Please@me if u reply
Can someone make a big scarr ai voice pls😭🙏🏻
⊣∷ᒷᒷℸ ̣ ╎リ⊣ᓭ! ╎ ᓵᔑᒲᒷ ╎リ !¡ᒷᔑᓵᒷ, ╎ ᓵᔑᒲᒷ ⎓∷𝙹ᒲ !¡ꖎᔑリᒷℸ ̣ ꖌᒷ!¡ꖎᒷ∷-452ʖ, ᒲ|| リᔑᒲᒷ ╎ᓭ ᒲᔑ∷⍊╎リ ᔑリ↸ ╎'ᒲ ᔑリ ᔑꖎ╎ᒷリ.
purple
minecraft enchanting table
where do i msg if im trying to find this ai voice i found this one on this video and im trying to find it so i can change it a bit for a use on this edit
hey guys, so I am running the AMD fork of A1111 stable diffusion, but years later, I don't think there has been any updates. Is there any upgrades I can give to my Stable diffusion to get better outputs?
I am using an 8GB VRAM GPU (yea, shush), and I am using the DPM++ 2M Karras, but i wana know if there are better stuff
that should be enough for SDXL and 1.5
flux models could give better results but 12 GB vram is bare minimum for it and Nvidia gpus are more recommended as idk if the rocm version could work well
imma be real, I don't use the SDXL nor 1.5 checkpoints.
i use some anime checkpoints. However, Imma need suggestions to expand my negative prompt and some prompts to improve finger issue which I noticed some AIs have now "resolved that" this year.
plus, would hypernetworks help or the styles.csv ?
it is actually finetuned from the base SDXL/pony/1.5 model
in the civitai page it should tell if it is based on which SD
is that DirectML one?
Staff Applications Open
We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!
Click here to apply!
Yes
SD.Next would be a better option
but it requires some effort to install hip sdk and tensile libraries
what's your gpu?
rx 6600
i mostly use a checkpoint called perfect world which I just checked and its base model is SD 1.5
I use A1111 SD
I also noticed I have some LoRas of certain anime characters, but when using the LoRa, the output image ends up totally destroyed, as if the dataset somehow got contaminated by NIGHTSHADE (fugg nightshade)
You've just got trolled for using certain LoRA models lmao. Unless you're using some badly trained Stable 1.5 models you downloaded from somewhere.
that is true
You could've download the good ones from CivitAI instead. Make sure you don't just click the download button before you use it, try look at some generated image posts under the model to see if it generates good or not.
which i have been doing even since I started using SD
Hi
For anything about W-Okada, go to #🔍│help-w-okada.
okok thank youu
Hiii so im trying to change the lyrics with suno for the song, suno gets the verses right but ruins chorus
No, thanks. This is not where you promote your thing.
Hmmm.. funny enough, gpt-sovits has vram leakage
at least in v3 sovits training department, ain't sure about the rest
good morning to all members i have a question how you manage to have cleaner ai cover voice
i tried to converts vocals using weight g models but it fails the vocals have some glitch etc also the result obtain is far from the selected artist
This is not where you post an entire codebase.
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Sweet Dreams (Are Made of This) (PuppetMaster cover) (Drum model no. 581)
scammers about to get rich 
Uh oh
Imagine scammers using "original" W-Okada for phone calls. 
How ai works here
Which AI? Each AI works on its own.
This ai chat
#🧬│ai-chat is basically just another normal chat.
it depends all on how the model was trained
@warped dune we allow only english here
hm thats a little trivial but u could give mel band a shot on https://mvsep.com/
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Energy (Drum model no. 582)
ai chat was verified chat before weights took ai hub :/
and the lounge is general chat
czy ktoś z polski może zrobić model andrzeja dudy
sup yolo
Are you sure you remembered it right? Because #✦│chat was once named the verified chat.
No, thanks. If there's no game demo, I ain't gonna take a survey anyways.
Well, he deleted it real fast. I could see that.
I'm hiring prompt engineers. Anyone interested?
Why are you hiring a prompt engineer?
Because I'm a prompt engineer and my workload is too heavy at this point I have about 7 pending projects I can't finish alone. I offer compensation for time, subscriptions to premium services (v0.dev), hourly rate, and training
DM if interested thx. Sorry if this wasn't the appropriate place
Hai
Mel loss goes up to 30, I'm going crazy, I can't figure it out
mel loss should not go up
how
The sources I researched say that if it increases, the naturalness of the sound will deteriorate.
So what can I do to increase it?
you're more likely to have messed up some training parameters and collapsed the model
normally the default rvc parameters and decent pretrains shouldn't cause that issue
I don't know, I will try writing code to fix loss mel
which code are you working on?
One more time - mel loss does not go up.. ever
it has to converge down to some value
you prob meant g/total?
g total may go up if fm loss goes up
thank you
Guys, what's the best way to mix songs with AI for free? I'm trying to combine a music track with a vocals track to make it sound perfect.
I'm trying to find an AI that can perfectly sync vocals to music in the best way possible.
There's an AI-assisted track mixing website that was once used by a team of Kanye East, I just don't remember the name. But this is what I can do about mixing.
Hi
breh
INFO:testoo:loss_disc=4.044, loss_gen=0.000, loss_fm=0.000,loss_mel=0.100, loss_kl=0.100 These are my model values. After working for a day, I coded it myself and achieved a great result 🙂
Damn what happend to Mike Dean Doing the mixing
Staff Applications Open
We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!
Click here to apply!
Nitrous also got him.
Oh yea, I forgot about that
What’s the curling best model for duets
I have extreme AGI
Hii guys we are organising a hackathon in our college and looking for monetary sponsors is this the right place??
I'm hiring prompt engineers for $12/hr 30 hours a week if interested
Tell me more
I also tried to make an agi
Why does voice challenger work like a bug for me, like it says how...... it's going and at the end it's Korean?
speaks in syllables
aye anybody know if its possible to use raltime ai in dc
guys whats the best tool to make a tts in hebrew with trained model?
like elevenlabs can make it kinda realistic which is great but they don't support hebrew
bro left to improve his DEX
Question, what's the best AI program that can create really good Bas Relief images from a source image?
Hi, I love what you do, you don't have to answer if it bothers you, by the way I have a project and I need to do a study: “What are the 3 most repetitive and boring tasks in your business right now?”. Thanks
🔥 Join the Revolution! Be Part of the Creator Magic Community: https://mrc.fm/cmc
Join me as I dive deep into conversation with Sesame AI's groundbreaking voice assistant, powered by their innovative Conversational Speech Model (CSM). This AI doesn't just talk—it understands context, emotion, and nuance like never before. Founded by tech vision...
Lol.
Any prompt engineers I can train and hire?
hi dude
hey you don't bully my friend
fine......... ughhhh i hate this......
that's what i thought
When did I bully your friend?
Please speak English.
I still don't understand what you're trying to say, but let me make that readable.
Nah, I don't talk politics here.
Im eliot scott Obama
Sigh.
Please chill with your emotes.
Im chilling with my brother emote 🙅 🧏 🤙
Im canada
Mexico peperoni
No, you aren't reading my messages that right.
I don't do crack, sorry.
😐
Bloodborne
Well, that's pretty much it, kid.
Fuck.
why does ai art still suck ass
Is there something similar to Mellotron but newer?
Explain:
- your PC GPU
- what guide link u followed
- the issue
In #🔍│help-w-okada
We don't allow promos here
Explain:
- your PC GPU
- what guide link u followed
- the issue
In #🔍│help-w-okada
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: Easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
You can check TTS in our tts index
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
You could try another tts from our tts index and use the output as an input in rvc
Maybe you can find something better in our tts index, but I wouldn't expect that much
similarly, do you still hate DLSS?
dunno what that is
Maybe you can try flux.1-dev since it's the best open source image model, LoRAs might help
Use better models and prompts
no
Balls
then yeah that it won’t look good if you don’t use good models like flux
That's a true real fact. I like real art that made by human better.
DLSS is a feature found in newer NVIDIA GeForce RTX GPUs. This feature uses AI technology to generate some frames for a game you're playing, rather than using raw power to process.
If you ever feel like you wanna compete with AI fans about art drawing, I suggest try to pick a pencil, draw anything you like and tell me what you feel about it. ✅
We weren’t comparing it with human art btw
i was afk while my model trained the and the runtime after it finished where can i find the pth file
humblo
check #📰│dev-updates
does anyone know any good realistic ai for tts in hebrew that is realistic and emotional i also can get trained models in hebrew if that helps and referance for emotions of someone speaking to speak like him
I think I gave you some TTS options #🧬│ai-chat message
AI HUB Docs


