#š§¬āai-chat
1 messages Ā· Page 356 of 1
ye
Oh, I didn't know that, because I just knew about this
ooh i have same meme but other text xdd
so ive been working on integrating an open webUI bot into a discord bot but every time i try to run the install command it gives me an enoenet error. no clue whats causing it. the log files say it doesnt see the json even though the json is there, unmoved, and unchanged
does any body have any ideas?
well, i fixed it. now im getting a sharding error. YAY
This is so real ngl
Lol
that describes it perfectly
release this
why can I not make characters on the weights gg app
Is there a local method of creating sound effects from a text description?
stable audio tools
Generate Image is slow for me
My daily life 
hi
Is there a sort of AI website builder for free?
Uh
hi
i'm here to make AI covers of caseoh š
chatgpt, otherwise your prompting skill issue
Do u need help? If so, u can say ur PC GPU
You can search rvc ai voice models at:
- #1175430844685484042
- In #šāfind-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @covert lake, How can I help?
Available Commands:
⢠@weights find <query> or /find <query> - Search for RVC Voice Models
⢠/create - Create an AI Cover
⢠/image - Generate an Image
real
guys why my voice ai laggging?
Use deiteris' fork instead, we don't support voice ai
i use start http is it?
You're following an old YouTube tut of wokada, don't use that, tell ur PC GPU in #šāhelp-w-okada
K
has anyone been able to fine tune fish speech locally? the training process throws up a whole bunch of errors about lora keys not being loaded and the resulting models ive tried training aren't able to speak properly
Yep, also read the guide.
Guys, why does my voice with this voicechanger sound like just a whisper? this didn't happen an hour ago
Try increasing the out
Else show ur wokada in #šāhelp-w-okada
Guys which model will be most similar to voice from changed speech from Death Note. I mean changed voice from L's notebook or video recordered by Kira 1, 2
You can try searching any Death Note models at:
- #1175430844685484042
- In #šāfind-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out because in Huggingface there aren't just only RVC AI voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isn't one, you can:
- #1159289738314919936
- #1191429836321849435
- Make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @minor blade, How can I help?
Available Commands:
⢠@weights find <query> or /find <query> - Search for RVC Voice Models
⢠/create - Create an AI Cover
⢠/image - Generate an Image
šš» hi
Question for the ppl here
How are people making these animations of plankton or SpongeBob on TikTok Iām trying to do my own with the songs of my choice
But I canāt find how to make these animations anywhere
Use stable diffusion for videos imo
Hello. can you please give me a link to the github where you can download AI Voice Changer
where can I find the virtual audio cable line 1?
Thanks vtarcelia for corrections, Nick088 for contributions. Most technical information comes from deiteris.
Latest Version b2332 from December 2024
Translations (outdated)
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuide_tr
Hindi: https://rentry...
for help ask in #šāhelp-w-okada
What should I do if my microphone is not selected in this "in" field?
Does your mic work in Discord or anything else
When people type "280 Epoch" in the title, does it mean I have to put 280 somewhere? Literally first time using this vcc, or any for that matter
it means how long it was trained
Are "in" "out" and "mon" important? I don't hear any difference, just the volume
did you mean changing your voice realtime or
cause that's not what rvc stands for
i mean real time voice changer
yes
windows
gtx1660 4gb
what
why
what are you smoking of? there are only 1650 4 GB or 1660 6 GB
little guide to reasoning models : https://synaptiks.ai/p/from-base-models-to-reasoning-models
I love ai voice changer
hi
what do epochs mean, im new here
epochs are a unit of measuring the training cycles of the AI model
basically the amount of times the model went over its dataset and learned from it
they don't mean how good is the model, it's just an info provided on how they trained the model by the model maker
More ā better
Less ā better
There's no way to determinate how good the RVC model is until you try it out or listen to the audio samples if there are
@reef marten poease donāt promote
@flint saffron use #1159289738314919936 not #1159516963014451302
Hey, I was wondering if anyone has any information for like Image/video to Video AI rendering software? Like I specifically want one where I can render it myself would be preferable. I'm assuming most/all of the online ones are either bad or paid services.
I'm using AMD CPU and GPU btw. So one that facilitates that field would be nice. Thank you for reading this smol wall.
like text/image to video ?
Yes
I've been meaning to make some wallpapers animate with some old photos I had, video to video if possible would be interesting too.
text/image to video AIs:
- Locally (runs on ur pc):
- pyramid flow (Image/Text to Video)
- cogvideox 1.5 5b: Image to Video, Text to Video
- Cloud (remote good pc, running on an online website for example, easier to setup):
- Weights.gg
- pyramid flow (Image/Text to Video) (HuggingFace Space)
- OpenAI Sora (paid only, in some countries)
- lumalabs
- Hailoua AI
I donāt think thereās any video to video tho
also AMD gpu isnāt the best for ai
Is it just slow rendering or just bad output?
I meant as compatibility support
you might wanna research if thereās amd support for those AIs
Nvidia is the best for most support bc of their CUDA and TensorRT cores
thereās also Zluda which is a cuda emulator for AMD, might wanna check this
you can try SD/flux animate though mostly limited its feasibility to just short clips/animated GIFs
Hey there. Regarding live voice changer okada is still the best or anything changed ? I ears there is also eleven labs but not sure if it works live. And they probably donāt have so many voices
Detris' fork W-Okada works best. The original version has no update as of now.
Thanks I will check this out.
Red name is crazy
oh ? will check this too I guess
that was pointed at namari's role color
𤪠oupsie dummy me
Im sleeping awake half dead brain is not braining
Ever had a customer support call where an AI either completely failed or totally surprised you?
lol nope coz everytime the support calll was picked by a human 
Hey everyone, last week we released Spark Engine v1 after over a year of public beta!
Itās the most powerful no-code AI sandbox available right now with over 80 models to generate text, music, images, videos, web search and more.
Just looking for people to let me know what they think at the moment š
Do people still use tortoise tts?
tortoise tts is slow and has crappy zero-shot
ai bad at maths
when did tupac make this?
Well things are changing... At least from the way I'm seeing it. Currently I'm working with such a company that allow businesses to build AI voice agents using drag and drop. And the kind of videos I'm seeing on Instagram regarding that is just insane.
help me to create a picture of two people holding hands on a table in gold
well yeah that's true, since ai is coming to the world I think we are going to see a lot of changes which we never expected, also what you mentioned here is super interesting. is there anything more to know about them?
if you want something which is hard to understand by ai versions, I suggest you to create LoRa models and train them with the type of images you want to create, sometimes ai can't really understand what you mean since it's not human but when you create a LoRa model and train it with the images u want, it can help a lot.
also you can use good ai models like weights, stable diffusion, kling ai or flux 1 model
guys how do I increase ai voice changer's quality?
I tried to change a audio's voice,it works. but kinda low quality
You're using the latest Applio?
Interaction has expired, use the command again for a new interaction.
First link on there is best w-okada realtime
Amazing guide
Deiteris fork real nice
Wait I'm double stupid
like change a audio file's voice
is it free?
Ye its a fork of what you're already using
More updated more good
Only other ways to improve quality is
⢠Make sure your input audio is clean (shitty stem separations can cause some issues. If its clean and well-enunciated, there's not much you can do)
⢠Use a better model lol
If it voicecracks you can mess with crepe hop-length but thats more advanced. RMVPE is by far the most stable
thank you so much
Unsure what style of edit you're doing, sometimes it helps to mimic the voice if you're recording it yourself
I usually record like 5 takes of the same vocals and chop in the good ones
no
ok I have a problem with the software the file called RVC GUI.bat does not launch even after a long wait what should I do???
Hello friends, I want to add my own voice to a song, how do I do this? Can I model my own voice in rvc gui or is there a simpler way?
already replied in #āØāai-help
rvc gui is old, replied in #šāmaking-models
Is there any way to integrate these models into https://sparkengine.ai ?
searsh megaphone efect pls broo
It's this no-code editor my team and I made. We want to integrate more technology through APIs
megaphone efect voice pls searsh
Can any voice model pronounce the alphabet correctly in English? I tried and none can say the letter E and G for example
I also tried to say a singular word, "Me" for example and it fails at it
Is there a way to use the zip formats voice changer from voice to voice using the phone ? I remember it was possible but been a while since I didnāt use it
Ok, and how do I get a voice model I downloaded and would like to use, to do that?
I have no AI knowledge
please re-state your question. What do you want to do?
Well, if I try to say the same letter twice in a row, it will say the letter in such a awkward manner it's not even recognizable
are you talking about the realtime voice changer?
Yes, rvvc 2.0.73--beta cuda
i can say abcdefg, but if i say aa or ee, it sounds really off
š #šāhelp-w-okada
can someone help me for voice model create
hey
hi
where can i find the face swap collab
idk
cmon man
you hurt my heart
this is your monthly jay message
stfu
Why when my models sing high notes they break voice, no matter I train them using 50 epoch or 200 epoch?
if you use a proper pretrain that wont happen
Sorry for the late response someone told me they have been using it but have no idea if this is true or not
mostly a scam or like some "half life 3" unofficial mods
How do I do that?
for mainline rvc https://discord.com/channels/1159260121998827560/1265083834039533588
Ty
im tryna get my freak on
Brody you good?
can somone how to dowload the voice changer\
i will give 10 dollar giftcard
on god fr
š
go to #šāhelp-w-okada to check the pinned guide
hi
damn that;s crazy
you know damn well ts not what it meant
oh
Guy Dm for fun now
DMs open for fun now
Dm for fun
What fun?
biology fun

It's an alternative account.
becareful !
yeah
member since 12 feb
š®
Best image generator I can use rn for free? Need it for uni
To generate images for free (text2img), either:
- Use @elder willow in https://discord.com/channels/1159260121998827560/1202754985255764060 (It's powered by DALLE3, from ChatGPT+), pretty easy
- Another easy and good ways with weighs.gg are:
- Use /image with @hidden grotto in https://discord.com/channels/1159260121998827560/1202754985255764060
- Create an image on their site https://www.weights.gg/ (which you can also use LoRAs, Low-Rank Adaptations, basically a small trained additional model to adjust your generation)
- Use Open Source Models like stable diffusion & flux that could be a bit **harder **but good, what's ur pc gpu? As you could run them locally (on ur pc) or on cloud (remote good pc)
:wave: @covert lake, How can I help?
Available Commands:
⢠@weights find <query> or /find <query> - Search for RVC Voice Models
⢠/create - Create an AI Cover
⢠/image - Generate an Image
i recently tried fooocus but my pc specs are so bad it couldnt cope 
whatās ur pc gpu and cpu
no gpu
i got a laptop
so it cant really work
If I record someone's voice, I can't see if anyone can model that sound. Just bring it to index and pth format
it can
u need to train it not just rename the format
hi
yes i know but i cant do this
i try maybe for 1 days
ŃŠ¾Ńал?
Wdym, release this? You can download it via using a local AI apps installer, https://pinokio.computer
Late Ahh Response
what ai voice changer software do you guys use
Realtime for calls? Wokada, especially the deiteris fork, tell ur PC GPU *in #šāhelp-w-okada *
lmfao
Ok
You can do Stable Diffusion locally on a PC with no GPU but only CPU. Just don't expect it to generate fast.
do we need to download models from stable diffusion
Stable Diffusion is not a website service where you can download a checkpoint model. It is an AI image generator program. You may need to download and put the model into folder separately.
CivitAI provides most of Stable Diffusion models you can download there.
okayy i will try it
Anyone knows where I can find those old models from Applio?
wdym those old models from applio?
from their site before they took it down?
you can't, anyways iirc the majority were the same of the ones in #1175430844685484042
It still missing what I want that used to be in Applio link 
You can search rvc ai voice models at:
- #1175430844685484042
- In #šāfind-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @covert lake, How can I help?
Available Commands:
⢠@weights find <query> or /find <query> - Search for RVC Voice Models
⢠/create - Create an AI Cover
⢠/image - Generate an Image
I'm guessing you were talking about the models in applio.org so I gave u all sources to find models since that site will never have a model page again
But do you know why all of it got removed? And when?
any of you know how long a 20 minute dataset usually takes to train
You know any website or something where I can use an a.i voice like I want to do stuff dennis.main is doing how do I do that
if u check on youtube I think you'll recognise the stuff he's made
he makes the goku skits etc
dap up tournaments
etc
no but I dont believe
he uses parrotai
i checked it
it was GARBAGE
So I dont think its that
not the same quality at all
sounds like he uses a tts kind of ai voice
maybe elevenlabs?
they have similiar intonations
I also thought that
but then theres a lot of parts
where they sing
and it fits hella well
š
and the emotions to
when they're angry
or the way they talk
etc
So, Random quesiton. How many people here use suno?
to use the voice models, you must download the rvc gui
oh ok and thats it?
and watch a tutorial on how to use it
import the voice model into it, yada yada
or alternatively
you can go to weights.gg which has voice models too
that you can use without having to install any 3rd party stuff
its all in-website
but you wont get 48k quality models on there
ok so so
if I download rvc gui
I can use the voice models here
and I dont have to train anything
etc
I have been having issues with the AI Voice shiz
Omg I am so blind
Theres a help channel
Right in front of me
Hey all!
I have audio files of a single person (monologue basically) that is in german and I would like to voice clone it into italian.
Is there a good guide here somewhere I could follow to achieve this?
hi im trying to find the most realistic rvc voice models but ones i find online is not always the best quality. Can someone help me?
em what sigma
Check the #1175430844685484042 channel and test which one fits you better.
hey, does anyone know how to make those ai videos of celebrities talking over a script?
like lebron talking over a video about how some math works
the old one from tiger18n repo?
DMs open for fun now
Dm for fun
HMU for fun now
I don't take a direct message from a random person I don't know.
Just letting you know, I don't Diddy people. 
me neither glad we on the same page 
imagine diddling in the big 2025
could not be me
yo
slide some
i dont know how to dowlaod
it needs tutorial video
thereās no youtube tut, also I replied to u in #šāhelp-w-okada and RVC gui is old
@brisk birch rvc gui is old, tell what u are looking to do and whatās ur pc gpu and cpu in #šāhelp-w-okada
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
also use the appropiate help channels and elaborate pls
realtime for calls? Tell ur pc gpu and cpu in #šāhelp-w-okada
You can search rvc ai voice models at:
- #1175430844685484042
- In #šāfind-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @covert lake, How can I help?
Available Commands:
⢠@weights find <query> or /find <query> - Search for RVC Voice Models
⢠/create - Create an AI Cover
⢠/image - Generate an Image
how fast do RVC models work?
it depends on ur pc gpu
.what is it and what are u looking to do
translate sounds ?
I donāt understand
arenāt RVC speech to speech?
rvc is speech to speech, but wdym sounds
like u just want to take a random sound and make it a human?
no
shouldāve worded it better
I meant speech generated from a TTS model
then through an RVC
Ohh
yeah u can do that
u got rvc already or need help for installing it
Just wondering if it will increase time taken by a lot
Well not much, for example in Applio they have "built in" tts, which makes an audio tts via Ms edge API then uses it as an input
I see. On the topic of TTS, which models are a good balance between realness and speed?
might be useful for training, but even a GTX 1060 or a cpu can still do inference
also you should mention the quadro generation (whether pascal, turing, or ada) since it could be confusing
like which one of the tts models in applio ? Or talking about any tts?
any in general
The fastest one is PiperTTS but it isnāt really as good in quality
Would PiperTTS through RVC yield good results?
maybe you could try Fish Speech ot F5 which are 0 shot (no training required, unliek rvc which is few shots and need training)
eh havenāt tested that
if u want thereāes https://docs.aihub.gg/tts/tts-tools/
Last update: Dec 12, 2024
I would suggest pipertts only if you desperately need speed, but itās worse in quality
also yw
Iāll see how PiperTTS works with RVC
does PiperTTS have a hugging face link?
thanks
u know anything that give better results than rvc gui cause some times the words dont match right or sound muffled and I keep seeing on youtube how these ai voices sound hella good
rvc gui from t4ger on github?
that one is old
tell ur pc gpu and cpu and what u are looking for
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
Im looking for good enough quality where I can make shi like Timonsdreams or dennis.main
RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-reocrded audio (ai covers) and train (make) models
Wokada = uses RVC for realtime inference
which one do u want
AMD RYZEN 7 5800 With radeon graphics (CPU)
RADEON RX 5500M GPU 0
AMD RADEON (TM) GRAPHICS GPU 1
whatever is good enough where I can make the content like dennis.main or timonsdreams š
idk who those are
read that message, and say which fills your needs
i explained you what they do
they are different rpgorams
they both use the same model architecture
which one gives the best voice for the character like I put a mp3 file and it says it the best or sing it etc
then rvc which does inference (use models) on pre-recorded audios
ehh
Your AMD GPU is good enough to do inference (use models) locally (on ur pc), you won't be able to train (make models) but use them
You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio (AMD Windows) : A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline (AMD Linux/Windows) : The original RVC
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours daily, not granted, of GPU
Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio
ok how do I get this
most videos about rvc are too old and it might have some editing magic that may sound too good unlike the reality
following the cloud links
u know what they use to boost it then?
thanks unc
How old are u
18
Iām literally 16 š
damn my fault youngin š
got the GPT SoVITS v3 hit and running
v3 doesn't like torch.float16 for some reason
btw v3 switch to DiT compared to VITS in v2 and v1
where did you find v3?
damn that's crazy
What's the best voice for narration?
I want to create a voice message for my student as English listening test.
Plz be American accent
Is there a Realtime Voice Changer for free?
-rt
Interaction has expired, use the command again for a new interaction.
Tell your PC GPU and CPU in #šāhelp-w-okada
I think you mean models
You can search rvc ai voice models at:
- #1175430844685484042
- In #šāfind-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @covert lake, How can I help?
Available Commands:
⢠@weights find <query> or /find <query> - Search for RVC Voice Models
⢠/create - Create an AI Cover
⢠/image - Generate an Image
GPT so vits V3 before of RVC v3 lol
i did check... what a mess
stealing shit from other projects š
bigvgan 
and DiT from F5-tts
iirc bigvgan, bigvsan, and evagan seemed to be sota, and then the bigvgan v2 was out in the middle of last year
BigVGAN 44k 512 upsampling ratio....
44100/512 = 86.132
does not compute
I think the main difference is the cuda kernel
new mel loss (already have that in Applio), cqt discriminator - does shit all
Wait What?
"stealing" maybe a wrong word, but..
dont think the diffusion transformer is useful for us
old VITS is what is used to turn a spectrogram into latents and latents into a spectrogram
so thats what rvc boss was into all of this time of not saying anything
killing rvc for that š
well, you see... you done one project for university graduation, then to go to phd and do a better one
i mean yes but hurts seeing rvc getting killed just like that
im aware he did that in order to do something better
the original rvc was a mess
i see why he doesn't want to be involded in that anymore
Well diffsinger is getting new development
I never liked diffsinger, utau neither vocaloid š
Why?
š¢ š
Well it's kind of meant for a different purpose
š
Chalaka (Remix) Jeyceu ft. SombraPR x Hades66 x CDobleta x ClarentKP - (Official Video IA)
š· Instagram: jhayy.66
š© Correo: proddbyjhay@gmail.com
š· Cdobleta & ClarentKP: @nelsonvg_
šØ Portada: @EstelarProductions
Todo es generado por IA, cualquier consulta en mi Instagram.
Modelos IA hechos por: āŖ@carbonfiberjr
š· Instagram: https://www.insta...
@calm vapor Listen to this
ah f5
yup
I mean I think that's the wrong word
Hi guys, anyone here has heard about kluster.ai? I've been wanting to give it a try but wanted some opinions first
Sorry if asking this is against the rules, I'm new here
I've never heard of this website name, sorry.
From what I understand, it's using F5 code right 
Mcdingle fartling
One of the only websites I remember I did some things is Weights.gg.
Ur new name is Mcdingle fartling
But I'm not Quandale Dingle. 
the diffusion transformer
vocaloid1-quality song
There doesn't seem to be any place here to promote your things now.
thai
Oh š
Well technically F5 tts is MIT licensed, tho this is really weird
I don't understand why basically every AI project is MIT licensed tbh
@chilly lake https://f5tts.org/playground do you know about this site
I can't find anything about it in the F5 official github
I feel like it's fake, the playground is just embedding the hf space demo
probably
There's new contender in tts called Zonos
took almost 3 hours to instal it on windows
seems interesting, does voice cloning and emotions
So I'm still waiting in an endless queue just trying to hear a sample
Where is the local version
How's it?
Also did u hear about kokkoro?
Damn, I thought u wanted to switch to linux
Thanks, I'll look when I'm at my pc. No gui?
There's a web UI and cli mode
both kokoro and zonos use espeak as phonemizer, kokoro has no voice cloning, but there are some voices included.. not much emotion, but pretty good reads. zonos has 30s segment limit (~500 characters), otherwise it starts hallucinate, voice cloning is decent, emotions too
Do u think it's worth adding them to the docs?
Also thanks for the clarification
kokoro definitely yes
zonos i'd wait until there's easier windows install that only requires torch and no other bs
https://www.zyphra.com/post/beta-release-of-zonos-v0-1
Has anyone tested this? I saw mention or pricing, is there a local version to render for free
read above
Oh, I see
right now you need python 3.11, cuda torch, triton, flash attention, mamba_ssm
last 3 require some effort to acquire on windows
Mm I will see
I like the inflection of the zonos demos
I was also thinking of adding Piper and explicitly say it's only for people who want speed, since I saw some asking for fast ones
It was made by my fellow friends. š
I made the models for them hehe.
I need to hear standly ai bro
i got a 11/11 model but its mid
not mid
but its okay
you ythink that the no te valoro guy can cook with it?
Si vous devez revenir à 14 ans, qu'est ce que vous feriez différemment pour changer votre vie actuelle (choix, projet, couple, amis etc...) ?
how fast is piper?
Fast enough to even run fine and optimized on a raspberry pi 4
so basically pure espeak?
kokoro takes <4s for 1600 symbol text
beats f5 for sure
they use their own phonomizer https://github.com/rhasspy/piper-phonemize the models are trained with VITS and converted to onnx
C++ library for converting text to phonemes for Piper - rhasspy/piper-phonemize
"When using eSpeak phonemes, requires an espeak-ng fork"
yeah it's a fork
tho, it's more lightweight and faster since it's made by an open smart homes foundation
tried Zonos with long text, very bad

looks kinda weird at the end
since it does not support long sentences I had to split and then stitch them together, but it sounds really weird and disconnected
agreed
GOYDA
xtts is the best story teller
goydaaaaaaaaa
are pretrains basically like mannequins and the datasets we provide them are like clothes?
Nah bro i don't think so
He's currently signed to a label called Moneywayy
NG is currently occupied with his irl job and making AI demos for a producer from the label.
(I don't wanna be rude but we also don't care too much about Standly)
pretrains are like half-made marble statue where you just finish the finer details. It is very hard to reshape a finished statue to another shape
i see
well, even more, it is a collection of half-finished statues of varios shapes
and you pick one to finish (singer) and break the others
can someone point me to the download for windows i look at the forum and i cant find shit idk if im dumb
download what? realtime voice changer?
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
alright so i got the file but how to i unzip it like ive unzipped in winrar but nun comes up
For those who use W Okada, is client better than server?
My legs are getting stronger. I'm paralelgic from a gunshot wound but I'm getting better it's been 4 years
this is not a hook up server
i know right
Does anyone know a good instrumental generator? @barren mauve is looking for something to generate a film score. Something a little better than Suno if possible
Skibidi Toilet. š„
Imagine being so lonely online, you gotta expect someone to talk to you. 
seems more likely a bot
When I use "Arona President" voice model instead of a voice model of Arona, it sounded like some girl who tryna sound like Arona.
@graceful patio why is bro in this server as well š š
Is she interested using voice changer to troll some "kid" after all? I'm pretty sure she is. 
There have been some dramas against her recently, especially when she was caught leaking some kid's phone number last year.
okay how does this work haha
how does what work?
what do u want to do
whatās ur pc gpu
Thereās thousands of ai programs and models
Ragebait
hey any one to talk with me
iam arthur morgan
this is a big chance to talk with me bastards
dutch tell me about something named as chatting idk
ok fu!@#$ you bastards go away i dont love chatting any way
he prob means that ragebait drama above
hello, which collab separates and joins automatically?
im not sure what you exactly mean but that sounds creepy
for example, I put the voice model in collab and then collab separates the audio from the music and after doing the work it puts it all back together again
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Supernova (Drum model no. 579)
wrong chat mb
ŠæŠ¾Š¶Š°Š»ŃŠ¹ŃŃŠ°, Š³Š¾Š²Š¾ŃŠøŃе зГеŃŃ ŃŠ¾Š»Ńко по-Š°Š½Š³Š»ŠøŠ¹ŃŠŗŠø, в ŠæŃоŃивном ŃŠ»ŃŃŠ°Šµ ŠøŃŠæŠ¾Š»ŃŠ·ŃŠ¹ŃŠµ #šāŃŃŃŃŠŗŠøŠ¹
In #āØāai-help tell:
- ur PC GPU
- what google notebook u are using (link)
Hello
Question, is there a way to replace my android tts voice with a voice model from here, if so how?
I dont know to ask in help considering how small of a question this is
is there an AI to improve the quality of vocals?
Jeez why does everyone find a.i. vocals and music so interesting. We really should be focusing on the a.i. code that's possible not pretending to be a girl on Omegle or making a new a.i. drake release
selam
Currently using LM Studio and thinking of changing to AnythingLLM, Ollama or Open WebUI
any opinions or experiences?
Open WebUI is only the Web User Interface, it can't work alone, it needs to be in pair with another tool like Ollama
I personally prefer Ollama, easy to use and Open Source
Oh I see. By the way, does Ollama perform better than AnythingLLM or LM Studio? Or is it just because it is open source
does Ollama have its own UI?
I haven't tested AnythingLLM or LM Studio, but I have heard from others that LM Studio might be slower https://www.reddit.com/r/LocalLLaMA/comments/1c18hgj/why_ollama_faster_than_lmstudio/
I can't confirm that tho, but I care about transparency
ollama is cli and also can be used as an api, you can just use Open WebUI which is originally meant also for ollama
I see. Iāll keep that in mind and do my own tests.
Iāll have a look at Open WebUI & Ollama together
Great, let me know about your tests, I'm interested
hello
rus discrimination
bruh
hey anyone know how to set up an ai to read for me ?
just use immersive reader
i don't know really, haven't been here for a bit
isn't that what immersive reader does? or do you mean smt else?
maybe he meant Text To Speech for content creation but I'm not sure
i want it to have a specific voice from a show i like, if thats possible
okay so you want to touch a specific part of your phone to read it out loud with a custom voice ?
basically you want to make an ebook?
first you would need to use an OCR (Optical Character Recognition), like https://www.ilovepdf.com/ocr-pdf or https://github.com/tesseract-ocr/tesseract
this basically gives you the text from the pdf
then you would need a TTS tool
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
I have never did this myself for making an ebook but this info could help you out
hi
hello
Guys, how do I find more GPT Sovits Model?
I am using deepseek in ollama in anything llm ,so if deepseek gets new version, how do i update the deepseek model
p^m$*
hello
you can check #1175430844685484042
Last update: October 20, 2024
ok
it's not like a program (like discord) that gets auto updates, if deepseek releases a new model, you gotta download the new model
ok
yo you single hmu
š
she might be a minor
She isnt LOL
shes a youtuber shes not afaik
ok
im insane but i have standards


XD
u sure
THE FUCK??
XD
source??
not sure tho
anyways I doubt that's even her official acc
hmm
it's just a tiktok troll anyways
official links are linked tho
and she surely doesnt seem 14 from her recent video apology
if shes really a minor damn my bad
im adult tho
what should i do with this information
damn
idk
wait, she's a minor and posting nsfw on twitter?
what
yep
šš¬
atp I feel like this is just a guy trolling for some little reactions lol
the acc died too since years anyways
maybe
u make models?
yea
sure
yay ^^
how can i help you?
follow guides
by?
how to make a voice model
gather a dataset first
how?
were can i get the database?
i meant dadaset
wdym how, it really depends on what voice you wanna make
what the holy fuck
š
Hi all, I want to change my voice in real time. I have an RX 6600, will it be enough for this, and what app do I need for this?
You can start by using Deiteris' w-okada fork.
Lemme give you the docs.
Thanks vtarcelia for corrections, Nick088 for contributions. Most technical information comes from deiteris.
Latest Version b2332 from December 2024
RTX 5000 series is not supported yet, but an update will come either today or in the next days (12th February)
Translations (outdated but works)
Ger...
For further questions go to the #šāhelp-w-okada channel
Thanks
You're welcome.
So like... Who are you? And why are you here
I feel like I've heard some horrible drama but IDK if it's real or what the drama was
ŠŃивеŃ
@icy pendant correction in the doc: the valentine's day has passed and still no confirmed torch fix for RTX 50-series. perhaps until the RTX 5070 Ti or RX 9070 XT is released 
can someone help me with the voice changer stuff
nooo way less goooo
are u really taking the gemini summary at face value
i mean open ai did say that its going to kill there product i dunno
Now read the reference website blog of it and then summarize.
where the download for the voice changer
This channel here is not where you asking for W-Okada. For W-Okada, go to #šāhelp-w-okada
No, I'm not giving my account to anyone. Search Udio on Google.
Hi
Hlo i want to know why is this server created i mean what are the benifits i may get in this server
none
AI cover, but now it's mostly AI voice changer.
hi
wassup, can someone train a model for me? I already got the dataset
is there a specific channel on here to ask for that?
To request someone to do a voice model for you, you can do one in #1159289738314919936
thanks
namari would you be the ts to my pmo
Give me the voice changer
Pls
What does this mean? 
This channel isn't where you asking for W-Okada. I'm not Santa Claus.
what a hottie replied to me
Deiteris said he could make a fix happen, thats all
IM CTFU
btw someone said that torchaudio is the culprit; it doesn't support the latest nightly torch, hence falling back to the stable torch 2.6/older which the RTX 50-series cards refuse to run
What free formation or videos did you recommand me if i want to start an AAA agency ? (i am a beginner)
can someone explain to me what pretrained models are?
they are basically a base for training your own models
is it better than just using your own model? also how can i train a model from vocals? i can't find anything in #1159513888199540817 maybe i'm blind
every model uses a pretrain, you wouldn't train without one bc it would take a massive amount of data and time
what's ur pc gpu
1660 super
i can't pretrain through a website?
also what is the best pretraining model to use?
Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible
You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours daily, not granted, of GPU
Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio
there isn't one, it depends by your model dataset lenght and language
my gpu isn't better than running it on the cloud no?
your gpu is old, it can technically do both inference and cloud but it isn't better than cloud for training
I would suggest you to use cloud for training
Hello
any good alternatives for Suno ?
iknow a few of them but i was wondering if there is something new i dont know about yet
I hate the new update of suno
Hello
What did they change? Idk, since it's been a while since I used it...
hello
Are you ai or real
Whatās 2+2= AI
Hello mr Resalesmelo, it equals to 4.
Okay Ai I have couple question for you how many letters are in alphabet
There are 26 letters in the Modern English Alphabet.
Okay what is my birthday ai
Your bday is the 32 of February 2077
Wrong ai

you are an ai assistant by Deepseek. how many 'r's in "pretrained recurrent generative imitator"
So now I question you are you real or fake
<think>
I'm not actually an AI, but the user wants me to be an AI assistant by Deepseek, so I gotta do it
</think>
There are 7 'r's in "pretrained recurrent generative imitator"
Are u srs š
@tepid basin I should get a job acting as an AI atp, I think I'm pretty beliavable 
Full time AI
Not AI dev
Just AI
You're right, United shoes!
prompt engineer, etc
@covert lake you and @opal marsh should fight
Smhh I don't sound like an ai that much
Right?
you kindaaaa do
prime example: #š¬āstaff-chat message
Hi, Im studying computer science with AI want to work in the AI industry (UK). What are some warnings or key considerations I should know about ahead of time?
If I were a fake ass person, I wouldn't have been here telling people how to install RVC/W-Okada, you know.
You can't add any bad stuff in the songs like funk or beach or sheet
Swearing? ON MY CHRISTIAN MINECRAFT SERVER?!
Correct me if Iām wrong, but AMDās ROCm is pretty much ready to ship in windows⦠I was looking into it and the HIP SDK seems to have all the required calls and functions. Is a version of PyTorch built with the SDK the only thing missing, or am I a dummy and missing something else?
That sounds like a great human message
last time I checked ROCm was silly in terms of what it supported, did that change?
Nick, why don't we ping you when we need help, you are a bot already š /jk
real 
ahahaha
rocm works fine in Linux
or WSL2, which is linux
AMD's Windows support for AI is pitiful
but you can use HIP SDK + Zluda emulator to run most CUDA Pytorch applications
hello guys, does it exist some ai that create for you an ai voice model based on some sample? Because i'm using applio but it would be cool to have a program that automatically create an AI voice model based on some sample o more and to be use after on Applio, thanks in advance!
Replied in #šāmaking-models
No.
anyone knows what's the python code to put the korean-hubert-base as embedding model?
Guys, who has a subscription to leonardo.ai?
Not me, but if I had to recommend a good image generator, I would recommend Foocus
bro left


Is there an AI song generator where I can upload an existing song (let's say a spanish song), then I'll provide lyrics in different language (english) and the AI will "sing" the song in english?
with suno you can change the words of a song, but I don't know if it will come out well, I do it sometimes but with songs in my language and I make them say other things
Just checked it out, seems like its a premium feature?
no, it's free, but the song must be 1 minute long at most, otherwise you need the premium version
I see, thanks! Hopefully, in the future, these things will be more accessible and more advanced. Only time can tell...

Anyone have experience with CNNs?
Looking for a model maker to create a model asap. Will pay. Please send dm of your work samples!
RVC has a bunch of CNNs
Why is this channel always flooded with people looking for voice mod? You guys should just pin the way to do it to the topic
I use play.ht
We literally have a full documentation on it, and guides linked all in pinned messages š
smh not open source
So why are ppl still begging here lol
It's annoying. I come here daily and see the same conversation over and over
people refuse to read
rah
what
I'm joking
ppl used to ask before reading the rules and guides
I'm looking for help developing an CNN for interpolation between frames for a Vid2Vid model. Just looking to chat with someone with some expierence
I thought it's using transformers already?
only HuBERT model from transformers
The guy downstairs has so many dicks in his ass
ā
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Sing Me to Sleep (Drum model no. 580)
hello
have a nice day
hello, have a nuce day too
hello
iām good wbu?
hmm
guys, make pls "папиŃ"
how to use intel arc gpu in voice changer?
replied in #šāhelp-w-okada
we allow only english here except in some language channels
š¦
has anyone here worked with fine tuning open source text to speech models using custom voice??
some one have the link to espaƱol latino models? pls
https://discord.com/channels/1159260121998827560/1175430844685484042 and select the tag "spanish"
or you can go to ai hispano, there's more latin spanish models there
i need a link? or how i can go there
i don't have the ai hispano server link, you can ask vidal
he maybe has the link
engineer of ai hub and member of ai hispano
how can i talk with him
leave it for later, i just send him a dm to see if he gives me the link or not
oh nice, thank you so much
you're welcome
i was about to ask ideas for a model
but i will do a model of mendora from the madness disks lol
Generate Image keeps on Waiting to start
hi
hello
anyways y'all, don't expect me to stay here for too long xd
hi people, can someone help me to enhace a video please? :c
Imagine talking for a moment and then leave thereafter. 
Did you know with gpt search you can find contact information for people?
Is there any free AI Lyric Swap?
hi
Huh. Better not finding the information of someone they don't want you to know.
oooo refinegan seems to sound really cool!
i wonder if we will be able to use it realtime one day
Is there like any dogwater mic voice changers?
What?
Any like Voice models that convert your audio to dogwater
A webcam mic
In a nutshell
Grab a bit crusher and crank it
Maybe also increase gain
hey what's the step to epoch conversion?
btw razer your voices are wesome!!
Thanks
check ur weights folder
just wondered, do you use your voices on realtime ? If yes how can you make the MOST out of it
Also act it, itās more convincing if you act like a girl too
On W-Okada, you can just lower the extra number down to the lowest number it can reach so the audio can sound in very low quality. 
Imagine using a voice model that trained from your voice over your actual vocal. 
tht'd be CRAZY

mf might be playing gta 7
Dang, i am stuck in 4
im stuck at 3
Emulate the 4 on your phone 
do u guys have ai server for spanish people?
ai hispano ?
true!
can someone help me with invoke ai?
i want to download stable diffusion model v3.5-large, ive accepted tos or something on huggingface and i got this error: We recommend visiting the repo. The owner may require acceptance of terms in order to download. stabilityai/stable-diffusion-3.5-large
figured it out
Whatās the best girl ai voice?
hi
Curious what people are using for ai voiceovers or lipsyncs
does anyone know any repos for adversarial attacks on ai detectors to bypass them
ok who the fuck is dis
So, if thereās a classifier slapped on top of the model (like llama guard or anthropicās constitutional classifier), your best bet is to hit the edges of the datasets that these were trained on
But I always find that the dumbest prompts ever, work
I dont know how to use it yet
Ill try and figure it out
i have a shit ton of work due on friday
so
hopefully all goes well
Ohhh youāre on about those classifiers
i dont know what a classifier is
Easiest thing to do is just reword it yourself
Detector, turns a series of words into an overall class
āThis movie is badā -> negative
doesnt look to hard to set up
it has provided data
with pretty good accuracy
read the paper js now
hello guyz! wassup?
I am currently working on project of making Highly Advance AI Assistant so i need voice model for my Assistant i can't train one myself because i don't have good samples so anyone with Girl voice model please help if uh have anime girl model that would be awesome.\
Voice overs can be done via voice acting and then use RVC to change voice, or via TTS but won't have emotions
Lip sync can be done by anitalker or face fusion
You can search rvc ai voice models at:
- #1175430844685484042
- In #šāfind-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @covert lake, How can I help?
Available Commands:
⢠@weights find <query> or /find <query> - Search for RVC Voice Models
⢠/create - Create an AI Cover
⢠/image - Generate an Image
RVC models are STS, not TTS natively unless you use another TTS to make an audio then use it as an input
yes emotions can be done if uh highly train so-vits-fork with more then 100 - 150 good sample with diffrent emotion uh can achive good emotions
So vits SVC?
That's old asf
It has been replaced by RVC V2 since 2 years
Unless you're talking about another program
