#🧬│ai-chat
1 messages · Page 321 of 1
hi
@covert lake
there you go
small correction - inference audio was 12 min, the training set was 9:41m, after split it is a bit longer in Applio
so extra 75 seconds to work with and still faster
where can i find some english valorant voices (pls ping with answer)
first infer may take a bit of time as the model may take a few extra seconds to load
hi
but in general there should be no noticeable difference between rvc and applio for inference using zluda
direct ml though in my expecience is much slower
anyone know a good free voice changer?
dubbing AI
It has a lot of voice changers and it has a good soundboard
lmao
h
hi
original: https://rentry.co/voicechangerguide
fork (optimized): https://rentry.co/forkvoicechangerguide
directml is problematic in both rvc mainline and applio (and in general) so no wonder. But well, for me it takes 58s to convert 12m audio with directml. Given that rx 6600m is something like 40% slower than 6700 XT, it sounds reasonable
where can i find some english valorant voices (pls ping with answer)
hi
Hi
hIh
hello
h
yea inference seems the same
I can't access the channels..
training seems like it took some sligthly seconds more in mainline ?
if i read it correctly
what channel?
voice models
also forgot to reply, i was updating uvr5 ui kaggle 😭
cant u click in #1175430844685484042
mainline training 69s/epoch, applio - 42s/epoch, so about 25-30% faster with torch 2.3.1
yes but I don't have permission
You don't have permission to post a model
but you can view the channel
Yea, weird that the inference was slower on termux tho
yes
ig it depends by accelerator too
ahhh
Ayo? @soft portal level 1 !!! 
docs are down rn, but here is an archive of the guide: https://web.archive.org/web/20240915173810/https://docs.aihub.wtf/extra/model-maker-role/
Last update: May 20, 2024
this is how u become model maker to upload models
ok thanks 😉
Good Afternoon
this is @jovial totem
today
uhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh ok
b
hi
koi hai ?
im genuinely sped😭
skibidi
yo
Ayo? @wary olive level 1 !!! 
what program should i use for AI voices
i was used RVC BAT but it doesn't work anymore for me
help
where is epochs?
what
weights
thanks
Hello
ага, давай
Does anyone know of a good discord server to get help with ComfyUI?
Sup
@orchid slate can i use real time voice changer in amd cards like rx 6600 ?
Sup
You can use directml build, or you can use a cuda build with Zluda emulator... both methods I've found are lacking
how do i make it so i dont hear myself from the client
upscale image
4o canvas is a death sentence for entry programmers
Ayo? @elder willow level 2 !!! 
why and o1 preview canvas once that comes out will be even better
Blud I can't even imagine o1 preview with canvas
4o canvas obviously has better coding training than 4o
I've used it and it did a better job at coding than o1 preview, astonishingly faster also
o1 preview has the best coding
Doubt
Also I explained it in #🔊│ai-development
Anyone?
anyone play coc?
fork wokada works with less cpu usage than og one https://rentry.co/forkvoicechangerguide
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update October 6th, 2024: Multi PC setup explanation added
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVo...
#🔍│help-ai-art
what specific purpose? because u can use it for SD/Flux, LLM prompting, and even RVC
Ayo? @queen jay level 1 !!! 
Hello?
@scenic token What happened to your group that banned everyone?
Ayo? @fair viper level 2 !!! 
skill issue
i see now, it is 3 am all around the world
okay dude
Hi, is there a website that will ask me a load of questions then give me a character sheet I can give to a ai model? I want to replicate some fictional characters but I'm terrible at writing character sheets
that answer ur question?
No
Ayo? @cloud sentinel level 1 !!! 
Can someone tell me what's going on, am I having a stroke? I don't smell toast
You don't need any help anymore. The dead don't need help
c.ai?
spicychat?
You can try to find a character that you like or create your own
Civitai
h
Hi, i want to make a voice model from a character
I want to learn how to do it
Is there a guide that i can follow?
I got a lot of audio files of the voice alone
My birthday is on 22rd this month
too bad its not on halloween
hi
They let you create characters by adding personality traits and descriptions. I am bad at thinking of personality traits and I was hoping there was a site that would give me those traits based on questions
Ayo? @sweet harbor level 1 !!! 
hello there
no entiendo
Ayo? @grim osprey level 1 !!! 
hello guys
yo who wanna duo catfish with me 
you're weird if you do that and also you're gay if you catfish also
hehe
he is probably doing it for both
anyw hmu i been making mad bread from this sh

No you haven't and if you did then you are gay and you can't say otherwise
lolll
lollllllllllllllll
so you're gay
yea lets duo?
Nah I'm not gay and I can make money other ways that are not weird.
self report
Ayo? @tacit falcon level 8 !!! 
you're the gay one here I'm just calling you weird for catfishing.
bro got flustered took 5 minutes to reply and had to edit L
ayt buddy u typing too much, saw you had to rewrite too lmfao ayt bruh take the L im out ezpz
self report
At least I can make money not being a James Charles copycat hitting on straight men like you do
lmao lemme guess u make money by straight up having gay sex? got it
learn to type faster gay boi ur wasting my time, closeted lmao L
No I'm not gay at all and the money I make is from doing sfw non controversial deepfakes.
oh hell nah thats even weirder holy shit, blocked
that's not weird at all what you do is weird what I do is make a deepfake model of someone that's popular or have consent from.
I need a code to use a models files the model has metadata.json model.pth and something else
hey, is so vits old news now that rvc has become so popular?
What are people using to make AI covers nowadays?
Hello, what voicechanger should I install? Please help
Va faire kaka
Ayo? @stable furnace level 2 !!! 
@covert lake sorry but i needa ask one more time
how do i use this model after downloading?
Has anyone tried swapping out the optimizer or really doing anything else to tune the voice models being trained other than just changing the epochs/batch size?
I don't have many datasets to test with but had some p interesting results swapping out the set-and-forget default adam implementation with prodigy
unfortunately not having a quality control on hand makes real evaluation kind of difficult since the default is p tainted too
@polar flax not sure if this is a virus or not i don't wanna risk it lmfao
with the rvc cli?
Hello everyone
I’m jah am I welcome here
whats u rpc gpu?
yes, rvc is way better
Whats ur pc gpu?
its been since a year people use rvc
whats ur pc gpu?
how to train model?
if u dont know ur pc gpu:
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
I have a 4090
alr its good enough to do it locally (on ur pc)
docs are temporary down but i will give u the temporary one in a sec
ty
thanks
yw, pls tell me the name of the gpu so i can help you
oh lol nw
Ayo? @merry laurel level 1 !!! 
So you are saying Applio should be good with my pc?
it say GPU 0 Intel(R) UHD Graphics
GPU 1 NVIDIA GeForce RTX 3050 Laptop GPU
Ayo? @robust vale level 1 !!! 
seems like you got an rtx 3050 (kinda bad) and integrated graphics which is worse
Could you also tell me the gpu memory of the rtx 3050?
i mean the number next to that
should be like: number of usage / total memory
7.9 gb
cool thanks
dang im surprised no ones made a scream voice yet for Halloween
"It's funny what an angel you think you've been"
BUY THE SONG HERE!!! https://vocalokat.bandcamp.com/album/self-proclaimed-angel
join my discord server if ur cool :33 https://discord.com/invite/mcVYxRYqX6
Wow....It's finally here. This has been a long time coming, I remember starting this song back in May 2022 when I was still 17. I was heavi...
man
not really sure if u would be able to do it locally as i heard rtx 3050 is bad
ok
Or you can use cloud (remote good pc):
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
the best cloud one is kaggle
i would suggest to use cloud as i heard rtx 3050 is bad
ok
im not sure if that one is good enough for local
for any issues u can always ask in help channels!
ok
ultimate dry texter
I've tried a few different optimizers, it is not much different if at all
3050 with 8GB can run training and inference
gonna be about the same speed as colab
not too bad, not too great
so its only bad for the rtx 3050 who have less vram?
i was told rtx 3050 is like a gtx 1650
it has tensor cores, so can to AI stuff, 8GB VRAM is sericeable
as long as it is not a laptop 3050 that is
desktop 3050 has 8x PCIe so it does not do well on old motherboards that only have PCIe Gen 4
its a laptop gpu
so ye its better he does cloud
yeah, kaggle will be a better choice
can anyone explain what divergence is?
as for local, maybe inference can be done locally
for training usually its suggested 8 or more gb
@atomic vector explain “divergence”
so nick asked you to explain divergence, right rocococococo
ok, so divergence is a concept that can be applied in different fields like math, physics, and even finance
in simple terms, divergence refers to the process of something moving away or spreading out from a central point it can also be the point where two or more things separate or move in different directions
for example, in math, divergence is used to describe how a function or a vector field changes as you move away from a certain point in physics, it can refer to the way a fluid or a gas spreads out from a central point
in finance, divergence can refer to the way different economic indicators or market trends move away from each other
does that make sense
-# AI-generated responses may be inaccurate; please verify important information.
LLMs are 🔛 🔝
with 4GB vram and nvidia some features may need to be disabled, like CUDNN to prevent pytorch from running out of memory
btw, u work for applio & mainline?
i remember seeing u in both of their servers
ok the bot is not that useful here 😂 divergence/convergence has something to do with the graphs?
not the mainline, I went to check RVC devs server, but it is mostly dead lol
ye rvc hype is kinda ded
seems pretty surprising, at the very least something lr-free like prodigy should be substantially different than the untuned AdamW that everyone is using as a baseline
had some questions but nobody could answer there, so I assume original devs have moved on to better things
yup
rvc boss now works on gpt so vita
Wok works on a wokada-like tts app
u mean tensorboard? I thought u meant the word lol
Might need to ask in #1192011222023950368 as i dont really know much what to reply
rvc v3 is as much probable to be released as gta 7
yeah I'm more into seedvc and svc fusion the rvc nowadays
never heard about those
i mean, ai covers hype is kinda ded in general
it would be a bit unrealistic to expect continued support for rvc at this point anyway, it's actually kinda surprising how long this architecture has lasted without more significant change
Hello,
I'm Christian work as a software developer and project manager. Have a Youtube channel on Project Management.
Extremely happy to be here.
seedvc is zeroshot sts and svc fusion is a not that known version of svc
image gen has swapped substantial portions of its core multiple times over the last 2 years
rvc is better than svc tho
last svc version i heard was 4 or 5 tho
yea, rvc kinda stayed like this
there really isn't any svc fusion and rvc comparison
seedvc seems cool tho
didnt see a sts 0shot before
most people swapped from svc to rvc, i personally have never tried svc but from tea i heard svc was way harder to use
if i remember correctly
not sure if it was tea or someone else who said that
there is many version of svc there is even ddsp
gotcha, thanks
dunno about that
but since like a year everyone kinda uses rvc
like most voice models i ever seen are rvc
almost never seen a svc model
yeah and they don't like trying other sts stuff that's not rvc really
any i js wanna be able to use the model
but locally even if slow
I'll have an online version yeah but I wanna make it work on my computer by itself yk
would u mind sending the projects u said tho?
Also is svc fusion few shots or
im mostly interested into the 0shot one tbh
didn't you send me a archived link of the mainline
as the docs are still down, i deployed temporarely the docs, hyperlinks work correctly, the guide doesnt change tho
if you dont want to smash your head against a wall, get a "compiled" rvc version, dont try building from source
atleast tried but uts like 5gb and keeps failing
svc fusion is few shots
latest applio can be build from source or download a prepackaged build too
I'm trying again from the link you sent
ye the guides on our ai hub docs are pre compiled
Would rvc be considersted few shots or, i forgot tbh
yeah few shots or many shots
nvm searched and the right term should be multi shot
bc it needs way more training than a few shot ai like for example gpt so vits
btw could u send a link of svc fusion? Was able to find only seedvc on github
it seems also pretty updated
installed the mainline file that was like 5gb and extracted to desktop what now?
unzip, preferable to a folder that is not under OneDrive / has no spaces
run whichever .bat file
well on my desktop theres no spaces in the file path
but i can put it on the drive if thats needed
"C:\Users\natha\OneDrive\Desktop\SnippetEditor\RVC1006Nvidia"
there's go-realtime-gui.bat for realtime voice changer, there's go-web.bat for inference and training
okay
yeah but no space in it
there reason for that is you dont want to sync all the files it creates to MS
"C:\SnippetEditor\RVC1006Nvidia"
and wait until it saves stuff there
ngl
ion fw onedrive
i uninstalled it after it ran outa space and everytime i ever reset or get new computer immediately uninstall
thats funny
nevermind
so, do you have a gpu? 🙂
well...
RVC1006Nvidia why then
anyway, with CPU the best you can do is a very slow local inference
and training that sets your laptop on fire
quite literally?
it is very demanding, so yeah...
where do i put the takeoff model to use it 😭
index goes into logs, pth goes into assets\weights
my computer is finna catch fire not even training
Ayo? @elder willow level 6 !!! 
just use https://huggingface.co/spaces/TheStinger/Ilaria_RVC for inference
yeah I'm downloading it right now to upload to Google drive and what's better for you pkg or windows version
this is beautiful
at least it is not your laptop on fire
only if you have melting connector on a 4090
im on windows, bUt integrated graphics so i will use cloud (so using linux), its not on igthub ?
No they have a website and stuff just all there downloads are chinese download links which I can download from
okay then will get the pkg for you
if u could send the website in dms or here would be cool
一个 SVC Fusion的非官方整合包文档
why cant it infer anything at all
it all errors
i did a normal song then a snippet that had the instrumental removed
Hi
try without the “”
Sorry but i dunno about local, seems like a low memory issue
Hello
Who’s here??
if u need help ask in help channels and be specific
alright
hi, i cant see voice models
there's no way it can run out of memory while processing not too long voice audio in cpu mode and 16 GB ram
#1175430844685484042 , you have access to view them but not upload them
🤷♂️ i dont do things locally
this is what made me guess that
i also tried searching for that error but seen there was no solution so
also u sure he got 16gb of ram
@polar flax yea thats a not enough ram problem, barely 4gb usuable
so yea i dont think u can do anything on local
is that to the ai split thing
ai split? U mean uvr?
if thats seperating vocals
if u mean uvr, i really dont think so or it would take hours
im not sure the terms exactly
uvr = ultimate vocal remover
to split vocals
i dont think u can do literally anything local
too weak pc
idk its not the same
its the same program
i wanted to do my project and have it simplified now i gotta goto multiple random sites for each process 😭
Cloud is your only choice
lmao 4 GB ram in 2024 💀
Well not everyone is rich
im poor what u expect
idk why u thought he had 16gb of ram tho
computer was 300 😭
it's bare minimum even in work laptops
Even my laptop got only 8gb of ram
i3 🔥
yea there are still macbooks with 8 GB ram
twins?
bare minimum for things now day is atleast 8-16gb but most people dont got money
also macbooks with only 8gb of ram is a scam
real
bare minimum for anything now days is 8gb of ram
much more suggested to have more ram
i dont have my pc rn but should be 11th gen for me too
hello
mine is also 11th gen
can i use weights gg to train model
For training on weights.gg, you can:
- Pay
- Make a creation for 5 days streak to get a free premium training, you can repeat this
- Use referal code, getting 5 friends to subscribe via your training
u pay to train them?
actually i got 10th gen 🔥
nope, weights.gg uses rvc, so i just use the other cloud options of rvc
like google colabs or kaggles
lemme send rq
As you dont got a good PC, its better you use cloud (remote good pc) for training an RVC Voice Model:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
google colab = easy but risk of losing work for getting disconnected
kaggle = harder but no risk
but weights gg is free?
its free only for inference (use model), or for training only if u do streaks or referall code
but nit directly free training
oh
I would personally suggest using kaggle mainline
i cant see anything
damn i think one week would be enough to do takeoffs entire discogrophy not gonna lie
huh kinda weird, may need to refresh discord, could be a bad internet issue
i dont think so but thanks
nick
yea 30 hours are good asf
can you add me
btw, u just need 10 mins of audio for training
You dont need anything over 1 hour, more than that makes no difference
like run the ai split on every song from that year and cut them together
yea u could make each different one, but its useless to make a single program with all of them
wdym
might be better if u have any issues to ask in #✨│ai-help , im at school ipad rn so could disappear every second
like, no need to take hours and hours of his voice, even 10 mins are enough, and could make different models for each era
im honestly thinking now like
one for each year and branched off that different ways he may've sounded
i started from the beginning so its one album and from 2011
im really disappointed tho i wanted to do atleast of this locally
yea
u didnt add me 💔
Ayo? @elder willow level 7 !!! 
Guys, is the program definitely safe?
yes, dont watch yt videos tho, the program is Open Source
whats ur pc gpu?
i js mean like add
with 4GB vram? way
had they had 32RAM, they could've used shared memory, not ideal, pretty slow, but they only have 4GB ram as well
he said 4 GB system ram and he attempted to run in cpu mode
yeah, they are trying to do rich boy stuff with poor boy equipment
I thought 16 GB is in average modern laptops/PCs and 8 GB is the lowest one
yo guys can someone humanize a document made using chatgpt for me or suggest some tools to do it?
i need it real bad
there are still low ram stuff unfortunately
ai detections stuff arent trustable
Hi. Is there a guide that explains what each setting in the RVC GUI do?
RVC GUI is outdated
dont watch yt tuts
whats ur pc gpu?
I have a ARM MacBook.
MacBook with MPS?
It’s RVC Mangio I’m using.
mangio is outdated too
i mean usuable but would suggest another one
An M3 16gb MacBook Air.
Ayo? @untold saddle level 1 !!! 
Like applio or mainoime
Applio has no realtime (yet)
Well, I couldn’t get Mangio to work in MacOS, so I have to run it through windows in a virtual OS.
Mangio has outdated requirements and some modules do not have them, so it installs a mish-mash of incompatible mix
I’m made my own model .pth file online using the Google tool.
Applio is up to date, should have no issues installing on Mac
performance-wise MPS it is not particularly good
but somewhat faster than CPU
at least last time I checked
you can attempt it
Like I said, I did it on the Google Labs(?).
Colab
Which is better?
Kaggle should be faster
-gui
as far as I know, Mainline with torch 2.0.0 is slower than Applio with torch 2.3.1
Kaggle > Colab
So what should I do for the best results? It’s a bit robotic atm.
again if you try a local training on MPS, newer torch is obviously a better choice
both have the same back-end, so your results will be the same in both
quality-wise it depends on the size of your training set and avoiding overtraining
5 minute audio trained for 400 epoch gonna have bad output
It’s around 15 to 20 mins of clean audio.
Ayo? @untold saddle level 2 !!! 
usually you get good results even before 200 epochs
depends on the variety of the set and other stuff
I had no idea I could do local training on a MacBook.
Wokada fork is better for realtime anyways
What should I use for training and then the conversion on a Mac? I’m completely new to this.
Okay. Thanks. And that does both?
yes, training, inference, a bit more
Okay. Just downloading it now.
heh
Why does it say this: “rm: *.bat: No such file or directory
curl: (7) Failed to connect to raw.githubusercontent.com port 443 after 15 ms: Couldn't connect to server
Creating venv...
Checking if python exists
./run-install.sh: line 40: python: command not found
Please install Python3 or 3.10 manually.”?
When I try and run the install command in Terminal.
skibidi
when you run linux install it depetes windows .bat files
and you dont seem to have python 3.10 installed
Okay. I have installed python now..
Now I just get this “rm: *.bat: No such file or directory.”
ignore it, the previous run did delete them
Now I get “Traceback (most recent call last):
File "/Users/****/Downloads/Applio-3.2.6/app.py", line 1, in <module>
import gradio as gr
ModuleNotFoundError: No module named 'gradio'”
@marsh ferry yo
esex
Hi guys, how can I beat educate myself on private AI companion potential?
I have a question
can you just activate the virtual environment and install requirements manually?
No
Does anyone know how to remove the robotic sound at the end of words?
Hello
anyone know of a good TTS or STTS program for Streaming on twitch?
hi
Ayo? @abstract estuary level 1 !!! 
How to watch voice model? I can't see
u mean view voice models? try #1175430844685484042
u have the permission to view them but not upload
u needa be a model maker to upload
I still can't see voice model..
I can see pretrain models but I can't watch voice models
Ayo? @mental snow level 1 !!! 
...?
d
huh, very weird, could u show me a screenshot maybe in #✨│ai-help if u cant sitll see it?
I uploaded screenshot
☑️
hello
Ayo? @gloomy gull level 1 !!! 
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
ку
Hello everyone!
hello
Hi
hi
Hi, can you please help me, when I speak I have system sounds coming out of my microphone, how can I solve this problem? I don't have sound monitoring or listening.
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
e
is this still relevant?
probably

Hello dose anyone know a free image to video
That I can use my picture
Hello dose anyone know a free image to video that I can use my picture
assalamu'alaikum
Hello, I am a junior for learning AI.
RVC isnt gonna make my pc blow up is it?
No cus my pc isnt the best and apparently ai requires alot of processing power
shits literally going to set it aflame
I cant tell if your serious
haiii
whats ur pc gpu?
it wont blow up
just asking if it can run it
nvidea gtx 1060
Ayo? @cursive forge level 1 !!! 
You could be able to only inference (use models) local (on ur pc), not train (make models)
Its just better to use cloud (remote good pc) in ur case
Are you looking for inference or train
Both but for now just inference
Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Re...
Last update: June 15, 2024
can u even use the realtime apps on that?
For rvc training cloud you can choose between:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
nuh
idek tbh
is rx 7700 xt good for rvc?
How do I use models from the voice models chat on this?
both of the blue texts i sent are hyperlinks, text that u click and it redirects u to the site, which are guides
Ight thanks
what u lookin for? realtime for calls ? ai covers ? training?
realtime for calls
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
1st link, wokada fork, which is optimized
ah okay
Ayo? @austere eagle level 2 !!! 
It'll be very slow probably
Yeah slow
Realtime won't work but you can convert audio files at a reasonable speed

hey everyone,
I've been looking for a completely SFW image generation models,
But most of the "SFW" models are not actually SFW,
Do you suggest any models that are Actually SFW?
AMD has its own totally SFW generator
do you know where I can find a tutorial for doing it in google colabs?
Blackforest FLUX.1 is pretty SWF
The blue texts are hyperlink, click them and it redirects u to the site
Also I'd suggest HIGHLY to use kaggle
Way better GPU time but harder
amd 780m should be fast enough for realtime
i dont recommend any integrated gpu because of vram
and AMD cards are worse for realtime because they dont have CUDA
Hey there
how do i use the voice model in #1175430844685484042 after i download it
is there some type of software i use to run the voice, and input files into it
whats ur pc gpu?
does anyone have a voice model called better mommy i cant seem to find ot anymore
wot
i mean just like i said
Ayo? @elder willow level 1 !!! 
there was a voice model called bettermommy but its gone now, so im curious of someone still has the files for that one
a model?
and is it supported by ComfyUI and so?
Ayo? @past smelt level 1 !!! 
it is a standalone thing for RDNA3
Amuse 2.0 or something like that
it's from AMD,
so i dont expect it to support CUDA, right?
nope
i recently had a voice model from #1175430844685484042 called better mommy but my pc shit the bucket and lost the pth file for it but i cant seem to find it anymore in that channel
for comfyui you can download flux
Nice
most of models are posted with tag "sfw",
but they produce the most nsfw images on first prompt with word "girl" in it
Flux girl is a pretty girl
all/most flux models,
or base model?
base
of course if you try to force it to show boobs or other parts, it will try to avoid that
like showing an object in front of the most interesting parts
but as long as you dont force it, there wont be any naughty stuff
that's what i am after tbh
does anyone know a good tex to speech app for ai? i have the voice i js need it to read a text...
the most basic vanilla tts is the free MS Speech engine from Edge browser
something like Xtts?
good reads, at leat in english, no mistakes, no hallucinations like xtts
so you can dump a book into it and it will read it most properly
as soon as i thought of xtts lol
xtts - run same text 100 times, you get 100 different results
some good, some bad, some awful, so it is not a hand-off tool
but when it is good, it is really good
iam not really in-depth with tts, but it's the latest i saw on "Aitrepreneur" the yt channel
and i thought it was a good one
could u send me the link in dms i cant find it and i have to help a friend w one of her school project 
in cmd
Ayo? @past smelt level 2 !!! 
or terminal
and the long text in a .py file, and you can run it in cmd
search on youtube of how to run python
is there any good open source Prompt generation webui?
install python 3.10, run the command in cmd prompt to get the module, edit tts.txt to add your text, edit the script to use a speaker for your specific language
how many epochs should i train my IA model?
....... bruh lemme cry
take python 101 lesson
you could just use an HF space btw https://huggingface.co/spaces/Nick088/Edge-TTS
it is the very basic stuff, but I guess the iPad generation may struggle
has a ui so its easier to use
or that, didn't know there was one
i dont want to take no classes i js want to help my friend w her project..
edge tts is an api how will it struggle 😭
ty babe
but i have my own voice
run the output thru RVC
go take a rope
😂
bruh js teach me
Nick gave you a link to text-audio
make an audio with a text, then use illaria rvc to change the voice
bruh
a school project is supposed to teach you something, is it not?
well, here's an opportunity to learn something new and magical
you can give this a try: https://huggingface.co/spaces/coqui/xtts just use a text and 30 sec of your own voice recording
how hard is that?
no need to train models or other crap
but i dont want to learn anything i js want a easy way to make text to speech w an ai voice reading it
i have a trained model
i want to use it

use the link Nick gave
then use https://huggingface.co/spaces/TheStinger/Ilaria_RVC to get that audio converted using your model
What does the "model architecutures" mean?
v1 or v2?
idk
V2 goated
hi
but the accents suck
it is your accent my dude
text -> edge tts -> speech in a chosen language, usually neutral no accent
speech from edge tts -> rvc with your model -> speech with your voice and accent
dont select french speaker for english voice in tts, lol
what did you use then?
When trying to combine models, does the epoch matter?
no
where's the accent then?
as in i had to pick an estonian accent
you pick a speaker
cus i cant find any better ones that fit what i need
for the language of the text
suntem aici în plattoul de filmări platanos alături de regizorul filmului doina rusti ce va inspirat să scrieți această poveste și să o faceți întrun film de cinematograf?
try this without an accent and a fem voice
is it estonian language?
its romanian
so why you're talking about estonian?
BECAUSE THE BEST I COULD DO
that sounded close to it was estonian
i cant find any neutral voice model
who wont sound weird but all of them dont sound right
you need to start with an obvious - there's no romanian speaker in the list
why are you picking a random speaker instead of saying 'there is no romanian, what do I do?"
girllll bye its not my fault there arent neutral voice models who can just mimic the models accent
the accents come from the voice model
i tried with other ones and it sounded completely different like another language so the accents also matter
english audio + french speaking voice model = english audio with french accent
so if you want estonian accent, you need to train the model on estonian speaking audio
no u

start from the start and explain what you want in details
https://www.youtube.com/watch?v=oReykFI_yCM look this AI
no ty
i literally js did
explain like i'm 5 years old
okay go to therapy
go back to school 😛
how i can download the voice?
What to use to make ai covers? I have nvidia gpu
Ayo? @little robin level 1 !!! 
Does anyone know if there are any reasonably high quality voice models that were distributed along with their dataset? I want to test some training settings but I have no good baselines
there's not much magic is making a model that sounds crisp
I know, this is less about searching for "magic" and more just needing a solid baseline to compare against
get a reasonably clean audio (not too clean), reasonably long (mine's 45 min)
my source was a podcast downloaded from youtube
good mic, no boomy room echos, no reverb
Where do we post Ai-Art?
Well, mine is not stable diffusion
Alright then
Ayo? @vestal crag level 1 !!! 
I remember being here to make an ai cover for a song but I can’t find the ai cover generators
what tools would I use to make a shitty song sung and made by a friend sound halfway decent as a joke
why not perfectly clean
autotune then rvc of his voice ontop
seems to be that the perfect is the enemy of good
but the better the audio the better it will work
well, it has something to do with what the model is learning and how it is easier to do nothing than something
whatever the explanation for the silence was, I dont recall right now
but what if it's perfect with no silence
i have a model trained using the default noisy pretrain, the source audio has a faint fan noise in the beckground, yet it sounds crisp af
so magic, I guess
here's an example of audio quality that is just not good at all: https://www.youtube.com/watch?v=1fA8JUf_swU
terrible microphone
but what if no faint background sound and also no subsonic or ultrasonic frequencies.
in my understanding the model would learn to produce pure silence instead of something as complex as speech
with a faint noise the model is more adaptable to be used in real world scenarios where a perfect silence is rarely an option
it the output of such inference may be weird
so if the audios you infer have perfect silence, then sure, use the model trained with perfect silence
There is anyone really good at IA making here ?
I need one Image to my RPG, its a male character, I want to mescle 2 imagens to have this character
hii i have a cuestion i can use this voices to talk with my friends on discord?
Yes
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
@tepid basin scammer there
Ty
I have also got rid of his messages in other chans

'HParams' object has no attribute 'data'
hi everyone glads to be here
how safe are these models
horribly dangerous. viruses everywhere. watch out!
perfectly fine and anyone that says otherwise doesn't know what they're talking about
Be careful of the one with too many likes, could be riddled with viruses.
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
Cxsmo you going to get the 5090 when it’s released
do you guys know how to fix a sensitivity issue, where the voice changer is picking up sounds like my chair squeaking?
they are safe
they are all safe
this is for up coming projects https://docs.google.com/forms/d/e/1FAIpQLSdGzt2lun5dr2NyUkEW4Q7ApS_ZJBQb-7vuRotUQMIRC0qUuQ/viewform?usp=sf_link
probably not but next gpu for me will be a 5090
hi
I wonder how much faster the training process will be with a 5090
training?
Yeah, for training a model
Oh, I think you misunderstand me. I was saying I wonder how faster it would be on the 5090 with training a AI model
😄 i just joined i dont understand anything here. apologies
an AI model* sorry forgive me
can anybody online give me a walkthrought whats goin on here
Ayo? @rose flume level 1 !!! 
if it's twice as fast everything then it would be twice as fast but if not then if will be whatever percentage faster.
Ai rvc voice models
rvc? remote voice control?
can i make William Dafoe say : You'd better run.... BATMAN!!!!!!!!!!!
no stands for retrieval based voice cloning
you take a trained voice model and use it on clean audio that you want the person to say or sing and then it will sound like them
do you have anything ready to share?
-hf second link
Suggestions for @rose flume
- UVR5 UI, by Eddy and Ilaria Huggingface Spaces
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
- 🆕 FaceFusion UI, by Nick088 Huggingface Spaces
if it's trouble then do not, but i'd like to listen to whatever your working on
nobody has anything to share/ show-off?
@earnest dragon btw, speaking of noisy audio, inference of a noisy audio can actually clean it up and remove the noise
I ran the original audio thru UVR just to see what kind of noise there was
none of that made into the output
обезьяна
anyone want to play asia server dm me
Ayo? @visual hornet level 1 !!! 
Hello, we need your help guys, please help us to make our voices heard from your social media accounts, Turkish women need to live, we are trying to make our voices heard in all applications such as Twitter instagram with the hashtag #TurkishWomenNeedHelp , please be one of us, put yourself in our place and help us 🙏🏻
You mean 𝕏??
Ayo? @paper yoke level 1 !!! 
Is anyone using root Stable diffusion around here ?
what does bro think this is
lmao
no
guys are there available Laswell's RVC model from call of duty modern warfare 2022-23?
hi
Check #📰│dev-updates message
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
thanks!
NAME
main.exe
SYNOPSIS
main.exe COMMAND
COMMANDS
COMMAND is one of the following:
module_download
cui
rest_client
convert_file
D:\chenger\dist\main>
Anyone have a good website to download youtube videos / youtube audio?
Ayo? @clever verge level 3 !!! 
I'm doing homework and listening this masterpiece rn https://www.youtube.com/watch?v=II47GsODBSo
hiiii
y2meta?
Hi guys
How can i dowloand applio? (I don't know very much)
I'm glad if you're helping me.
I'm sorry if I make you busy
What's ur pc gpu?
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
I dont have a pc, I didn't know this was required to download applio.
But ty for helping
Ayo? @wispy kite level 1 !!! 
there's cloud services you could use if you're planning to make ai covers on mobile
-hf
- UVR5 UI, by Eddy and Ilaria Huggingface Spaces
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
- 🆕 FaceFusion UI, by Nick088 Huggingface Spaces
Tysm i try it after👍
Wow very nice
дло
yo is there any way of using rvc with an amd gpu
directml build, or nvidia build with zluda patch
are you able to link me the directml build
hiiiiiiiiiii
Hiiii sonunda bi Türk buldum
I just arrived here, and I'm a little scared of downloading my first AI voice model.
o7
Documentation for a high-quality, open-source speech conversion ecosystem designed for simplicity and optimized performance
I don't know if it's safe to download, because I'm a bit paranoid about downloading things without knowing if it's safe.
Is there any AI website or program that is like a TTS but you can upload audios so it try to follow the rhytm? (like old uberduck ai tts used to do)
the programs are safe
Be sure to be following our tuts and not some random youtube tutorial
First of all, whats ur pc gpu?
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
My GPU is NVIDIA GeForce 940MX. Is it any good?
Nope, not good for local (on ur pc)
You can use cloud tho (remote good pc)
What are you looking for?
- make models
- use models for pre-recorded audios
- use models realtime for calls/games
The 3rd one
Ayo? @delicate frigate level 1 !!! 
Anyone have good ways to remove intrumentals from vocals thats free?
For Realtime Voice Changing for Calls on Cloud (remote good pc for those who don't have a good one, YOU CANT DO THIS ON MOBILE):
- Google Colabs (4 hours daily of free T4 gpu, easy to use, require only a google account) :
- How to use Original W-Okada's Voice Changer Google Colab
- Modified W-Okada's Voice Changer Google Colab
- Kaggles (30 hours weekly of better GPUs, T4x2 & P100, harder to use, requires an account and a phone number)
- Original W-Okada's Voice Changer Kaggle
- Modified W-Okada's Voice Changer Kaggle
Id suggest kaggle for way more hours
I'll try Google Colabs. Thanks!
I remember u said u got an rtx 3060 12gb, so you can do it locally, use UVR (Ultimate Vocal Remover): https://docs.ai-hub.wtf/rvc/resources/vocal-isolation/#local-uvr
Last update: Feb 29, 2024
id suggest kaggle, but yw
Thanks so much!
yw
Is there for the AI Training Folder, a certain amount of training i should use? Or is it one of those things where the more i have the better it sounds?
I have a data set of around maybe 3 hours but i think it may be to little so im getting more is that bad?
3 hours may be a bit too much, quality and variety of content > quantity
Gotcha so i should narrow it down to bits that are clear then?
good clean-ish audio (may have some background noise), preferably good quality mic, no reverb, no echo, no boomy room
noted
the model inherits the general feel of the audio, if someone records with a shitty mic and bad quality, all results gonna have the same shitty quality
Im currently spliting vocals and background noise of everything then ima manually edit out bad parts and stick with the soilds in Audicity.
Im training for a "Documentary Voice" for a personal project so i downloaded a few documentarys that have good voices and going through them.
but if you infer a shitty audio using a good model, the result is generally good
?
even 10 mins are good for an rvc model training, over than an hour is not needed
Noted
it will be worse than say 30 minutes, and 30 min will be lil worse than 60, but an extra quality from more than an hour probably not worth the time spent training
lets jus say, even 5 min will be generally recognizable as a specific person speaking, but very poorly
similarly with the duration of the training - the model captures most features during initial 10-20 epochs, after that it is just small improvements in quality
10 minutes sound unnatural vs 30 minutes and above
what matters the most is quality than quantity
though it is significantly, significantly easier to gather a large volume of good enough data than a small volume of nearly perfect data
you should remove reverb and other post-processing effects/distortions/artifacts, if you can't remove it like e.g. poor mp3 compression (don't even expect Apollo perfectly restore it) and spectrogram cutoff below 16khz, exclude those parts and find other good sources
i am kind of curious about this, though. i gathered about 15 mins of voice data and trained it for 100 epochs and it's definitely not done. some of the fundamental aspects are definitely there, but they're not mapped well enough to accurately recreate the trained voice
even if the features are initially picked up quickly, it still seems that you have to train a fairly significant amount of time to actually map them out properly
hi
see the tensorboard's charts
the usual - fm, mel, kl, total d and g
if it is a language vastly different from pretrain, then you may need more data
Ayo? @chilly lake level 16 !!! 
other than not enough epochs, it might be due to inconsistency, i.e. different sources, different EQ profile, roleplaying other voices, etc.
Hallo! im a new user with no GPU and no idea what im doing does anybody know of any non GPU voice changers that i can use through call?
the roleplaying other voices thing is probably a part of it. i'm sure it's just not enough epochs
though the loss was starting to rise toward the end, which could be on account of the lr schedule or just actual divergence
tensorboard splits everything into like d_g/1, d_g/2, d_g/3 etc and i'm not quite sure what that means
the voice would blend, so you shouldn't
i'll cut that section out of the audio to be safe
Ayo? @vernal abyss level 4 !!! 
how about excited utterances / gasps/ screams etc? remove them too?
i'm using a rip of some game character voice lines, so i've got everything rn including non-dialogue
what is the most reliable realtime voice changer at the moment?
w-okada is a virus according to my pc
pls help
Ayo? @tiny pewter level 1 !!! 
Ilaria rvc doesn't work anymore ?
can somebody help me in dms?
What is a epoch and how important is it?
epoch is one cycle of training
a model has to go thru multiple cycles in order to reliably reproduce the desired result
Whats a good amout of EPoch?
What's the best budget card (450$) for realtime voice-changing
depends used or new


AI HUB Docs

