#✨│ai-help
1 messages · Page 202 of 1
Will my audio sample model lose some of model's voice if I set index to 0?
it's really hard for me to wrap my mind how can accent(pronunciation of sounds) be separate from the actuall voice. I mean I understand that a person can have one voice, yet use different accents, like American or British. But how come pronunciation of sounds is not behind the person voice. Is it then pitch? Tember?
If accent is shaped by our mouth, then voice is shaped by our throat? And AI can distinguish it?
if you use index to 0, it will change the model's accent exactly like the inferred sample
on higher indexes, it will try to change the accent exactly to the model
idk bout the other question tho that's outta my league
Wrong channel, use #🔍│help-w-okada
Don't use that software nor yt tuts
What's ur PC GPU and what u looking for
That's old, and use #🔍│help-w-okada
Hell yes
@river smelt #🧬│ai-chat message this is the right channel for inference on prerecorded audios
tell me ur pc gpu
I have laptop, but not with gpu. It's Dell Latitude 5285.
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours, not granted, of GPU
Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
Your PC is not that good for it locally, it could run on CPU, but will be very slow, I would suggest to use cloud since you got integrated gpu and only 8gb of ram ddr3
You can run an RVC fork program locally on your laptop with only CPU, but it won't be that fast. I tried it with my Dell Inspiron laptop. 
What do you mean very slow? It does make the result of audio become bad?
Thanks for this. I'll try it now
Uh.
8 GB DDR3 in 2025
Whether if it's slow or fast, it has nothing to do with the quality of the output audio, unless you did some settings on RVC.
I planned to make cover voice ai from beginning, not the RVC
RVC isn't like W-Okada the realtime voice changer, where the speed of CPU or GPU matters to the quality of the output audio.
Wha-
Thats already alright...
Misunderstand
In the conversion
So, use models on pre-recorded audios or for calls realtime?
Why'd you ask me like that? You'll need a "Line In" or microphone recording program or use the built-in monitor recorder on W-Okada to record its output audio.
Is their a voice model for "suisei" in blue archive, im still a noob anyways on this shtuff
You mean Sensei?
I found some on Weights.
No, i didn't mean that. I mean ai can cover audio from music.
Oh wait suisei is not in blue archive mb
So, that's mean I've never heard of this girl name.
Its a vocaloid i think idk a fellow filipino challanged me to ai cover an opm using her voice:|
To download a voice model, go to #1175430844685484042. To find a voice model, use Weights bot in #🔍│find-models.
Yip
nice!
Brvuh
Wait... since theirs no rules stating age, am i allowed?
Since in some servers i get banned and sheesh without an explanation...
and this is an alt acc
how old are you?
Disi-singko...
@low shard underage user (minor of 13)
Im cooked then
If you're under 13, this can get you banned everywhere. Huh.
Eh im 15 but some servers i get banned without a single explanation even though there are no rules stating a specific age allowed in the server im just gonna ask now if under 18 is allowed here.
Using a Discord account at such underage is against Discord's Terms of Service.
why would you get banned if youre 15
What age?
13?
This number of age should be fine.
Idk, i made this alt for that server and i just got banned again, though the server is just a community server for a vtuber(yes im a dweeb)
Hmm i see, lets end this convo before it has the potential to become some political sheesh
what
Cuz in some servers im in, the trivial matter were talking about just boils down into a political argument.
Though that will not happen here ig
Wait what
15 or under 13 then

Misunderstanding or all what i said was vague?
Something like this can make me and anyone here worried.
Alright
why not?
I'm on a ryzen 5 with a 2060
bc it's very very old, and youtube tuts aren't updated
you're looking for inference (use models) on pre-recorded audios right
I'm looking for something like that RVC converter. Something simple. This pne is pretty much drag and drop. I like it cuz it's easy to use and convert.
You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours, not granted, of GPU
Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio
I'll go locally. Thank you. I will try the applio and see how it goes. I kinda wanted to update the other RVC anyway
Is there a way to keep the libraries and whatever it needs within the pkg folder?
I don't want to lose it again.
U can get the precompiled one
Microsoft: UPDATE RIGHT NOW
how many samples at the time the model will learn
Thanks for helping me
a safe value is 8
I always leave it at 8 or 9
not 9
perfect you fine
any batch size works, but i agree that 9 is a bit unsafe
How do I get model singers to sing falsetto perfectly without that screaming sound?
I've got an issue with my rvc, For some reason its not working on discord i hadnt messed around with the settings much but one day it just started not catching the voice anymore, anybody else got this issue before?
Can someone help me rq? I'm using the voice changer on discord but its just speeds my message up and makes it crispy
How do I make applio link public?
Adding --share but not sure where
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
you Can use #🤖│bots for commands
well this also working fine as of now
i mean use correct channels
Hello everyone! i'm currently trying to set up google collab w okada, but i have no option to connect it to google drive, i need to connect it to google drive for everyday use. please help, thank you.
is someone online?
uh
how do you make the output and input so you can use the voice changer in roblox?
also ngrock is not working for me but the other option is dissapeared too
I was wondering if someone could vc with me and help me
[Errno 2] No such file or directory: '/content/voice-changer/server'
/content
ModuleNotFoundError Traceback (most recent call last)
<ipython-input-1-e49dd8a37163> in <cell line: 24>()
22 get_ipython().run_line_magic('cd', '/content/voice-changer/server')
23
---> 24 from pyngrok import conf, ngrok
25 MyConfig = conf.PyngrokConfig()
26 MyConfig.auth_token = Token
ModuleNotFoundError: No module named 'pyngrok'
NOTE: If your import is failing due to a missing package, you can
manually install dependencies using either !pip or !apt.
To view examples of installing some common dependencies, click the
"Open Examples" button below.
gonn ask again since nobody really answered but I've got an issue with my rvc, For some reason its not working on discord i hadnt messed around with the settings much but one day it just started not catching the voice anymore, anybody else got this issue before?
better if you ask in #🔍│help-w-okada if you are talking about Realtime voice tool (okada)
Oh i thought i was using the correct channel since this is rvc
Alright
nope its just unfortunate thats the same abbreviation, rvc is retrieval based voice conversion
I'm trying to convert a lot of small files but I seem to be getting rate limited, is there anything else other than https://huggingface.co/spaces/TheStinger/Ilaria_RVC I can use? or is there another one I can use at the same time to double my conversions?
or I guess the other question is...can batch conversion be done with the same level of quality as that link
Is it generally safe to make the Learning rate lower for small datasets?
Hey guys quick question here. What's the fastest way to train in cloud? Paid Colab? Wich gpu? do we have some info about that?
free tier kaggle and lightning ai are the fastest
for paid options no idea which one is the fastest, paid colab has an a100 which has decent speeds
maybe but remember that a lower lr requires training more epochs than usual
depending in how much u decreased it
runpod has h100s
paid tho
ty @analog obsidian @crude flame gonna check those later :3
The model I did train sounds pretty decent as-is but the total loss for the generator (I think) consistently stayed at 41 and the graph was really inconsistent so I'm just wondering if I either need to just train it more with the current LR or lower it more
I noticed that when training there was something about Multi speaker datasets but idk how that works or if I can even get the model to output the right voice if I do try that
how to get this rvc v2 thing
thats just because the dataset is small, a lower lr is not gonna make it better
/??
i dont get it
i just want the best local software that i can run locally which works with msot of the models in #1175430844685484042
I just use colab.
for simplicity use the compiled version https://huggingface.co/IAHispano/Applio/blob/main/Compiled/Windows/ApplioV3.2.8-bugfix.zip
thank you 🙏 does it use my gpu and cpu? i want it to use both
yes, uses your gpu for converting audios
is there any special way i have to set it up or is there a file i just run which will do it for me?
Isn't it called run.*
nope, just unzip the file in a folder and use run-applio.bat
place your models in the logs folder
What can I use to make AI covers of songs live? does anyone know
uh does this support real time?
no
ok welp what does support realtime
who knows how to help me? I've been trying to fix this sense 11:00 am
ita the wrong pth
ok where would the right one be?
show folder of the weights
lets go to #🔍│help-w-okada
For overclocking my gpu is core clock or memory clock more important for rvc
no
its useless to overclock for rvc
voicechanger or normal rvc?
normal
you'll get 1% performance increase and within error margin
I have a 30 minute audio that actually takes a whole hour and that's a long time so I want to use a graphics card
then no, check online options
That doesn't answer the question.
split it into 5 minutes each or use split audio feature in applio
You might either mean the RVC if you wanna train or inference a 30-minute audio.
yes but he have an amd carf
inference
Yes, inference. I did not understand correctly at first. I sorry.
the gpu doesn't matter but that's recommended to do
note that applio supports amd gpus
for inference you can use Applio
really?
Thank you Do you think this will take less time right?
I have a processor Rayzen 5 3600
and a graphics card RX590 8g
It is possible to inference a single 30-minute audio. However, it would be taking too long to inference like that.
it could run OOM without splitting
I can make a Python script that splits a 30 minute audio into 5 minute parts.
better do it manually without cutting voice sentences
Sorry what is oom I'm new to this
OutOfMemoryError
yeah
If it was run on colab, would it be better? I want the shortest possible time.
colab is not reliable for this
Inferencing a longer audio duration, it would be better to done on a super fast GPU with more VRAM.
For free version of Google Colab, no.
Well thank you
you helped me
Hey, ! JOE! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
However, the paid version like Google Colab Pro, you will get more RAM and GPU VRAM.
actually a voice sentence couldn't be more than 1 minute and that doesn't take much vram to infer
and you just need to denoise and noise gate before doing so
In fact, I am converting the audio of a lecture video to another audio model. The lecture is on average 3 hours long. I know that it is crazy, but I am trying.
@humble cobalt After trying several times, I found out that the amplification threshold is the volume of the primary stem, and the normalization threshold is the volume of the secondary stem. I changed the default values to these, compared the SDRs and these values seem to be fine, since as I told you very high values introduce noise.
again just do denoise and noise gate, then split it into 1-3 minutes each esp if you have 4 GB vram
try restarting the training, its stuck it seems
you should loose much data
just re-input everything like before
RESTART
Not a real Windows XP. 
WAT U MEAN IT TOOK ME 10 HOUR
Calm down.
and i have to do it all again
it will restart from the latest checkpoint dw 😭
oef thank god
how do i restart
Or you can wait for it to finish again, anything you want.
yes
doesnt even do nothing
just keep continuing
i just wanted to have a rvc deep voice for trolling on roblox
its for content
Wait for it a while, it's kinda slow to suspend the training process.
but i litterly dont understand how this works at all
@proven hill by any chance do you have a deep voice rvc model to troll with
ayo
actually yes but its paid
Nuh uh.
paid my bum
mines 
It would be a shame to go for e-girl, like fucking shame ass. However, if you go for a voice model of a real man voice, I'd leave it go anyways.
yes
oooh no am crying
do u have a free one
i mean its a market 
how much did u pay
free mhh
theres something in my shop free
I'm sorry, but I don't make a voice model.
i dont pay, i make them
well vat is it
You can look for a voice model from #1175430844685484042. To #🔍│find-models, use Weights bot in #🔍│find-models.
dude i was camping there 24 7 only see playboi carti
check link in bio
Hi, my name is Ilaria and im a 25yo girl from Italy 🇮🇹
i dont see no link
nvm i do
srry
😭
Boo. 👻
all of em bad
most of em playboi carti
am very sad
my training completly failed
did you checked weights.gg too?
every website u name it
train on Weights.gg
theres not much market as “male voices” tbh
u litterly have it
If you're looking for a generic voice model used in a commercial-type ahh website, then no. AI Hub by Weights doesn't have a voice model like that.
but it over my budget
its impossible theres nothing for free
ow man
furry
its corpse
Skibidi Toilet. 
regarding?
for high five?
I like anime women. 
trust me you can find it for free
nos i cant pease can u find me a good or give me own of yours 
try finding an eboy voice
they are alll badd
i wanna hear the one of yours
wait matter fact
this is what i want https://www.youtube.com/watch?v=SGFxMxxUboE&ab_channel=ItsKashB
theres the demo
oh ok but can u see like what i mean in that video
this one like am tryna find that but it wont say nothing
unlucky ig idk what to say anymore
Hi how many epoch is actually good? I set my epoch to 1000 and it has been 1 day already i am at 385 epoch lol
and my voice source is only less than 2 min like 1 min
Would it be bad to stop my training at 500 when i set it to 1000?
epochs ≠ quality
so is not quality?
no, the quality is made by the dataset mostly
the epoch is how many times the file is being trained on
Alright is there a way to enhance the trained model
you cannot improve an already trained model
ok thank you
If the model sound
is quite low
when speaking is it possible to make it louder
like the pitch
do i need to use another software then? or ai tools
interesting question, i think yes that you need something else
afaik theres no option to be louder in okada
no, kits is bad
what do you prefer?
how do i get rid of popping bubble sound or static
i mean, im ilaria, the creator of ilaria rvc, so id say ilaria rvc 😭
yo guys can someone help me pls? Running with the system Python.
python.exe: can't open file 'C:\Windows\system32\rvcgui.py': [Errno 2] No such file or directory
Press any key to continue . . .
how can i fix that
Hey, levangabriel! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
could be a model problem
youre using rvc gui?
yep tryna install
Llaria when done training do i need to press stop training or go to the settings menu and restart applio? How do i extract my pth files and other files?
use something else like applio
wait the train finishes
but is at 1000 epochs
lol
i think at 500 is enough
then stop training
👎
White noise was the only voice model I've ever trained with this shitass site. 
import gradio as gr
ModuleNotFoundError: No module named 'gradio' ☠️
did you set up everything correctly?
LMFAO
oh theres some warnings :d
Never use python -m pip install into a Python program if you don't know what you're doing. 
im studying ce bro no problem i fixed it <3
-gui
better try download applio precompiled, so u dont need to worry about pip install shit
If you don’t finish all the epochs sessions
Will there be negative effects afterwards?
I set it to 1000
If I finish it at 500 is there a downside?
no
Thank you
Guys, does anyone know how to download a female voice changer and how to set it up?
-rt
Interaction has expired, use the command again for a new interaction.
first link
is rvc or w-okada bettter?
its two different things
what are you searching
wdym
depends on what you need
girl voice changer
RVC is the audio conversion program. W-Okada is the realtime voice changer program. These two programs have different usages.
oh ok
A program that works better than RVC-GUI is Applio. A program that works better than W-Okada is Deitris fork W-Okada. Both programs can both run locally and on a cloud service.
Which audio is primary and which is secondary and what does SDRs means
U can check model's stems on leaderboard tab by filtering by vocals, instrumental, etc. Primary stem is showed on Stem 1 on gradio interface and secondary stem on Stem 2. Maybe a few models don't have their stems listed cause I'm still working on that cause there are too many models. About SDR..
Hello again guys
I heard that it is necessary to add a silent space in the dataset, is this true?
No
It don't make any sense
its in the logs folder
did you train it?
yeah
uh
is it important?
what did you use to train?
applio
idk if it was somewhere else then
model pth is in the weights folder
there is no folder named weights
oh
assests -> weights
there is no weights in assests
You mean assets? 
you know how to find index?
file
you can run 'create index' any time
not the full training
there is nothing in logs folder i swear
then it didnt traint the index
with the equipped model in the interface?
yah looks like
do i just use the model?
that i trained
to make the index then?
Do you mean like some custom pretrained models?
go to training tab, select your trained model, then click create index
just do everytjing like if youre about to train the model but just press “train index”
if you deleted the sliced audios and extracted features, you'll need to re-do those steps
what if i closed the program
just re-open and do as I said
would the index have affect since i did not created the index from the start?
no
Faiss Integration (.index file): The Faiss library enables efficient approximate nearest neighbor search in RVC during inference, retrieving and combining training audio segments with closest embeddings. For your final RVC model, include the one which the file name starts with added.
Example:
added_IVF157_Flat_nprobe_myModel.index
The train index button is "Generate Index"?
i did'
whats the differences lol i know that the last one is stopped before being processed
the first one is fully finished
original
there isn't a best one
it depends on ur dataset language and lenght
usually original (which has been trained on english) is the most used one
Thank you do you know where or best place to test your model/.pth file
Any website?
Or any ai tools that don’t require so much effort to download
Thank you agian Dr Jr
Your welcome again @polar ridge
I got an ASUS Vivobook S 14 with Ryzen AI 9 HX 370 on board.
Still no such software, that uses NPU for processing AI voice changing?
Not something I bought my laptop for, I'm just curious, if there's anyone, who made it to work it on CPU's neural processing unit all of the sudden for the last 6-7 months instead of iGPU, when I've been considering getting one for myself.
Starting pitch extraction with 4 cores on cuda:0 using rmvpe...
0%| | 0/1 [00:00<?, ?it/s]An error occurred extracting file /kaggle/working/program_ml/logs/riffy_v3/sliced_audios_16k/0_0_0.wav on cuda:0: CUDA out of memory. Tried to allocate 1.93 GiB. GPU
100%|█████████████████████████████████████████████| 1/1 [00:04<00:00, 4.54s/it]
Pitch extraction completed in 10.34 seconds.
Starting embedding extraction with 4 cores on cuda:0...
100%|█████████████████████████████████████████████| 1/1 [00:00<00:00, 2.18it/s]
Embedding extraction completed in 5.08 seconds.
An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
Not sure what to do
depends on NPU
slice the audio
It's a Ryzen AI with 50 TOPS.
I also happen to have a mini PC with their older R7 7840HS, though it has 5 times less the NPU performance (only tops out at 10 TOPS in theoretical tests), and I doubt it will suffice for such a task.
if it is AMD, expect pytorch on linux supporting it sometime in 2035
yeah, but is there any RVC software right now, that can take advantage of CPU's NPU?
voice changer with onnx maybe?
I don't think any software is capable because no one knows whether it's possible to choose NPU device
There're examples for DirectML in CPP, but none in Python. And no one reported whether python directml reports NPU device
And I think ryzen ai series use Vitis AI and separate execution provider, so it won't work with directml
Hello ! I have install Applio on my pc on local and I would like to share for my friends but i don't know where a create a public link with. "set share=True in launch()."
go in the applio folder, at the search bar of file explorer, write cmd, then in cmd write env\python.exe app.py --open --share
-# guessing that you're on windows and used the precompiled downloaded from HuggingFace
@simple ore btw adding another bat or another way to easily do this would be cool, just a suggestion
Was just saying for newbies to do this easier
dont think it makes it available on public internet, does it?
It's the gradio share link so it works
It's the same way used for the Colab
But if you don't think it's useful,ofc u can disagree, I didn't mean this suggestion for myself nor going to force anyone
Hey I wanna make my own model soon I’m currently working on making the vocal batch. I wanted to ask how much all vocals should amount to, as in referring to time. I have a 40min studio session I want to use and I think I can get about 15 minutes of full vocals from it all ranging in different octaves
Nope, NPUs are basically never supported anywhere,
The only project I have ever seen support it was FastSDCPU iirc
-# from what I have seen personally
Where can I download applio that uses refinegan? It's not in version 3.2.8...
thank you i found it
how is RVC still not AMD supported, I come here every 6 months I feel like
and I am once again disappointed
(on windows)
-COLAB
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
You can run Applio on AMD GPU, ideally 5700+
6600xt
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
@low shard what do accent mean on Illyria zero like that setting? Is it like how accurate you want the voice to sound like them or?
when i upload my audio on applio, the runtime is 0:00 to 0:00. would anyone know how to fix it or know why it's like that?
its when
sorry?
dont say sorry
can you continue pls
Feature retrieval rate basically indicates to the AI model how close or far off from the prompt/image it should work in terms of inference which dictates how close to the subject in input you want the output to look like.
If you need more sophisticated answer, it controls the usage of feature index.
You see, when one trains a model, model learns features extracted from the voice ( it's own ) but, for simplicity I'll refer to features as " accent " or " way the voice articulates, stresses " certain phonetics etc.
Basically, if it's 0, model doesn't use the features from own index, heavily relies on what it can retrieve from the audio you use for cover ( inference ) and similarly but reversed, if it's 1, it'll use all it can from own index.
yet another take on it is:
see it as " strength of searching for own voice's features that'd be closely matched ( if possible ) with the audio "
Note: Poorly trained models or those trained on limited data will struggle with high index ratio; can manifest as artifacts or jammed / jittery pronunciation or phonetic handling
if something like that happens (overtraining) where should i stop/export model
How can I use a zip for training dataset instead of a single audio file on Applio Colab?
you cant
Aight lemme boot up Foobar again
unzip it first to a new folder
Doesn't it take a single file input?
you can have multiple files
The index value, which higher the value more accurate is the accent
upload failed to import no module name utils
can you help with this?
Hey, Sophie! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
Wrong channel, Wokada is different than RVC, tell the link tutorial you followed in #🔍│help-w-okada
Use an entire zip to train? No, Applio doesn't read zip a file to train like that. You must extract its files to a dataset folder.
help me install rvc
-docs
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
Which gpu do you have
And vram ?
ram?
No video RAM of your GPU
Go to task manager and open performance
its useless to know the vram its good enough anyway
16 gb
Okay.
Go it
im install virtual cable
You can try applio
RVC is not Realtime voice conversation
what are you looking for? what you wanna do?
If you want realtime conversation go to #🔍│help-w-okada
RVC is the audio conversion program. W-Okada is the realtime voice conversion program.
Do you need a virtual cable?
Bruh.
^^^
ok
-realtime
Interaction has expired, use the command again for a new interaction.
im install realtime voice changer in disk c
So many times people keep mistaking W-Okada as RVC. Click on the first link for Fork W-Okada, this fork W-Okada runs better than the second link.
windows 11 work?
Any Windows version works, but Windows 10 is the minimum.
ok
start start_htpp.bat?
Damn. I said click on the first link to download fork W-Okada, not the second link. Did I say it wrong?
blat
Let's be real. For more information about using W-Okada, go to #🔍│help-w-okada. This #✨│ai-help is about RVC programs.
Anyways, screenshot your process at #🔍│help-w-okada.
I ran it as explained in this link and succeeded in recognizing AMD.
RuntimeError: CUDA error: operation not supported
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions. An error appeared. Does anyone know about this error?
card?
Did you use the NVIDIA version instead of DirectML?
Let me check in a moment.
are you using an AMD card? try follow the instructions carefully, including HIP SDK stuff
I had no issues wit 6700xt
AMD card? What is this?
you also ened to post a bigger piece of log, not just cuda error.. I need to know where it happened
I am also using 6700XT
How can we find out? Would it be better to start from the beginning?
hip sdk 5.7, these libraries https://github.com/brknsoul/ROCmLibs/blob/main/Optimised_ROCmLibs_gfx1031.7z
Should I download that?
Oh, I downloaded that too, but it still doesn't run. How can I fix the TT CUDA error... I think I'll have to try it from the beginning.
show a bigger error message
I will check and let you know. Thank you...
After checking, there are no other errors other than that. The AMD graphics card is also recognized well.
Thank you all for your replies. Let me start again from the beginning. I may have missed something.
YES
DEPRECATION: omegaconf 2.0.6 has a non-standard dependency specifier PyYAML>=5.1.*. pip 24.0 will enforce this behaviour change. A possible replacement is to upgrade to a newer version of omegaconf or contact the author to suggest that they release a version with a conforming dependency specifiers. Discussion can be found at https://github.com/pypa/pip/issues/12063
Installing collected packages: torch, torchvision, torchaudio
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
torch-directml 0.2.5.dev240914 requires torch==2.4.1, but you have torch 2.3.1+cu118 which is incompatible.
torch-directml 0.2.5.dev240914 requires torchvision==0.19.1, but you have torchvision 0.18.1+cu118 which is incompatible
Is this error a problem? There were no problems running it at all.
I think you are using this. Is there a separate amd version of the 3.2.8 bugfix version?
no
Even though I use that, a cuda error appears TT
use the compiled version, there's nothing to install (nothing that could trigger "DEPRECATION: omegaconf 2.0.6 has a non-standard dependency specifier PyYAML>=5.1.*. pip 24.0 will enforce this behaviour change. A possible replacement is to upgrade to a newer version of omegaconf or contact the author to suggest that they release a version with a conforming dependency specifiers. Discussion can be found at ")
make sure you replace torch as the guide explains
make sure you patch in the correct zluda version
make sure the path to HIP sdk is present in the environment variables
I will try it all at once. Thank you.
why does rvc use the card so much
because yes
On 4090 40%-60% while using it
Voice changer
wdym by that?
Rn im using smth with 72 or 73 i think
send screenshot
Interaction has expired, use the command again for a new interaction.
download the new one from the first link
Im using this
yea its old
With this setup can i get good ovr performance?
ovr?
yes you can ofc
pls discuss your topic in #🔍│help-w-okada
that spec is overkill already for the usual valorant etc. use cases tho
though 4090 should be more suitable for those who train models and flux loras
Idk what does it even mean
woman stuff
Ah
@viscid moss Separation by Link is not working I'm UVR UI (Local) How to fix it ?
Can u show me the error on CMD?
If the video has any age, region or similar restrictions you will not be able to download it.
i try with multiple videos but it not worked
checking...
That video gives me an error too
I tried 10 random videos from YOUTUBE and all of them worked except the one you are using, I think there is some restriction. I even tried that video lyrics version and it worked for me
I'll try to figure out what's going on anyways, to see if I can improve the video downloader to avoid restrictions
okay. lemme try with other videos
if the original one isn't working try looking for lyrics version or re-uploaded versions
okay,
I have a doubt, will it download audio in best quality?
yep, download the audio in the best possible quality and in 32-bit float WAV
okay what about sample rate ?
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
italiano?
It depends on the source, I think most of the times 44100. I'm also not quite sure if all UVR models will work with 48KHz or 32KHz
emm okay. thank you for information
ur welcome
ur welcome in my house

si
come ti posso aiutare?
hru eddyyy
grazie, nulla mi avevi già aiutato mesi fà, ora so come fare
ah perfetto 💖
era molto comodo il tuo link del nuovo aggiornamento su yt, peccato perchè io che ho un pc lento e uso solo colab, mi sarebbe toranto utile, ora uso applio che però è lento ad aprirsi, e molte volte si blocca e devo riaprirlo, spero ritornerà un tuo aggiornamento su yt
questo solo per testare la voce finita
non lo so.. nessuno è più interessato ormai
ah peccato
io lo sono ancora, e anche molto, lo uso per cover e anche per doppiaggi di serie tv fan made
*uso
carino
anche se per fare dei doppiaggi perfetti aspetto che tortoise tts aggiunga l'italiano e che ci sia una versione si colab
tortoise se non sbaglio non è più supportato
ah non sapevo, io ogni giorno controllo su yt se esce una versione italiana di tortoise tts o di una tts decente, avevo già provato eleven labs, ma non puoi decidere le pause nel testo, non puoi scegliere le emozioni, e parla troppo veloce per un doppiaggio
grazie
di nulla :)
@viscid moss I tried it with more videos but it only worked with 2 video. Out of 10
Coding stuff :3
damn. For now if it doesn't work try downloading from a web page or something like that. I'll check it later, I just finished fixing the UnicodeDecodeError and I'm publishing the update
how do i update ??
That's related to yt-dlp (audio downloader) not related to separator so dw so much
and after updating can i disable that thin after updation
running the updater
now ? or later ?
I tested that on my end and seems to be working without that thing
okay.. thank you.. im bug reporter for you 😅
fr ngl
what does it mean ?
Certainly. Not gonna lie
doing batch conversion and geting this problem again and again it dont process few audios
show me ur input and output paths
i closed because i got frustated but the paths are input C:\Users\The Beast\Desktop\DIN Output C:\Users\The Beast\Desktop\NoOUT
@viscid moss
I think the problem is the audio path, it's outside the UVR5 UI folder. Try using the inputs and outputs folders inside the UVR5 UI folder.
Other folders may require administrator permissions to write files to them
Also how can I select stems. Like I don't want noise output or reverb only output ?
when i try running rvc
I tried using an audio on the desktop like you did and it worked for me, I recommend you try placing the audios in the inputs folder of UVR5 UI and placing the outputs folder in the output path, it is not necessary to put the full path you can use: ./input and ./outputs. You can also rename the audios to a shorter name.
If it still doesn't work it could be because your Windows user contains a "The Beast" space but I'm not entirely sure.
literally invalid address, it should open localhost address, not like that
also for greater good, stop using opera gx
i changed it to Admin but idk why it still show The Beast
On Output Single Stem on Advanced Settings u can write the stem u want, but i haven't tested with all models. May not work with some of them (the new ones)
its dont work on denoise model. sometime
Which denoise?
Well... u can't change that username 😅
Have u tried the another workaround?
LOL
emmm Why ? why i cant im admin of this PC 😅
whats so bad about opera gx
^
Mel-Roformer-Denoise-Aufr33
whats the best denoise now?
idk.. just using randomly
who else is getting the 5090
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
I mean, yes, you can change it, but it's a bit complicated, and it could cause problems with Windows. So I really don't recommend changing it.
does this actually approve speed still if i am doing a 20-30 min dataset on a 4090
not much
but with 24GB, why not?
stems for that model are "dry" as primary stem and "other" as secondary stem
i saw a tutorial on yt he changed his name but it my case it shows admin on lock screen but when i open CMD it show old name "The Beast"
what ??
I mean if u want the No Noise Stem of that model u should put "dry" on the single stem output thing
okay. and is it a good model which im using for de noise
Lemme check the bible
ye seems like
i sent something to you in DM pls check it only for one time..
everytime i try and use the Ilaria RVC and click convert, it keeps saying Error and GPU task aborted
its a glitch with huggingface 😩
try with a shorter audio
should i change any settings while man to woman vocal?
only the pitch
okay zero to what
12
is -12 enough?
-12 is female to man
no probs
My RVC keeps crashing, and I have it set to these settings. Any way I can fix it?
Okay, nvm I fixed the issue, but now I can't hear my voice
this is ANCIENT
use the new version
-rt
Interaction has expired, use the command again for a new interaction.
first link
Oh, alright.
i have problem w voicechanger, i hear my voice but i dont hear it on discord
i havent made an ai model since google colab banned gradio is there a good gui i can use
what gpu do you have?
nvidia gtx 1650
okay then no local options
use rvc disconnected
tho it doesnt have a ui
yup
alr
thx
Preprocess completed in 0.01 seconds on 00:00:00 seconds of audio.```Why am I getting this on Applio Colab
The dataset is a .flac
uh strange
I'm looking to buy Colab Pro, but I'm wondering if I can use RVC without running out of resources for a month, has anyone tried it?
Btw have people considered making cloud port on Github Codespaces? Wonder how it compares to Colab and Kaggle
i dont think rvc supports flac
eksdee
if it does just change it just incase
Why is that
I don't need to change anything on the copied path right?
Did I mess up 
-realtime
Interaction has expired, use the command again for a new interaction.
I bought Google Colab Pro for two months in 2023. There will be three GPUs available for Pro: NVIDIA A100 and L4, and now newly added v5e-1 TPU. However, these three GPUs eat more compute units than CPU, T4 and v2-8 TPU, even with no code going on when a Colab notebook is connected. The longer you use one of these GPUs for anything, the more compute units will be used for that.
Hey everyone i'm using realtime voice changer for the first time. I can hear my own voice but without any voice changes just my regular voice. How do I fix this
And supposedly, your compute unit count will be shown here on how much of it left before running out.
For W-Okada the voice changer, go to #🔍│help-w-okada. This #✨│ai-help is about RVC.
oh so the other one is for the client right?
RVC is the audio conversion program. W-Okada is the realtime voice changer program. The only W-Okada related to RVC is W-Okada uses RVC voice model to interence, not the code and GUI themselves huh.
ok thanks!
You're welcome. 
how do i make it so i can hear myself?
i put my input as my microphone and output as the vb cable audio input
For W-Okada the voice changer, go to #🔍│help-w-okada. This #✨│ai-help is about RVC.
Does anyone had this problem?
does anybody have the link to download the voice changer ?
-realtime
Interaction has expired, use the command again for a new interaction.
First link.
Light blue and light purple named users be asking in wrong channel lol. 
i dont see it
If the voice model doesn't exist, you can request someone to do that at #1159289738314919936.
there are cheaper renting alternatives like runpod, vast.ai, etc
guys windows defender is saying no for rvc gui what do i do
-gui
RVC-GUI in 2025? Wild.
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Does anyone know what happened to the Ilaria huggingface
Why is it no longer on the google collab?
use it on huggingface
is it used online again?
what?
sorry, my apologies I think I understand now
The google colab is broken
The huggingface space is better
I managed to find it thank you!
delete it and get applio😭
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
kaggle moment
si
si
forgot to slice audios?
show preprocess and exrract feature step logs
this?
how big is the batch size?
12
isnt 8 best?
2 gpus, so 4x2 = 8
kaggle is double gpu?
There's T4x2 and P100
gotcha
ah i see
i'm using rvc disconnected the gpu stopped at 435 out of 500 epochs is there anyway i can get my model without retraining it
also is 500 epochs nescecarry for a 14 minute long dataset if i do need to retrain it
cuz i didnt rlly fuck with the settings
did you enable the save option?
I have no idea but i dont think so
I didnt think i'd need to
then you lost everything
Well shit
Question still stands
i mean, yea but you should also look at the tensorboard
I have no idea how the tensorboard works
-tensorboard
Rvc disconnected in 2025 smh
i thought there was a command
how do i reduce delay bruh
ciao
Last update: Dec 24, 2024
ciao
che gpu hai?
wokada? usa #🔍│help-w-okada
rtx 3060 ti
andiamo sull’altro canale
buona
google colab gives max 4 hours, NOT GRANTED, of GPU
Kaggle gives 30 hours weekly granted
Kaggle is better but harder
you have to read the guide, there isn't a right amount of epochs
harder as in ?
needs a phone number, less user friendly
hello someone now a real time voice changer for amd gpu pls ?
RVC is for inference on pre-recorded audios and training
Wokada is for RVC inference in realtime for calls, and there's the deiteris fork for better performance, tell me your GPU name in #🔍│help-w-okada , this is the wrong channel
oups sorry
it's fine
so I'm used with Applio
what are you using now
whats your gpu?
it was named EasyGUI before?
yea a long time ago there was the easygui
I don't have one, I have an old pc rn, I'm running it on the incorporated GPU
then you cant use it locally
-cloud
mh
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
first one
I don't wanna train models or create
what you wanna do?
AI voice covers, I want to make a producer tag
you can use applio on colab, ilaria rvc on huggingface or weights.gg
it was an old colab, it's still up a newer version btw
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
no😭😭😭
I thought that bcs you're both Italian
nono we are just friends and hes a lot younger than me
How old is he? 😂
i wont tell unless he does
its easy to make it yourself, sadly i just turned my pc off
I know it's easy, but I wanted something real and a woman's voice
isn't this one for making and training models?
Fml
never used it, and I don't have the patience to learn how to use it
🤷♀️
no prob
no
16 yo bro
it's the easiest between all of them
Really?
yeah, I would suggest it
Okay
how old are u
almost 23
damn
folks
sorry if this comes off as insensitive
but is there any way to make the ai's pronunciation of S sounds less noticeably artificial
wym
robotic or harsh?
uhhhhhhh
if it's robotic try a different model, if it's too harsh you can fix that while mixing the vocals
robotic = to small of a dataset so rvc started overfitting the sibilants
harsh = all the sibilances in the dataset were very harsh so rvc only knows those
yes
sounds fine imo (i havent played tf2 so idk what he normally sounds like)but you can try de-essing the output
de-essing?
T de-ess or RX de-ess
They just makes the esses not as harsh
https://techivation.com/t-de-esser/?srsltid=AfmBOopc5H9unNMz9ZhLZCfqs1CNoaknxdU-VRX99YG8_jUDaI0F2QTX here is T de-ess its free
Are those free pulgins?


AI HUB Docs