#✨│ai-help
1 messages · Page 222 of 1
Im but some part just left me confused
And then click run all?
wait
first click this
click the "(1):"
an then click the play button that shows up
Ok afterthat continue with the rest?
yeah, then click the "(2):"
two times to click the play button too
and then click the last one
After that wait for the applio link?
Okay
when it looks something like this, its ready
Yes
no problem, try to ask later or wait to someone else that knows about it
Ok
something unexpected always happens when something nearly succeeds
36 epochs
you can do inference while training?
yeah
got my friend to do it
you can stopping training and then resuming it
and yes, you can while training, but it is too slow
okay okay
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
can i use mangio and applio at same time?
click on the link
Error
The applio link error
I wish applio google colab is working 😭 kaggle is hard af for me
yeah it's hard, but at least it faster than colab😭
😭Still i prefer gooogle colab
?
Now im stuck with this error link
just locally train it
i know, cause it's easier to use
Im doomed
How
Is llaria rvc zero also not working?
yes only for inference, not training
Ughhh
colab is broken again, the previous uv workaround does not work
Maybe cause I made a copy in my drive
Again right
new fix for Applio colab
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --upgrade --index-url https://download.pytorch.org/whl/cu121 -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install numpy==1.23.5 -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install gradio==5.23.1
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install -r requirements.txt -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --upgrade --index-url https://download.pytorch.org/whl/cu121 -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install numpy==1.23.5 -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install gradio==5.23.1
Can you post it in a web browser...
basically add this to all uv install lines
In a separate cell?
no, in the install cell
Why can't you edit the Colabs
Separate cell
no
Row by row
the whole cell ```# @title Install Applio
!sudo apt update
!sudo apt install python3.10
!sudo update-alternatives --install /usr/bin/python3 python3 /usr/bin/python3.10 1
!sudo update-alternatives --set python3 /usr/bin/python3.10
!curl -sS https://bootstrap.pypa.io/get-pip.py | python3
import sys
sys.path.append('/usr/local/lib/python3.10/dist-packages')
import codecs
from IPython.display import clear_output
rot_47 = lambda encoded_text: "".join(
[
(
chr(
(ord(c) - (ord("a") if c.islower() else ord("A")) - 47) % 26
+ (ord("a") if c.islower() else ord("A"))
)
if c.isalpha()
else c
)
for c in encoded_text
]
)
org_name = rot_47("Vkkgdj")
new_name = rot_47("kmjbmvh_hg")
uioawhd = rot_47(codecs.decode("pbbxa://oqbpcj.kwu/QIPqaxivw/Ixxtqw.oqb", "rot_13"))
uyadwa = codecs.decode("ncc.cl", "rot_13")
A = "/content/" + rot_47("Kikpm.ovm.bu")
!pip install uv pyngrok
!git clone --depth 1 $uioawhd $new_name --branch 3.2.8-bugfix --single-branch
%cd $new_name/
clear_output()
print("Installing requirements...")
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install -r requirements.txt -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --upgrade --index-url https://download.pytorch.org/whl/cu121 -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install numpy==1.23.5 -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv pip install gradio==5.23.1
clear_output()
print("Finished installing requirements! ")```
for UI colab
similar for noUI
My reaction
Not a separate one?
noobies said the whole cell
The install cell w additional rows
you can either edit the existing cell or make a new one
it is not additional rows
it is just adding parameters for UV
Edit most likely
?
graph at 91 epochs
If I copy this will it be formatted at different rows.
is that from your friend?
no this is mine
but using the same vocals
im the only one training
lyery said avg loss 50 is more accurate than loss g total
aight bet
this one?
yeah, keep training until the graph line is flatting and rise up
set the smoothing to 0.6
oh yeah thats easier to read
ikr
its at 1.9k now
yeah, just keep training
hopefully this best one cus we have made so many models of it
a lot of em been good
but not all of em are raw vocals
just like you, i've made a bunch of models too, but I never really knew how to train them properly, only recently started learning more about it
this one was from yesterday
just there was bit of noise that fucked it cus of vocal remove
it works, just that ilaria rvc zero is meant only for inference
which model did you use to remove the vocals?
How is it for no UI...
mvsep
bs melformer
i think sum like that
similar change
i recommend you to use melroformer voc_fv4 by gabox, this model can extract fuller vocals than the bs model from mvsep, it's old tho
i also mixed this with raw vocals
In what sense also it's hard to edit w a phone keyboard.
voc_fv4 models can gives you almost like raw quality vocals (imo)
ill try it now rq
https://huggingface.co/spaces/TheStinger/UVR5_UI you can find the model here
oh yeah i normally use melband
my bad i forgot
np, make sure you used the right melband model, because some of them is separating fuller vocals and generating annoying noise
@simple ore I need applio no UI for training
https://colab.research.google.com/drive/1y42NG3StPnbx_BzhCj94CE4aQCJx8gMR @simple ore done, how do I fix applio no UI?
Nite
goodnight
Same replacement withouthe gradito row?
u think if i open mangio while training on applio it will blow my pc up 😭
Mangio rvc should be banned smh
I hope you were just kidding since mangio is abandoned since 2023
Puoi mettere i link ai colab che ho appena aggiustato a qualche posto?
Solo chiedendo
WHERE DO I FIND MY PTH FILE I GOT THE INDEX NOT THE PTH (applio)
if you save all weights you'll have the small model saved every time d/g saves
its just 1 index file and folders
IDK HOW TO DO THAT
im on 3.2.8
there was a bug that if you chose saving every 100 epoch and max 100 it did not save the model
i dont have that option
you need to save every x epoch
bettt
save only latest saves one copy of D/G
save every weight also saves the small model
you may run out of space as D/G are really big
betttttttttttttt
for Image models, is there currently a workaround for creating copyrighted characters?
My brothers paid gpt won't create me comics based on celebrities and I need them urgently
that's ridiculous, my career kinda depends on this
local install does everything
The collab for RVCAICoverMakerUI isn't working
It's just giving this
Traceback (most recent call last):
File "/content/main_program/main_program/main.py", line 1, in <module>
import gradio as gr
ModuleNotFoundError: No module named 'gradio'
@simple ore
which ?
instead of backend container stuff add UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit uv to uv pip install lines
This is inside the applio installation but it still throws an error
I'm going to try
rvc support online?
@title Install Applio
!sudo apt update
!sudo apt install python3.10
!sudo update-alternatives --install /usr/bin/python3 python3 /usr/bin/python3.10 1
!sudo update-alternatives --set python3 /usr/bin/python3.10
!curl -sS https://bootstrap.pypa.io/get-pip.py | python3
import sys
sys.path.append('/usr/local/lib/python3.10/dist-packages')
import codecs
from IPython.display import clear_output
rot_47 = lambda encoded_text: "".join(
[
(
chr(
(ord(c) - (ord("a") if c.islower() else ord("A")) - 47) % 26
+ (ord("a") if c.islower() else ord("A"))
)
if c.isalpha()
else c
)
for c in encoded_text
]
)
org_name = rot_47("Vkkgdj")
new_name = rot_47("kmjbmvh_hg")
uioawhd = rot_47(codecs.decode("pbbxa://oqbpcj.kwu/QIPqaxivw/Ixxtqw.oqb", "rot_13"))
uyadwa = codecs.decode("ncc.cl", "rot_13")
A = "/content/" + rot_47("Kikpm.ovm.bu")
!pip install uv pyngrok
!git clone --depth 1 $uioawhd $new_name
%cd $new_name/
clear_output()
print("Installing requirements...")
UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit !uv pip install -r requirements.txt -q
UV_PRERELEASE=if-necessary-or-explicit !uv pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --upgrade --index-url https://download.pytorch.org/whl/cu121 -q
UV_PRERELEASE=if-necessary-or-explicit !uv pip install numpy==1.23.5 -q
UV_PRERELEASE=if-necessary-or-explicit !uv pip install gradio==5.23.1
clear_output()
print("Finished installing requirements! ")
nope
and it has to be the same "UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicit" on all lines
It worked, thank you very much
@simple ore NotADirectoryError: [WinError 267] The directory name is invalid: 'C:\Users\vedant\Videos\sample.mp3'
end preprocess, for first step of training, the directory is valid
it ask for a folder name, not for a file name
im so stupid ty
😭 @simple ore
show the changes
@title Install Applio
!sudo apt update
!sudo apt install python3.10
!sudo update-alternatives --install /usr/bin/python3 python3 /usr/bin/python3.10 1
!sudo update-alternatives --set python3 /usr/bin/python3.10
!curl -sS https://bootstrap.pypa.io/get-pip.py | python3
import sys
sys.path.append('/usr/local/lib/python3.10/dist-packages')
import codecs
from IPython.display import clear_output
rot_47 = lambda encoded_text: "".join(
[
(
chr(
(ord(c) - (ord("a") if c.islower() else ord("A")) - 47) % 26
+ (ord("a") if c.islower() else ord("A"))
)
if c.isalpha()
else c
)
for c in encoded_text
]
)
org_name = rot_47("Vkkgdj")
new_name = rot_47("kmjbmvh_hg")
uioawhd = rot_47(codecs.decode("pbbxa://oqbpcj.kwu/QIPqaxivw/Ixxtqw.oqb", "rot_13"))
uyadwa = codecs.decode("ncc.cl", "rot_13")
A = "/content/" + rot_47("Kikpm.ovm.bu")
!pip install uv pyngrok
!git clone --depth 1 $uioawhd $new_name
%cd $new_name/
clear_output()
print("Installing requirements...")
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicituv uv pip install -r requirements.txt -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicituv uv pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --upgrade --index-url https://download.pytorch.org/whl/cu121 -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicituv uv pip install numpy==1.23.5 -q
!UV_CONSTRAINT= UV_BUILD_CONSTRAINT= UV_PRERELEASE=if-necessary-or-explicituv uv pip install gradio==5.23.1
clear_output()
print("Finished installing requirements! ")
Finished installing requirements!
still the error appears in start applio
Finally, you are my angel, thank you for your patience
copy-pasting in colab cells is ass
What settings do you recommend for this, after having played the audio?
Or truncated*
Eddycrack864/RVC-AI-Cover-Maker-UI/blob/main/assets/RVCAICoverMakerUI.ipynb
it no longer works, error :
probably the same uv instal issue
is there any channels got removed in the server lately ?
An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
im having this issue with applio
never had it before but now its just suddenly popping up
you dataset is wack
lol
too short / slices too big / forgot to slice
yes
so what do i do
really happy to see a conversation where people aren’t always trying to be the responders, but actually being good listeners
when someone asks the same thing over and over, it’s not curiosity, it’s just ego talking
I've got keyboard click in this part what I have to use to get rid of it ?
the overlaps
from the sliced method
no, you need some so it captures the context
ah okay
Did the uh, did the making-models channel get deleted?
it seems like that
Remove that whole piece from your set lul
If you can't get it with declick and stuff without distorting
Soo where do we ask questions about model crafting
Here I guess?
nvm
maybe you can ask here
Here or https://discord.com/channels/1159260121998827560/1192011222023950368 here ig
Imo this change was bad
All my incredibly specific questions about models and the helpful answers to them 
tbh I like this new channel
@brittle wing refrain from posting promotional or sketchy link in this support channel
Apologies. It was a link to an article in LessWrong about a behavior in LLMs that I am curious if anyone has any information about.
🔥
-kaggle
Kaggle is a Cloud (Remote Good PC) Service that offers 30 hours of GPU weekly, but needs a phone number verification
by Vidal
Kaggle
by Hina
Kaggle
by Hina & Deiteris
Kaggle
by Eddy, ArisDev & Nick088
Kaggle
by Eddy
Kaggle
by Shirou & ArisDev
Kaggle
by Shirou
Kaggle
is Applio working for you guys on google colabs today?
I'm getting gradio error
How to use any ai models?
I linked the fixed colabs, even pinged u
Dawg ain't no way I'm linking my phone to nothing. This is whack wtf
you're not forced to use it
Well what do you suggest big pimpin ?
hi , i have a decent voice recording, but its from a phone .. is it possible to run it trough an ai that will make it sound full and like it was on a good mic enhanced with full spectrum frequencies not missing
like its an studio recording
Horizon is a tunnel service
Elaborate the issue
!give-media-perms 1h @balmy blaze
Elaborate more what you want to do, what's your PC GPU, what are you following, what's the issue
You need to use the workaround, check #📰│dev-updates
What rvc is this using? Can anyone tell me?
For realtime voice changing? Or for pre recorded audios? What's your PC GPU?
That seems to be Applio
Realtime
Mobile
thank u
You can't on your phone
Even if you use a cloud service (remote good PC), phones lack of a virtual audio cable needed for re routing the modified voice to other apps
@brittle wing do you have any laptop? Thats the super bare minimum for realtime voice changing, even if it's not powerful you can use cloud, as it laptops have a virtual audio cable
Else it's impossible
Yes
Sorry if I accidentally interfered into someone's conversation, but I can't understand one thing. Do anyone know how to make some voice model(for example English one) speak other languages without problems?(without, like, a little bit distorted words and etc.)
Tell your PC specs
Your only options are either train it to that language, or lower the index (lower means it uses less trained accent and uses more accent from the input audio)
Ok, thanks
I am in a meeting right now. I talk everything after 20-30 minutes.
Alright, @ me when you're free
Yh
Yw
i feel like i was pointed with a gun every time you ask this😭
Why 😭
Y'all don't ever check if your PC meets the minimum requirements when you're going to install a program?
It's very needed for AI and Games for example
why not add separate model making channel under this? useful information was shared on the previous one
idk, probably i'm weird
yes, i checked the requirements before installing rvc
Users were confused on which channel to use and also kept asking in multiple channels, so we merged them all in one
All the info should be already in our Docs https://docs.aihub.gg/
Last update: Oct 21, 2024
We ask it only because it's needed for us to know if you're PC is powerful enough, it happened many times that users tried to train models on a 10 year old laptop
yeah i understand
i think i can't joking about this, because my english skill will make it so rude or misunderstanding
Nahh don't worry
I know you're joking lol
try AudioSR but still not recommended for making rvc datasets
Resemble enhanced could be good too for speech
Whats the best channel to creat voice model
Wdym channel?
You need to use a program
What's your PC GPU
about w-okada forked and normal w-okada, if i want max extras do i put the forked one to like 5s
and idk what extras does tbh
2.7 s unless for weaker gpus
esp for running games
Extra controls quality a bit
Chunk controls delay
It causes more delay for more resources being used, and also cutoff issues
U can also send a screenshot of your wokada settings
You're settings seem fine, you can also enable force fp32 for more stable models in advanced settings but can cause more delay
Also you should set the chunk to the perf number + 60
You get the perf number when running the program
it would be fine for 4080
Oh I didn't know that
Cuz like I followed the fork guide
It say xx80 192 chunk
2.7s
I didn't know what chunk was
That's the general suggested one, the chunk depends on which programs u are gonna pair the Wokada with
Also what's force fp32
It uses the fp32 inference mode
For models
Meaning it takes more resources
But it's more stable and slightly better quality
You're welcome
I'm getting ./run-applio.sh: line 3: .venv/bin/activate: No such file or directory when I try to run applio on linux
did you do run-install.sh?
The folder does not contain such a file
did it create .venv (the virtual environment) ?
how to add another voice to the program?
Yes, it did
But it rather already was there
When I unzipped it
well, personally I prefer just making an environment using pyenv
and then installing manually
yeah, not sure what's the deal with the compiled version
add rocm/bin path to Path env variable
install python 3.11.11 using that
then just download the source/clone the repository and pip install -r requirements.txt
Do I need exactly 3.11.11?
preferably, does not work with 3.12+
Ok, got it
if the system uses <3.12 you dont need pyenv
just a manual python -v venv venv will do
I'm installing python 3.11.11 already
Wait, do I need to clone the repo from hugging face or do it in that compiled folder?
git clone https://github.com/IAHispano/Applio.git
k
Oh, sorry, I thought, HF was also a repos hub
no, we use hf to store big files
Got it
Diffence between an "added" .index and "trained" .index?
"added" one is usable, and another is not
Makes sense as "trained" is only 2.3 MB and "added" is 91.4 MB.
yep good Idea
"Using HiFi-GAN vocoder
Converting audio 'assets/audios/audio.wav'"
Why does Applio hang here? It never finishes the conversion? @simple ore
The timer just keeps going with nothing happening. (no CPU usuage).
If i try without the index file, it works. With the index file, it doesn't work.
remove the whole part or try separating using another effective model like mel roformers or bandit v2
done
Do I need to run install and other files then?
I'm getting this error when I'm running run-applio.sh:
Oh, and also this warning:
A module that was compiled using NumPy 1.x cannot be run in
NumPy 2.2.4 as it may crash. To support both 1.x and 2.x
versions of NumPy, modules must be compiled with NumPy 2.0.
Some module may need to rebuild instead e.g. with 'pybind11>=2.12'.
If you are a user of the module, the easiest solution will be to
downgrade to 'numpy<2' or try to upgrade the affected module.
We expect that some modules will need time to support NumPy 2.
Re: mixing models, is it advisable to mix models from different pretrains together, or is that taboo
Like if i have a KLM and an OGpretrain model, does mixing them together degrade something? Or should I only mix two versions of KLM / two verions of OGpretrain
just try and see if it works good for u
og has less vocal range so idk how it would react being merged with a klm model
Which program? RVC or W-Okada?
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
run just 'python' for me please
requirement are supposed to install numpy 1.25.3
it fails with numpy >2.0
got it
if I got dataset 1H length, can I reach something below 27 in training?
total g? it is affected by batch size
larger set requires larger batch, thus the value goes up
What seems to be the problem?
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
use GUIDE
do u know waht the download is thp
it doesnt show where to download
Bro i asks if i want to download nvidia control panel
nvm thx
how tf do i use this
my batch size was 12
I'll check total g when I go home
how can i use it on collab
im having a hard time getting it to work
please help me
@pastel oak is
oh ok
hi
Do you need help
Did you read the whole guide
Also what's your PC GPU and OS
Use what on colab? What do you want to do? What's your PC GPU?
is wokada still working on google collab
i got a i5 1235U and integrated graphics
w okada definitly wont work nativly on my machine
Damn, I mean it does work on CPU but would be mostly shitty performance
Cloud is having A LOT of issues, cloud is unstable ASF compared to local
Hina mod original wokada is broken
yeayh i guessed
Wokada deiteris fork colab works but you get detected on the free tier for using a web UI, which isn't allowed, so it will actually work only if you pay for colab
isnt there anyway around
Kaggle (another cloud service site) Wokada deiteris fork is the only way, which has a harder interface so it's not easy like colab, needs a phone number verification (it's owned by Google), but gives you 30 hours weekly of better GPU compared to colab
There's no other way on cloud for free than this
Else you can pay for colab or buy a good pc
i tried verifying my phone number with collab but for some reason i wont recive the verification code
Not colab, I'm talking about kaggle
Those are 2 different cloud sites
i ment kaggle sorry
Try contacting kaggle support then
There's no other way for free
No other work around
We even tried encrypting the Wokada deiteris fork colab code, it still gets detected no matter what since months
if i try to run it nativly on my muchine will it be atleast usable?
For discord VC?
yeah
You could try, it might be decent
Be aware that it could possibly harm your CPU if done for a long time
Tbh just contact Kaggle support, it's free and they will reply to you
how long will they take to respond
hello. what is that mean ? log interval should be equal to step count until that specific epoch? For example if epoch 150 has 2000 steps, log interval should be edited to 2000 ?
Who knows, maybe a day or some
Let me know for any other issues or questions
You need to train till you reach the number of epochs of the save frequency, get how many steps it took that, delete the saved model, and change the log interval to the steps amount
in this situation, log interval should be 80 ? epoch save frequency is 10
You should have stopped training long time ago, you had to stop it after going over 10 like at epoch 11 when the epoch 10 was saved
Now you are at epoch 310, so you kinda wasted time
nah that's ok, each epoch tooks only 8 seconds
Uhh what is your batch size and dataset size
8 batch size and 6 minutes dataset.
Since every 10 epoch takes 80 steps, log interval should be 80 ? Am I right?
a single epoch should be 100+ steps if everything is right.. for a decently good dataset
<10 steps/epoch is atrocity
graph synching only applies to old applio and mainline rvc
no longer needed as it was dumbest idea to begin with
applio now logs every 50 steps for averages and at the end of every epoch
Can anyone tell me how to do inference on a Mac locally?
Replay. That is what I use on my Mac.
Thanks. I looked at that, but I couldn't find where to add the .index file.
I zip my models whe I add them to Replay.
You add them under Model +
Yes, I found that but it only asks for the .pth.
Do you have the latest version?
Yes. The old one used to acknowldge that a index was included, but not this one.
It works without the index.
Yeah, but it doesn't sound right because you need the index to do an accurate conversion.
Index is for the accent.
keep getting this when trying to use the google colab
$ python
Python 3.11.11 (main, Apr 11 2025, 16:11:07) [GCC 14.2.1 20250207] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>>
today's colab fix is to add --prerelease if-necessary-or-explicit to the install cell's uv calls
tried that and now its just been stuck on installing requirements for like 5 minutes
oh wait ignore me it worked
big thank
why does start http on my vc client just inastantly close and nothing opens up
how can i get less delay in rvc
hey anyone know any tools similar to VASA-1 but not text-to-speech?
something where you could use your own face to animate an image of another face with ai
is there any way i can increase the quality/ realism of the The W-okada Voice changer, my current settings are 5s extra and 130ms. (i dont mind if it takes a little longer, i just want a better quality)
getting a better model
real quick, is it just me or does realtime just is not good enough for using it passively on lets say, a game match?
i dont think it has to do with my pc specs, it might have to do with my mic quality i dunno
but like, voice will randomly go super robotic out of nowhere
Both Discord as an app program and web browser would basically work the same. The difference of audio depends on your microphone, a version of W-Okada you're using, and the settings of it.
Are you trying to use the original version of W-Okada? This one is too outdated. There's a better one available.
There's no audio delay in RVC program, unless you mean by a realtime voice changer, which is W-Okada.
Are you using fork W-Okada on a cloud service like Colab and Kaggle? Because typical extra number setting isn't supposed to be too high if running locally.
yes w okada
Which W-Okada version are you using? And what is your PC GPU?
rtx 4060 laptop gpu and latest cuda version
Audio sounding robotic can happen when you didn't set chunk and extra settings properly according to your GPU.
The version number of W-Okada you're using, something like 1.x.x.x or bXXX. A "latest CUDA version" doesn't mean it.
Download and use this better W-Okada instead. https://rentry.co/ForkVoiceChangerGuide#download-nvidia-on-windows
You're welcome. Make sure you read everything on the guide I sent to you, especially NVIDIA parts. 
oke
when i launch it from window sit opens up a browser page instead of app
and in that page i cant choose input and output device
they dont show up
nvm that
Make sure you have give audio permission to the tab on your browser.

wheres the fork ai voice changer?

does anyone here know how you could make okada work with fivem?
If its a game or anything, then get a virtual cable, all explained on the guide
https://rentry.co/ForkVoiceChangerGuide
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
i have that, but when i get into fivem it's messed up, every other game works fine tho im pretty sure
Define "messed up"
like um, you've looked at my forum on ai support right? i listed what happens in there
hey i been using the changer but when i listen back i notice it picks up like crackling noise and i tried it with another mic, anyways to fix that?
how to know if the w okada is safe
Is this a good spectrogram? I'm not audio engineer guy
https://i.imgur.com/qRT8MvS.png
ye thats normal. the program is safe since its open source and its been checked out by experts
we wouldnt recommend a program to 500k users in the server if we didnt think its safe
all the developers are here
is there anyway to ue it in the ap
don't use it in the app, just use it in your browser
but wont it lag more
the great majority of AI programs use a web user interface, original wokada did too, just that it opened it's own window
no, or do you have around an extremely shit ton of tabs opened in your browser?
yeah
like 20 rn
it doesn't open in it's own window also to solve some performance issues
ohh
what browser are you using btw?
opera gx
that browser could led you to have trouble, many users reported to have issues with it for using wokada
also, it eats even more ram for it's fancy effects
which is why we don't suggest it
o
all those fancy things eat performance ofcourse
what browser is good
usually chrome or firefox don't give issues with it
you could still use operagx, but just be aware of more ram being eaten and possible issues
ok ill use chrome
also why does my voices not sound realistic
show your wokada settings
it sounds realistic for like a second but then it goes back to sounding like a voicemod voicechanger
!give-media-perms 1h @tribal pivot
hold on wait iam reinstall cuz avast got mad at it and i cant remove it from the thing
ima
also, the program is open source, the entirety of the code is public to see at https://github.com/deiteris/voice-changer , we even use the programs ourselves
I'm guessing you don't know coding, but just letting you know
yeah i dont😭
yeah anti virus sometimes could have false positives, it's normal, it's better you make an exception for the program folder
yeah especially avast
TL;DR: the code is public, meaning we can check all the lines and all the things the program does, and we wouldn't suggest or use programs that we know have malware
that's what open source is, public code
let me know when you're done and got in wokada
alr
show the command line window
and also show the wokada settings
what's your pc gpu and also show the wokada settings
f0: rmvpe without onnx
extra: 2.7
be sure windows and your gpu drivers are up to date
my drivers are up to date
what about your windows version?
windows 10 and updated my windows this morning
could you try to:
- download vac lite if you haven't (3rd step of the guide)
- restart your pc and the program
- set the output in wokada to line 1
ok hold on
itt works now ty :D
yw, about the chunk:
it controls the delay, it's better you find your own best chunk by closing all useless programs in background, leaving only wokada, the browser and the program you want to use open, click start, and check the perf value at the top left of wokada, then set the chunk to that value + 60 (or generally just a bit higher)
also, there is a force fp32 setting in the advanced settings that will make your model a bit higher quality and more stable, at the cost of some delay though
thank you :D
you're welcome, for any future issues ask there or make your own post #1192011222023950368
ok :D
any one know of any good ai software's? like am using Okada, but the latency is so unbearable long
guys how do i use model without WebUI in python or js?
and i just can't find config for shape
be sure to be using wokada deiteris fork and not youtube tuts, video tutorials are all old
share a screenshot of your wokada settings
!give-media-perms 1h @kindred pine
so use rvc for pre-recorded audios in python/js?
i keep getting those errors
RuntimeError: mat1 and mat2 shapes cannot be multiplied (80x77 and 768x192)```
maybe try:
- https://pypi.org/project/infer-rvc-python/#history
- https://github.com/blaisewf/rvc-cli
- use as an api button on the bottom of the web ui
damn that's ancient
XD
you deffo used a youtube tutorial
the one you got has performance issues, it's an old version of the original wokada
plus vb cable gives random issues on windows
delete everything you got off youtube
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
u deffo did since the settings were even wrong too
only written guides are updated
so its not the right software?
exactly, the one you got is an old version of the original wokada
Wokada has 2 main versions:
- Original made by Wok
- Deiteris fork (modified version) made by Deiteris
each version has it's own updates
the latest deiteris fork is better than the one you're using
possible to send me the link to the actuall software?
i gave you the guide
^
the 1st link
you have to read the guide to understand how to download it and how to use it
oh ok ok, am high af so kinda just spacing out XD, thx ill be reading the guid then
alright, let me know
I'm stuck because a voice is downloading but it's making me spit because it's bugged help me please..
elaborate more
- what's your pc gpu
- what tutorial link are you using
- a screenshot of the program settings
- what you want to do
- what model link did you use
be sure to not use youtube tutorials
Ok
video tutorials are all old, only updated ones are the written ones
you need to provide the info I mentioned for getting help
I have photos of what it does to me
you need to provide all the informations I said
!give-media-perms 1h @pure gust
be sure to provide everything I said here
@pure gust Don't post in #🏙│ai-images
Ok
that's not the right channel for doing that, I deleted your messages
this is the right support channel
Ok
don't use voice.ai, it's paywalled and sucks
you still need to provide all the info
^
reply to all those questions
if you don't, we can't help without any info, AI isn't a program that can run locally on a 10 year old pc, we need to check if your pc is good enough
also be sure to not follow video tutorials
you have image permissions here, use this channel not #🏙│ai-images for help
the processor is an ADM Ryzen 53500U with Radeon Vega Mobile Gfx
what is a tutorial uh ..
that's the processor, aka CPU, Central Processing Unit
I asked for your PC GPU, does your pc have a dedicated gpu?
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
the gpu, graphics processing unit, is very important for intensive tasks like graphic, gaming, and AI
AMD Radeon(TM) Vega 8 Graphics
that's integrated graphics, which is weak and not good for intensive tasks like AI
are there any other GPUs?
Uh...
if not, your pc is cooked since it's too weak to run ai locally
locally means that the program will run on your hardware
cloud means it will run on a remote good pc
it's better you use cloud
the only working cloud method is Wokada Deiteris Fork
wokada is a program designed to run RVC mdoels (speech to speech) in realtime for calls/games, and deiteris fork is a modified version which is better
kaggle is a cloud provider by google, it gives you 30 hours weekly of free gpu, but it's hard to use and needs a phone number
you don't have much other choices than this, voice.ai makes you pay and also stays always on your pc to use the pc power for their own services
I see
follow that thing I told you and let me know
I succeeded I think
are there any issues
u can also show a screenshot of the wokada so I can suggest you settings
Actually no, there are mistakes uh
elaborate the issues
!give-media-perms 1h @pure gust
you can also send screenshots
I managed to do it but it gives errors at the end because I don't really know how to do it
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
actually it works omg
that happened because you probably runned it twice, opening multiple tunnels
if u want I can check your settings
do you need any help?
Mmh
probably
all help channels got merged into one
so it's easier for users to find help, and not to post the same question in 6 different channels
help bro
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
please elaborate
yeah, also you need a virtual audio cable to make the wokada work in your discord vc/game
follow only this part of this guide https://rentry.co/forkvoicechangerguide#virtual-audio-cable for getting vac lite
i need a good free ai to create animated videos for tiktok with my script
Ok
also after that, make sure to set your wokada output to line 1 and the other program (like discord vc) input as line 1
you can also send a screenshot of your wokada settings
so, text to video?
text/image to video AIs:
- Locally (runs on ur pc):
- pyramid flow (Image/Text to Video)
- cogvideox 1.5 5b: Image to Video, Text to Video
- Cloud (remote good pc, running on an online website for example, easier to setup):
- Weights.gg
- pyramid flow (Image/Text to Video) (HuggingFace Space)
- OpenAI Sora (paid only, in some countries)
- lumalabs
- Hailoua AI
thhank you bro i will try them
Help channels for RVC, W-Okada, AI image and others all merged into one, but there's also #1192011222023950368.
Ohhhhhhhhhhhhhhh, btw is the colab version abandoned now or just stored somewhere
last time I used it, it was broken
It's good ?
what colab version? what link are you talking about? what you want to do? what's your pc gpu?
set your usual devices as default ones
what version i download on hugging face can someone send me link
Ok
elaborate
what do you want to do?
what's your pc gpu?
3060 rtx and intel 7
there's thousands of AI programs and models, with different versions for each gpu and OS
nice but what do you want to do?
The green tick is supposed to be your main speaker/microphone.
can you send me download link for the ai voicechanger
realtime for calls/games right?
if so, wokada deiteris fork
yes
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
apri it bro thank you
yw, let me know for any issues
Intel 7? Is it the Intel Core CPU generation or Intel Core Ultra? 
intel(R) Core(TM) i7-10700F CPU @ 2.90GHz 2.90 GHz
my pcs in shambles need to get a new one fr
That Intel CPU is at least decent enough to run AI.
nothing works bro and the most are paid
elaborate more the issue, what doesn't work? what did you try? what's your pc gpu?
also, if you're using cloud, you can't expect free service 24/7, those run on gpus that are extremely expensive
i paste my script, that should create a 1minute tiktok but i only get 1 second clips or videos with stock images
Then there's nothing you can do if you think paid services are bad and you don't have a great PC with great GPU to run AI video generator locally. 
my pc is not a problem
those are tools for text/image to video, not auto tiktok generators with family guy on top and a random subway surfer gameplay below if that's what you're looking for ?
i just have no money to buy ai
Still.
thats not what i want either
can you show an example then?
i dm you my script ok?

it's better we keep things here
you can just show an example of a video that gave you inspiration, or one you want to do similarly
Do not direct message someone without their consent.
Crazy.
can't you just explain what you want exactly?
like an ai that automatically generates the video you want, edits it, adds subtitles and automatically posts it for content farm ?
At least just say some keywords of it, it ain't that hard.
It ain't hard to tell.
english is bad cant explain
No shit.
TUNG TUNG TUNG SAHOOR!!!
That is whan I can think of a video, according to what you say.
tell me what ai to use please
If you don't explain some more, like voice acting, TTS or image/video generator, then I don't know what else you're looking for.
this animation style is fine
bro i just cant talk in english
this gay ass comunity all sitting on dick
no one with real brain
you can use chatgpt to translate your message, tell gpt to fix the grammar and make it sound natural
that's what i always do
true but this crazy i just improve like this
If you can't explain things right now, just get out from here for a while and come back when you know you can. 
the problem is you lol, don't blame anyone here
i did explain but no one get it bruh like is it hard to understand i just need invideo for free
i dont even know good paid ai
No one can read what's going on your mind, so you'd have to be more precise about what you're gonna do I guess.
this shit crazy yall are maxed out nerds
Don't try to play a victim like that. 
thank bro i will try
On Weights, there's a feature that you can generate video there, but you'll have to pay for premium to use it.
https://youtu.be/c38vtLw1nSk?si=cG2s9-m07_k7Mysy here's the tutorial, idk if this tutorial is outdated or not for now
Learn how to use Runway AI for image, video and character generation (Gen-3, Gen-2, Gen-1). Runway ML tutorial, Runway tutorial for beginners, how to use Runway ML.
To learn more about Artificial Intelligence ➡️ https://www.youtube.com/watch?v=QL6bCyYRMdM&list=PLXP4h6BgzlN2rAKQPLvchJg6lYMkG2Ekn
🔥MORE at https://theskillsfactory.com/
There...
bro nice ty
There's Luma Dream Machine. I did these videos last year. While you can use the service for free for a limited period, you may often encounter the long queue number.
Only if I could explain at least understandable to non-English speakers while my English is too good though.
You can generate some videos there for free, but again you will encounter queued task there if you have free plan. That's pretty much it. For subscription plans, you will have to go read everything by yourself. Same goes to some other websites.
it is more demanding and there's nothing more affordable than queue for several hours
I don't know how would you even understand this, but this is my best I could explain.
@redwonder3 who are you 🤣
is it possible to use one index file on different models , on voice changer?
Well, normally it's not necessary to use index on realtime, but i think you only should use it on it's respective model.
I'm looking for a AI model which have better and high accuracy of STT, TTS, emotion deduction, video analyzing and generating the perfect report's. as per my research i want not able to find any multimodal which have all those thigs, like deep gram, whisper, hume, proctring so on
For a voice model, look him up in #1175430844685484042. 
It's possible to use index from one voice model with another, I tried it on RVC and Applio, but as an experiment. While using index on W-Okada is possible, the program gonna use more resource than setting it to none.
what is the best into 50 & 100 eposh
I am traning Voice Model for singing with 1:10 hours long high quality dataset and TITAN Pretrains. Should i stop training or train futher? It already trained to 650 epochs. In tests by ear i can't hear significant diference between 330 and 650 epochs
well, I can tell you that if you take klmn 4.9 and cut your dataset down somewhat, you may get better result that with titan
titan is a "finetuned" pretrain, not sure how big the variety is there, but it is not singing pretrain
I used the single WAV file by joining all samples. Normalized and cleaned. I suppose the Preprocess step does it, doesn't it? What is "klmn"? Which pretrain can be ideal? I want at least 40k sample rate
@low shard
whats ancient? LMAO
your program
oH
yeah
If I want to use w-okada on server, is there anything I can use to block out the noise from me typing from being picked up?
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
Me or him?
you
am I gonna have to read the whole thing?
yup
it's not that long
you have to follow only the parts related to windows nvidia, since you got an nvidia gpu and your OS is probably windows
I don't think so
maybe @pastel oak knows another way, but server shouldn't be able to use sup2, sup1, nor echo suppression
I've seen some people say nvidia broadcast when searching here but I'm not sure how to go about setting it up, trying to figure it out
yeah server can't use any of those
I already have nvidia installed its essential
Is it trained with Russian vocals? And how about cutting the dataset? How should I cut the dataset? Should I use 48k sample rate?
with windows nvidia, I meant the wokada deiteris fork windows nvidia version
everything you got off youtube tut was really old
oh you meant that
and if you downloaded vb cable, uninstall it
it gives random issues on windows
youtube tutorials get so easily outdated
idk what that is but I dont think that I installed it thankfully
which is why we usually don't make them, written guides are way easier to keep updated
did the yt tut make you install any other thing along the wokada
Ah I figured it out for server noise reduction, you put your regular mic input into broadcast, enable the sound settings then set the input for okada to nvidia broadcast
I dontt think that I have this installed do I need this?
do you have an nvidia gpu?
yeah
then no
no you don't, you need only to install the wokada deiteris fork nvidia windows version
use th nvidia
perfect
ofc not the 5000 serie, since you got a 3000 rtx serie
also delete the older version of the original wokada that you got
yeah couldnt find the link to that
and also make sure you don't skip the 3rd step
just delete the folder
oh alright bet
Directly above the picture you sent is a link for the 5000 series nvidia, and above that is for the normal nvidia gpus
Can smb help me with Voice changer?
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
please elaborate
- what's your pc gpu?
- what issue are you facing?
- what tutorial link did you use?
- show a screenshot of your program
found it thanks
!give-media-perms 1h @patent trellis
- [Error] Giving <@&1313370885650124821> to @patent trellis for 1h
oops wrong user lmao
!give-media-perms 1h @prisma dove
also, be sure to never follow youtube tutorials for wokada, they are old
To be more accurate,
I have laptop with rtx3050 , and Iris xe
Issue was that i choose in vc Rtx , but it not using at all.
Tutorial was my frendo
this is the third step correct?
yes, get the vac lite
this guide is a bit confusing sorry
show a screenshot of your wokada
also thanks for the info
yeah your friendo is cooked too then
that is an old version of the original wokada
it's like over a year old
but he has nothing problemo at all
your friend probably used a youtube tutorial
the issue is your settings, but it's not worth to fix it in that program, because the version you and your friend got have worse performance and worse quality
both of yall should first uninstall this version
then fix the settings
oke
@prisma dove please delete the version you got, then read the guide for the wokada deiteris fork https://rentry.co/forkvoicechangerguide
also did your friend make you install something called like vb audio cable?
this isn't rvc, this is wokada,
RVC and Wokada are 2 different programs
RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models
Wokada = uses RVC for realtime inference
You need wokada since you will use it in vc
Wokada has 2 main versions:
- Original made by Wok
- Deiteris fork (modified version) made by Deiteris
each version has it's own updates
the version you're using is an old one of the original wokada
which has performance and quality issues, so don't use it
I already gave you the link for the wokada deiteris fork tutorial, https://rentry.co/forkvoicechangerguide
you need to read it
there's no updated video tutorial
oke
why that reaction? written guides don't get outdated like video tutorials do
we can freely keep written guides up to date everytime
unlike video tutorials that are harder to keep up to date since they require more time
I hope you understand
ofc, just cause im lazy asf
for any issues or question you can ofcourse ask here
ty for support
we know that users find video tutorials easier, but making a video tutorial that would comprehend every single os with every single gpu brand would be difficult, along with the fact that AI progresses at a sonic speed
you're welcome, we are here for that
nono, i think that's tutorial link is more easier than i uses later for another programs. And reason is more correct that could be
Goodluck and let me know how it goes for you then
are you facing any issues?
rvc mangio is ancient, that's what you're doing wrong
ahh alright
mangio rvc fork has been abandoned since 2023
NEVER follow youtube tutorials for rvc
lmao sorry
nah don't say sorry lol
what's your pc gpu? and what do you want to do?
got a 4070 laptop gpu
and im just using voice models to sing songs
like homer simpson sings not like us
etc.
thank you!
cloud (remote good pc) also exists, which is easier but there are free tier limits (like time) and are less reliable, if you want I could tell you them but it's not worth it when your laptop is good for local inference (inference is using the models basically)
you're welcome, let me know for any issues
i will
no, there's no russian vocal, but there are high pitch samples that help with high pitch inference on a finetuned model.. one big file is fine, dont use process effects in applio if you already has it normalized and cleaned, do audio cutting in preprocess
OK. Thank you very much 🙏🏻. How about 40k and 48k . Should I try 48k ?
depends on the dataset, check with spek
Server mode cant use those, need 3rd party program
I figured out how to use broadcast with server mode 🙂
Hello, I'm currently using an old version of w-okada's mainline RVC for real-time voice changing. It's been a few years since I checked the space, and it seems like there have been many advancements. Are the w-okada mainline RVC and the Deiteris fork currently the only/best real-time voice changers, or are there other/better options?
i keep getting an error now when i hit convert i am using applio
I couldnt find in guide the settings blank??
Is there easier use client with low-speed internet, or server?
whenever i try using server it doesn't work but when i go to client it does work, can anyone help me?
show a screenshot
!give-media-perms 1h @echo turtle
wdym?
what browser are you using and show a screenshot
operagx
show a screenshot and elaborate the issue
!give-media-perms 1h @nocturne stone
show a screenshot of the CMD, and also a link to the model you're using
so when i use server my voice doesn't work, and when i use client it does with the same output device and input.
operagx is known for giving issues for wokada, also it eats more ram so more performance because of its fancy effects
we suggest chrome or firefox
ok
also, set
f0: rmvpe without onnx
extra: 1.0
did you download vac lite?
no