#✨│ai-help
1 messages · Page 227 of 1
yea 0.2
ah ok, set it to 0.1s
alr
longer crossfade length gives results comparable to rvc but increases delay
I see that they migrated to web browser now
tho it gives error when trying to set the name of the model. also it only accepts RVC? can't upload GPT sovits models then?
I was eating lmao wtf happened
So I don't remember what batch size and other settings I used to make a particular model, is any of that info stored in the logs folder?
tho it gives error when trying to set the name of the model
You should not rename the model in the UI, it's bugged
you should rename the model files then re-upload the slot
can't upload GPT sovits models then?
RVC is Speech To Speech
GPT-So-VITS is only Text To Speech
Those are 2 different AI types
what were you eating?
chaos
smt similar to focaccia
ooh okie, gotcha. Thanks for explanations ^^
do you need any other help ?
As of right now as far as I can remember, not yet but i will come with any questions/problems as I explore this web version if thats fine
got it working again
alright
guess it doesn't have as well an app version that can be added to the system tray right?
it's still a program running locally on your pc hardware
both original wokada and wokada deiteris fork use a Web User Interface coded in Javascript & TypeScript
Web User Interfaces are used by the great majority of AI programs because:
- easy to edit
- can be used for cloud
the original wokada just made it's own 'browser' to open the local hosted url webui, and iirc that was removed in the deiteris fork for performance issues
I see. Thankyu ^^
You're welcome
why does my voice sound like a broken Roberter
I don't understand it, I have everything like in a Yt video
Yt video
That's one of the issues, all video tutorials contain outdated info
AI progresses at sonic speed
you can't expect an over year old video tutorial to set you up
soo that video was yesterday
share a link to the video you used so i can check if they used updated software
and also tell what's your pc gpu
Is it because of the voice model?
an Gefroce 4090
great, share a link to the video you used
share a screenshot of the program you used
it could be both a bad model and outdated software/ wrong settings
!give-media-perms 1h @karmic egret
Should I send a screenshot of the voice changer?
an entire screenshot of it yes
or the youtube tutorial since you did the same thing, but you said u forgot which
yep, awful settings and extremely outdated software lmfao
what video did you even watch
idk
that's an old version of original wokada
a yt video haha
okay
and also the settings were pretty fucked, crepe tiny sucks in quality
great
so what is the new vc?
wokada deiteris fork
there's 2 main versions of wokada:
- original made by wok
- deiteris fork (modified version) made by deiteris
the one you're using is the old version of original wokada which has worse performance and quality
read the wokada deiteris fork guide: https://rentry.co/forkvoicechangerguide
all video tutorials are outdated, only the written ones are up to date
Another supporter sent this to me too, I have no idea where the VC download link is
you have to read the guide, it's all in it
it's exactly in the windows nvidia part https://rentry.co/forkvoicechangerguide#download-nvidia-on-windows which redirects you to https://huggingface.co/Shadicti/deiteris-Fork/blob/main/voice-changer-windows-nvidia-b2332.zip
it's crucial to read all the guide and not miss steps
also be sure to not skip step 3 and get vac lite
then do the other steps
you're welcome and let me know
you should just set your usual microphone as the default one
seems like you did that already
alr thanks Download VoiceChanger now
I am gong to change my Asian voice to European voice. Which model is fast and efficient?
it's nrcessarily to uninstall virtual cable that i installed some time ago or can install vc lite along?
what is that 😦
Are you all using realtime voice changer client?
let me know
idk what to do know
not necessary but would be better you uninstall vb audio cable
while i do not know what are those scripts are exactly, u don;''t need to run them as the app is inside the folder
okayy ty
called MMVCServerSIO.exe
The force gpu clocks, Forces ur gpu to be constantly active, this helps if ur gpu goes idle state
it's usually not needed tho
you can just go inside the folder and run the .exe
okay thanks
shortly:
Server = can have less delay with wasapi and asio
Client = can use noise suppression and echo reduction
that's the difference
why im on an website
thats the interface
it's basically the same as old okada
just better in performance and easier to edit
it's not a website, it stills runs locally on your pc gpu, just it's a web user interface
please check https://rentry.co/forkvoicechangerguide#why-does-it-run-in-a-browser-and-not-its-own-window for futher details
broken in what way? did u try the crackle fix ?
yue can you go in a call with me for an sec to see if it sounds weird to you too
sure, can try
im a cracky roberter
wait
hello?
wait
was sounding fine i believe, i mean no crackle or robotic voice
i cant talk discord dont accsp my cable
okay thanks
np
in discord, set input as line 1 and output as headphones
and ofc remember to set the input as microphone and output as line 1 in wokada
do u want to share a screenshot of ur settings?
i can help u set them up
I am getting this error: RuntimeError: PytorchStreamReader failed reading zip archive: failed finding central directory in Applio Google Collab. Any suggestions?
Elaborate:
- your PC GPU
- the google colab link
- what you want to do and did
- the model download link
what do I have to do to change my tone of voice?
change the pitch slider
the robot's voice is not natural.
- Not PC 2. Google Collab Link from AI Hub Guide 3. What you did? Nothing. 4. What am I doing? Waiting. 5. What did I do? Pressed Start Training. 6. What was I doing before that? Placed KLM pretrain from AI Hub Link 7. The model download link from KLM Hifigan 32 K available through AI Hub.
What I think was going on? Maybe that KLM download is corrupted.
What do I do next?
those settings are wrong
gpu: rtx 4060
f0: rmvpe without onnx
extra: 2.7
about tone, change the pitch, it depends by your voice and the model
did you download vac lite?
Not PC
Are you on mobile? Was asking if you're on a desktop/laptop pc since you could have a good one and do it locally instead
Google Collab Link from AI Hub Guide
yeah but there's applio no ui and ui collab
The model download link from KLM Hifigan 32 K available through AI Hub.
Which?
I don't understand why the old one is worse, and I can't find anything about it.
The old one has worse performance, the newer one, deiteris fork, has better optimization and also settings that can help the quality
for example, if you turn force fp32 mode in advanced settings, you will have better quality and a more stable model at the cost of a bit of performance, as of my test on my 4060 ti 16gb, turning that on gave a 20ms delay but it did help with quality
yes
!give-media-perms 1h @karmic egret
@waxen pike try the settings I told you, and be sure you downloaded vac lite
I don't know how to set everything up correctly. When I opened it, it was like this.
scroll down
Applio UI Collab and this is the 32Khz
G-https://huggingface.co/SeoulStreamingStation/KLM49_HFG/resolve/main/G_KLM_HFG_32k.pth?download=true
D-https://huggingface.co/SeoulStreamingStation/KLM49_HFG/resolve/main/D_KLM_HFG_32k.pth?download=true
I'm going to use another link and see if that is the issue.
the rest works perfectly
you already setted the extra, f0, and everything else?
because you shouldn't leave them on default
on default it uses your CPU
I have to come later, my parents want me to come down
alright, let me know
I have everything perfect When I turn it on everything sounds good but I think I have to use a different voice haha
seems like the pth wasn't downloaded fully #✨│ai-help message
maybe retry
I have few small questions regarding onnix files/settings. Basically based on the guide for deiteris fork, onnix isn't needed anymore for NVIDIA users but can help AMD/Intel users right?
And regarding chunk, iirc on old okawa, if i was giving more, there was the delay but was giving better quality, and this remains the same here right? tried 85ms (hard to hit 72 but that was recommended for 3090 as was highest card with recommended settings) and i believe it gives good results but if i want better, having on idk.. 100-200 should help the quality right? altho delays
and F0 est can be changed from rmvpe_onnix toi simple one as no onnix file is used i guess
amd users werent able to run .pth before, so they were forced to use onnx
nowadays the fork allows the usage of .pth in amd/intel
you can use rmvpe_onnx in a non onnx model but why? its a downgraded version of rmvpe that is meant to be used with an onnx model
you're just increasing your cpu usage for no reason if you use onnx f0 with a .pth model
onnx is there for compatibility, not because its better
hey guys what is this? Please open the following URL in your browser.
http://<IP>:<PORT>/
like I downloaded 1 model and tried to reopen start http
guess i can't post the link to the guide
but basically i followed the settings in the guide from rentry link and set the settings there (+ advanced) and helped the voice to sound more smoothly.
can you provide further log? theres not much to make from that single line, perhaps the errors/outputs before it?
actually the fork should give better results because fp32 inference is way better than fp16
but that depends in the model
a trash model is gonna sound trash no matter what u do
i think fp32 helped more ? not sure but anyway using as well now vc lite
its not a quality setting
is a stability setting
fp32 calculations are more precise than fp16
in simple words, that translates in the voice not randomly glitching for no reason
it said in guide that greenm means stable right? can it be misleading? i mean be a bit unstable but still showing green?
bcs it was having some weird cuts first time
I message you
check out
screen
then prob thats why it wasn't working that well
original w-okada has only fp16 inference
so things randomly explode there
random artifacting, voice completely glitching, etc
was on this one from deiteris at begging when i said that it was having random cuts
a thats a you problem
try chunk 150ms, 2,7s extra
.pth
don't use onnx
1s crossfade length
did that after looking more
wanted to show final advanced settings pic but my perms expired i think
changing browsers also help, i've heard people have issues while using operagx
its weird idk
i know that random cuts happen when the extra is too low or when you're lagging
or also when chunk size is too low
onnx puts more stress in your cpu so if u were using onnx that also might explain why the voice was lagging
but anyway, i have:
protocol: sio
crossfade 0.15
silence on
fp32 on
disable jit on
false onnx
protect 0.5 (left as it is as idk what to set exactly...)
skip pass through conf no
chunk 85
extra 2.7s
i have a model with pth file no onnx
and no index as i heard it's not needed anymore
anything that needs change?
using rmvpe as f0det
-90 db in. sens
it means the file it tries to open is damaged/not downloaded properly
and at audio i set the SR to 48k
i would try:
close the voice changer > install chrome > set default chrome as default browser > open the voice changer > change chunk to 150ms, protocol rest
will try as i have chrome as well but not using it unless smth doesn't work on opera or firefox :))
well it's more or less ok. just asking if those settings are fine as they are or needed small changes but gonna try now chrome
chunk 85 is fine as long you're not gaming
for games increase chunk to 150ms ~ (well that depends how gpu heavy the game is)
its random so i cant tell ya
gotcha!
every game has their unique chunk value lets say lol
iirc, i think it said somewhere to increase if i use a game
yeah
just play with the chunk value, you know u did it well when the voice is stable

thanks!
uhhh, wanted to try another pth model with index file and got this error...
wait, i stopped this and started again and it works now?!?
lemme guess, are you using a youtube tutorial for a realtime voice changer?
ye
all video tutorials are outdated, uninstall everything you got off it
what's your pc gpu?
I reloaded everything but now its even dont opening
download the fork made by deiteris. that should work after setting up and reading the guide from rentry website
good enough
ty
just uninstall the version you got off youtube and vb audio cable
vb audio cable gives random errors on windows
I mean I installed this of website
the version you got is an old original wokada
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
from the git hub
did u follow the 1st link guide?
I can send link
follow the guide here for deiteris fork
the updated link is the one above
the wokada deiteris fork
Wokada has 2 main versions:
- Original made by Wok
- Deiteris fork (modified version) made by Deiteris
each version has it's own updates
the wokada deiteris fork is more suggested
also contains the link for virtual cable lite
it's better you don't use the version you got off youtube
it's outdated, it got worse performance and quality
if you're talking about https://github.com/w-okada/voice-changer , this is the version i'm talking about
you should not use it
it's the original wokada, it's not as good as the wokada deiteris fork
it's better you just forget what you got off youtube and follow the deiteris fork guide: https://rentry.co/forkvoicechangerguide
video tutorials aren't up to date
Hello, is there a way to do some TTS client that automatically applies an RVC model onto the TTS?
Currently bashing my head against my desk rn, I'm trying to get llama py to properly use my gpu with cuda to generate llm text. and no matter how I install the pip packages, no matter how i set my build flags. It keeps saying it's assigning layers to my cpu, any help?
I've tried pip install llama-cpp-python --upgrade --force-reinstall --no-cache-dir --compile --extra-index-url=https://pypi.nvidia.com, set CMAKE_ARGS=-DLLAMA_CUBLAS=on before building, CMAKE_ARGS="-DLLAMA_CUBLAS=on" pip install llama-cpp-python --force-reinstall --no-binary llama-cpp-python, $env:CMAKE_ARGS = "-DGGML_CUDA=ON". etc etc
Python 3.12.6
RTX 4070
Toolkit 12.8
May I know the effect of Pitch, Index, F0EST, N.GATe parametsrs in Voice Changer Client?
python 3.11
I do have build tools installed though
Yeah I've followed through with all that in the past on 3.12.6 and it just uses a cpu version, not sure if that different py version will make a difference
Unless I have an issue with my py
llm = Llama(model_path="./models/ana-v1-m7.Q4_K_M.gguf", n_ctx=2048, n_gpu_layers=32)
I do specify ctx and gpu layers
make sure you uninstall CPU one, then remove the pip cache
then pip install llama-cpp-python --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cu126
although that failed for me
sure ill clear my venv make a new one and use that and see if it makes any difference
not venv, the pip cache
well pip is in the venv no?
clearing the venv would clear the pip cache unless im mistaken
no, it is un global repo
huh okay
pip cache purge
pip install llama-cpp-python --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cu126
states requirements already satisfied for everything
Oh 1s
Okay I've cleared everything out including cache, install llama-cpp-python using that.
When I start my python script it still appears to not raise my gpu usage or my vram, and it states cpu layers in the terminal.
load_tensors: layer 0 assigned to device CPU
Does LM Studio have an api?
Huh well I finally found this error after trying to compile my own with cuda
No CUDA toolset found.```
-- CUDA Toolkit found
So I have cuda toolkit but not toolset.. No clue what that is and can only find toolkit downloads
there's just way too much wizardry required for installing things on windows
LM studio provides a standard API for the model
send a payload request, get whatever back
I run orpheus TTS with it
I sound like a stoner, at the beginning everything was fine and after 30minutes of use I sound like I've smoked weed
How do i run this through vrchat without the thing echoing there voice
share a screenshot of ur wokada
!give-media-perms 1h @sick fog
The frick is wokoda
downloaded this do not know if this si the one
the github is a mess to traverse
like it just will not work on vrchat even when i got vb audio connected
or smtimes the oputput of my actual game goes to my vb audio which is so annoying
i got these settings if i have to I'll swapout my 4060 for my rtx 4090
or just go afk ig
0%| | 0/56 [00:00<?, ?it/s]/kaggle/tmp/.venv/lib/python3.10/site-packages/torch/autograd/graph.py:744: UserWarning: Grad strides do not match bucket view strides. This may indicate grad was not created according to the gradient layout contract, or that the param's strides changed since DDP was constructed. This is not an error, but may impair performance.
grad.sizes() = [64, 1, 4], strides() = [4, 1, 1]
bucket_view.sizes() = [64, 1, 4], strides() = [4, 4, 1] (Triggered internally at ../torch/csrc/distributed/c10d/reducer.cpp:325.)
return Variable._execution_engine.run_backward( # Calls into the C++ engine to run the backward pass
/kaggle/tmp/.venv/lib/python3.10/site-packages/torch/autograd/graph.py:744: UserWarning: Grad strides do not match bucket view strides. This may indicate grad was not created according to the gradient layout contract, or that the param's strides changed since DDP was constructed. This is not an error, but may impair performance.
grad.sizes() = [64, 1, 4], strides() = [4, 1, 1]
bucket_view.sizes() = [64, 1, 4], strides() = [4, 4, 1] (Triggered internally at ../torch/csrc/distributed/c10d/reducer.cpp:325.)
return Variable._execution_engine.run_backward( # Calls into the C++ engine to run the backward pass
i keep getting this error
with applio
Hey guys, I’m having an issue with Codename-RVC-Fork (v3.1.0-rev1).
When I start training, it crashes while loading the filelist.txt due to an encoding error.
Anyone know how to fix this or had the same problem before?
Im having troubles with the Real time voice changer, it keeps buffering and not play the full audio just parts of it how can I Fix this
it is a mess because there is a better one here https://rentry.co/forkvoicechangerguide
i just got that one
took a long time searching through old support messages tho
it seems fine tested it through discord idk bout vrchat thoo
vrchat in vr mode or desktop
?
coz the 4060 might not be strong enough to handle vr mode + the voice changer
but i mean the only way to know is to try it
just disable badly optimized avatars and reduce every gpu intensive task in the game
want me to swap to my 4090?
also i have all avatars off cus I'm friends with alot of crashers n they join me n crash so its annoying
ah you've got 2 gpus in one pc
Nah
my 4090 is in my other pc behind my desk
I'm upgrading the case but I'm too lazy to swap the parts but if i need to I'll do it rn
first try if the 4060 can handle vr mode, iirc the 3060 can't, and the 4060 isnt that different from a 3060 sooo
i think its like 10% faster?
Can anyone help me fix this?
there's something wrong with your dataset
it sounds perfectly fine and is at the right sample rate
i did not say how it sounds, I mean the files are fked
did you resume training with a wrong sample rate?
no everything at 40k
double check
the error is because it tries to cut a piece of the file and gets past the end of the file
yeah im sure i tried deleting the problematic dataset but even then its still not working even with the previous datasets that worked
the preprocessed files might have fked up
if not sure, start over from preprocessing
not sure what to do now
i mean i started a new voice model with the same ones as before and i mea nits working now..? only issue is now i gotta wait for 1000 epochs sadly but it is what it is i guess if anyone knows how to fix the other one please lmk
yo
quick question
does mic quality in okada matter?
cause my mic is fucking ass
What settings should i use for a pretty quick ai voice but for it to sound realistic
define being realistic
if you want it sound less robotic, try another model
Hello, I am looking to get information or an analysis on a specific Ai instagram model / influencer.
This influencer uses a unique approach to video creation. And I cant pinpoint exactly how they are creating their videos
I am willing to pay someone to do research for me or tell me how they are doing this, how they set it up, and How I can recreate it.
Here is the information I know:
This is the creator I want to analyze: https://www.instagram.com/marina_hasegawa_/
I suspect that they are using real reference videos taken from other sources and replacing the characters / scenery with their own. I'm not sure what software they are using to do it though
I think I know how they created the character(s) in their videos. But I don't know:
- how they applied them to the video format mentioned above
- How they are able to maintain photorealistic quality
- How they are able to do all of the above and keep the custom Text and Jersey number 28 in all of the videos
- How they change the outfit for the characters.
Stop.
Is Okada colab working?
No where in this server is allowed to promote your own things.
Im not promoting?
I'm requesting, Trying to learn how this ai influencer makes their videos
??
What is your PC GPU? Most of W-Okada Colab notebooks are broken currently.
I see. I don't have a pc as of now. I plan on getting set up with one later this December.
Thank you for letting me know. I'll be patient.
The what? You don't have one? That's bad.
W-Okada won't gonna work with mobile devices since they lack the feature to change input/output audio devices like the one in PC.
sorry but that sounds like another "I will pay you after I get some money from that"
W-Okada can run locally. If you're looking to buy a PC, I'd say to buy one that has a decent Intel/AMD CPU and NVIDIA GeForce RTX 30-40 series GPU.
afaik it may still have issues #📰│dev-updates message
you can try this kaggle notebook https://www.kaggle.com/code/suneku/voice-changer-public
i replied in #🧬│ai-chat, that isnt an ai influencer, its a real person
still they could have been using some AI tools for certain tasks
is that all it is to make it sound less robotic is different models?
How do you make a model with applio when im in the training path it says the parameters of the model do not match
start from the start
select correct sampling rate that matches the pretrain you're going to use
umm guys i was trying the ai on a game
and it kept capturing the voices of the ppl playing asw
idk how to fix this can some1 helpp ??
select the actual mic, make sure you playing in headphones and they dont leak audio
yes i put my actual mic as input and virtual cable as output
and i have echo sup1 and sup2 turned on
ughh i think i need to buy new headphones
use headphones that leak audio and turn down its volume to listen
that is wokada, the original wokada
you got an old version
wokada is the program meant for doing RVC realtime inference (use models)
never use video tutorial, delete that wokada along with vb audio cable
since you got a 4060, you're good enough
read the updated wokada deiteris fork (modified version): https://rentry.co/forkvoicechangerguide
elaborate:
- your pc gpu
- what you want to do
- what tutorial link did you use
- a screenshot of the program and error
best rvc fork for gtx 1660 I wanna train my own voice model
- if anyone knows any slovenian pretrain models
best gas for 1975 pinto? I wanna race F1
or slavic languages
original wokada hina mod colab is broken
wokada deiteris fork colab works, but you need a pro subscription because google detected it uses a web ui, which isn't allowed in free ui
tell your pc gpu and what you want to do
I already made 2 with this gpu
but I forgot which rvc I used cuz it was like 2 years ago
yes it took 2 days of my pc running non stop but it was worth it
elaborate:
- what's your pc gpu
- what you want to do
- what tutorial link did you use
- a screenshot of the program
simply that gpu is cooked
.
you could even train but it would be slow asf and limited by vram
it's not worth to train locally when cloud exists
alr thx I'll check it out
Train (make) RVC Models on cloud:
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Be sure to know about the tensorboard
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC
RVC Inference (use models) on pre-recorded audio on Cloud
You can use either:
- Weights.com: Easiest Possible Ever Automatic
- Ilaria RVC Zero: Fastest free on cloud
- Applio UI Colab: RVC Fork with some extra features like TTS
- RVC AI Cover Maker UI: Automatically Separates the Vocals and Instrumentals, converts the voice and mixes them back
Mainline is broken though
thx 🙏🙏
it's just better to use applio kaggle or colab
?
or any slavic pretrains
yw, if u really want to use local, applio is good https://docs.aihub.gg/rvc/local/applio/ but it would be just way slower
Last update: Apr 01, 2024
don't think so
maybe check #1235952130855010365
but last time i checked i don't remember seeing one
ye that's why im asking haha
didn't know I can train on cloud
I'll check that out first
cloud means you're using using a remote good pc btw
I would not recommend those claiming to have some, they may have the audio of specific language, but the content so variable it sounds shit
cloud isn't as much stable as local ofc, it does happen it breaks easily, but currently applio colab and kaggle are working, just mainline is broken but anyways applio is better
regular default pretrain + 30-60 min set produces good output
so what pretrain should I use?
with default pretrain, no issues with the language
alr
Noobies tried to train a 44k pretrain for refinegan but it didn't work out well
Interesting
Hi,
I haven’t been able to find any Kaggle notebook that works with Codename-RVC-Fork 3.
Has anyone here managed to adapt it or get it running on Kaggle?
i dowlaod The VoiceModulation but how do i open it
Heya, since it's possible to use it on multi pc setups, i have an older pc with 1660 3gb vram. i believe that won't be enough even if i run just the app on it, right?
Explain about the "app" you're trying to run over LAN network.
altho i know that i must run at these settings for 16xx series:
256 ms chunk + 2.7s extra
the deiteris fork
but just wanted to ask if it may work decently
since it has just 3gb vram...
With NVIDIA GeForce GTX 10/16 GPU in a host PC, it will still work fine, especially Dertis' fork W-Okada. Don't expect it to run W-Okada with a game on host PC that well.
sorry if it doesn't make sense or u don't understand. please ask and i'll try to explain better.
Yeah, ive tried with the old one 1y ago+ and didn't like it at all :))
and wasn't deiteris fork on that time too :))
tried to use kaggle applio and got error when starting : ERROR: Exception in ASGI application
when I try to use the ui link ngrok says : dial tcp 127.0.0.1:6969: connect: connection refused
TypeError: argument of type 'bool' is not iterable
An error occurred launching Gradio: When localhost is not accessible, a shareable link must be created. Please set share=True or check your proxy settings to allow access to localhost.
then thanks for the fast reply ^^
local install?
for kaggle import the latest notebook from github
create a notebook, import notebook,
alr
it wont let me run the ngrok cell?
only the first install one
i'll run the install one first ig
works now 👍
sooo, did you try the settings I told you?
btw how's it going
can i have some guide to use?
just install it
wait
what's your pc gpu?
aren't you using wokada deiteris fork already?
input: real mic, output: line 1 - out of the virtual cable
yeah, uninstall vb audio cable, get vac lite at step 3, it's said in the same guide you followed to get the program
discord input: line 1 - in of the virtual cable
don't skip steps
No, I will pay you to get me the information about it.
Check the Request, Its not a real person.
Read my responce to you
It is Ai generatred, and I'll prove it to you.
Check your dms
I tried using aicovermaker keggle but it gives me erorr : python: can't open file '/kaggle/working/main.py': [Errno 2] No such file or directory
and ngrok when i go to link : dial tcp 127.0.0.1:7755: connect: connection refused
@viscid moss
your ai covermaker ui is broken
copied from here btw
should I import it from somewhere else?
what is the github for the kaggle version
,
aaa lm try rq
haha rip
lemme try
are u sure that u run the right notebook?
Version 6 btw
it's working for me
can you send me the link?
on Kaggle and Colab
the run.bat?
Ok, make sure to run version 6
ye
not the run.bat, it has dependencies conflicts
ppl said that precompiled version isn't working too
the run.bat seems to be working so far on my end, cloning main branch
let's see
My cpu is amd Ryzen 7 5700g
did you check in task manager, performance tab ?
That is what it says
GPU is one of the most important things in intensive tasks like AI, running it on cpu is useless since you're gonna have unstable ping
you can run it on cpu but it's not worth at all
Ohhh fairs right I’ll leave it then
so yeah your pc would be too weak for a decent local experience
you can use cloud
I’ll just get a 1080 then
which is remote good pc
The google drive thing?
Is it free?
i think you meant google colab
you can use google colab for 4 hours daily max for free but the gpu time is kinda random
the issue is that hina mod original wokada is broken, and that the wokada deiteris fork colab gets detected for using a web user interface which isn't allowed in the free tier, so you'd need a pro subscription
but, you could use Kaggle for free, with 30 hours of weekly gpu, it's just a bit harder to use and needs a phone number verification (it's owned by google)
this is the wokada deiteris fork kaggle: https://www.kaggle.com/code/suneku/voice-changer-public
I mean that's kind of decent, but I advise you atleast an RTX card, since that card is barely higher than the bare minimum
are you going to use this in discord vc or games too?
Thank you appreciate it I’ll check it out
Dc vc
Is a rtx 3050 good?
with a gtx 1080 it's decent only for discord vc, like it works but not max quality and low delay
btw are you going to buy a laptop?
Naaa I’ll just add it to my current pc
Ohhh I see
I would suggest an rtx 3060, btw you're not really forced to buy it since you got cloud
yeahh its for the future possibly
yeah, if you want even an rtx 3050 would be good but an rtx 3060 might be better for higher vram, tbh depends also on what you're going to use the PC for other than this program
Yeah that’s also true I would be prolly be using it for gaming
I’m just setting up my acc rn
Then yeah if you want a budget GPU an rtx 3060 might be better
Or you could get an AMD card since they are cheaper, it would be good for gaming
Just keep in mind that ai support on AMD isn't as good as Nvidia, but fortunately Wokada deiteris fork improved amd performance
Oh is it? That works quite well ngl
I’m just confused rn with the installation
Yeah, you could have issues on AMD for other ai programs though
What's wrong ?
And that’s true also
i cant find the notebook option
Click edit my copy, then check session options
Kaggle recently changed the name of notebook options to session options
okay im on session options what should i do?
oh wait
nvm
i just need to put the gpu
how do i do the 2nd part
like this?
That's just the name of the cell basically, just run what's below
okay
so like this?
just paste it into an empty line?
and for this do i copy the whole thibf?
Nope
All you gotta do is run the cell
There's a play button at the top left of the cell
yeahh like this?
i pressed this command but im unsure if its doing anything
Is that ok if Time per epoch is 0:00:01 while training model?
yyes
yup
yup it did as u see in the output
Nice, be sure to set https://rentry.co/forkvoicechangerguide#virtual-audio-cable only this part on your pc
A VAC (Virtual Audio Cable) makes a fake audio device, used to re-route the audio of different programs
In Wokada context, it's used to get the output of wokada as the input in other programs
- what's your pc gpu
- what tutorial link are u using
- what's your dataset size and training settings
1 seconds per epoch seems weird
yhyh i got all of that
thank you tho
u got vac lite, not vb audio cable, right?
bc vb audio cable gives random issues on windows
nooo i got vb

i got it yesterday
uninstall it
ill change it dw
get vac lite from that link i sent, unless u want more issues lol
na no more issues pls
also share a screenshot of ur wokada, so i can suggest settings
how do i uninstall?
okay ill do that right after i get vac lite
go in the windows settings, app > installed aps > find it > 3 dots at its right and click uninstall
yep i just done that
🔥
vac lite
yh i just done that]
@viscid moss
be sure to check the guide installation too below the blue installation link
set gpu to the kaggle gpu you set before in session options
set f0 to rmvpe without onnx
set extra to 2.7
set input to your microphone, and output to line 1
theres 2 options
select the first
okay
looking good?
ill set my mic in a bit
and for the characters shall i just import them?
- RTX 3060TI
- First tutorial - https://youtu.be/Hx2IHzt5tAc , Second - https://docs.applio.org/applio/getting-started/training
- Dataset Size - 5 minutes.
Settings:
RVC V2, Sampling Rate 32K, HIFI-GAN, rmvpe, contentvec, Batch size 8, Total Epochs 500, Save only latest, Save every weight, Pretrained, Custom pretrained (G_SnowieV3.1_32k.pth and D_SnowieV3.1_32k.pth), Index alghoritm - Auto
you can just click edit and upload the models
you need to set the input to microphone and output to line 1
yeahyh ill do that later as i havent got my mic rn
oh yh i forgot
the first tutorial seems outdated
the settings look fine, are you sure it's doing 1 seconds per epoch?
could you send a screenshot?
I can't send screenshots
Mel Spectrogram Similarity: 99.84%
Time per epoch: 0:00:01
!give-media-perms 1h @gusty cloak
okay wtf 😭
idk 💀
something surely went wrong in the training
I'm using Codename-RVC-Fork v3.1.0, but i have same results in Applion
it's the first time I see such thing, have you also tried retraining it? maybe changing the pretrain?
@simple ore have you ever seen this before?
nope, check #💬│engineer-chat
I had time per epoch 10 seconds before i started using pretrained
@gusty cloak training a model on mute files because they forgot to slice stuff
kinda random
but would u know the name of this song?
Should I change Pretrained or what?
do the preprocessing again
I haven't heard the name of this song, though it's written to be named "Phantom"?
yeah, i tried looking for it but i cant find it oh well
does anyone know why my thing doesnt work when i try use my gpu for okada
Checked all checkmarks in preprocessing and it works fine thx
elaborate:
- what's your pc gpu
- what you want to do
- what tutorial link did u use
- a screenshot of ur wokada
!give-media-perms 1h @last robin
Hey I used some google colab back when I used to make covers and I found it here but now it does not seem to work so like... How do I make covers lmao
i have a RTX 3060 im trying to use a voice changer i dont know what you mean by tutorial link but i have been using it for the last couple days and has been work fine but noe it says invaild or unsupported data type ComplexFloat on the command thing
the wokada stays the same but just says failed to fetch when i try switch
voices
it works with everything else but my gpu even tho it had been working on it for the last couple days
click edit, select a slot, upload model (.pth) and index (.index)
then just click the slot
and start
what's your pc gpu first of all
yes, i suggest you to rename the model file to smt else though, not just model
okay do i just press edit?>
Geforce RTX 2070
can you share a screenshot of your wokada?
no don't edit directly by the ui, it's broken, you need to edit the model files name, and if you already uploaded the slot, you can just re-reupload it
ah okay
lmfao you didn't even need to use google colab
google colab is a cloud method, meaning it lets you use a remote good pc, and it's meant for people with a bad pc, it has limited gpu time in free tier
As you got a good PC, you can use RVC locally, you can choose between:
- Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline: The original RVC
locally means you will able to utilize your own gpu and have no limits
or do you still wish to use cloud?
yes
okay
lemme guess, you used a youtube video tutorial?
yes i did is that wrong
also weird it's not showing all settings for some reason
yup, all video tutorials are outdated
you're using original wokada which is more bugged, and also you are using vb audio cable which gives random issues on windows
so yeah, using this it's normal that it will stop working randomly lol
you can uninstall everything you got off youtube
i uploaded it
where would i find the updated version
there's 2 wokada main versions:
- original made by wok
- deiteris fork made by deiteris
each has it's own unrelated updates
check https://rentry.co/forkvoicechangerguide for the latest wokada deiteris fork
be sure to uninstall the original wokada + vb audio cable
alright, are you having any issues?
to uninstall t is there a certain thing or do i just delete the files
yeahh i cant seem to choose it
show a screenshot of what u see and what browser do u use
wait it shows you didn't upload anything, did you click upload ?
yeah i did
Oh LMAFO ty bro I just watched a yt vid like a year back that said to use the colab so I did
so did you do everything as said as in https://rentry.co/forkvoicechangerguide#uploading-models ?
youtube video tutorials are outdated asf and don't give all info for rvc/wokada lol, yw and lmk
yep i did
if it helps
the person that uploaded it says that
you cant use a gpt sovit can you on here
RVC and GPT-So-VITS are 2 completely different AIs
RVC is meant for Speech to Speech
GPT-So-VITS is meant for Text To Speech
not all voice models are RVC
yup
in rvc context:
- pth = the voice
- index = the accent (be sure it's the added_index and not trained one, anyways most models should only include just the pt and added index)
i see makes sense now
I just started making models and i'd like to know if everything going fine
hi again im very confused where i actully down load the new wokada
elaborate:
- what's your pc gpu
- what tutorial link did u use
- what you want to do
- a screenshot of ur wokada
!give-media-perms 1h @somber kraken
did you read the guide? it's all inside the guide, get the windows nvidia version
ohhhh sorry im very dumb i had misread the guide
it's fine lol, lmk
so pretty much
im running a 4060
i was searching for a voice changer
to troll my friends
good enough
i remembers watching this video
when i was little
This is How Villager AI Videos are Made on TikTok and YouTube.
THIS VERSION ONLY WORKS WITH NVIDIA GRAPHICS CARDS AND WINDOWS 10
Voice Changer Download:
https://drive.google.com/file/d/14IeoglqoxXk9D_SbCU70grZgLBt_8EwA/view?usp=sharing
AI Hub Discord Server:
https://discord.gg/aihub
(For in-game use) VB Virtual Audio Cable:
https://vb-audio....
I hope not a youtube video tutorial
nvm
and that it had
a voice changer
so i searched it back u[
it works but
i have allot of output media
does anyone know how to get more gpu usage from applio
this shit is an old version of original wokada
it's over a year old lmfao
simply you wasted time watching that youtube video
how do i hear myself
i just dont know which one to choose
no, it's bugged and has worse performance and quality
oh
forget everything of that youtube video
all video tutorials are outdated
only written guides are up to date
hmm alr
@low shard how do i hearmyself?
but which one would i theoretically have to pick to stream it to my mic
uninstall everything
wokada has 2 main versions:
- original made by wok
- deiteris fork made by deiteris
each has it's own updates
latest wokada deiteris fork is currently the best to use, read https://rentry.co/forkvoicechangerguide
it's just better you forget that guide at all
it's bugged, just don't use it bro
i just need to know which output device is
even the vb audio cable you used
it gives random issues on windows
it's not worth to use those things at all
i dont have an audio cable
you're just going to waste time trying to fix something that is not meant to work
A VAC (Virtual Audio Cable) makes a fake audio device, used to re-route the audio of different programs
In Wokada context, it's used to get the output of wokada as the input in other programs
i have a cheap usb mic
i will
it has everything you need in it, the updated version is much better, and it got the vac lite
but its 10:30 pm rn
without a vac, wokada can't work at all, in all versions
so yeah you just kinda wasted time lol
set the output as your headphones in wokada
okay
i mean the vac is at the 3rd step of the wokada deiteris fork guide, and that should work also with the version you got rn
but anyways let me know
are there any other issues?
no but forsome reason i cant hear myself
can you share a screenshot of ur wokada?
thanks man it works
set output to line 1
and monitor to your headphones
-# sorry for my confusion lol i was helping 2 people at the same time
after doing that, you should be just able to select any model, start, and hear yourself
does anyone know anything similar to kaggle i can use to run applio for training
wdym?
ohh thats fine
i ran out of gpu usage and i cant find like a paid option or anything to get more
also be sure that the chunk has a bit higher value than the perf value at the top left
the rest seems fine
-# let me know ofc
ran out of gpu usage?
could you show a screenshot and explain more?
did you seriously use already the 30 hours weekly of kaggle?
yeah😭
okay done that its still not working
how many models are u cooking 😭
30 hours is already alot for free tier
legit just 1 but all the errors ive had i had to restart the model from scratch completely like 6 times
kaggle doesn't have a paid tier to increase gpu usage
what if i just export the log folder and make a new account
well, google colab gives max 4 hours of gpu daily (which can be random), but there's a paid option there
you need to verify with new acc, so new email and new phone number
if it's the same email or phone number, it won't work
yeah thats fine ill probably js use one of my familys numbers
what's not working exactly?
so if i export the log folder for the model and put it in a new one can i just resume the training like normal or is it more complicated
which to select? the output? you should uninstall vb audio, and then select line 1 as the output in wokada
and then select the input in the game/discord vc as line 1
i believe my mic is working i just cant hearmyself
should i try a new broswer/?
huh
be sure the site has microphone permissions, also try chrome or firefox since operagx usually has issues with wokada has reported to other users
can you elaborate your help request?
yh will do
well without info it's hard to help
alr
🔥
i just refreshed and it works
which vb audio should i select, there are multiple options. i dont know which one to picka
or should i not use vb audio
cuz i'd rather stay on the old version for now
do i change it
You shouldn't, yep
ye but for now
Vb audio cable is bugged on its own for windows
The vac lite from the 3rd step of the wokada deiteris fork guide will also work on the old version
So you'd just need to follow that single step of the new guide to make it work on the old version
https://rentry.co/forkvoicechangerguide#virtual-audio-cable specifically only this
If you want to update it, you will just need to run the other steps, but for now, you can just get the newer vac lite to make the old version work
I could just suggest you the force fp32 mode, this will use a more stable and higher quality inference (use models) mode, but will higher up a bit the delay
i want to use this on ig would u know how to do thar?
oh no it works now
IG? Instagram?
yeahh
It's an advanced option to get higher quality at the cost of some delay
It's optional lol
ill try it
i mean my delay is not that bad
I don't use it, but You'd just need to find the voice settings, set input as line 1, and then set output as headphones
okay ill see
Yeah bc it runs on cloud lol, the only thing that could kinda nerf the delay is your internet speed in this case
yhh thats true
so if i wanna use it on dc or any other app
how would i do that?
you'd need to do the same process I said in #✨│ai-help message
it works for every app
where is that?
you'd need to check the voice settings inside the settings of the app
in discord just click the gear, go to voice & audio tab, and here you can see the input and output
ohh okay
that's for wokada
for other apps (like use wokada in disocrd):
input: line 1
output: headphones
how do you delete a model that youve uploaded
elaborate:
- your pc gpu
- what program are you talking about (example: Applio, Original Wokada, Wokada Deiteris Fork, etc)
alright
idk what any of that is
what tutorial link did u use?
I hope you didn't use a random youtube video tutorial lol
so, everything is fine now right?
i just kinda did it all by hand i didnt need a tutorial
where did you download the program?
yhh should be however my voice doesnt sound liek the sample one
idk hwo to remove them off my list though
either reupload the slot, or remove the model files in the model_dir folder inside the program
also, be sure you're using the b2332 version
wheres the folder
the accent of it is not there
and whats b2332
can you share a screenshot of your program folder?
where is it
this program is in a web browser idk where its folder would be
you can higher up the index value, which will make your accent similar to the trained accent, but could sound like using autotune
could you send a screenshot of whatever you're using to opening the program ?
thats true yeah ill do it a bit
there's thousands of AI programs
this
i was told this is the most optimsed one
that looks like a shortcut,
right click it > properties > check the absolute path
you will get the path of where the program is
is it the latest version?
check inside model_dir, here you will find the model files
yup
you can also check the version by opening the program itself, you should see b2332 at the top left
ok
it's wokada deiteris fork, not sure if it's the latest though
it says B2332
yup latest
guess its abandonware
here you can delete the model files of that slot
welp nope, the b2332 version is actually from 2024 december
set monitor to none
okay ty
-# which is why it's weird you said you got it from october
any other issues ?
there's thousands of AI models, maybe try using other models
yhh i might do that
do yk any girl ones that are good
also it doesn't automatically add emotions, it depends on how you talk too
ty
or
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @earnest musk
- https://weights.com/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.gg/essentials/how-to-make-voice-models/
:wave: @low shard, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
okay will do
nick
i want to update now :D
for sum reason
when i start a discordcall the model stops properly outputting
yeah, I would have guessed that since the version you used works on hopes and dreams
please, start by reading https://rentry.co/forkvoicechangerguide and let me know for any issues/errors/questions
If I wanna use it again what do I do?
you will just need to run the last cell of the kaggle since you already installed it on kaggle and the files remain
okkk
@low shard i just throw the folder inside of the logs and continue training thru applio?
or do i need anything else
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link, wokada deiteris fork
i desperately need help on how to even get the voice changer
it was like a year ago when I installed okada, I got a windows new pc but can't find the download link
what's your pc gpu
what's your pc gpu
GTX 1660 TI
good enough
my CPU is a Ryzen 9900x, maybe its more powerful
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
why are people using a fork?
video tutorials are all outdated btw
what happened to the original
fork means modified version in IT term
the wokada deiteris fork has improved performance, and has advanced settings that can help with better quality
well alright, I'll see if it works

