#✨│ai-help
1 messages · Page 231 of 1
hmmm
It will work still
alrr
should i just stop the training now or should i see if it goes back down
I think it won't go down
so ya
bet
IT STILL DOESNT WORK

do i need to download the vb audio cable bc i uninstalled it
even tho it still shows in the thing..
it lets me do it on server but not client and idek what im doing anymore
What cloud
?
wHAT SHOULD I TRAIN ON COLAB
I'm gonna train a robot voice
can i use UVC to just remove bg noise on audios and stuff
i got a voicemodel someones making for me but he needs more dataset and im wondering if that works
hmmmm can u send a screenshot of the CMD? Is there any error?
yep
it depends on what type of bg noise tho
not much
its mainly just like bits of music or backround noise on my end when someone messages me
that could be removed with UVR
alr
prob with MelBand Roformer | Vocals FV4 by Gabox
i was using MDX23C
this is not 2023/2024 anymore
Thats what it said on the document
it still doesn’t let me put my input in
replace the word 'token'
I'm noticing a weird issue with my fresh install of the deiteris fork of w-okada/wokada; I'm on Windows, and I grabbed the voice-changer-windows-amd64-cuda.zip.001 and 002 from https://github.com/deiteris/voice-changer/releases. I'm just following the install guidance at https://github.com/deiteris/voice-changer.
Whenever I attempt to launch by double-clicking MMVCServerSIO.exe, I get a strange crash.
- I double-click MMVCServerSIO.exe
- Terminal opens up
- Lines are written stating I have Python 3.12.7, noting the Voice changer version as b2332 NVIDIA-CUDA, loading weights, and noting all items are downloaded with all weights loaded.
- One final line is written "[main] protocol: HTTP"
- About a second passes
- Loading icon / spinning ring appears on mouse cursor briefly
- Terminal crashes
In effect, the terminal window that opens when double-clicking MMVCServerSIO.exe seems to crash after the "[main] protocol: HTTP" line.
I can see that all of this is suitably being written to vcclient.log, given the file's updating with a new Date modified date/time whenever the terminal crashes... But there's no error recorded. When I open vcclient.log, the final entry is always just "[main] protocol: HTTP"
I'm not exactly sure what I may be doing wrong here; I should be on a stable GPU driver for my 40-series card, and I have no pending Windows updates. BIOS for my MSI motherboard seems to be fine. I'm not having issues running or launching other AI-related local programs, just the deiteris fork of w-okada.
This is additionally extra weird, because I was using w-okada perfectly fine about six months ago in October 2024. I can't tell what might have changed between then and now, if anything.
I just keep seeing the MMVCServerSIO.exe terminal crash on that "[main] protocol: HTTP" line.
It's very strange.
===============
IMPORTANT EDIT
Unplug your monitor from your motherboard. It fixes this. I don't know why.
I use Intel(R) Core(TM) i5-8265U CPU can it run rvc stably on discord
You run RVC or W-Okada the realtime voice changer with that CPU? That doesn't sound right.
You got any GPU?
Should I use RVC o W-Okada? :o
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
That's a bad question. RVC is regular voice changer, while W-Okada is "realtime" voice changer. Both uses RVC voice model to inference, the difference is how they supposed to work.
i have but it is intel R UHD graphics 620
That's an integrated GPU, and even worse.
yeh ☠️
so what should i do
What should you do? That's one of the most asked questions I've ever seen. Running W-Okada with only CPU isn't so recommended. You can try W-Okada on Kaggle instead, but you'll have to register with your phone number on this one, and ngrok account needed for token. https://www.kaggle.com/code/suneku/voice-changer-public
ok =()
Terribly sorry for the bad question, though thank you for replying!
I just tried RVC and it is just freezing upon launching the realtime gui
so I'll try wokada
What is your PC GPU?
4070ti!
Download and use this better W-Okada. https://rentry.co/ForkVoiceChangerGuide#download-for-nvidia-gpu-on-windows
Oh thank you!
Perhaps a bad question again, but these forks are just like distros for linux, right? :o
This link actually also helped me diag another issue
With this up here
It's very handy
Specifically in my case, I've tried the... admittedly unorthodox play of attempting to use the 5000-series-specific download, which is mentioned on that Rentry
that version of W-Okada and its MMVCServerSIO.exe will mention process exit codes / error codes
I don't know why that functionality isn't in the main version or the deiteris fork.
But still, I now know that this behavior up here's happening in my case during the HTTP launch due to some form of access violation
good ol' 0xc0000005
I was about to reply to your message, but the way you word your message is kinda confusing.
Yeah that's on me
The gist is that running MMVCServerSIO fails at the [main] protocol: HTTP line
I can now see that it's specifically failing / exiting process with code 0xc0000005
This can mean a whole host of potential points of failure varying from RAM issues to CPU issues to motherboard issues to... really many, many things.
In my case, it may also indicate some level of failure in my intel CPU which, to be frank, isn't... unexpected for my gen of i7.

The issue mentioned up here likely isn't a W-Okada thing, it's probably local to my machine and may be indicative of CPU failure.
not surprising that it refuses to run on non-50 series gpus
?
still the virtual cable problem?
tell me if u need to post screenshot
Crazy. Download Virtual Audio Cable lite now. https://software.muzychenko.net/freeware/vac470lite.zip
so i have an Nvidia 1080. i know, super outdated, but it functions for now. i just dunno what the hot new model is nowadays. so i wanted to ask what the hottest new is
i have a 1070 and they said use this
There's Applio.
-rvc
i'll look at that Applio doc rn
ty
says 2k but i know 1080 is pretty comparable. do you think that's good enough?
check the console window for actual error messages (the black background one)
alr
Any NVIDIA GeForce GTX 10/16 GPU is fine for basic AI program. If you wanna "train a voice model", NVIDIA GeForce RTX 20 series GPU or up is definitely recommended for that task.
noted. I'll probably try and suffer with the slow. buuuuuut i do appreciate you pointing me in the right direction. plus for letting me know which channel was right now. last time i was here there were WAAAAAAAAY more. so TY for all the help. i'll def be back in the future of Mangio's current training while i do HW goes poorly and i need some help with figuring out Applio
You're welcome. Mangio as an RVC program is outdated, as there's no more development going on.
mangio is abandoned indefinitely
Hi I’ve been using w-okada voice changer, and i rebooted it recently and now every model sounds very robotic, even the base models, how could this have happend?
someone help me. im using w-okada normal one last ver of cuda not forked. this never used to happen basically im using the voice changer and yk how theres a terminal running aka the .bat file for running the voice changer basically it terminated itself and the w-okada voice changer didnt close but everything stopped working now when i keep opening it just keeps terminating itself without even me even getting the change to use a model yet
if you wanna use w-okada on a game like roblox, what input and output do you use for the voice changer and roblox? im mac, so im using blackhole
how can i use rvc on kaggle
hi guys. just wondering. which of the UVR models is the best for you guys?
hi, wi want to ask. why my voice is not change
The original version of W-Okada is outdated, and very buggy at the moment. What is your PC GPU?
There are two Macs: Intel Mac and Apple Silicon (M CPU) Mac. Running W-Okada locally on any of Mac won't be that good, but I think it's possible to run it from Kaggle.
guys i need help, why does my cmd show pipeline not initialized. how to fix that
RVC or W-Okada?
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Hi everyone, I’m looking for an expert who can build a real-time deepfake video and voice system for live Zoom/Teams meetings.
The goal:
- Live avatar (my face) + live AI voice cloning (my voice) during meetings.
- It needs to look natural, respond to conversations, and work with remote laptops.
If you’ve done something like this (or close), please DM me!
This is a serious paid project. Thank you!
Looking for someone who can start immediately or has demos to show. Open to custom solutions or existing tools integration.
This is not where you promote your job application.
Uhmm w-okada
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Ah, it's such a mess when there are many people in #✨│ai-help looking for W-Okada at the same time.
Who’s w-okada??
W-Okada is not a person; it's an AI program where it changes your voice in realtime.
Are you using this one? https://www.kaggle.com/code/suneku/voice-changer-public
ye bruh
but i don't know how to use it =v
Hi everyone,
I’m hiring for a paid project: I need a live deepfake system (my face + my voice) for real-time Zoom/Teams calls.
Goal:
- Live avatar of my real face during meetings
- Live AI voice cloning (my voice)
- Must work smoothly with remote laptops and screen sharing
This is a real paying opportunity. DM me if you’re confident you can deliver.
Thanks!
W-Okada not changing your voice can happen when you didn't press green start button on UI, or you just didn't set something right. What is your PC GPU? Did you follow any tutorial video on YouTube before this?
Please stop doing this.
send it once and wait, it doesnt need to be the latest message
hello, i need help. i tried using rvc, but i have a problem,when i press start, in my CMD it always says Pipeline is not initialized, and my sound doesn't change at all, but when i use passthru my sound appears but it's my original sound, i'm using cuda_v.1.5.3.15 version. currently i'm using Amd Rx 570 8gb
You forgot which message you should reply to? The W-Okada version you're using is the original version, which is outdated in the end. AMD Radeon RX 570 will work with W-Okada, but it won't be that fast if you wanna use it with a game.
If you have an AMD GPU, download and use this better W-Okada instead. https://rentry.co/ForkVoiceChangerGuide#download-for-amd-intel-and-cpu-on-windows
Wow, my Kaggle tutorial is lost, so I think I have to teach someone on how to use Kaggle again.
GTX 1050 I used a fork version and I didnt like it tbh
And also this never happened before
Been using it for so many months just started getting this issue now
I know someone
@royal jacinth
Okay let’s talk then @latent kettle
Check your dm
Ok
what should i do next after extracting it?
You didn't like it? If you used the older version of fork W-Okada, it could happen. Or else something is wrong with your PC.
How do I actually make a full on ai.
how? i haven't even installed anything
while using w-okada
it just randomly happened
Okay
@hallow thistle the cmd is this right before it closes:
using default ngrok token.
checking the modules...
2025-04-27 13:38:15 | INFO | ngrok.session | Session created with auth token
2025-04-27 13:38:15 | INFO | ngrok.listener | Created listener "cd85d7f8982c8c703bb7114ff6d3bdff" with url "https://private-------.ngrok-free.app"
2025-04-27 13:38:15 | INFO | ngrok.listener | Listener "cd85d7f8982c8c703bb7114ff6d3bdff" forwarding to "tcp://localhost:18000"
vcclient
2025-04-27 13:38:15 | INFO | ngrok.tunnel_ext | forward_tunnel; tunnel_id="cd85d7f8982c8c703bb7114ff6d3bdff" url=tcp://localhost:18000
----------------------------------------------------------
Application | http://localhost:18000
Log(rich) | http://localhost:18000/?app_mode=LogViewer
Log(text) | http://localhost:18000/vcclient.log
API | http://localhost:18000/docs
License(js) | http://localhost:18000/licenses-js.json
License(py) | http://localhost:18000/licenses-py.json
Ngrok | (private-----ngrok)
----------------------------------------------------------
Please press Ctrl+C once to exit vcclient.
and i'd still have the voice changer open but would be broken too
This is the original version of W-Okada. I use fork W-Okada. Original and fork W-Okada are whole different looks..
i know. most of the stuff dont load becaue the batch terminates its self for some reason before it finishes loading and even if it finished loading voice changer models wont work nor would u be able to switch
fixed it.
Do not run W-Okada through batch. Run W-Okada on CMD instead or double click on the program.
Does W-okada on colab work ok?
Original W-Okada notebook is broken. Using fork W-Okada on your free Colab can get your account terminated.
what should i do to use w-okada =v
Running W-Okada directly on your PC and Kaggle are the only hopes.
my pc uses intel R UHD Graphics 620 gpu☠️
That leaves Kaggle your only hope now, until you buy a better PC with better GPU.
but i don't know how to use kaggle
just use virtual cables
input (your microphone) output (virtual cable you have), monitor (your speakers [this is optional its only to monitor your self])
why is it not working =v
did u give it perms
Elaborate
What's your PC GPU? What you want to do? What google colab link are you using?
that message thing that pops up in colab
more likely outdated colab that Google is banning it
Yes, elaborate what I asked you
Runtime disconnected
Your runtime has been disconnected due to executing code that is disallowed in our free of charge tier. Colab subsidizes millions of users and prioritizes interactive programming sessions while disallowing some types of usage as outlined in the FAQ. If you believe this message is in error, file an appeal. Please include any relevant context about your usage.
Your compute unit balance is 0. Purchase more
To connect to a new runtime, click the connect button below.
Yes, elaborate what I asked you
t100
What's your PC GPU? What you want to do? What google colab link are you using?
it is colab's, what's yours?
That's google colab GPU, not yours
Always check #📰│dev-updates
That one gets detected by google
Bc you can't use web UIs in Google colab free tier
And also original wokada is broken on colab
So u less you pay you can't use wokada on colab but there are other ways
so is there no ui one?
@zealous shoal first of all, use task manager to tell your PC GPU
Maybe you can do it locally if it's good enough
i have a 780ti
Only Kaggle, 30 hours weekly but it's harder to use and also you need to have a Google account and verify your phone number
https://docs.aihub.gg/w-okada/cloud/w-okada-kaggle/ this is the guide for the wokada deiteris fork kaggle
Last update: Feb 19, 2025
the acc isnt a problem
Reminder that you need also to verify your phone number else it won't work
alr
damn
Because the providers update the dependencies
Of course, it has an ancient GPU
That's why I highlighted having a good pc
The bare absolutely minimum is a GTX 900, but your PC is even way weaker than that
well the 980 performs 11% better than 780ti
Summarizing, your only options are:
- buy a new PC
- use Kaggle
- pay for colab
- wait months if not more for colab to get fixed, if it ever does for wokada
The 780ti doesn't even get recognized by CUDA anymore which is needed for AI
It's ancient
aw man
AI is an intensive task
You can't expect to run it on ancient hardware
@zealous shoal well, let me know
dang i cant verify a number
If it's giving issues, contact Kaggle support and wait till they email you back
Also no, it needs a user interface
Good luck, it's like this since 4 months
i think my friend has one
You can't expect everything for free
@low shard
Not sure but worth a try
wait about the t4 gpu
does it require display?
ill save money for it
i have an extra pcie x16
@zealous shoal Hina is busy and can't make original Hina mod Wokada to work, and no matter what we do Wokada deiteris fork get detected, soo it could be like this forever
I would just suggest you to use Kaggle support, they will reply to u TMR prob
Wdym? You don't have a monitor?
yes i do
It doesn't use a regular UI, it uses a Web User interface that opens in your browser
bc it saids on the nvidia website, you can have two gpus
one for ai
and one for any task
I mean you can just use that GPU for ai yeah
Tho you should prob take in consideration that your main GPU is old and your other use cases of this PC other than using this program
me too poor
RTX 2060 could be at most affordable for you
Just contact Kaggle support https://www.kaggle.com/contact
or better, 3060
You don't even need to write an email to contact them for ur phone number not working
They have a 780ti
I mean the upgrade suggestion
What to do if the microphone is not selected in the voice changer
elaborate:
- your pc gpu
- what you want to do
- what tutorial link are you using
- whats your browser
they are deffo using a yt tut 😭
well better to use that support forum
GTX 1050 TI Laptop
replied already in the post forum
hi im frands Meow:3
yo can anyone help me? i have problems with ookada
any problem you need to get help on?
you too, pls explain it
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Is there a guide to install on linux mint? I was using https://rentry.co/ForkVoiceChangerGuide#linux but it says to use "sudo yum" and yum isnt a command on mint
elaborate:
- what's your pc gpu
- what u want to do
- what tut link are u using
- what is the issue
maybe try apt instead of yum?
I don't use linux
tried that, it just says unable to locate package portaudio
Same, this is my first time really trying it haha
what if u try sudo apt install portaudio19-dev ?
That seems to have worked, I'll continue the guide and see what happens
let me know, just updated the guide
Oh well I guess there wasn't much more to continue for installing, just need to figure out how to open it now haha
Like if I double click it, it generates the pretrain folder and some files, but nothing actually pops up
doesn't it show a terminal window?
it's normal that it generates internal files, but it should show that it's doing that in a terminal window, atleast on windows
maybe wait and see if it opens the browser alone when it's done generating files
Nope, I am kind of getting it though, I opened terminal myself and used chmod +x, then just ran ./MMVCServerSIO now it's doing some stuff
chmod +x may have been unnecessary but oh well
is collab doesnt work anymore?
this one?
/bin/bash: line 1: ./ML_Program: Is a directory
WARNING:pyngrok.process.ngrok:t=2025-04-27T14:45:00+0000 lvl=warn msg="Stopping forwarder" name=http-18888-5b0d435b-886b-4599-abea-a0f13e8760db acceptErr="failed to accept connection: Listener closed"
--------- SERVER STOPPED! ---------
Cool it works now! I got the error "MMVCServerSIO/_internal/libstdc++.so.6: version `GLIBCXX_3.4.32' not found (required by /lib/x86_64-linux-gnu/libjack.so.0)
[PYI-26775:ERROR] Failed to execute" after it finished installing everything, so I went to the _internal folder and renamed libstdc++.so.6 to libstdc++.so.6 .bak, then ran "sudo apt update
sudo apt install libstdc++6 libportaudio2 jackd2"
now it works fine
the original hina mod colab is broken
the one you're using, wokada deiteris fork gets detected, it works only if you pay colab because web uis aren't allowed on the free tier
@tough fiber
what's your pc gpu and what do you want to do?
Does anyone had success with rvc webui on amd gpu using dml
If i use the req-dml.txt i can get dml working but the conversion always gives a error about not finding the source file
Its a wav file i used fine on the cpu version
Windows 10
elaborate:
- whats ur pc gpu
- what u want to do
- what tutorial link are u using
6700xt
Use ai models to make covers and maybe train my own model
As far as tutorial i asked chatgpt
Downloaded the latest build for amd from github
Made virtual environment
Activate it
Installed torch dml
And dml.txt
Made sure to use python 3.10.9 and pip 23.3
As far as tutorial i asked chatgpt
chatgpt doesn't know about RVC
your pc gpu is good enough
but the original RVC(also known as mainline), doesn't have windows amd support
it only has linux amd support
@woeful obsidian what's your OS?
(Operative System)
Windows 10
Thats why i used the directml
Which works on windows
Its not as good as nvidia but much faster than cpu
3070 but on cs2 its pretty bad
the mainline rvc doesn't have windows amd support, only linux with rocm
it's shown in the readme
you need to use applio (rvc fork) with zluda (cuda emulator) https://docs.applio.org/applio/getting-started/installation#amd-gpu-support-windows
AMD support isn't that well for AI, especially on windows where you gotta use an emulator lol
you don't even need to use cloud (remote good pc), your pc gpu is good enough to run it locally
you'd just need to set the graphics to the lowest for games
check the wokada deiteris fork https://rentry.co/forkvoicechangerguide
Im confused
for Nvidia graphics cards
pip install -r requirements.txt
for AMD/Intel graphics cards on Windows (DirectML):
pip install -r requirements-dml.txt
From readme
okay nvm it was in the other readme inside the docs
soo, can you share a screenshot of the error?
we usually suggest applio anyways, because it has better performance and support (generally speaking)
along with an easier User Interface
Anyone know how to run hugging face models?
elaborate
what's your pc gpu? what you want to do exactly? what models are yout alking about?
huggingface is one of the biggest site that contains AI models of all type, RVC, image to video, text to image, etc etc
along with also datasets and other things
Performance as in quality or time to convert
Does it have all the modes of rvc ?
performance as in speed mostly
wdym with all modes? maybe you meant models? yes ofcourse it can run all rvc models
No i meant algorithm i guess
There is like 4 or 5 to choose from in rvc forgot the names
wdym algorithm
it's still rvc, it runs rvc models fine
I mean pitch extraction algorithm
Like pm rmvpe
yes ofcourse it has rmvpe
rmvpe is the best one
the others one are there but aren't suggested at all
model quality depends on the dataset
the fastest f0 is fcpe, it uses 1gb of vram in fp32 mode
but it's a bit unstable
I know
So this applio and all its dependcies have all been scanned and safe?
Im paranoid about that stuff lol
it's safe yes, it's been a fork since like a year and it's open source
the code is public at https://github.com/IAHispano/Applio
yea dont use mainline if you got an amd gpu
Does it do anythinf better than main rvc or just more user friendly
better performance, better amd support via zluda and more user friendly
it's the most suggested one for amd
Ok dope
Thanks
Can I dm you about it
No need to, you can elaborate here or in #1192011222023950368
I already asked you the things you need to elaborate
nick are you a mod
Hello, are you looking for tech help or something related to moderation?
I am a Junior admin, and mod yes
i guess moderation so there is a user who is scamming people from this server
i have screenshots if you need
please dm @vital hedge so it will let you open a ticket on the server and the situation can be seen by the whole staff team
what you mean i have screenshots why is there a process to whole thing lol
i thought i should let you guys know i am not dumb enough to fall for it but maybe someone will you know
sorry but we do this so we can check the situation as a team and have the ticket saved for moderation proof, it takes only 2 minutes don't worry, you don't need to do anything else than just dming the bot and showing the info in the ticket
oh thank you so much nick i did it ticket is generated i hope it resolves soon, thank you again bro ❤
Where do I find the latest and correct version for the voice changer?
When I launch mine now, I goes up to like 30k ms...
I have AMD and Windows 11
What happened ?
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
Thanks a lot!
what's your pc gpu btw
Any idea what a good female voice is for someone with an accent? xD
i'm guessing realtime for calls
great
yeah then just use the 1st link, it's the wokada deiteris fork which is the most optimized and has alot of improvements generally, but even for amd
Yep, I just need to test with a good model now I suppose
be sure to download vac lite at the 3rd step and the wokada deiteris fork amd windows version
My last fork began spiking up to 30k ms, I dont know why
if you had vb audio cable, uninstall it as it causes issues reported by other users
alr, do u may want me to check ur settings?
I would love that
it will be named as "Line 1" in the wokada output options
!give-media-perms 1h @carmine fox
u can share screenshots now
Yep!
This is lite?
yep
u can share a screenshot of the rest of ur wokada settings too
gpu: your amd gpu
extra: 2.7
chunk: 128
set input as your microphone
also, you should be able to use force fp32 mode in advanced settings in amd too, it makes the models more stable and better quality at the cost of some delay, your choice if you want to use this
Yes, perfect!
you could optionally also use https://rentry.co/forkvoicechangerguide#reduce-more-delay for less delay, but you won't be able to use echo & noise suppression and it's a bit more complex
The issue I always had with Okada is how you can just naturally tell the voice isnt real, bugs out too at certain tones. But it could just be my settings or setup? I never heard from someone who is actually good at this
Is there a good, realistic female voice you can recommend? For someone who is going to speak English, but has a slight accent
The settings you're using right now are good, I would suggest you to try models with those settings
if you still have the same issues, it might be the model aren't properly trained, like at a specific pitch range, since those are models trained by the community, there aren't like 'official rvc models' since it's an Open Source project
it would be better you play around with the pitch option and try other models
https://rentry.co/forkvoicechangerguide#voice-models-to-try-out might help you out
or if you want to search them yourself, You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @earnest musk
- https://weights.com/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.gg/essentials/how-to-make-voice-models/
:wave: @low shard, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
Thank you so much, I really appreciate the help
you're welcome, need any other help? or is this all solved?
Im actually getting the same issue with this FORK as the last...
ms spiking up to infinity
And it just keeps spiking
It stopped now at 36k ms
36k? The perf value showed only 288
Are you using it with a game too?
Can you close every useless program in background, then put the chunk to 400?
Just remember to set the graphics to the lowest, and if the perf value goes red, put the chunk a bit higher than it
I have nothing open now except my browser and discord
Still the same issue
I never had this issue before
Guess I can try restart my PC
Like, you keep rising the chunk, and then the perf goes even higher and red?
Did restarting fix this? Or just increasing the chunk?
Just increased the chunk
Yeah, I think the issue was related to the perf being red
Be sure that the perf is always green, else increase the chunk
Any other issues?
Should be good thanks! Last question, should I keep all the volumes at default? 100%?
Should be fine, increase them only if you hear yourself too low
Also, monitor is optional, it's only to hear yourself if you set it to your headphones
How do I enable the monitor audio?
Do I just put it to my headset?
Yeah that fixed it
Yup
You're welcome, have a nice day
for the next step after setting up applio and tensorboard
what do i do from here?
nvm
where do i do this?
that's for applio colab not applio kaggle btw
yeahh ik idk how to do this in kaggle
oh wait
nvm
i didnt even realise i pressed the collab one omd
the only issue is idk what to do from here
im trying to downlaod cuda and all that for voice training im on the step where i downlaoded cudnn and powershell is aking my to find the path, but im like so lost and cant find any of the files i donwlaoded it, would anyone be able to help me
im doing batch right? since i got multiple files?
lemme guess, you're following a youtube video tutorial?
wdym
nvm i found the guide
basically im here right
nope
elaborate:
- your pc gpu
- what you want to do
- what tutorial link are you following
oh yeah batch inference is just inferencing multiple files, you just need to put the folder that contails all files
be aware that it's not much suggested since you might need to play with the pitch on some specific files
hey! Im using APPLIO COLLAB and my dataset is 25 min, and when i try to pretrain it says the track lenght is 00:00
do i need to cut my wav. into 2 parts or what should i do?
elaborate:
- your pc gpu
- what you want to do
- what link are you using
- a screenshot of the error
oh-
do you need any help
when you say i need to put it in the folder what folder do i put it in?
i have the files on my pc
and i copied the path for it but its not appearing in the voice model
the input folder is this, go there in the file url
i figuired it out and its now dowlnading python but when it started doing thing it moved my cudnn file away from my file i want my rvc in, is that normal, do i need to move it back?
pls elaborate #✨│ai-help message
or does it need to be in kaggle im guessing?
no, it's on cloud, it's connected to its own pc files not your pc at all
ah shit
you can either upload the files via the UI or use the file url to put the files in that folder
how do i uppload this to kaggle?
how do i use the file url
this>?
it should be given in the kaggle output near to the applio url
yes
yeah i just found it lmao
Nvidia geforce rtx 3080 ti
I want to make a voice trainer, using RVC-Project
/
Retrieval-based-Voice-Conversion-WebUI
and ive been follwing just asking questions in chat gpt, and i downloaded cudnn off of nvidia and by using powershell and followed some prompts, and its currently downlaoding python now
audio extension which can be .wav, .mp3, etc
its not showing up for some reason
Nvidia geforce rtx 3080 ti
great
I want to make a voice trainer,
I guess training a model
chat gpt
chatgpt doesn't know about RVC
You have been trying to use the original RVC (aka mainline), which isn't much suggested to Applio (an RVC fork, modified version)
it's better you uninstall everything you did right now, and follow our docs guide for applio https://docs.aihub.gg/rvc/local/applio/
Last update: Apr 01, 2024
windows
trying to create and train a new model in Applio Collab
the error says:
Preprocess completed in 0.01 seconds on 00:00:00 seconds of audio.
An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file
No wav file found.
WARNING: All log messages before absl::InitializeLog() is called are written to STDERR
E0000 00:00:1745779166.624992 3045 cuda_dnn.cc:8310] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
E0000 00:00:1745779166.631806 3045 cuda_blas.cc:1418] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered
/usr/local/lib/python3.11/dist-packages/torch/utils/data/dataloader.py:558: UserWarning: This DataLoader will create 4 worker processes in total. Our suggested max number of worker in current system is 2, which is smaller than what this DataLoader is going to create. Please be aware that excessive worker creation might get DataLoader running slow or even freeze, lower the worker number to avoid potential slowness/freeze if necessary.
warnings.warn(_create_warning_msg(
Not enough data present in the training set. Perhaps you forgot to slice the audio files in preprocess?
An error occurred extracting the index: need at least one array to concatenate
windows
That's your OS, Operative System, not your pc gpu
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
Colab is cloud (remote good pc), it's meant only for people with a bad pc
so this will allow me to train a voice by me speaking my normal voice and evntually no matter what i say it will sound like what im trying to mimic?
what? you can't train an ai model to work perfectly specifically for your voice
the ai training will work based on the dataset you give it, how good is the dataset, and how you monitor the training with the tensorboard
Also, RVC (and Applio) is for training and inference (use models) on pre-recorded audios, you should check wokada deiteris fork if you want realtime inference for calls or games (along with Applio to train the voice)
i think that what i ment, using a model and then traing it for me specifically
no, you can't train a model for it to be perfect only for your voice
you can train a model based on the voice, not to work specifically with a voice
if you just found a model randomly online and try to 'train it' to work with your voice, that's impossible
just find another model, not all models are super high quality, or try to play with the pitch
@sharp lava are u trying to train a model specifically to work with your voice because the realtime voice changer isn't working well? may u want me to check your settings too?
then what is RVC-Project, is there no difference
Appilio and RVC-Project what is the differnece
Applio is a fork (modified version) which has better performance and easier to use
there is no single way in any program to train a model specifically to work with your voice
but, you can train a model based on a voice (like peter griffin for example)
yeah thats what i mean i dont mean my voice, i just mean like when i talk and train a model itll sound like the model, peter griffern for example would be trained from peter griffen clips and all that
i dont mean my voice specically
you don't just talk into the program and magically it trains itself
yeah i know that
you need to make a dataset based on the voice, clean it to be perfect, make sure it's only vocals, check the sample rate, set the correct training settings, then start training and monitorage it with the tensorboard
so yeah, if you're looking for this, this is how RVC training works, and it's the best you use Applio for best performance
got it
can you also use Appilio as the voice changer it self, or is it just for training, I currently use W-okada
no, not as realtime voice changer
it's just for training and pre-recorded audios inference
also, are you using wokada deiteris fork?
share a screenshot of ur wokada
!give-media-perms 1h @sharp lava
What i need to put it in the F0 Det
elaborate:
- your pc gpu
- what you want to do
- what tutorial link are you using
- whats the error
NVIDIA RTX 4090
VOICE CHANGER
you didn't elaborate everything
reply to each thing I asked
@brittle wing do you remember what tutorial link did you use? if not, send a screenshot of the program
Quick question when I want to use Applio do I need to do run all of the cells
!give-media-perms 1h @brittle wing
for kaggle? no just the start ui (last) one
for training? as long as it's good quality
Yeah I believe so
And would u know anything abt extracting game audio from a game file?
hey, do u know why i can hear ppl in the game thru my owakada ?
like its echoing in the game sometimes
and than ppl can hear themselves thru my mic
share a screenshot of ur wokada
u should check the game modding community
Okay
set output to line 1
set in sens a bit more to the right
if it still persists, you will need to either:
- lower your headset volume and put ur mic away
- use client and use echo sup2
output line 1 means i cant hear the changed voice right?
yes, because for that you need to set monitor to ur headphones
could u pls explain what monitor means?^^
ohh perfect it worked now after i changed my monitor to my ehadphones ❤️
oh and where do i finde the turorial for the settings?
Hello, I am looking for an RVC file in Bulgarian.
it's basically for monitorage of the voice conversion
so you can both use it to the program and both hear it urself at the same time
you were the guy who had an rtx 4070
well wdym rvc file in bulgarian? maybe an rvc model in bulgarian?
oh but like what diffrent does it make if i put monitoring to my headphones or lite
shouldn't have a difference
it's just meant as a 2nd output to hear yourself at the same time while using the model
but i cans till ehar myself
i mean atleast the changed voice
but the echoing is fixed i think its a bug sometimes
i think that's just because of echo in your headphones, you're not supposed to hear urself if u had output to line 1 and monitor to none
my settings before was output my headphones and monitorm was line 1 i learned that from a youtube turoial back than
you should forget about what you see on youtube for wokada
there's people who suggest even crepe_tiny or harvest on youtube for wokada 😭
yes
btw where do i found the turoial in the server or do i always have to look into this chat?
just right above the support channels
Last update: Oct 21, 2024
ty very much
yw
what app shoudl i use for the voice changer
Im having troubles with the Real time voice changer, it keeps buffering and not play the full audio just parts of it how can I fix this. I have a AMD RX 6600 and I want to use the audio in discord or any other games.
yes I am the same I am looking for a file in Bulgarian
rvc models are not language specific
so i just uploaded the audio to the online file link how do i get the link and put it into the applio input place
how do i paste the linkkk
what colab, what link, what are you trying to do?
Applio on kaggle
So I got Applio all set up
And I need to fill in the input box with a link
I got the Audio on a folder here
But idk how to copy it and paste it into the input
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
It will be fine except for intensive games
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link
Download vac lite at the 3rd step, then the wokada deiteris fork Nvidia windows version
is there any video too installing this?
what are you trying to do? run inference? train a model?
batch inference is only if you're able to upload multiple files / local install
easiest way to upload files using UI to colab/kaggle is to go to the training tab and make a dataset
you'll get /something/something/assets/foldername -- that's what you can use in batch inference
No, video tutorials are outdated
I'm trying to install Zluda and I got a fatal error on my cmd prompt
More specifically "fatal: too many arguments"
show a screenshot
of what you're trying to do and the error
I'm trying to do the setup steps on the github page
That's directly copied from github
i'll let the maintainer know
no, that's message from attempted uninstall
How do i use kaggle for training models
Can we use a longer reference audio for a more accurate voice cloning in Zluda
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
Can Zluda use the amd gpu instead of the cpu?
do you know what zluda is supposed to be for?
Mb I mean fish speech using zluda
Ngl someone just recommended it to me, and I've been taking shot after shot in the dark to try and make it work
I have 0 idea on what I'm doing
it does
if you haven't, you can search the documentation for that
I tried making a voice clone and have it said 3 words and it took almost 4 minutes, is it possible for it to be faster?
it is compiling the kernels, gonna take 20 min 1st time
Wait so before I proceed with this, I can use this for Realtime tts right?
Kinda like an ai assistant?
Are you talking about something in the #1159513888199540817 ?
Actually this is the whole thing... Idrk if there's an issue but I think it's worth asking at least
not sure why it says cuda not available
you probably did not start it properly
So apart from "Cuda not available" everything is fine?
yeah
So I did relaunch it, I just clicked on the fsz batch file in the folder
Is there another way of opening it?
no
if that fails it means you not do the preliminary stuff - hip sdk, adding it to the path
I did do that tho
Probably did it wrong then...
I think I did it correctly
Do I have to do step 6 even if my GPU is the 7900xtx? Could that be the issue?
that looks fine
no, 7900xtx is supported by default without hacks
Then idk the issue then
which python version do you have installed?
run cmd.exe, from it run where python
okay, that's fine too
yes mine is 49mb but it can't be importyed
This is what my Microsoft Visual installer looks like when I launch it, I tried repairing and reset my pc still the same issue
let me just install it myself
it seems there's a missing line in install-amd.bat
python -m pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --upgrade --index-url https://download.pytorch.org/whl/cu118
that is the pth file alone, with index file included in the zip it can be more than 500 MB
@elfin nebula it should be after pip uninstall torch torchvision torchaudio -y --quiet
you'd have to reupload each of the pth and index file in the space
Wait so what should I do?
Do I run that in cmd?
no, edit the install-amd.bat
add the line I mentione after that line
delete venv
then re-run install-amd.bat
Okay this is where I get confused. Where do I edit that?
What does An error occurred connecting to Discord: Could not find Discord installed and running on this machine. mean
the cloud environment doesn't have discord within it
mostly only in local system with discord being run
open .bat file in notepad
insert the line, save
Ah gotcha.
Malware
I can't find the .bat file
you can always just nuke the folder and start over
yes
Bro how do i access the gradio with ngrok
Seemed to work but it's using the wrong gpu
Or is it saying that it's disabling that gpu
Okay where do I insert that?
in either system or user environment variables
or in fsz.bat
set HIP_VISIBLE_DEVICES="0"
Seemed to work in fsz
How do I "train" the voice model for it to mimic as accurate as possible?
Dose anyone know how to install rvc i forgot
Okay
Which RVC? Applio or W-Okada?
Does this model sound
Run interface first
But I understand wym
Would it be good
If I combine all of my
Audio files?
in applio where do i put the pth and index files
can someone help me
why does the llm break starting from the second input message
it starts spouting nonsense
import os
import logging
from fastapi import FastAPI
from pydantic import BaseModel
from transformers import AutoProcessor, LlavaForConditionalGeneration, TextIteratorStreamer, BitsAndBytesConfig
import torch
import asyncio
from collections import deque
# FastAPI app setup
app = FastAPI()
# Define the incoming request schema
class SpeechInput(BaseModel):
speaker: str
text: str
# Setup logging
logging.basicConfig(level=logging.DEBUG, format="%(asctime)s [%(levelname)s] %(message)s")
# Constants
MODEL_ID = os.getenv("LLAVA_MODEL_ID", "CreitinGameplays/pixtral-1")
QUANT_CONFIG = BitsAndBytesConfig(load_in_8bit=True)
HISTORY_LEN = 20
# Initialize model and processor
logging.debug("Loading model and processor...")
try:
model = LlavaForConditionalGeneration.from_pretrained(
MODEL_ID,
quantization_config=QUANT_CONFIG,
device_map="auto"
)
processor = AutoProcessor.from_pretrained(MODEL_ID)
processor.patch_size = model.config.vision_config.patch_size
processor.vision_feature_select_strategy = model.config.vision_feature_select_strategy
logging.debug("Model and processor loaded successfully.")
except Exception as e:
logging.exception("Failed to load model and processor.")
raise e
# History buffer for chat
chat_history = deque(maxlen=HISTORY_LEN)
# Function to generate a response asynchronously
async def generate_response_from_llm(text: str):
"""
Generates a response from the LLM using the provided text input.
"""
logging.debug(f"Generating response for: {text}")
# Define the system prompt
system_prompt = "You are Pixel, a cute little AI girl. You livestream sometimes and change moods occasionally. Your responses are rather short and never longer than two sentences."
# Prepare input for the model
conversation = [{"role": "system", "content": system_prompt}]
conversation.append({"role": "user", "content": text})
conversation.extend(chat_history) # Add history if necessary
prompt = processor.apply_chat_template(conversation, tokenize=False)
inputs = processor(text=prompt, return_tensors="pt").to(model.device)
for k, v in inputs.items():
if isinstance(v, torch.Tensor) and v.is_floating_point():
inputs[k] = v.half()
# Generate a response from the model using async IO to offload blocking tasks
streamer = TextIteratorStreamer(processor.tokenizer, skip_prompt=True, skip_special_tokens=True)
generation_kwargs = {
**inputs,
"streamer": streamer,
"max_new_tokens": 512,
"do_sample": True,
"temperature": 0.7,
"repetition_penalty": 1.1,
"top_p": 0.95,
"top_k": 50
}
try:
# Perform inference asynchronously
response = await asyncio.to_thread(model.generate, **generation_kwargs)
# Process the response from the streamer
response_text = "".join([new_text for new_text in streamer])
logging.debug(f"LLM response: {response_text}")
# Append model response to history
chat_history.append({"role": "assistant", "content": response_text})
return response_text
except Exception as e:
logging.error(f"Error generating response: {e}")
return "Sorry, I couldn't generate a response."
@app.post("/process_speech/")
async def process_speech(input: SpeechInput):
"""
Endpoint to receive speech input, generate a response from the LLM, and return the response.
"""
logging.info(f"Received speech input: {input.speaker} - {input.text}")
try:
# Generate a response from the LLM
response_text = await generate_response_from_llm(input.text)
logging.info(f"Generated response: {response_text}")
except Exception as e:
logging.error(f"Error generating response: {e}")
response_text = "Sorry, there was an error generating a response."
return {"status": "success", "response": response_text}```
why is this always flagged?
I recall another helper staff showed workaround for apple quarantine thing, try searching it
but its infected file
make sure you don't have some other viruses in your system and try redownloading it
then do the workaround on apple quarantine
yea but my pc never had this issue or pop up but did as soon as i downloaded so i dont have a virus otherwise.
false positive, the app source code is open and public to see at https://github.com/deiteris/voice-changer
that’s not apple quarantine, macos doesn’t run .exe
that sus antivirus tryna be like apple 
How?
https://rentry.co/forkvoicechangerguide#mac apple quarantine would say that pytorch is damaged
you have a shitty antivirus then
how often does the voice changer get updated?
@knotty moth hai I'm trying to make a Neco Arc song cover but huggingface only let me make one song & it wasn't the voice model I wanted. Do u know how I can make more covers?
This site looks so complicated I don't know what it's talking about when I go to pricing
Ok I duplicated to generate privately but now it's saying I need to upgrade Gradio SDK & IDK where the options at
heya! i was just wondering if you can make a voice model just from a laugh or is it not possible?
anyone know this problem
not possible
What program are u trying to run? And what's ur PC GPU? NVIDIA 50 serie ig
seems more like possibly unsupported old gpu
- unsupported gpu or 2) trying to infer 1hr+ long file
Selam
I got issus on google colab its not downloading the files it says it does not work on the py Version while downloading the Modules after that i tierd using ngrok it didnt even install the pyngronk Module cuz it does not work ?
RVC voice model can be trained on any audio. But with just laughing track, the voice model result would be just laughing sounds.
thanks for telling!😊
how do i stop iy?
nvm
@low shard i managed to increase my audio from 12mins to 38 mins is that fine there may be repeats but idk
repeats wouldn't really help the training
repeating the dataset wont help
it's better you get as much high quality, non repetitive audio as you canr
heya! how can i make this laugh into a voice model? idk i'm just bored https://voicemod.link/item-asylu-e903-tn
ohhhh okay
what if i want it to be low quality for fun?
that was actually a reply for the other user lol
well, you can just get as much shitty data you want then follow https://docs.aihub.gg to train that laugh model
Last update: Oct 21, 2024
trust me it wouldn't be the craziest, we had models made out of toothbrushes and planet sounds 😭
thank you and i'm sorry for that message i didn't know🙏
it's fine dw, you're welcome
hey uh do i need ultimate vocal remover for this?
that's for spearating vocals and instrumentals, i don't think that laughing is in a song, so i don't think so
btw don't expect it to sound good, it will mostly be just a 'shitpost model'
okie i think i got this now...
and also i wanted to do this since someone made the minecraft villager just from sounds and it kinda sounded like he was speaking english
but ik that the laugh is just going to be goofy but i still wanna try!
for fun
cuz i'm bored
tbh the minecraft villager model still makes more sense than some sans or spamton model
fr
i still wonder how it sounds english even tho it's just sounds and the punching noise..
you can even try finding some cat model
cat model?
okie!
Oh
Oh okay so that means I prolly will get 28 mins
yea rtx 50 series
didnt nvidia updated cuda drivers for 50 series?
months passed and rvc cant work on 50 series still thats suprised
RTX 50 series need cuda 12.8 which doesnt work on torch 2.6/older
Applio? if so u should follow these steps
#✨│ai-help message
w-okada
voice changer
optimized version
so on voice changer theres no way to work?
oh, in that case
there is a version exclusively for 50 series
Can I send you a DM? 
i was looking for someone with an 50 series GPU
recruit them 🔥
UVR5 UI update will be tomorrow, finally i got the korean translation but I'll add more stuff
Nicee
Like... 50-series GPU detection and installation of the required pytorch version, and perhaps a more robust way to update UVR5 UI
yeah that's nice
will help the few people who actually bought the 50 serie for using it rather than scalping
you can just go ahead and use torch2.7.0 cu128
it's been officially released after all
hey guys, is it possible to run gpu intensive games with voice changer? if so how? i heard that you can use dual GPU but is there any other way?
what's ur pc gpu
did u use the 50 serie version
nvidia 2060 super rtx
yea this things happens on 50 series version
i think its cuz of server use instead client thats why
got the same problem too
switch client instead server
alright i have 50 series gpu and this error is appearing
object has no attribute 'vc_pipeline'
yeah I did thanks but this error above is spamming the cmd prompt
File "/usr/local/lib/python3.11/dist-packages/torch/utils/data/dataloader.py", line 631, in next
data = self._next_data()
^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/dist-packages/torch/utils/data/dataloader.py", line 1326, in _next_data
return self._process_data(data)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/dist-packages/torch/utils/data/dataloader.py", line 1372, in _process_data
data.reraise()
File "/usr/local/lib/python3.11/dist-packages/torch/_utils.py", line 705, in reraise
raise exception
FileNotFoundError: Caught FileNotFoundError in DataLoader worker process 1.
Original Traceback (most recent call last):
File "/usr/local/lib/python3.11/dist-packages/torch/utils/data/_utils/worker.py", line 308, in _worker_loop
data = fetcher.fetch(index) # type: ignore[possibly-undefined]
^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/dist-packages/torch/utils/data/_utils/fetch.py", line 51, in fetch
data = [self.dataset[idx] for idx in possibly_batched_index]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/dist-packages/torch/utils/data/_utils/fetch.py", line 51, in <listcomp>
data = [self.dataset[idx] for idx in possibly_batched_index]
~~~~~~~~~~~~^^^^^
An error occurred extracting the index: [Errno 2] No such file or directory: '/content/Applio/logs/JONY/extracted'
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
forgot to run extract features step?
It is resume training
restore backup?
Everything was fine until 13 april (last backup time). Today i see this. If I not mistaken such folder not been
Yes I repeated the same steps
go to here
and see what it restored
you need to have the google drive mounted when you do the backup restore
you can also check your files in the google drive
really?) what was wrong with "v2_extracted"?
since we dont do v1 training in Applio, the folder has been renamed
also may need to edit filelist.txt as well
actually it is strange, it should be using the filelist with whatever the filename is there
Yes now it shows error related to missing files in v2_extracted ))). I need to download filelist and use VS Code to replace everything
ah
I see
it was the mute files
make sure both mute and your model have the folder named extracted
and then change all v2_extracted in the filelist to extracted
mute not included to backup. It created automatically. I just need to change backup and reset runtime
Is there a difference between Okada and RVC Webui?
in terms of realtime yeah
tell your pc gpu and what you want to do
Can someone pls send a link to a covers inference?
what is the difference between w-okada and deiteris' optimized W-Okada RealTime Voice Changer Client (Fork)
i have rtx 4060 laptop, just want to do high quality RVC
Oh, i see, overall just better rounded fork. Will try it out
@low shard ngl i have no idea on what im doing
its not even letting me convert
@simple ore
why did you not select a voice model at the top?
It wasn’t giving me an option to do so I refreshed asw
Unless there’s something else I need to do
is it possible to make the voices sound more excited when I make them?
Just using weights website
Yes my model has been rejected i need to put all the files
If i train longer my model might be overtrained
I think this contains malware
wrong
nxdomain means the website doesnt exist
hi
I understand ngrok but when i load up the site it says this site can't be reacged
If the "website is malicious", it would throw up a red warning instead of a sad paper.
correct, the site cant be reached because it doesnt exist
imagine if you tried to watch a video on youtube and youtube doesnt exist

classic 404 moment
you might have entered incorrect, nonexistent url, or the ngrok session run through the colab/kaggle cell might have aborted with some error messages
cant be reached = malware 
and any kind of big corps telemetry/backdoors are not malware /s
Pls bring me back my model maker after you've approved a model that meets the requirements
model makers could have their role revoked if fail to meet the QC standard
why was submit-model replaced with model-maker-role
Where is the QC standard?
So if i train a longer dataset my model will still sound robotic and muddy
more likely overly denoising problem
Where do i read the QC standard
whats for the voice models
Last update: October 20, 2024
Anyone got a good recommendation for a decent voice changer that doesn’t stutter? I tried one, I forgot the name of it.
Specs of my pc is
RTX 4070 TI
And an I7
MMVC i think is the one
Have you used it?
Realtime right?
For calls/games?
If so
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link Wokada deiteris fork
How can I use these model voices?
What's your PC GPU and do you want to use them realtime for calls?
@low shard is there a way to solve this in deiteris fork "RuntimeError: Failed to find NodeArg with name: skip_head in the def list"
Please elaborate a bit more, what did you do and what's your PC GPU
i press "start" and this error happen. I have rtx 4060 laptop, trying to run RVC model
okay, i reuploaded the model and it's gone
@low shard so I think I’m currently using Wokada, I’ll have to show you the interface. But sometimes it’ll glitch out and sound very robotic. Or it’ll basically stutter and repeat itself. It’s another real Time voice changer.
are you using the wokada deiteris fork from the guide i sent you?
if u saw a video tutorials, those use an old version of original wokada with vb audio cable, and it’s not good
Ahhh yes. I use VB audio cable. I’ll check your guide out in a moment! Thank you
I’ll definitely have to try that one out because the one that I currently use is not good whatsoever. I mean, it works fine sometimes but then other times it just glitches out or stutters
vb audio cable gives random issues, and the old original wokada has worse performance
Yeah, I’ll definitely check out the one that you sent me then. I just don’t like having replay issues,. Thank you, Nick.
yw and lmk
Well, it’s between either the voice changer or doing vocal training, correction, continuing vocal training
Well yeah, try the program with the best settings, if it still sounds bad, it’s a model issue
tho this will solve performance issues
I will definitely try it, because am all for trying out different good voice changers so I will let you know!! thank you
Yeah
I have RTX 4050
I want to use it in realtime calling
whats this ?
Yo could someone help me get the ai software for the voice changer, because might get hub looks alot different to the vids
@acoustic scarab
literally 2 messages up
I'm a bit dyslexic so I was trying to ask if someone could like call and help
Because I was trying to get it for a video
@simple ore do I install the 5000 series one is i got an rtx4070super because that's the only one I see for windows
Or the download for nvideo gpu and windows
<5000 series - you download the regular
read what it says
"NVIDIA RTX-5000 series, the newest release of GPU's, require a separate download. You do not need it if you have an older GPU, follow the normal Nvidia link in that case
https://github.com/IllIlIlIllIl/voice-changer/releases/tag/b2335"
It's a bit hard when I'm dyslexic
I read stuff then I read another and then it gets confusing
Alr appreciate the help btw
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link Wokada deiteris fork
do you know any way to not have like the robotic like sound? im spanish and almost all models in spanish sounds robotic, if i speak in english they sound better
show a screenshot of ur wokada
guys will people hear me like how i sound in discord mic test or no i’m really confused
because i test it in voice changer and the discord mic test both with the voice changer working and it sounds more realistic and cleaner in discord mic test
show a screenshot of ur wokada and discord settings
i’m not home i can do it when i’m home
@ruby basalt
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
???
https://huggingface.co/wok000/vcclient000/tree/main
which one do i download? i just reset my pc and cant remember
hi guys, im trying to train a voice model but it keeps saying no files found even when i uploaded like a 5 minute sample of the og
can anyone help?
oh its disabled
apparently
Your settings are good, If it sounds bad it's just because of the model
You don't, this is the original wokada, which is worse from quality compared to the new Wokada deiteris fork, video tutorials are outdated since a while
What's your PC GPU?
oh okay, so will people hear me like they do in discord mic test and what do i put my discord output and input to so i can use the voice changer in call
yes, set discord input as line 1 and output as ur headphones
A VAC (Virtual Audio Cable) makes a fake audio device, used to re-route the audio of different programs
In Wokada context, it's used to get the output of wokada as the input in other programs
so line 1 takes the wokada output and puts it as input in discord
ohhh
why when i'm using vcclient the voice lags so much? i have a rx 6700 xt and i followed this guide
where can i train voice model
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
Should I get the Applio app or just stay on the one I already have
It'd be nice if you gave info like, what you already have, what your usecase is, yada.
If youre replying to the forkvoicechangerguide link and saying "vcclient" then you definitely didnt follow the guide, those are 2 different things
Download the amd version from the link that i sent and make sure to do everything it says
Especially changing gpu to cpu to gpu after changing chunk or extra
Use recommended chunk and extra too
I don't know how you followed the guide. But the "fork" W-Okada DirectML should supposedly look like this, not the one with too many files in one folder.
Also, make sure you didn't follow any tutorial video on YouTube before this one.
Please specify on which reason you should stay for Applio.
This link is the Hugging Face repo for original W-Okada versions, which are all outdated. What is your PC GPU? There's a better one to download.
How do you get a more accurate voice model with fish speech? I don't understand the guide in fish speech. Which words do I use for it to be more efficient. I was planning to just slam a 2 hour recording. But I don't think it'll be necessary
You have a model maker role, so I think you know where to train voice model. 
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Which RVC program/website are you trying to use? Is it Weights? Because Weights often limits most features for free users.
If it's me you're asking, it's Fish Speech by Zluda
I wasn't asking you. I was asking another user.
Hello guys
I was making a jjk edit i wanted the voice of the narrator from hakari voice lines
i am really really new to this
ai voice stuff
can someone help
Alr so I don't know if this is a channel I can ask for help in with this topic.
So I'm trynna figure out what ai voice was used in this
I've looked through the internet and haven't found a tts like that
Hi, can anyone please give me a guide how to download the voice changer?
using the applio for training a model, I read the guide but I didn't understand if the dataset has to be a 10-30 minutes audio or several 10-15 second audios?
realtime for calls? What’s ur pc gpu?
10-30 minutes are suggested
@gusty granite actually let’s talk in https://discord.com/channels/1159260121998827560/1367065669862031501
Just a short question.
Is deiteris' optimized W-Okada still the best in terms of performance and quality (if the model is good)?
Or is there new stuff out there (maybe paid?).
Didn't checked for a long time but was interested if there is any new bigger steps with RVC things.
I'm mostly interested if the pitch situations are still a thing or if this got better?
!give-media-perms 2h @stoic aspen
- [Error] Giving <@&1313370885650124821> to @stoic aspen for 2h

