#✨│ai-help
1 messages · Page 201 of 1
Sure, it plays but it doesn't matter, if there's any sort of corruption, shit can happen
Where did you get it from, what did you do with it, what program u used for exporting etc
Recorded it in audacity
try re-export the audio using audacity, or another application without including metadata
^ reopen it and export again
How do i excport without metadata
I tried re exporting
to multiple different formwats
makes no sense
uncheck "include markers and other metadata"
funny enough, that never caused issues on my end
( typically export such in rx for archival purposes
Is there an audacity equivalent? I didn't record it in audition
or
do you want me to import it into audition
then export without metadata???
izotope RX: uncheck "preserve non-audio data"
I have no idea what izotope is
Im using Audacity
audacity should be able to open files without metadata, so the last thing is to make sure it is a valid stereo/mono file and not corrupted
No i mean like
I
recorded the file initially in audacity
so do you want me to export it normally from audacity, put it in audition, then export without metadata?
... whatever you use, doesn't matter, just exclude the metadata
even audacity has it
" edit metadata "
how do you get that export screen?
Whenever I hit export it just sends me to the basic save file
export audio / sound depending on your release
or ctrl+shift+e
( and keep it mono. )
??
erhm, then update your audacity
Alrighty
no idea which version u use
oh there we go
alright its workinbg im updated now
ok so
Mono
then export without metadata?
I dont see anything metadata related
in that case, just export
alr
and retry in applio
use another working audio file and paste your recording there
Paste it, where?
ctrl+C your recording, close it then open the working audio, ctrl+A, del, ctrl+V
^
Oh, you just want me to combine both into one file?
no
you use file A as a dummy file / " container " for your target audio, File B
or treat it as a bucket, whatever
So, you want me to grab my recording, copy it, go to my working audio, and replace it with my recording?
yea
It says the audio splits into 1 chunk
time to update your applio to 3.2.8 bugfix
no, yours is 3.2.6
wtf???
oh
no
the
folder name is 3.2.6
but
I extracted the 3.2.8 bugfix into that folder
I just forgot to change the folder name
Huh, thought I am the only one to write msgs like that
Another thing, when I added something to the file
it split into 4 chunks
and then after 3 chunks converted it gave the CUDNN error
how about:
- download the fresh 3.2.8 bugfix
- update your nvidia driver
i already got the bug fix
and i already updated my driver a couple hours ago
Alright yeah im pretty sure the main issue here is that the audio splitter isn't cutting it into enough chunks
so when it tries to convert, the files too big
and it gives an error
well then, that must mean the audio's specific enough so the slicer hates it ~ To put it simply
Does it not have any silence parts?
test it on few chunks ( cut em from ur audio ) 3, 5, 8 mins
See what clicks, manually cut it into chunks ur gpu handles and once you get all infers, concatenate 'em and call it a day
once again, your file has no spot where it can be safely split at a silent spot
so it fails
it could be a file with a noticeable background noise or music playing
I bet he had recorded in a noisy environment or bad mic without denoising 💀
Hi. I've written a script for a short story in the witcher universe, and I'm voicing it with voices from the game. I found a repository with almost all the voices for so-vits-svc, but either the models are not trained well enough, or the neuronics itself is outdated and not good enough, often the voice gets computerized or has pronunciation problems. Perhaps someone can tell me if there is a neuron that helps to remove the computerized voice, or in other words, a neuron that kind of makes the voice more natural, so that for example you can generate some replica, and then pass it through this neuron that would improve it, something like “the final touch”? Or maybe there are suitable models for something more advanced, with voices from the game and I just can't find them?

So vits SVC is old
Use RVC v2
RVC guys discover SOLA to chunk audio and achieve infinite output in realtime at small cost, while the same RVC guys use silence to split audio in chunks that's never guaranteed and get OOM during infer. Now I've seen everything
Wasn't it supposed to split chunks simply by time? I think I've seen infer code in rvc mainline, and they pre-calculated estimated vram usage and just chunked the audio by specified amount of time
😭

it is funny that the pipeline itself splits audio into small chunks
but no, it does not do audio splititng before pipeline
Ah, no, it does preprocessing for training, but not for infer
Hecking lmao
Preprocessing interval (and padding) is set in config here based on VRAM like I said
https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/7ef19867780cf703841ebafb565a4e47d1ea86ff/configs/config.py#L167
But it's used only here and used for 1-click training, and not sure about padding
https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/7ef19867780cf703841ebafb565a4e47d1ea86ff/infer-web.py#L231
https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/7ef19867780cf703841ebafb565a4e47d1ea86ff/infer-web.py#L743
Anyway, whatever... It's a huge mess I don't even want to look into
i tried to add model to RVC GUI but it didnt work for me can someone tell how to do it?
I was not sure what that was about, applio still has (if fp16 chunk = 3.6, if fp32 chunk =3.0
oops.. other way around
per = 3.0 if config.is_half else 3.7
weird
but in config.py preprocess_target_version = "3.7" if precision == "fp16" else "3.0"
can anyone give me a link to a colab where i can train models?
Train (make) RVC Models on cloud:
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Be sure to know about the tensorboard
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC
RVC Inference (use models) on pre-recorded audio on Cloud
You can use either:
- Weights.gg: Easiest Possible Ever Automatic
- Ilaria RVC Zero: Fastest free on cloud
- Applio (ui)
However, u might wanna check ur pc gpu first in case it's good enough
as cloud is only for bad pc
that was what i meant @low shard , its been like that for almost 2 hours , this just happened suddenly
You have 2 training items, while I have like 30 training items. 
weird, have you tried refreshing?
Just, give me more prenium trainings.
yeah
Weights works like normal, even on my slowest laptop. Have you ever tried to refresh this page?
i did but nothing new
works all fine for me, did you check if this issue persists on other devices? might be ur WiFi
or where are u from ?
With no premium mode, you can use a premium item to train a voice model.
I only have last 2.
id rather not say since i do want private info to stay private , but im sure this just happened today , if the issue was in my wifi , any other site wouldv been also like that but i will check again on other devices
I'm not sure why would you wanna train 69 models there at once. I only train a voice model on Weights as a draft voice model.
id rather not say since i do want private info to stay private
Understood, I meant the staty like "USA" or "Italy" rather than anything more private ofc
I just asked since it seems it was blocked in South Korea and would be weird if it was blocked in other states
Send your Weights link to #1159290752195633273, not here.
I already have my own Weights account, so unfortunate. I started from the start. 
alright so it does load on a phone , even recognized my account
but weird thing it didnt on my pc , ig id have to switch to the phone
thanks for help
Oh, i having my phone.
yw, @oak plank u might wanna check on that since weights.gg doesn't seem to load on his PC but does on mobile
vpn?
Try clearing cache and loading it again
@eager cliff if u could tell this
idk u should ask charles
Most of the times it works
nope
lemme try
Okayyy
either ISP or browser issue
prod yeah
RVC
what program should I use to use RVC
elaborate more on ur pc gpu and what u need
a real time one and I got an RTX3060
for realtime u should use Wokada, especially the fork
lemme explain u in #🔍│help-w-okada
alright iv got it fixed , google chrome really sucks sometimes
Use opera gx, easy solution. Chrome is trash
I trained in weight model and the truth is that it was bad, on top of that it only gives the PTH, it doesn't add the index. Training in weight is horrible.
thats what i acually did
Oh yea. Gjob, you're a W
🙏
So much they said that weight is good, it is literally horrible, updates are needed and in general everything
In what sense?
( asking as I do not use it so, don't know what's wrong / bad etc with it )
@bleak pasture
Might pass the message to some proper people
Everything is wrong, everything does not generate good covers, the voice models are trained poorly, the images are generated poorly, everything is wrong, literally nothing is done well on the page
How much of user's input is needed on weights
for generating covers / using rvc / applio
Cause like, have you excluded possibility you've done something wrong / user's error
Is there an RVC TTS software i can download that reads documents in my browser? im doing online classes and the select to read tts is absolutely horrible, and i was wondering if there is a rvc line reader i can use for my browser
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
does this allow me to select a line and have it read it or does that not exist yet and i have to manually paste it
i want to be able to select text and have it read with rvc directly from browser if possible
i think you can find them in #1159513888199540817
whay do you plan to use it for?
i see you are playing roblox but i wouldnt use rvc on there because i got warned when i did that
like the roblox vc auto moderation warned me lmao
@acoustic scarab is this using my intergrated graphics instead of my gpu
if so how do i change it
oh and it just errored
can't say for sure with given amount of info
Which type of pre-training is the best?
i just downloaded and ran
There isn't a best one, depends on ur dataset lang and size
oh uhh i tried one of the ones you sent and i keep getting this error
i uploaded a model
and selected it
and tested it
and it uhh
didint work
Could you exactly link me what u are using
What should be the target_sample_rate? 40k?
the cut off seems to be around 16k, which multiplied by 2, is 32k
I understand it will be 32k
?
ye
thank you
yw
The best one is the default one which is original, it is better to leave it as default which is original.
thx
How can i upgrade my Applio? I am using ApplioV3.1.1
Download the latest compiled release, unzip it into a folder of your choice, like C:\Applio-3.2.8
move your models and audios over from the old folder to new
delete old folder
Thank you

Bro's still not over about Weights even after a month ago. 
I use Weights to train some "draft" voice models to make sure what they could sound like if trained locally. It ain't a big deal.
I want to use Arabic Vocals to train. Which pre-trained models can do it perfectly?
What does OG pretrain means?
original pretrain that came with RVC
use the default pretrain or try make an arabic pretrain by yourself. using 20+ hours dataset of wide range of speakers and renting some A100s are recommended.
Thank you
Where can i find it?
Is it automatically uses OG one when i click start train?
The other files I had, had background noise though
and they worked
oh
I did a denoiser on my audio and it worked
yup, so we were on the right track then
splitter had no places it deemed as split-able
yeah you should definitely
what are the models made of? vstt stt gtts
Huh? Those are RVC (Retrieval-based-Voice-Conversion) models, which are STS (Speech To Speech)
They are made of a .pth file which is the voice, and an index which is the accent
what is the best local rvc
Hi guys. how much slower training with FP32 is compared to FP16? Does training with FP32 provide enough advantages?
VRAM is the main concern, speed is not 2x slower, but depends on GPU
if you're finetuning you may get away with just FP16
Although, i already made my rvc models. But i had 2 prenium training.
may even not explode during training
#ask Can the voice model here be used for the F5-TTS?
There isn't a best one, all have the same quality
It mostly changes the UI, even if in the latest applio or codename fork there are newer features
What's ur PC GPU
No, those are 2 different type of models, RVC is STS while F5 is TTS
The only way to use it like that is to make a random tts audio with any voice, then convert it to the RVC voice
But you can already just give 30 secs of sample to F5 to make a similar voice, it's just not as good quality as RVC
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
In F5, it is very difficult to find an Indonesian model. Applio, on the other hand, has many. Unfortunately, Applio does not support voice emotion and Local LLM. Do you have any suggestions?
Applio uses Microsoft's voice service (Edge screen reader0
Is there any way to run Applio with a local LLM?
no
hello, does anyone know why my conversions sound grainy? the original was clear, the model was some of the most popular ones( I tried multiple)
Applico Just uses Edge TTS API to make a TTS Audio, then converts it with RVC
Hi, my mel spectrogram similarity is stuck at around 40-50%. Could this mean the training is going wrong?
Could the edge tts thing be specified in the info notes btw? People get confused and think RVC is TTS too
here's a sample
The quality depends on the voice model
it may be a recent addition
I haven't used applio since September
after so many questions
ah. I have the default pretrain model. and I tried a few models on weights.gg, only the karina one works fine, the rest produces grain like the above (jisoo, jennie, ... all had >20k downloads)
I didn't know if it was the model or some parameter tuning I'm not aware of (big noob)
The default Pretrain model isn't made to be used, it's made only as a base for training your own models
That's how Pretrains work
The quality depends heavily on the model
However applio also has a Clean Audio option in the advanced settings
Maybe that can help
when I say default pretrained, it means I didn't do anything to the default pretrained_v2 folder. I just add models to the weights and used it
Yeah dw even if u change Pretrain, it won't matter unless you train your own
I'll give applio a try I guess
Thanks @low shard
Applio Is a fork of it, meaning more updated with some things
You aren't forced to switch since it's same quality type except some new features
However I remind you it's better to just search for a better model
Ye we call that Mainline
Hello, What are some easy ways to spot overfitting by looking at the charts in TensorBoard?
yeah I'll try to find other models too. I tried all the most popular models and only karina one worked for me I thought it was something else other than model. Thanks a lot!
What should be the target_sample_rate: 32K?
Yup
thx
Yw
That's right. Both Applio and F5 can run TTS.
Are the models in this group only for Applio?
The slicer GUI?
yup
I don't remember that, do you mean this part https://docs.ai-hub.wtf/rvc/resources/dataset-isolation/#step-4-manual-audio-slicing-optional of the docs?
Last update: Dec 24, 2024
So it cuts the sound every 10-11 seconds
I'm not sure what program you're running that's why I'm asking what tutorial (link) are u using
I don't know either. I have been using this program for a year. It divides the sound in just 10-11 seconds and I make the model this way. I saw it on a YouTube video
@crude flame doesn't RVC automatically slice the audio
Hello everyone, I have the opportunity to familiarize myself with RVC in Google colab. https://colab.research.google.com/github/w-okada/voice-changer/blob/v.2/w_okada's_Voice_Changer_version_2_x.ipynb.
Everything starts fine and even the standard voices work well, but if I add custom voices, then RVC stops working and throws this error.
RuntimeError: ('Exeption in RVCInferencerv2F0', RuntimeError('CUDNN_BACKEND_TENSOR_DESCRIPTOR cudnnFinalize failed cudnn_status: CUDNN_STATUS_NOT_INITIALIZED'))
- vcclient_dev/server/validation_error_logging_route.py - 33
2024-12-28 13:49:05,899 - vcclient - validation_error_log - ERROR - Exception: ('Exeption in RVCInferencerv2F0', RuntimeError('CUDNN_BACKEND_TENSOR_DESCRIPTOR cudnnFinalize failed cudnn_status: CUDNN_STATUS_NOT_INITIALIZED')) - vcclient_dev/server/validation_error_logging_route.py - 32
2024-12-28 13:49:05,901 - vcclient - validation_error_log - ERROR - Exception tb: Traceback (most recent call last):
File "vcclient_dev/voice_changer/voice_change_manager/inferencer/rvc_inverencer_v2_f0.py", line 92, in infer
File "vcclient_dev/voice_changer/voice_change_manager/models/rvc/synthesizers/rvcv2_f0.py", line 118, in infer
File "torch/nn/modules/module.py", line 1553, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "torch/nn/modules/module.py", line 1562, in _call_impl
return forward_call(*args, **kwargs)
File "vcclient_dev/voice_changer/voice_change_manager/models/rvc/components/TextEncoder768.py", line 49, in forward
File "torch/nn/modules/module.py", line 1553, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "torch/nn/modules/module.py", line 1562, in _call_impl
return forward_call(*args, **kwargs)
File "vcclient_dev/voice_changer/voice_change_manager/models/rvc/attentions.py", line 52, in forward
File "torch/nn/modules/module.py", line 1553, in _wrapped_call_impl
return self._ca
There can be no problems with the voice file, because I checked locally and everything worked fine.
i cant use voice model
It's too late now, I've already started making the model
how can i use model
You need the RVC GUI
what's that
10sec slices are waaaaaaaay to long
AI helps you use voice models
Wrong channel, use #🔍│help-w-okada
RVC is NOT wokada
Use models how and what's ur PC gpu
Well I have been using the same method for 1 year.
Maybe don't say that else they think it's the GitHub rvc GUI from t1ger that is outdated
Hmm
You're not using that.. right
For W-Okada, go to #🔍│help-w-okada. This #✨│ai-help channel here is about RVC.
and it is wrong. 10 sec slices require way too much audio to achieve the same as, say 1/3 set with 3s slices
-gui
Isn't it too short?
i want to send image of whats the problem is, but le me text it, it says u can't upload file of that type
What RVC are u using and what tutorial
And what's ur PC gpu
U need to give info
each step RVC only uses 0.36s segment out of each sample, so with 10s it uses ~1/30th of the set
how to check
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
See my message above. If you're using Windows, press Windows key + R, type taskmgr and click run.
Is it okay to use japanese hubert while using a pre-trained model?
Intel(R) UHD Graphics 620
Yes but remember to inference only using japanese hubert after training
That's integrated graphics, very bad, u can't use models locally but can use cloud
Do u want to use them in realtime for calls or on pre-recorded audios like ai covera
yea, real time
Wrong channel then
Rvc isn't for realtime, it's for making models and pre-recorded audios
Go to #🔍│help-w-okada
thank you!
For japanese models I guess
how do i train ai voice model
Which GPU Do you have
i never understood, why do some people train models and some dont?
hi can someone help how to put voice model in rvc
Hey, ! Sam._! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
Some people train models because they need them and they are not available, Some people actually want learn or do experiments because of their creosote to learn something new, Some people also use pre trained models (the models already trained by other users) because of different reasons (they may not have enough time or resources or it can be very complicated for them)
Are you using RVC locally or on cloud. And which rvc exactly?
i never understood, why do some people bake some cakes and some eat them?
Bro... you roasted him 🤣
Roasted like a corn. 

im not a tech nerd cmon...
:blizzard: 👉 :buddha:
:blizzard: 👉 :buddha: à
à:blizzard: 👉 :buddha:
:blizzard: 👉 :buddha: à
:blizzard: 👉 :buddha: à
:blizzard: 👉 :buddha: à
:blizzard: 👉 :buddha: à
I answered your question btw
i saw it, thanks 😊
Don't feel bad or sad for above massages by @knotty moth it's all for fun
Ciao @low shard e @proven hill. Vi ricontatto poichè ho provato a fare come mi avete detto (registrare la mia voce, cantando), ma la mia tonalità è molto diversa da quella di Lazza (o di altri artisti), perciò non gli somiglia. Mi esprimo meglio: ho provato a fare "100 MESSAGGI" di Lazza con le mie parole (esempio: il 3° audio, chiamato 100 MASSACRI è identico a Lazza. Link: #1287422682446565578 message), ma purtroppo come detto in precedenza la mia tonalità è molto diversa da quella degli artisti perciò se faccio per esempio Lazza, si gli assomiglia perchè la voce ricorda la sua, ma purtroppo non è proprio uguale, a differenza dell'audio caricato sopra tramite Link.
Avete altre idee? Ad esempio esiste un TTS Singer che posso mettere un audio di riferimento o qualcosa simile? Tra l'altro ho notato che con un TTS semplice viene meglio rispetto alla mia voce (anche se non a tempo, ma questo non è un problema).
Attendo Vostre,
Lorenzo
Scusate il ping, buona serata.
no, non esiste purtroppo
purtroppoo non credo che esiste
Ciao, vi ringrazio per la risposta. Avete altre idee su come migliorare la voce e farla il più simile possibile
Perchè purtroppo ho aspettato (sperando in una possibile risposta), ma purtroppo @civic meteor non risponde.
forse, potresti provare https://www.weights.gg/it/create/song , qua è gratis (se non ricordo male) e puoi mettere una traccia di riferimento, ma non l'ho mai provato onestamente
potresti provare cosi, ma ho un grosso dubbio che non sara simile, però non so cosa altro dirti
Grazie. Perchè mi sembra alquanto strano che il link di PlasmaTi viene identico.
dimmi come va con weights.gg, non ho mai provato onestamente ed ho dei dubbi vistoo che le voci italiane sono meno conosciute
Massimo 190 caratteri mi dice... Non riesco a fare nemmeno un verso.
ah, mi dispiace
How should i put my model there?
Last update: June 15, 2024
Step Inference: 1. Upload voice model
still 32k
thx
I was busy my bad
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
I've done voice to voice before but how would you make a parody where you would need to change the original song lyrics
udio ai maybe
What is more important, Fm or Mel?
mel > fm > kl... but if fm is all fucky, you wont get good results
mel as in some latest Applio releases
original one in RVC is not good
You need to sing
And convert the voice
does anyone have free time to walk me through how to download the voice changer top to bottom
Realtime voice changer for calls? That's Wokada, not rvc, go to #🔍│help-w-okada
- train better or get a better model
how do i import file into okada for ai to cover
w-okada is not for covers 🤦♂️
rvc / applio = covers
w-okada = realtime voice changing
#✨│ai-help
#🔍│help-w-okada
Both have their respective channels
ai hubbers am i right?
oh?
im new to this stuff mb😭
's good, dw
ok ima train my model
so rvc is how i make song covers right?
yea
Have the audio file of your song ready, & let's extract the vocals from it with an audio isolation software.
can u send the guide to the train thingy
In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.
guys, what's up with Ilaria RVC on huggingface? everytime i try to put a huggingface link for a model it tells me that it's not an huggingface link lol
Ok, it's a problem with her space, because the links are working on other spaces
there is no problem.
it's okay now, but before it was glitchin
I had some problems. Turnes out I needed to clean audio of sample and of data. No back vocals, no echo, check your audios for any anomalies that doesn't sound right.
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
if you have like a pretty good computer, do u think it would be better to do rvc stuff locally
depends
r u the person that made llaria rvc
actually yes lmao
thank you 😭
basically i got a pc with a 4070 and a ryzen 7 9700x recently, i dont know much about rvc i kinda just followed the guide for llaria rvc on i think it was huggingface and use pretrained models. I just wanted to know if u could make like, better quality or better sounding covers by running locally, and also the gpu limit on huggingface got annoying. I also wanted to train models that sound like really good but idk how.
- quality is the same local or cloud
- the limit is lame but thats the max i can give to yall without nothing in return
- for training models you should ask a staffer
I dm'd you by the way though it's your preference on where you want to train. I can't help with that
does anyone know what kits uses for the settings to train their models, tested a dataset on it and it came out a lot better than applio's model
Hey everyone!
I'm currently looking for an internship in AI/ML and I'm eager to gain hands-on experience. If anyone knows of any opportunities or has leads on guaranteed internships, please let me know. I'm passionate about learning and contributing to AI projects. Any help or guidance would be greatly appreciated!
Feel free to DM me. Thanks in advance!
Let's say I want to resume training a model on RVC disconnected that has been trained for 100 epoch, I input total epochs as 200, will it be trained for 100 more epochs to a total of 200, or 200 more epochs to a total of 300?
What does Index search FAILED or disabled failed mean
it resumes from saved 100 to your selected max epoch
does not work, neither the application nor discord sees the sound
gonna talk in #🔍│help-w-okada
Nothing, just that you left index at 0. Not an error
how can i download vocie models from wheigts
To Download a Model from Weights.gg:
- Login
- Click the 3 dots at the right of the image of the model
- Click download
- Download Anyways
- Unzip the zip, and you might wanna rename the pth and index since all models on weights are renamed as 'model'
@raw stirrup It's better to ask here rather than in DMS
about your loss fluctuating, maybe wait a bit, it's just the 40 epoch
ok thx
also check the tensorboard
mainline ?
yeah the og one not mangio or applio
Maybe try checking https://docs.ai-hub.wtf/rvc/resources/training/#tensorboard
Last update: Dec 24, 2024
its the latest nvidia build but i dont see a tensor board batch
I haven't used mainline locally
ok thanks bro
yw
chat gpt seems to think itsfine so far but its just a bot lol Conclusion
From the logs provided, your model is progressing well. The loss trends are typical of a generator-discriminator setup in mid-training. Assuming no major artifacts in the generated audio, you're in a great position to continue training and refine your model further. Keep monitoring the losses and evaluating samples for the best results.
😄
u need to download th tensorboard bat from the link nick sent, put it in the rvc folder, run it in there
cant seem to open link 4sum reasom
this is the tensor download link:
https://docs.ai-hub.wtf/rvc/resources/tensorboardfiles/tensorvenv.bat
thanks bro
can someone teach me how to use a voice channger
this is the loss graph is it doing well?
hey guys im a newbie and i was trying to train my voicemodel but i get no-feature-todo instead of all features done in the step 2b
Ye, going down is good
Are all ur files in the dataset.wav?
Try to report them aw wav in audacity
thanks brother 🙂
Rvc is not for realtime voice changer for calls, ask in #🔍│help-w-okada
ohhh, ty lol it was in mp3 format
@low shard i still get the same thing :/
You should never train on mp3
It's lossy
Be sure to always get wav sources for ur dataset
Show me how ur dataset is formed
Like a ss
oh
Try restarting discord
Is RVC Nvidia 1006 still the go-to for realtime voice conversion and training?
realtime its https://rentry.co/forkvoicechangerguide
training you can use it, people been using applio lately or forks, check at https://docs.ai-hub.wtf
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update December 12: NEW UPDATE VERSION b2332
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuid...
Last update: Oct 21, 2024
Thanks b 🙂
can someone help me humanize a really short text
i just cant get something in my head
dont even need an AI
guys i have a rtx 4060 ti but dont know what to set the epoches
idk somewhere ig
Last update: Dec 24, 2024
u have to install python
and then you need to add pip to the PATH environment variable.
Recently, some people have reported that when inferring with applio, the beat of a song becomes increasingly out of sync.
Does anyone have any idea what might be causing this?
Sorry if it's hard to understand, I'm using machine translation.
applio is a experimental version of rvc, i recommend rvc mainline or mangio
ive tested all 3 versions mainline just works much better and is more accurate in training models and less bugs
i been doing this stuff before rvc and so-vits began
im from the tacotron 2 deep faking
Thank you for your advice.

applio is not an experimental version of rvc
at all
its taken from rvc not a new thing
is what i meant
applio is a fork
not an "experimental version"
and mangio is very outdated
yeah its forked from rvc mainline tho, and is worse in my opinion
na ive trained 3 models on both versions rvc mainline is much more accurate and infers just as fast
? ive used all 3 versions and been doing this since 2020 with extensive testing
and soon applio is going to be better if refineGAN gets a pretrain
when tacotron2 and talknet weas around
ill test future updates but right now mainline for me is much more stable and accurate
when was the last time you used applio?
2 weeks ago
i even tried the new beta thing they made
was worse in comparison to mainline
mrf and refine dont have pretrains so ofc they suck
think they either messed with the configs or some other code
they made it better
thats all
if it was better the models would be more accurate than rvc mainline which isnt the case
i purpose u to try it yourself
better as in faster
propose*
i do
ive been using applio since pretty much the start
the quality is the same
user error then
same datasets on applio and rvc and rvc mainline comes out more precise
every time
ive trained big modles 2pac andrew tate solid snake mainline just is on another level
wish i could agree with u
if it sounds worse thats just user error
it cant be user error when the dataset is exactly the same and same settings
ill provide u with a audio sample from both models 1 trained on applio and 1 trained on rvc mainline and prove ur wrong
Can anyone help me? My VB audio doesn't come out with my voice so I can use AI's voice on discord
Hey, kakauinworld! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
did u set the mic outputs etc in youre gui
Yes, I even selected the microphones in Windows, but the voice doesn't come out
probably best to ask here https://discord.com/channels/1159260121998827560/1159290161683767298
did u train the models using applio and try to infer in applio and mainline?
no i trained my dataset on both applio and mainline both at 200 epochs and same settings mainline came out more accurate
not just my 2pac model biggie andrew tate etc mainline wins for me
im using latest nvidia mainline build
did you infer the applio model in applio
yea sending now
and did you use default pretrain or another?
default but ive tested multiple like custom on both mainline and applio mainline always comes out on top
infer using the same audio
same batch size?
i always train on batch size 4
always same settings
my rvc mainline model is damn near hard to distinguish from the real 2pac shakur in my opinion very lifelike
lol ive done nothing to it like i said try it for yourself train all on same settings same dataset and mainline latest build will come out on top
anyone saying otherwise is telling lies
whether its bias or something i dont know
but hey whatever flaots youre boat for me mainline is the best thing to train on
🙂
The "AI's voice", you might mean by W-Okada. For virtual line audio, I'd suggest for Virtual Audio Cable (VAC) Lite instead, as this one works better than VB-Cable.
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update December 12: NEW UPDATE VERSION b2332
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuid...
The only thing W-Okada related to RVC is W-Okada uses RVC voice model to inference, the code and GUI themselves aren't related. It's best to go to #🔍│help-w-okada for more information.
It's when the AI audio takes a long time to come out, how to solve it?
It's best to go to #🔍│help-w-okada for more information. This channel here #✨│ai-help is about RVC and certain RVC fork GUIs.
okayy
you likely ran process effects that does normalization of the full source file
while RVC normalizes each individual chunk without asking
that's why there's a difference
but if you take sliced audios from RVC and copy them to Applio's model/sliced audios (and 16k), the result should be the same
again there's a possibility that RVC does yet another normalization for inferred audio
but that's just how it is
@simple ore I ran no effects i trained on both versions with the same dataset at 40k pretrains v2 lol, no effects no tweaks both equal settings same fresh install default settings and rvc comes out more accurate
ive been testing these methods for months on both applio mainline and the mangio, But mainline rvc just trains more accurately
ive tested this on biggie dataset michael jackson andrew tate 2pac freddy mercury
etc
all to the same result
i used to love so-vits before rvc then i favoured rvc, then mangio was a thing and i tried that for a while but it had many bugs then applio was introduced but it never had the accuracy or same impact as mainline rvc i dont know what was changed or altered in the code but the accuracy in model performance is not the same, for me anyway
but you know each to their own whatever works for you guys
you still judge the same thing on the latest applio 3.2.8 bugfix? and have you tried some other pretrains like KLM 4.3 on both mainline and applio?
note that the default pretrain was trained from vctk dataset with background noise, and not denoised, so it might be fine for average dataset being not very "clean"
Joined this server a long long time ago, but was on a laptop that didnt have good enough specs to run anything. On my new pc now, anyone wanna put up with my bs an help me set it up?
There's no other way?
how can u make ur the voice u want to work its not working for some reason like the thing work it just i can't get it to wokr on discordd
anyone else have an issue of split audio shortening an audio clip?
this is most likely a placebo
I've been training using rvc for over a year and recently switched to applio as my fork's base
it's most likely a coincidence man because there's not a single thing in rvc that makes it better architecture wise
Trust me, it's a matter of randomization, weights init and the nature of stochasticity in a way
Enforce fp32, use deterministic tweak and test them both side by side.
Then, If you want, send some test samples from both and I'll do spectral comparison for you to rule out " differences "
(( oh yeah, also each inference is randomized and different even if by a little. Sometimes better, sometimes worse
Hey, guys. Are there noticable difference in training from loss and lossless audio samples?
Don't think so
Lossy files are lower quality
But does it matter if I can't tell a hearing difference in original audio sample be it loss or lossless?
Realtime Voice Changer for calls? Wrong channel, tell me ur GPU name in #🔍│help-w-okada ( a screenshot and also the link of the tut u followed)
thx but i can it working some how lol tho
I don't have a comparison but usually you shouldn't on lossy
I have a question: is there a difference if we train a model with 1 hour of audio or 5 hours of audio?
iirc there shouldn't be a difference after 1 hour
Guys, could someone send me the link to Amplio for training?
Another question! Which is better? KLM 4.3 [x3] or Original?
im just explaining my experience ive been using all versions since they were released i started with tacotron 2 in 2020 then talknet2 so-vits etc and for me rvc mainline outperforms everything based on default setups and default pretrains ive tested custom pretrains on both applio latest version and mainlinme rvc and rvc still trains my models to much better accuracy im not sure why that it is but im just saying thats the case for me so-vits isnt even in the picture anymore im not trying to shit on applio i respect people trying to make better things and improve stuff but for me it hasnt been the case mainline latest nvidia build just works to perfection for me ive trained many models on both applio and mainline mainline wins in accuracy
again u can hear the difference night and day rvc is almost realistic where as applio just sounds off
u still haven't answered whether you have tried other non-default pretrains using mainline and applio
i said in the reply i have tried custom pretrains on both versions
Does the voice changer **MMVCServerSIO **program lead to a ban in the Fortnite game?
i dont know what you want me to say bro rvc mainline is just more accurate but u dont have to accept that its fine i have 0 reasons to lie i would love for applio to work the best as its newer and more maintained but the reality is maniline for now just works much better
shouldnt do unless youre miss using it
For W-Okada, go to #🔍│help-w-okada for more information. This channel here #✨│ai-help is all about RVC and certain RVC GUI forks.
However, I don't think using W-Okada in Fortnite can get you banned. Unless you're using it for something bad.
-gui
Training model with better quality, with the same pretrain model and dataset, doesn't have to be just RVC GUI, Mangio, Applio or the Weights. The better quality can happen randomly, even with the same pretrain model and dataset. Unless you've got that good ears to hear the difference.
i get what youre saying and that would be the case if i only tested one dataset on all 3 versions but ive been using multiple datasets from different people biggie 2pac andrew tate buklau freddy mercury loads and they always sound more realistic on rvc mainline
i train all models to 200 epoch for a starter and they just train better on rvc cant explain why i didnt code these programs but thats been my experience
i even tried adding the configs from rvc to applio and they still trained differently off sound etc
so weird
French Montana voice model exists on Weights. But it seems like no one made a model of him in #1175430844685484042, sadly.
You can request a voice model of him in #1159289738314919936 or train one by yourself.
doesn't sound like hima at all
KITS AI? or any website free?
Kits.ai is now a scammy site, and has some expensive tiers to use their service. Weights is one of the only free websites alive.
respect to the owners/creators of weights.gg
does anyone have most updated guide how to create a dataset and rvc model of my own voice, goal to use it to merge with rvc's so w-okdada would sound really good
How do i in port models with RVC Gui
Last update: Oct 21, 2024
just add them to folder weights
inside rvc folder
if youre using the latest rvc mainline its assets/weights
Hey is anyone else getting a "build error" when trying to duplicate an RVC space on Huggingface
The space works normally without duplication, but whenever I duplicate it gives me an Exit Code 1 err.
wich space?
For the RVC V2 spaces made by rejekts
maybe its a space problem (?)
That's what I was wondering, I have another space that I recently restarted and it seems to be working fine.
Idk if this helps
This is the err I keep getting on only RVC V2 spaces (Rejekts ed.)
Send link
That's old, don't use it
What's ur PC GPU
I use standard CPU
I used to duplicate space with no issue. Is there a more up-to-date one for RVC?
Oh NVM do it on cloud
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours, not granted, of GPU
Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
is there any ai voices that support things like laughs?
rvchuggin face plz
-hf
Suggestions for @jaunty mason
- UVR5 UI, by Eddy and Ilaria Huggingface Spaces
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
the last 3 options
depends on what you wanna do
Uh
I wanna do a real time voice changer
and something just for audio recordings
to make voicelines
for combine soldiers
-rt
Interaction has expired, use the command again for a new interaction.
Thanks
for the voicechanger
@low shard can help you better
theres the download section
For W-Okada, you can go to #🔍│help-w-okada. This channel #✨│ai-help is about RVC.
Yes, click that .exe program.
This means W-Okada successfully launching. It should launch one of your web browsers automatically.
text comprehension 0
Damn.
yes with viruses and rats
now
i gues
what now
Do I just drop the folder somewhere
I'm not gonna tell you every one by one step.
-realtime
Interaction has expired, use the command again for a new interaction.
First link.
kk
For more information or any issue using W-Okada, you can go to #🔍│help-w-okada. This channel here #✨│ai-help is about RVC and RVC GUI forks. 
not the tiger18n one 
Does it work on Roblox?
This is like the second time I've repeating this.
I can't hear myself
And it just
is not working
I can't hear myself
I'm trying different settings
this
what is
this
Woo. Why did you set the audio system to "server"? The client mode works best for certain scenario.
where is the "Unwa's big mel roformer beta 4"?
._.
Have you tried setting like this on W-Okada? This could make a difference.
Again, if you have any issue about using W-Okada, go to #🔍│help-w-okada. This is like the damn third time I've been repeating this to you. 
Anyways. It seems like you've downloaded the DirectML version of fork W-Okada, two GPUs appeared on your W-Okada terminal, which can mean your PC either has an integrated or AMD GPU. That GPU part on GUI, you're still setting it to CPU, which means W-Okada will use your CPU to inference instead of your GPU.
Almost every setting on your W-Okada seemed to be set way wrong. 
it works
the issue is it's nto workin ingame
in roblox
I ask players
and its
not working
I'm literally in the Help w okada
It keeps saying cpu for gpu
can you show the error?
what? @oak plank could u check this?
It keeps saying cpu in the gpu area
I have given em AI Members role, they should be able to send images here now
I also updated the bot, now it gives AI Members role at level 2
So basically after 10-15 messages
hello who are you
you didnt know?
Nice to meet you Ilaria!
nice to meet you!!
I think I've heard about you, not sure when tho hahaha
probably negative 😔✊
I joined in September so yeah lol
fresh meat!
Not really, idk I read somewhere you were an ex admin ig 
i assume youre from weights then
Maybe when I was cleaning up roles 
Yep!
yea then something happened and baboom
SO IT WAS YOU!!
my role :(

how things are going with the server btw?
recently staying here sometimes but idk
In AI Hub or Weights?
contributor? i remember specifically told to not give me that 💀
Not that
both :3
Something special for admins 
It's going good, currently I'm focusing on Dreamtavern by Weights activity, it has 4.6k members lol
Coool 
what is dreamtavern?
Next, it will be AI Hub's activity, making it active af
It's a web app where you can chat with AI Characters at https://dreamtavern.ai
So they also have a server
Dreamtavern app is owned by Weights now, so yeah
good luck!
when i was admin it was pretty active, then most of the usuals left for good and it kinda died
im gonna check it out!
is something similiar to sillytavern?
Mhm! We will try our best to make this server even more better ☺️
Not sure hahaha
I am also new to AI stuff tbh
double fresh meat!
I am good with discord bots and web apps tho
😆
what are you specialising into?
web dev mainly, 3d websites, also curious to learn more about AIML
you also removed my bot, i think to replace with yours(?)
It was being discussed to remove the bots that are not used anymore, so we removed a bunch of bots 😅
you should def check out a new LLM that its better than chat gpt 4o
make sense, i didnt update it anymore
let me grab the name of the thing
claude 3.5 sonnet

Thanks for sharing that, Illaria! I will definitely check it out 
let me know if you like it!
its free on their site
oh wow its cracked
Suree
its also open source but you kinda need a bunch of A100 to run it lol
Hello Razer
hewo
How was your day? :D
I suppose it's available in hf zerogpu? if so I'll check it later
is good just suffering in val and looking into DeepSeek V3
i truly dont know!
im gonna try making a hf port maybe
Niceeee
claude still mostly better at coding
chinese people do be really stay on top regarding ai stuff tbh
all types?
I use AI for understanding programming concepts sometimes
because sometimes there is code I have no idea how it works
😭
Especially GLSL
DeepSeek V3 is worse then claude at coding
rip
unless this is wrong https://github.com/deepseek-ai/DeepSeek-V3?tab=readme-ov-file
oh wait
am i blind
or dumb
or both
ok flip what i said
i am both
nvm DeepSeek V3 is better

Hello Maki, How's it going?
new year to me 12 hrs ago
Happy New Year Maki!
also, youtube is shit
i dont remember where i was talking about youtube quality today
Do you know what exactly happens there? Is the 100th epoch retrained or will the model end up with 1 additional epoch?
It seems to read the epoch for D and G files correctly tho so I'm confused why the training is offset by 1
probably a glitch
try the latest applio that should resume from epoch 101 instead of 100 again
anyone have an idea why in discord for the input devices it says cable output when it should be cable input?
actually tihs happens everywhere
Other than small dataset and 2^n efficiency, is there any advantage in setting lower batch size than the GPU can handle?
batch 8 for most cases, or 4 if short and less diverse (<10 mins)
Is there any open source project available like Suno Ai ?
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Going over 8 makes training faster per epoch, is there any drawback? In case of big dataset
easier to overfit and make sure the dataset is diverse enough, though more recommended for training a pretrain
also batch 16 fp32 could take up to 32 GB vram without bottleneck
I'd rather choose batch 8 fp32 better than batch 16 fp16 while taking the same amount of vram
guide
because RVC-Boss is too lazy to make one line fix for resume
batch 16, fp32 even without checkpointing
fp32 does not double the whole vram
5GB vram was the base for everything other than fp32 weights, with fp16 it may take total of 8GB, with fp32 it may take total of 11GB
prob different hardware, in kaggle T4x2, batch 8x2 causes slower time per step
multigpu configuration may be slower. Though which Applio version?
4 am 😭
:3
Interaction has expired, use the command again for a new interaction.
guys.. how do i use find models💀
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @earnest musk
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @low shard, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
How can I fix this sound so that it sounds like purah from zelda totk. He's very robotic and doesn't have proper facial expressions.
yass get it king
Is it related to this topic?
how do i need to fix weird sounds, because i'm speaking one time its not take my voice, second time its really loud at voice still dont got, my mic is very good
hi im having a problem with uploading models ? it doesnt work when i press upload then select the pth and index files, and press upload again it just doesnt do anything at all and stays like that, also the voice cuts off like insane no one would understand
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
is applio broken on mac? i have a friend with an m4 mac and conversion isn't working for them
whats good foe
Hey guys quick question here. What's the fastest way to train in cloud? Paid Colab? Wich gpu? do we have some info about that?
@acoustic scarab help this fella out
i dont use applio
- a windows user
@hearty idol help this fella out
That's an insulting. 
what
nevermind, it just takes longer than usual
🙏
realtime voice changer client stops working suddenly
i know it stopped working when the numbers beside vol are not moving when i talk
is pytorch down?
I formated my pc and can't run RVC anymore
I know is really old by now but I got used to it and would like to keep using it
could somebody help me?
you're installing oudated software with libraries no longer available
https://download.pytorch.org/whl/cu118
Does manual installation also fail?
damn. is there a newer version or a way to get those libraries?




qERW