#✨│ai-help
1 messages · Page 115 of 1
UH
it says convert
ye i get it now
@pastel oak you got a clue? o-o
thanks for the help lmao
@slim geyser Sorry for the ping o-o
please avoid pinging random people
what are you trying to do with ASIO?
follow this guide: https://rentry.co/VoiceChangerGuide
I need their help, and Mares/Shad are the only ones that were talking about it in the past day
Mare helped Shad set it up
https://github.com/dechamps/FlexASIO/releases/tag/flexasio-1.9
https://github.com/flipswitchingmonkey/FlexASIO_GUI/releases
get these two things then ping me again
Done
also make sure to get this https://dotnet.microsoft.com/en-us/download/dotnet/6.0
Ok did you install everything aswell
start the Flexasio GUI app
mhm
Ayo? @simple igloo level 4 !!! 
i mean mares already sent screenshots of what to tick since you saw our conversation
copy that, and save to flex toml
alr lemme find it
done
just changed the mic to mine for input
copied mare's latency and other settings
thats pretty much everything
if youre using realtime gui then select flexasio as input and output, thats it
hmmm
gimmie a sec
think my first installation might've went wrong somehow?
im not sure if this is the fix but select audio SERVER and see if its there?
I see it there now
Okada's getting errors now though
ill check myself too
yea doesnt work for me either
just download rvc mainline its imo better anyway
fixed it by changing output to line 1
what does rvc mean
the difference changed drastically
Way faster now
😱
im using 256 like its 192
niceeee
is rvc ai voice
did you use MME line 1?
wait youre using 192 chunk on a 4060?
I mean I dont got a problem with it 
im using 80 chunk atm with a 3060 ti
gives me the best quality and I can maintain a convo
overall its not bad but on rvc i can use max Extra for better quality with barely any cpu usage, on wokada if i choose 32k it goes up to 50% ish already
if youre happy with it, nice 😄
retrieval based voice conversion
is it bad that i have RVC Quality set on high 😭
retrieval based voice conversion
i guess you can sum it up as "ai voice"
everybodies telling me to set it to low
but setting it on high gets rid of the cut off
its unnecessary resources used for no difference
everytime I stop speaking
what cut off do you mean?
you know how normally when you finish speaking, the ai just eerily goes silent
like cuts off
think its the noise supression
noise suppression is deactivated when you use audio server
what you could try is start 0.0 end 1.0 trancate 100
and silencefront off
silencefront always resets to On when you restart the app or change a voice model fyi
it helped a little bit I think
Also, not sure if you have the issue, but if i play a video game with wokada then it adds another layer of delay. Not sure if this would still be the case with the asio routing inside wokada, havent tried it yet - but its an issue i dont have with realtime. if you have something similar, do let me know if the issue still persists with the new routing and let me know 👀
Will do 👍
What time zone does the app use?
question can i have any nvidia gpu for this?
cause it says there is no compabatible gpu or it has tto be aspecific ndvidiagpu?
ayooooooo
what kinda of software should i download for AI voice changer
whats your gpu
i have used some rvcs but which is the best one that gives the best result for you guys
since i will mostly likely buy a new cheapest laptop which nvidia gpu is worst that can ne ablle to train like a picture a posted up before or which model is nessecary for it
yo so the colabs aren't working anymore correct
i see about of 40+ of nvidia gpus on wikipedia which is the best or worst of the best @shad ?
how much money do you got to spend
if you can get one of the rtx gpus you should be fine. ideally 2060 or better
if you want to get technical, one with 8-12gb vram
can you send me pls a video for the best configuration in order to get the best and the realistic voice sir
Ayo? @red mural level 1 !!! 
i mean i got about 1000 euros but also my dad would lend some money soo
so basicallz i can get any nvidia gpu for it right?
there are no videos, youll have to ✨ read ✨
go through each step, ignore amd, colab, mac, intel
which country are you from
tysm < 3
serbia
can you search for a " GIGABYTE G5 KF-E3DE313SH " on any tech stores you got
or GIGABYTE G5 KF-E3DE313SD
and then what
or ASUS TUF Gaming A15 (FA507NV-LP002W)
these should be in the 900 - 1200€ area
well, buy one of them?
getting this error when running batch ASR in GPT SoVITS
RuntimeError: CUDA failed with error CUDA driver version is insufficient for CUDA runtime version
hmm ok or i want buy cheapest or since i have sapphire nitro+ radeon rx580 4gb gddr5 can that gpu do for mangio voice training help me or nah also by putting 2060 on medium laptop can that help me? i have a father friend who knows these type of stuff so can that help me?
i mean we already talked about this, i sent you the amd version of voice training to try it out and you ignored it
it might be possible with rx 580 idk
but very slow and maybe not the best outcome
sorry for annoyin ya i just suck at this
trust m e i tried it and nothing i think
ah then unlucky
and im not sure if you can just put a different gpu laptop inside another that easily, but you can ask your tech friend you mentioned
actually i have saphire nitro ericx 580 4gb would it help?
How do I add my own voice model?
did you already upload the voice model you trained?
should i donwload it in ssd or hdd or its not a prblm ?
ssd is always better
Hi guys, beginner here. I was trying to figure out how to make features with two models. My approach was to make two full songs and stitch them together but if the 2 singers in the original sound too diff then it will distort the voice. Does anyone know a different approach?
when i chose a voice it shows me this message
unknwon message
If you clear the information being managed by this app, it may be recoverable
i dont understand, can you send screenshot also use #🔍│help-w-okada for voice changer help
done
Hello, how do I make TTS with RVC have a less monotone and more animated intonation?
well, the integrated tts in ilaria & applio rvc is edge tts which hasnt much emotions, you could to use bark tts #1212368971127590922 and then put the generated audio as an input in tts with the voice model you want
i see. are there tts models trained on people's intonations?
like, italian, brazil, etc accents? well that depends from tts to tts, by testing, technically rvc should change the tts output to the voice model accent
well, not exactly accents, but people's intonation and ups and downs in the voice
to make it less flat if that makes sense
you mean make it more real ?
yeah i guess so
well, edge tts looks real but emotionless, you could try bark tts which has more emotions in it
RVC Guides (How to Make AI Cover)
Translation by country
i've tried the google colab test notebook for it, but i've noticed it's often inconsistent
like the voice changes a lot, sometimes it just sounds like random noises, etc
be sure you have this checked on and select one of the voice presets
i don't see that
op nvm found it
and btw the blue "HERE" redirects you to a page with more info about all the voice presets
there are english ones
brazilian ones
etc
ok, i don't see "HERE"
Ayo? @strong fulcrum level 6 !!! 
this is what i have
!pip install git+https://github.com/suno-ai/bark.git
from bark import SAMPLE_RATE, generate_audio, preload_models
from IPython.display import Audio
preload_models()
text_prompt = """
I have a silky smooth voice, and today I will tell you
about the exercise regimen of the common sloth.
"""
audio_array = generate_audio(text_prompt, history_prompt="v2/en_speaker_1")
Audio(audio_array, rate=SAMPLE_RATE)```
send the google colab you are using
idk where you got that cus it isnt in the ai tts guide
its better if you use only the colabs/hugging face spaces inside of that #1212368971127590922 guide
I got it from here:
https://github.com/suno-ai/bark?tab=readme-ov-file
thats the official one but its not really user friendly, thats why i made that google colab i sent you which is in the guide
its easier to use
yeaa that makes sense
but so i would install it either from this github or pip install it for local right
you wanna do it locally? youd have to look up in their official github guide https://github.com/suno-ai/bark?tab=readme-ov-file#-installation
and speaking of, there are still random noises even with voice presets. like a random loud beep, random faint music in the back (from which i assume is from it being trained from videos?), and also it often randomly switches voices even with voice preset set
from what i just tested, i checked the use voice presets option and it was always the same voice, there might be some weird noises sometimes tho as it is focused only on emotions and its not still in development anymore
hiii, i was trying to use a model, but often the link doesn't work, for example this one: #1175500281979605034 message
can anybody tell me if there is something i can do?
what google colab do you use? send it
Its the right one yea
You dont need to use another one
The problem is the model download
That model download is deleted
So thats why
oh really? interesting
also yea, idk, seeing how barktts works, it would be a bit slow for local :(
I mean, local depends all on the power of you pc
not sure how this would fair
colab ^
How long was the text😭
If u put very long text ofc its gonna be slow 😭
tho each generation is only a sentence or two
This all is just for one of the sentence?
no, each one is a sentence or two
You sure you are connected to the colab gpu btw
Check to the up right and see if theres a "T4" near to ram
Okay good
Well, it all depends on the lenght of the text, considering bark is abbandoned from some months😭
unfortunately
can anyone understand the graph?
I did 400 epoch
Could I have continued training him?
or is the model overtrained?
I used a quality dataset extracted from the games with deverd and denoise
Ayo? @bronze maple level 2 !!! 
yea, and it seems the voice bank isn't very balanced. like en voices only 9 males and 1 female, while japanese voices have like 8 female and 2 male
Sorry but i can't really do much about that as i didnt make the tts model😭
yeye im just sayin
ur good
sei italiano ?
also, are there other tts that have similar goals/functions as bark?
Si
Piacere cosa cerchi da fare ?
In che senso che cerco di fare
Sto solamente aiutando alcune persone
a capito 😂 mi sembrava che eri tu che ai bisogno
Mmm maybe you could try tortoise tts and xtts2 which are in that guide i sent yoi
Nono stavo parlando del modello di bark ai tts, visto che mi aveva chiesto perché ci sono di meno voci feminili e gli ho detto che non so visto che lha fatto suno ai non io il modello
I just crashed my computer lmao
aaah ok
Alleni anche le voci tu con RVC ?
How did you even manage to do that😭
Si ho trainato alcune voci
Lmaoo idk
Screen froze, then pink screen appeared, then turned off XD
Are you using the ai locally
yea lol
it's a mac m1 16gb lmao
Macs arent well known for running ai things 
well m1 seems to be alright from other ppl
E' una settimana che sto cercando di scambiare con qualcuno, da 2 settimane mi sto allenando per allenare le voci, voglio migliorare nonostante le conoscenze che ho appreso
tho tbf, i have a million tabs open in safari
It also depends on the gpu, for example m1 is very very less powerful than T4 (google colab gpu)
huh interesting
tho can't be using google colab all the time
In che senso allenando? Stai trainando una voce al giorno?
Where do I find readymade datasets?
the more u take advantage of it, the more they nerf it for ur account
I mean true, if you want theres also an hf space on the guide for bark tts (it uses T4 gpu paid by suno AI)
yea i've seen it
Btw bark technically could run even on cpu if im not wrong, not sure why you got that error on mac but honestly not even the newests mac are able to train ai models, there's a difference from an ai task to safari tabs lol
Sì ahah 😂 , padroneggio il dataset, ma non capisco il grafico della tensorboard, voglio capire come realizzare una voce nel modo più perfetto possibile
so the only thing i have on right now is discord, terminal, finder, and textEdit, and i'm not running any AI programms rn (no applio no nothin), and this is my mem
Se vuoi questo ti può aiutare https://docs.aihub.wtf/rvc/resources/epochs--tensorboard/
Last update: Feb 10, 2024
Si o visto fra , ma non capisco Non so quando dovrei interrompere o continuare la formazione, se il mio modello è addestrato meglio o meno
I honestly dont use mac but its using 11gb of ram by just having that apps you said in the message what😭😭
Dovresti cercare "g/total"
Con la g
how many epocs might i need for a 20 minute dataset?
E monitorare quel grafico
sì, è nell'immagine che ho inviato
There isn't a right amount of epochs for training an rvc ai voice model, check this pls https://docs.aihub.wtf/rvc/resources/epochs--tensorboard/
Last update: Feb 10, 2024
Tecnicamente il modello non sta overtrainando, sembra un po sus quel punto alla fine che si rialza, aspetta un altro po e controlla come va, il punto di overtraining é il punto più basso prima che si rialza tutto, se vedi che overtraina stoppa il training
I tried closing everything and now it’s using 5gb
You could retry using it locally but i dont know if it will work for mac honestly
Bro i had like maybe 500 tabs open in safari and closing it made it drop like 4 gigs
😭
Lolll
hows this im at 130 epocs and 5.95k steps
Ok grazie riprovo allora
Ayo? @bronze maple level 3 !!! 
its set to go to 500
How many minutes of dataset do you have?
20 min
Ayo? @hoary zephyr level 3 !!! 
and do you think that model is training correctly and that he is not over-training?
You putted smoothing to max right
mhm
idk im new to this
Model looks fine right now dw
well would you think 500 is ok based on it
Well, that depends model to model, it could be yours overtrains before 500 epochs, or doesnt, 500 epochs is just used to see how the model training is going, like as its much epoch and you will have a detailed graph
Which RVC do you use? I have Applio v3 but I don't know if it's good
its a model of my voice with talking and singing
You mean locally or on google colab?
Oke, just check the graph time to time and see how it goes
Local fra
i need help with paperspace
kk thx ill keep yall updated
Ah non ho un buon pc lho faccio in colab io, cmq applio é buono da quante ne so
I might passout later its 3 am lol
google collab e una cagata oh un buon pc i9 RTX Studio 4080
im gonna be here till 3 am mst at this rate lol
Google colab offre una T4 gpu gratuita che è diciamo lo stesso buona per ai, ovviamente non super veloce come una rtx 4080 ma non una cagata per persone che hanno un laptop o vogliono farlo da mobile
how do i install rvc on paperspace
Aggiungimi in amico se vuoi posso train un modello per te proviamo
Sinceramente non ho nessuna richiesta di modello da creare per ora, tutti i modelli che volevo (40 se non mi ricordo male) gli ho già fatti
how do i fix my rvc not creating any .weights file
posso vedere un modello che ai fatto bene ?
@low shardlo sai comme si fa per riprendere il train di un modello ? cosi non ricomincio da capo
Mi dispiace ma in locale non so molto, potresti provare a vedere il 5 resuming step qua https://docs.aihub.wtf/rvc/local/applio/#training-
Last update: Mar 8, 2024
Sinceramente gli ho fatti mesi fa ormai, pero li trovi in #1175430844685484042
why arent weights showing up
Ayo? @austere ice level 1 !!! 
in the folder
Are you using ilaria rvc google colab version?
same problem as me too
Ima delete it i dont think you want others to use it lol
Yup lemme check it
That isn't a rvc model
That's a gpt so vits model (the tts ai tool)
So ye u cant use it in ilaria rvc
Youd have to use gpt so vits which is inside the #1212368971127590922 guide (the docs link one)
Or just find another rvc model
Yw
yo my rvc isnt making a .weights file 
ah fuck im just stupid
forgor to look in the assets folder
hey is there anyway to export a model early with the disconnected Collab its about to reach 250 epocs and i want to save this iteration for use in case of it getting overtrained at 500
Look guys, I tried to resume training on my old model, the graph has drifted, is this normal? 😂
Type loss/g/total
First go to scalars, then reload data, from the settings icon type loss/g/total set smoothing to 0,987
Hey man another question for ya, do u use a serverless/cloud computing service for ur personal ml stuff? What are some good ones? Are there any good free ones?
You mean what do i use for using my personal ai projects online ?
Google colab or hugging face spaces
Yeah
I dont really do anything local as my laptop doesnt let me lol
O ic. Arent there a lot of limitations tho
Well, not sure about huggingface spaces tho. Can u connect it to ur local program?
I mean for google colab you get free daily gpu and you can just switch google alt accs yk, and some stuff works without taking much time on cpu on hugging face space like #1163571683848900629
Well hugging face spaces work as like docker online, you have only free cpu 24/7 and if you build smt really good you could ask for a community gou grant for always have gpu on your hf space
But i dont think you can connect it to your local pc
I see
What about for colab? Can u hook up outputs to ur local program?
Ehh never tried that but i don't think you can hookup local stuff to cloud one
how do i fix error 1 in paperspace
Ic. So all thats rlly possible with those two would be in an enclosed environment then
Ayo? @strong fulcrum level 7 !!! 
in tensorboard what are epoch number are steps equivalent to?
Yup i really think so
ic, ok. thanks man
Yw
@coarse roost, I have found 5 results that match your search!
oop
so rvc wont launch
not sure what to do
ill send the log if anyone can help
Running with the runtime Python, Please wait.
Error processing line 1 of C:\Users\myuser\Downloads\RVC-GUI-pkg-20220525-mp3fix\RVC-GUI-pkg\runtime\lib\site-packages\distutils-precedence.pth:
Traceback (most recent call last):
File "site.py", line 169, in addpackage
File "<string>", line 1, in <module>
ModuleNotFoundError: No module named '_distutils_hack'
Remainder of file ignored
Error processing line 1 of C:\Users\myuser\Downloads\RVC-GUI-pkg-20220525-mp3fix\RVC-GUI-pkg\runtime\lib\site-packages\google_auth-2.16.2-py3.9-nspkg.pth:
Traceback (most recent call last):
File "site.py", line 169, in addpackage
File "<string>", line 1, in <module>
File "<frozen importlib._bootstrap>", line 562, in module_from_spec
AttributeError: 'NoneType' object has no attribute 'loader'
Remainder of file ignored
Error processing line 1 of C:\Users\myuser\Downloads\RVC-GUI-pkg-20220525-mp3fix\RVC-GUI-pkg\runtime\lib\site-packages\matplotlib-3.6.2-py3.9-nspkg.pth:
Traceback (most recent call last):
File "site.py", line 169, in addpackage
File "<string>", line 1, in <module>
File "<frozen importlib._bootstrap>", line 562, in module_from_spec
AttributeError: 'NoneType' object has no attribute 'loader'
Remainder of file ignored
Error processing line 7 of C:\Users\myuser\Downloads\RVC-GUI-pkg-20220525-mp3fix\RVC-GUI-pkg\runtime\lib\site-packages\pywin32.pth:
Traceback (most recent call last):
File "site.py", line 169, in addpackage
File "<string>", line 1, in <module>
ModuleNotFoundError: No module named 'pywin32_bootstrap'
Remainder of file ignored
Traceback (most recent call last):
File "C:\Users\myuser\Downloads\RVC-GUI-pkg-20220525-mp3fix\RVC-GUI-pkg\rvcgui.py", line 3, in <module>
from tkinter import filedialog
ModuleNotFoundError: No module named 'tkinter'
Press any key to continue . . .
@coarse roost, I have found 4 results that match your search!
- Uploaded: <t:1693353600:d>
- Likes: 0
300
RVC
Rvmpe
Hey does anyone know where this error would be coming from?
File "/Users/USERNAME/pinokio/api/rvc.pinokio.git/app/infer-web.py", line 496, in click_train
& set([name.split(".")[0] for name in os.listdir(f0_dir)])
FileNotFoundError: [Errno 2] No such file or directory: '/Users/USERNAME/pinokio/api/rvc.pinokio.git/app/logs/my-test/2a_f0'
I am using pinokio which help me to easily setup RVC on my Macbook Pro M1
has anyone managed to get Applio to run on Mac (m1) locally?
RVC Guides (How to Make AI Cover)
Translation by country
are there any angelic, holy, or echoy voices? sorry this is all very new to me
Ayo? @kindred rain level 1 !!! 
You could look on weights.gg and type the keywords in and see or use one of the bots or just look in the #1175430844685484042 channel
okay! thank you so much!
idk how to use the bots but ill look at that website thank you!
Np
does anyone know why i'm getting this when i try to run the go-web file in rvc?
(i closed the console by accident while running it the first time, but after deleting and unzipping the whole folder again it still doesnt do anything)
D:\RVC\Mangio-RVC-v23.7.0>runtime\python.exe infer-web.py --pycmd runtime\python.exe --port 7897
'runtime\python.exe' is not recognized as an internal or external command,
operable program or batch file.
i have python 3.12.2-amd64 and pytorch pip and nvidia cuda12.1 installed
python is a programming language code developers use, similar to other ones you might have heard of e.g. java, c++, ruby, etc
i was told its needed to run the program
is pytorch good?
pytorch is a library that some apps use for machine learning (basically stuff to do with AI) in tandem with python
Ayo? @frozen quartz level 2 !!! 
alr
Ayo? @red spire level 1 !!! 
can some tell me how to use conpiled version of applio. What should i have to download. .exe or .zip or both
i would like help with setting up the jcshlatt link on the google page. it doesn't seem to work
try with python 9.8
what error do you get?
uhh
Ayo? @sage seal level 1 !!! 
JSONDecodeError Traceback (most recent call last)
<ipython-input-11-20feb5d5be2f> in <cell line: 31>()
31 if os.path.exists(config_path):
32 # File exists, proceed with creation of creds and client
---> 33 creds = Credentials.from_service_account_file(config_path, scopes=scope)
34 client = gspread.authorize(creds)
35 else:
5 frames
/usr/lib/python3.10/json/decoder.py in raw_decode(self, s, idx)
353 obj, end = self.scan_once(s, idx)
354 except StopIteration as err:
--> 355 raise JSONDecodeError("Expecting value", s, err.value) from None
356 return obj, end
JSONDecodeError: Expecting value: line 1 column 1 (char 0)
is there at least a way to voice clone with rvc by tts? not ai cover.
and this is the link for the model: https://huggingface.co/analogspiderweb/analogspiderweb_RVC/resolve/main/JSchlatt-HD.zip
now it just doesnt respond
returns this instead when i try to run as administrator:
C:\WINDOWS\system32>runtime\python.exe infer-web.py --pycmd runtime\python.exe --port 7897
The system cannot find the path specified.
edit: nvm it works now, sorry for ping
working? i think you forget to copy python to path. and now you have fixed it ryt ?
are you using rvc on mac locally?
working now, installed python 3.9.8 and installed to path, then let go-web run for a while (also copied to path last time with 3.12.2, i think i just didnt let it run long enough)
you only really wanna play with:
- pitch or pitch all ( not sure how they implemented the transpose in here )
- index rate
rest leave untouched
index = accent, voice gestures of the model and such alike.
- more index, more style of your model you retain but can results in artifacts if the model isn't the best.
- less index, more features you take from input audio ( acapella )
in simple words, it's " do I want it to sound ( vocal features and accent wise ) closer to the model or closer to whoever sings the input / acapella "
0.3, 0.5 and 0.6/0.75 are typically fine. you rarely, if at all, ever go above 0.8.
You can also choose to not use index at all by setting it to 0.
Do 2 renders, one at 1.0 and one at 0.0 index, you'll understand better what it does.
when OT happens is the "step" number on tensorboard the same as number of epochs?
also if i am trying to continue training someone else's model, is there any way can i tell what sample rate their model has been trained at if they havent told me?
(i got the model off applio from the applio bot search and it doesnt seem to mention it anywhere)
steps compose epochs, so no
For sample rates, you can check the output audio
and you can continue training someone else's model if you have their d/g files and their preprocessed files
just tell them to send the model folder inside logs, that's probably safer
oor check the .json and .log files, iirc
either f0 extract and preprocessing have the sample rate in the beginning
wich are good e girl voices
i cant download anything like its just 0 percent all the time
same question
kiddan broo
Ayo? @hidden saddle level 1 !!! 
whenever i click start nothing happens
someone help
pls
command prompt pops up then disappears
RVC Guides (How to Make AI Cover)
Translation by country
use #🔍│help-w-okada
can anyone tell me how the appio overload sensor works? because it's a little strange that i set the workout to 300 epochs, but the sensor can be set to a maximum of 100 and it stops the workout, but it doesn't matter what i set it to, it can be 5 or 10 and it still stops. it wouldn't make sense to set a number for the workout and then the sensor detects if there is an overload and stops it. it's totally unnecessary to set a number
I don't think you should set a number for overtraining, but the program itself should detect when it are overtraining and stop himself.
do you know the problem i have??
no
okay
how do i fix the voice changer to stop echoin
enable "echo" and "sup2"
dint work and also it seems to stop like it just stops
@shaq since you told me to search GIGABYTE G5 KF-E3DE313SH " and found it to be 900 dollars or 1100 euros is it good for training voicees and i found it to be rth4060 is it good and enough?
yes any choice with rtx 4060 is great.
Ok thanks
Ayo? @quartz coral level 6 !!! 
which pre trainer is best?ov2 or rin_3
rin_e3 is super sensitive to noise
ov2 is good for shorter datasets
can i use it for longer dataset or for shorter dataset
longer
i think it works by seeing the loss and if it doesnt go down based on the threshold amount it stops training
if your model has mode collapse it prob breaks
how to use conpiled version of applio. What should i have to download. .exe or .zip or both
both should work
is it Compulsory to download both
or anyone of them will work
nope only one
which one is best
what is the index file and how do i get it
i trained my ai model some time ago and i didnt get the index file
anyone know where i can find voice samples for characters?
You mean voice samples for making your dataset to train the ai voice models?
I mean
Like YouTube
Where can I try to train a model?
I went on a side quest to make it (the default takes 44mins for me to install while this takes 4mins)
https://colab.research.google.com/github/Ran-Mewo/voice-changer/blob/main/Voice_Changer.ipynb
yea but they either have special effects or dont have every voiceline and its gonna take ages to splice all the voicelines together
44mins ?
Seems off to me, unless you're using the old one from the voice-changer repo ?
I think he is
Its not hina one
also if my audio is 44khz sample rate do i set target sample rate as 40 or 48khz
yeah for me google colab downloads at 2MB/s
idk if it's aussie or something
Ayo? @ripe jacinth level 1 !!! 
Doesn't this graph converge too quickly for just 1h of training data? Running 32 batches on 4090 so that might be why? https://imgur.com/kNmC0IJ
Btw you dont have to splice all the voice lines in 1 file or use more files, it doesn't really matter as long all of them are in a folder, and you could try to remove the effects via uvr
but 8m for a 1h dataset seems too quick
- I am paying for google colab so it taking an hour to install is like rip monei
also if i have a model that i want to train further do i just put the same model as model g and model d?
yes just look for these files D_2333333.pth & G_2333333.pth
only 1 model and 1 index though, neither state whether they are d or g
also btw with a 2:30 voice sample how many epochs do i go for? 500?
Ayo? @frozen quartz level 3 !!! 
check the logs folder, iirc that's where they are
i got the model off #🔍│find-models with the applio bot and it just gave me the model and index
⠀
Google Colabs 
⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.
AICoverGen-NoWebUI
Useful for making covers, doesn't inclued a UI, by Ardha, by Eddy, Hina and Gdr.
RVC Disconnected
To train new voice models, by Kit Lemonfoot.
EasyGUI
The OG interface, by Rejects.
⠀
use w-okada instead, also free
check pins in #🔍│help-w-okada for installation guide
alright thanks!
Oh I see sorry, that won't work for further training, unless the author shares the D_2333333.pth & G_2333333.pth files from the model's folder, you would need to train from scratch
Ayo? @haughty plaza level 1 !!! 
ty for help anyways though
actually one more thing, what do i set batch size per gpu to? im using a nvidia gtx 1660 super
oh and also at around how many epochs will the model quality start dropping off if im using a 150s voice sample (was thinking will train to 2k epochs or so, idk)
that's 14 GB VRAM right? I think it's okay to start with 16 and if it errors out just lower it to 8 maybe? Also for 2-5m it should be fine to train for 200-250 epochs
anything over 300 or so epochs and model quality will start going down right
theoretically yes but if you want to be 100% accurate I would install tensorboard and check as it's training #✨│ai-help message
im just gonna start training and go to sleep lol, in that case ill just set for 500 epochs and check tomorrow to find which one is best. thanks!
no worries good luck 
where did you take this
the epochs? #✨│ai-help message
lol, funny
i made that stuff 1 year ago
and still goes around
weird
If it's one year old I'm guessing it's obsolete 
well I'm just running tensorboard as I'm training so yeah thought it's really not a one size fits all thing. this is 1hr dataset converging at 8mins btw
⠀
Google Colabs 
⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.
AICoverGen-NoWebUI
Useful for making covers, doesn't include a UI, by Ardha, by Eddy, Hina and Gdr.
RVC Disconnected
To train new voice models, by Kit Lemonfoot.
EasyGUI
The OG interface, by Rejects.
⠀
oh I thought hina's one was outdated my bad
but I just looked at hina's code and it's basically the same without the colab quality of life stuff
like
# Install dependencies that are missing from requirements.txt and pyngrok
!pip install faiss-gpu fairseq pyngrok --quiet
!pip install pyworld --no-build-isolation --quiet
# Install webstuff
import asyncio
import re
!pip install gdown
!pip install torchfcpe
print(f"{Fore.CYAN}> Installing dependencies from requirements.txt...{Style.RESET_ALL}")
!pip install -r requirements.txt --quiet
``` (hina's code)
it's those dependency installing stuff that takes like so long to do
my colab thing has everything preinstalled as a 4.5 GB image
Anyone know how I remove echo?
hi guys,i dont know how to train a voice model,i search on youtube and down a file
i cannot find go web.bat
can anyone give me the file
all tutor tell me to down RVC1006Nvida
thx
lemme try
wait
same thing
Did you download the files?
Ayo? @spare gull level 1 !!! 
sad
My .PTH file is not creating, but the rest is fine, or I can't find it. Any ideas on how to fix this?
Is that all you downloaded?
no
i down the whole file
but i wait for it 10 min
nothing happen after the black window cames out
seems my pc hate me
I have created .index files "added_IVF923_Flat_nprobe_1_Test_v2" but that's all
Ayo? @brittle wing level 1 !!! 
i gonna do something crazy
@violet pecan, I have found 8 results that match your search!
let my pc become the original mode
Does it display any errors or is it empty?
Not there either :/
You did train, right?
nothing happen after i i open go web
yep
i even wait 10 min
nothing happen
my laptop
I can send you screenshots on DM what folders and files I have after training
You can post it here.
so i can just give up?
sad
@wispy burrow sir,is this mean my laptop too bad
but i can open the rvc voice changer
i have a wav.file,is that possible other ppl help me train
Don't you have anything in your weights folder?
The ModelName.pth goes in there.
Yuo may be missing something.
Do you have Python?
Ayo? @spare gull level 2 !!! 
Maybe I've messed something up here?
batch size too high for ur GPU, don't think it'll train like that
Batch size 16?
I have the same GPU.
I can't go higher than Batch size 2.
You very likely ran into an OutOfMemoryError.
alr i will change that
next time you click on "Train model" check cmd, as it can help to identify errors
go batch size 4, as I know 1660 super 6 gb is fine on that
It's a GTX 1060 6GB though.
An even older model.
same vram but idk if 1060 could throw OOM error on that
why the hell are you using harvest in 2024
Mine does...
I can't go higher than Batch Size 2
Yes...
just no
i kinda wanna disappear
17 hours for a 15 minute dataset...
17 hours is a very, very positive est.
1060 is a 3GB Vram, it can barely run okada, let alone training with cuda cores
rvc disconected how to create model
I'm new, first time using this 🫸 🫷
use the guides, trust me
anybody provide me regulations
@brittle wing is using 1060 6 gb
still a 1060
Ayo? @proven hill level 101 !!! 
vram depends on the variant but yes you're right
Last update: Mar 10, 2024
even a 24gb 1060 would still be a 1060
By the way, now that you're here.
I can't get llaria RVC Mainline beta to work...
Yeah...
okay lets troubleshoot it together
whats the error?
Anyone know how to connect rvc to discord?
follow guides
just follow the steps as written
this is what you actually need
okay 🙂
Ayo? @brittle wing level 2 !!! 
I don't know...
"Ilaria RVC is starting..."
Than after a short while it displays some text but it immediately closes.
I have no idea what it said...
open it with the cmd
Ok
It's the cmd that closes.
open the cmd inside the folder and paste
Ilaria-RVC-Launcher.bat
just prepare a dataset uf you dont know follow the guides
does it do the instant close thing?
if so try the video that gdr pinned in #🔍│help-w-okada but instead use ilaria rvc mainline's path
Ah...now it shows ModuleNotFoundError: No module named 'audio_separator'
use the Assistant to get the update
should fix
if it doesnt
open cmd again
and paste this
pip install audio-separator
after that
pip install audio-separator[gpu]
but again, with the assistant should work
or you can simply do
pip install -r requirements.txt
to fix every dependencies problem
sorry for asking but can you please guide me how to start learning Ai development ?
dont
for the sake of your mind
i think i have pinged a wrong person. 😰

you don't want to guide me that's why i said this. i think i have to find someone else. Sorry 😊
how can i guide you tho
i just said that please guide me how to start learning Ai development ?
i dont know where to start
dont creat a folder inside RVCdisconnect folder
put the zip file in rvcDisconnected folder, not in a subfolder
Hi, I am trying to use the local version of RVC, but when connecting to the web, I get "ERR_SSL_PROTOCOL_ERROR" in Chrome and the terminal says that "Invalid HTTP request received". Anyone met with same problem?
RVC Guides (How to Make AI Cover)
Translation by country
Ok I've done that now.
But now I've got:
ModuleNotFoundError: No module named 'importlib_resources'
pip install importlib-resources
Alright!
It opened Ilaria RVC.
Oof...Red over Brown is hard to read...
Red over Brown?
wdym
The brown buttons with red text are pretty hard read.
theres no brown button and no red text..
im struggling to find an avatar for my custom voice can someone help._.
...My text is red, not orange...
Ah that solved it.
Looks much better and easier to read.
Thank you.
I created a model, but for some reason when I want to make a cover, the site says that the file is not zip, although the model and the index are in the zip folder
no problem, i need feedback, if you want you can DM me !
I really hate that the bot responded with the pages I follow to install it and there is no mention of my problem
where do I download rvc
define RVC
real time voice changer ?
or the "real" rvc to train models with ?
The Mainline branch of Original RVC
Ayo? @trim escarp level 1 !!! 
make sure to connect via http and not https
Yes, both http and https conenctions give the same result
System time is also okay
Oh, seems like a browser issue, it seems to work on Edge
ive been trying to get into the ilaria rvc but whenever i do the run part it just shows me this
Is rvmpe the best or is that just because it's faster with decent quality?
I would like to clarify a question: how to use artificial intelligence to create music using AMD's video cards? I'm using Gradio and it's using my processor to run the model, which is taking too long. Is it possible to use the video card instead of the processor?
I created a model, but for some reason when I want to make a cover, the site says that the file is not zip, although the model and the index are in the zip folder. What should I do?
what rvc did you use
Ayo? @modern marlin level 3 !!! 
did you do it locally
no did you do it on google colab or using your pcs resources
ok
-help
-help
-rvc
Full AI Voice Model Training Guide (Local)
Link: YouTube
Credits: Christopher Villanueva
Model training with Mainline RVC
Link: Rentry
credits: Raven (ravencutie21)
AICoverGen Colab Guide
Link: Google Docs
Credits: Eddy (Spanish Helper)
Create a model with RVC disconnected (colab)
Link: Google Docs
Credits: Angetyde
How To Make an AI Cover With Ilaria RVC
Link: Rentry
Credits: 👽 Julia (ailen2091)
ive been making audiobooks but the time it takes varies wildly from a minute and a half to five minutes similar length text so why is it settings or sumthing
does having a really long clean dataset make the model better or worse
long as in about 30 minutes
the voice-models tab wont open for me
Ayo? @wise wind level 1 !!! 
no no it doesnt make things worse as long as its well cleaned, the max id suggest for making a dataset is 1 hour as after that u dont really hear changes
please be more specific
i mean 30 mins is good
there isnt really an average tbh
some models can be good even on 10 mins or 50 mins
what mostly depends is quality over quantity
for a good model u require 5 or more mins (i dont mean u have to throw ur dataset, its good u have a 30 min one)
@low shard if fix e#it nevermind sry
oke dw
does denoise work for keyboard noises in the background?
why does this not work
Ayo? @pure lance level 1 !!! 
i already replied to you in https://discord.com/channels/1159260121998827560/1224832529807446056
please check that
Hellooo !
Can someone please help me?
Basically when I try to use RVC, there are the normal stats at top left corner (vol, buf, res, rtf) and for some reason I couldn’t hear myself or record anything. Then I noticed the vol thing was at 0.00000 too and idk what to do honestly:)
srry mb
dw its fine
Anyone ? :p
are you using the realtime voice changer wokada?
Not sure what you mean by wokada
Ayo? @balmy creek level 1 !!! 
I’m really confused rn I just got it lol
are you using the tool for ai covers or realtime voice changer
Realtime
I’m on the realtime voice changer client rn
yea i think what you are using is called wokada, for that please ask in #🔍│help-w-okada or #1192011222023950368
Alright
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
do dataset clips HAVE to be under 10 seconds?
RVC Guides (How to Make AI Cover)
Translation by country
no no, usually for a model is reccomended the dataset is atleast 5 mins for a good one
i mean like
the cut clips
if u have a 5 min dataset does it have to be cut into small 10 second clips
nope, it doesnt matter if you cut your dataset into lil clips or put it all into a single wav file
yea but i dont want silence
its your choice
for that you can read this https://docs.aihub.wtf/rvc/resources/datasets/#cleaning
Last update: Mar 8, 2024
10 HOURS???
ON M1?????
what the hell
what fork r u using
oh, so original rvc?
damn
😭
damn
honestly i suggest using replay in this case, dunno much about it but people seem to like it, and it's the best one for the Apple Silicon Macs: https://www.tryreplay.io/
Might be RVC issue. M1 can easily run W-Okada so idk
should i train in v2 or v1
realtime conversion with rvc
v2
is v2 always better
yeah
What program should I use to play RVC voices?
what batch size should i use (using an rtx 4060 8gb vram on a 21 minute dataset)
8
damn i did 7 is that okay
can i just cancel the training once it reaches a checkpoint then change it?
is 500 epochs on a 21 minute dataset overtraining
i think i mightve messed up
I tried with my voice and it was minimal between 250 and 500 id do 250.
yo bandit
im so bad at detecting overtraining
tu a réussi a creer ton model ?
Did you manage to create your model?
Think of it as a strategy game. I don't think you even have to look at mel that much #1213509354343637065 message
only g/total, d/total, kl
im taking a look at my previous models when i didnt know overtraining existed and im just really confused
the thread inside explains it all. I don't see how it can be confusing since you're looking for graphs that are going up indefinitely
No 500 was too much and I don't know how to use trained models with colab im not sure how to upload it because it requires a link to huggingface plus it's my own voice which I don't want used
idk how to use d total to detect ot
it's not possible I made models with 5 minutes of dataset at 600 epoche and it was good
The shorter the Dataset the more Epochs.
500 is usually too much.
can anyone help me switch it so it uses my gpu insted of cpu it just says gpu dml
Ayo? @native scaffold level 1 !!! 
How many minutes of dataset do you think you need for 500 epochs?
what is complicated is that 500 epoche does not mean a quality model...
Ayo? @hazy thunder level 6 !!! 
I'm like you, I can't understand when to stop training so as not to be overtrained.
is it the higher the epochs the better?
no
the shorter ur dataset the more epochs u use
dont be an idiot and make it really high
oh
cause im an idiot and i made it really high and now my shit my overtrain
ok good to know lol, so lower is better cause more dataset
what about an hour dataset
500 epochs
ok
u could do 600 but that might overtrain
is this an OT??
yeah the step value should tell you like 32.5 for example
if it goes down while its going up
then is it overtrained
like goes down by a bit
Yes past that red circle is overtraining
it just dipped down again
can I see?
no, it's done
alright
how do i turn the checkpoint file into an actual model though
cause ive set it at 500 epochs and now it overtrained at 350
Look at your step count before you stop the training
yea then?
then when you stop it, it gets saved in your weights folder under Mangio RVC/fork
oh you are using local? Idk. I only used rvc disconnected colab so I can't help you there
yea i am
which stop training button
in the rvc web gui?
how do i know
it just says rvc webui
think its the mainline one
alright
am i gonna have to redo the training from the closest checkpoint?
increase or decrease
yea
ok i understand thanks
Help
I try to make ai cover but it just shows ERROR: Could not open requirements file: [Errno 2] No such file or directory: 'requirements.txt'
And I don't understand code
Android
Motorola
😦
Ohh Google
I also tried using my ipad
I do
Hi, Idk if this has a quick solution, but here is the thing: everything works well, but then, my mic from the headset is picking the desktop sound (so if I play a video while having it activated, the video count as a voice, so I do hear the conversion too besides my voice)
?
Ayo? @fresh mantle level 2 !!! 
where do i use a .ckpt?
Bro
WhAt
Tell us the name of the google colab
I'm not that experienced with this stuff so give me time
AICoverGen_colab.ipynb
Idk
Which link is it
Share
You maybe using an old version
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
Ok
RVC Guides (How to Make AI Cover)
Translation by country
The percentage become lower and lower
Is that any problem??
Or it is normal
Pls someone help me
Hello, did someone ping me?
hi :)
:)
How can I use voice-lines for voice.ai
Since voice.ai isn't partnered with AI HUB, i do not know, refer to their support instead:
https://support.voice.ai/hc/en-us
or their discord :
https://voice.ai/discord-invite-tb
The most advanced Live Voice Changer & Voice Cloning tool for Discord or any other software running on a PC. Join Now! | 185696 members
can somebody help me on w-okada, I only hear a robotic voice
I'm currently using
Ryzen 2600x
Radeon 6600
uhm
this video shows
you can use ai hub x voice.ai
oh, you want a custom model
yh
Ayo? @hybrid rivet level 1 !!! 
upload the zip 😭
np
is there any better free* alternetive for voice.ai
w-okada
how do i download it ;)..
blud pinged 5 people for that
damn sorry
my bad
just copied from the guide 😭
i just downloaded applio and i have on the drive standalone versions of rvc and rvc for training models, so i saw applio have these functions, these work like them? better or worse?
is the sense of keeping them all?
help me pls
Ayo? @sharp osprey level 1 !!! 
TypeError Traceback (most recent call last)
<ipython-input-2-2952be55a264> in <cell line: 56>()
56 if param_link == "":
57 paramset = requests.get("https://pastebin.com/raw/SAKwUCt1").text
---> 58 exec(paramset)
59
60 clear_output()
<string> in <module>
TypeError: VoiceChangerParams.init() missing 1 required positional argument: 'whisper_tiny'
Can anyone assist me with the VCC? I'm having insane delays and am curious on how fix or speed it up
Ayo? @lapis fox level 1 !!! 
yeah
Deecho-dereverb is crazy icl
what audio length is that? 
Well, this is... Quite the predicament.
Traceback (most recent call last):
File "threading.py", line 980, in _bootstrap_inner
File "threading.py", line 917, in run
File "C:\Users\[CENSORED]\Downloads\RVC1006AMD_Intel1\gui_v1.py", line 733, in soundinput
with sd.Stream(
File "C:\Users\[REDACTED]\Downloads\RVC1006AMD_Intel1\runtime\lib\site-packages\sounddevice.py", line 1800, in __init__
_StreamBase.__init__(self, kind='duplex', wrap_callback='array',
File "C:\Users\[UNKNOWN]\Downloads\RVC1006AMD_Intel1\runtime\lib\site-packages\sounddevice.py", line 898, in __init__
_check(_lib.Pa_OpenStream(self._ptr, iparameters, oparameters,
File "C:\Users\[DONTHACKMEPLSTHANKYOU]\Downloads\RVC1006AMD_Intel1\runtime\lib\site-packages\sounddevice.py", line 2747, in _check
raise PortAudioError(errormsg, err)
sounddevice.PortAudioError: Error opening Stream: Illegal combination of I/O devices [PaErrorCode -9993]
It also stopped responding altogether.
Basically crashed.
Classic RVC realtime moment
I was going for a more optimized performance setting because of my graphics card, so that's why the sample length is at minimum.
So that's the opposite; you want it to the max
I was doing it mainly as a test.
Again, I have a terrible GPU, so I set it to minimum as a test.
Yeah, that has the inverse effect though
Error log, I wouldn't know
The realtime module has never behaved properly on my end, the device management stuff has always been iffy
Then again, the log has information that senior software engineers have instead of me.
I will never understand this stuff, so I need someone to explain it to me.
thats for the index
That did absolutely nothing.
Full error (screw censoring my PC name)
C:\Users\Gamer\Downloads\RVC1006AMD_Intel1\runtime\lib\site-packages\torch\nn\utils\weight_norm.py:25: UserWarning: The operator 'aten::_weight_norm_interface' is not currently supported on the DML backend and will fall back to run on the CPU. This may have performance implications. (Triggered internally at D:\a\_work\1\s\pytorch-directml-plugin\torch_directml\csrc\dml\dml_cpu_fallback.cpp:17.)
return _weight_norm(v, g, self.dim)
Exception in thread Thread-1:
Traceback (most recent call last):
File "threading.py", line 980, in _bootstrap_inner
File "threading.py", line 917, in run
File "C:\Users\Gamer\Downloads\RVC1006AMD_Intel1\gui_v1.py", line 733, in soundinput
with sd.Stream(
File "C:\Users\Gamer\Downloads\RVC1006AMD_Intel1\runtime\lib\site-packages\sounddevice.py", line 1800, in __init__
_StreamBase.__init__(self, kind='duplex', wrap_callback='array',
File "C:\Users\Gamer\Downloads\RVC1006AMD_Intel1\runtime\lib\site-packages\sounddevice.py", line 898, in __init__
_check(_lib.Pa_OpenStream(self._ptr, iparameters, oparameters,
File "C:\Users\Gamer\Downloads\RVC1006AMD_Intel1\runtime\lib\site-packages\sounddevice.py", line 2747, in _check
raise PortAudioError(errormsg, err)
sounddevice.PortAudioError: Error opening Stream: Illegal combination of I/O devices [PaErrorCode -9993]
I can't read Japanese.
I can't either
I looked at the code, and it seems to be related to how it decides to pick the settings for the audio devices
Perhaps it'll allow it to not screw itself up
Nothing changed.
¯_(ツ)_/¯
select MME type on output
has to be matching
Oh yeah...
start sample length on 0.50 or tbh higher and work your way down after it works aswell
Oh, it's perfect!
For a GPU that's 12 years old...
It works!
I am so grateful for your help.
Nice
So... How would I be able to hear what I say?
Probably can't.
If that's the case, then whatever.
I think I can handle it.
you need virtual cable, go to the help wokada channel, pinned message open the rentry guide, go to step 6 download VAC
I have an AMD card.
That has nothing to do with virtual cable
I also don't have any lines.
Line 1 is the default name of the VAC virtual cable
if you dont have it do what i said
Ah you have the other one
Wait...
then use that
There's two of them?
yeah
Huh.
Installing it now. I'll have to restart after this, so I'll be back.
Wait, which one do I use?
vac doesnt require restart pc but you can
64 without the a
Alright.
I forgot to mention: I meant to say, "Hear myself whilst using it in-game."
Ah this rvc doesnt have that option. youd need something like voicemeeter banana