#🧬│ai-chat
1 messages · Page 319 of 1
js put my new juice wrld ai in #1159290752195633273 🫡
How to put files into eleven labs?
It only gave me index and pth files
No audio files
RVC models and 11labs are 2 different programs
U can't use RVC models in 11labs nor the other way around
RVC is Speech to Speech, 11labs is text to speech
applio supports eleven labs as input for tts to rvc
You sure? Applio TTS uses Microsoft Edge TTS API
Documentation for a high-quality, open-source speech conversion ecosystem designed for simplicity and optimized performance
I mean there's a plugin
But it's still not natively using them
It's just using an 11labs model to make a TTS file audio, then using it as an input in RVC, then converting that to the RVC model
Hi, I generated a clean audio track in Audacity .flac I have a question how to extract .index and .pth from this file because I can't find it anywhere on the internet. Thanks for the answer
Hi
there is a addon that allows Elevenlabs support
Yes I replied there's a plugin for that but it's still not native
And also seems like the guy wanted to use an RVC model in 11labs
yes it is its linked on the applio docs under advanced and is made by applio creator
elevenlabs for applio tts
either way still does the same thing
We got another one in the U.S just recently because of port people doing work strikes
I have two AI generated audio clips from Suno. I'm looking for a user who can help me copy the style of both music and vocals from one and create a special voice that I love with similiar yet different lyric.
Who can help me with this?
If you manage to get me to 150 instagram followers before the end of the year ill help
why not just do all with suno
No shit lmao im getting desperate because my old account with like 300 got banned
Ayo? @vivid flicker level 1 !!! 
I'm trying to get the exact style of one clip
no one is going to get you insta followers
Ayo? @magic hazel level 1 !!! 
someone just ban this guy
Ayo? @vernal abyss level 2 !!! 
then use suno to do it all
I tried and its not working and there's no way I'm buying premium on suno.
if you’re gonna promo do it in #1159290752195633273
Nah I feel like he should get the Russian only chat punishment
not a bad idea
Thats a good idea
You saying that means you don't know how ai hub Russians are.
Lmao its fine
o-kada is just not opening
which one the fork or the orginal
Yall mfs get mad when i self promo lmao
lol, my anti virus was on.
you're fucking annoying and nobody cares about your instagram. what, do you think people just look at one channel forever and so you have to send that fucking garbage in every single channel?
how about zero?
hello. does anyone have chat gbt plus and have multiple people using their account?
what is the point of doing this here is the good question.
and will i get banned if that happens
why
The real question is if yall mfs were so ignorant and like yall say "dont care" why tf reply to me and keep this thing going?
because it's a product for sale? and you'd be giving it to people who didn't pay for it? not like they're ever gonna be able to find out anyway
i want to get chat gpt plus and have multiple people use my account
Karen lmao, you the type of mf to not share your netflix password so you dont get banned
but i dont want to get banned . i dont think they will ban me bc i will just call my bank and get my money back
Ayo? @long jasper level 1 !!! 
how do i make money from AI songs?
useless
what do i do if i can't hear myself talking with the voice changer?
Because it's funny your responses
Don't if it uses someone else's voice then you can't.
I'm the person to use a service that has all movies and tvshows in highest quality for cheap sub and then share it with my whole family so it's 40 every 6 to 8 months instead of 20 or more a month
You are the type of person that gets mad at people that pirate stuff.
I know reading and context aren't your strong suit, but I was saying why it's against the terms of service, not defending it. I don't give two fucking shits what you do with your stuff
bro u guys are sad. focus on making money or sum shit cuz u shouldnt let ppl waste ur time and emotiions like thia
cya
who cares about terms of service if they really can't catch you.
exactly, it's all fluff
Yeah I do make money it's funny these people's responses
it's like the AI companies trying to put conditions in their models to not train on their output - it's wishful thinking
Why would you train a ai using ai that would just give a bad result.
you can kind of supplement a large enough dataset with some synthetic data but yeah it's really pretty shit and not a good idea
Not really
doesn't embed, but there's been at least one paper on this subject that was p interesting
Model collapse is possible like I see it happen with deepfakes and stuff when people train to long on not that good of data
of course model collapse is possible, all i was talking about is that you can mix some synthetic data with a large enough real dataset before synthetic data becomes a fatal issue
yeah
do you mean mode collapse? in RVC context, it is indicated by a few downward spikes by about less than 30% than usual in loss/g graph. the possible causes are poor quality or improperly processed dataset.
i'm not talking about mode collapse, no. just generally shitty results, and more so applied to generative models as a whole than specifically RVC
my results aren't bad either, was in relation to a mention of stupid ai company TOSes earlier
ty regardless tho
mb if it sounds a bit ambiguous to me, anyway here's another thought: #1220364005034561628 message
the conclusion is to avoid including synthetic/augmented data, ideally
No really ai is good for messing with images to make them better for training and audio also just not full ai currently.
Guys I have a really simple question, when editing music in external apps what's the best way to cut a large part of the song out without leaving a blantant split effect that everyone will notice.
Hello everyone, trying to learn w-okada here, does anyone have a microphone they suggest using. Preferably at a cheap price since I'm just getting started really. Any help would be appreciated, please and thank you. 
decent two-digit buck headsets are good enough if you don't care on audiophile grade quality
I mean... trying to learn the software here, and if possible download the resulted audio cause I wanted to use it for a story thing (Fanfiction really) so... does that work well enough (My headset doesn't have a microphone so)?
I'm sure there are cheap headset mics you can find
Actually question, can you possible record yourself with the Ai voices and if so how to
Ayo? @idle sandal level 1 !!! 
obs or audacity with w-okada
convert the recording using non-realtime RVC
Mhmm, oki going to try Audacity, if I have questions can I ask it here?
yeah that is the best way to do record voice then use non realtime.
Ayo? @glass copper level 1 !!! 
Audacity or any audio editors (izotope RX, adobe audition, etc)
what was the website name with the old rvc models?
Blavlavka
Yes
actually?
Asd
what
-docs
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
check the docs for tutorials everyone
Hi
Ton pipi est tellement rouge qu’il est vert
Hello dev bros. Having an interview in few days.
NEED HELP!
I have 3 years of experience in AI/ML. I have mostly worked with Classical Algorithms & Neural Networks (ViT's, CNN hybrids, EfficientNet, GNN, Pytorch, Capsule Networks, ONXX. etc). I will presently be interviewing for ML Research specialist role. The team is working on ML applications in drone camera's face detection and recognition.
Ayo? @neat hare level 2 !!! 
Native means that it can actually use models instead of having to use an input file which is not the best as RVC is made for speech to speech not text to speech
yea here its not about this, its about wokada & rvc, most people are are prolly underage lol
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
Yeah but text to speech using Elevenlabs then that to the voice you want using rvc
yea but not native as what the guy wanted to do
every promo out of #1159290752195633273 will get deleted.
im just saying its not the best in quality thats all compared to an actuall tts program
yeah what the guy wanted would never happen you can use Elevenlabs to clone a voice but not rvc.
Yea its 2 different programs
Yeah I get that
👍
👍
👍
Hi everyone,
I am working on the document classification((for the legal )for the large pdf having 100 pages or more .
in this large single pdf containing different -different types of legal document, .
How could classify the different -different documents from a single pdf?
Please suggest to me any approach you have.
sorry if this is a dumb question but what exactly does index do?
its the file containing the accent of the voice model
I suppose OpenAI came out and crushed the competition for tts
I mean, it's exactly what Bark dreamt of being.
gpt-4o-realtime
a speech model that you can exactly prompt it the way you want it to say things
is this legit??
i mean there are also good open source tts
hi
anyone know of a good deepfake server? If I'm not wrong deepfacelab is still produces the best results and I haven't found many tutorials on how to get it to work and the ones I have found I've had issues with.
its all on their site btw https://www.deepfakevfx.com/downloads/deepfacelab/
if u want a more easier way theres facefusion which requires just an image and not actual training, but the results arent as good
e
I am sorry but none can come close to an actual voice actor that you can give directions to
which is where the new model is at at the moment
My discord is the best deepfacelab discord
Deepfakevfx is not good there facesets are not the best and there pretrained models are not that good and also there guide is bad.
Thats the official deepfacelab site afaik
i mean, gpt so vits is good quality tho
it's not the official one that is just a website someone made for deepfacelab
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
merhabalar
nasılsınız
fenerbahçe bu sene sampiyon olur mu
bence olur
sizce olur mu 2 level olmam lazım
hadi ama
evet
yok
Ayo? @flint yacht level 1 !!! 
olucaz 2 olucaz
az kaldı basaracam
hedeflerimi gerçeklestirmek icin buraya geldim
basaracam
hasan abi basaracam
annem basardı oglum diyecek
su an 2 level olmak icin tahminimce guney koreli cocuklarla konusuyorum
2 level olucam anne rahat ol
korkma basaracak cocugun
khontkar gibi sarkı yapıcam basardım ane
hadi artık bot lutfen 2 yap beni
isim var seni bekliyorum ayıp oluyor
5 saniye kuralı var bir de onu da bekliyorum
dostlar
sakanın sırası değil
arkadaslar 2 level olmak istiyorum artık lutfen
nolursunuz
arkadaslar el atın 2. olim
hadi artık gerçekten cok hastayım grip oldum
boğazlarım ağrıyor sizi bekliyorum yakısmıyor size
2 leveel olmak istiyorummmmmü
gel kalbime yatıa
Stop doing wall of text not in english
Ok
Guys what is wrong with my w okada voice changer it is not Woking properly so much noise even though there no sound
.
is your mic bad and picks up a bunch of noise
When i normally speak it works very good
so when you are not talking it's bad but once you start you are saying it's fine
enable Sup2 and raise threshold as needed
No like when i use mic normal way it sounds good but when i use okada it is going berserk
Selamlar
yeah because background noise probably
ah
ask in help channels
ye just took care of it ty
Ayo? @bold gazelle level 1 !!! 
hey. i would like to know how i can train and make voice models.
Train is making modela
What's ur PC gpu?
@hexed gust I deleted the promo post you did in #1192011222023950368 , use only #1159290752195633273
rtx 3060 12gb
oh okay
Alr that's good for local (on ur PC)
how do i train a model?
Ayo? @idle urchin level 1 !!! 
i have no experience in training models.
In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.
alright
thanks!
5gb?!?!
what even is VoiceConversionWebUI
it will take me weeks to download that
The program name
RVC = Retrieval-based-Voice-Conversion
-rl
Ayo? @proven python level 2 !!! 
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
hello everyone
does anyone know why that whenever i try to use the voice changer or just talk, i cannot hear anything come out?
Why are my parents so on my ass for smoking, I'm 19 bro 😭
обезьяна
do your parents also smoke
My dad does but my mom quit
idk where i was really going with it but sounds like they probably dont want you to be like them lol and also smoking isnt good for you but do whatever
I'm not "starting" I used to smoke at 16, so... I'm just being open about it now, and not hiding it
hope i dont ever smoke
cigarettes that is, i dont think i could care about something mild
True. If you for some reason just want to try nicotine out, and are dead set on it, just use snus or chew tabacoo instead. It's thousand times healthier
Do not smoke or vape
I used to smoke like half a pack per day but now I'm back down to just 2. So im making progress at least
my parents are chronic cigarette smokers
Ayo? @tacit falcon level 3 !!! 
LIke my comment if you are not gay

hello
How did tyga ever live down "stimulated"
Rapper with 21 million monthly streamers making songs, bragging about having sex with underage girls while 25
wgat
It's as bad as it sounds

I feel like he should be investigated or something, idk
Damn, shits kinda beautiful ngl
real
can anyone give a good explanation on how to use this
Ayo? @winter comet level 1 !!! 
weed?
such a problem, the voicechanger processes the voices of everyone who speaks in the discord, how can this be fixed?
oo
Yes, it's resin afaik
The Vicki leekx version is so much better than the one on matangi. Just hits much harder & is much more gritty
why is miss chuyin weilai lowkey serving
Nah this one tho yes
wgat
<
anyone knows how to get amanda silvera model?
what vn is this?
asking for a friend 
hi
:3
Support the dev and get it from patreon
The itch io version is outdated by multiple years
It's super, super, super great, play it
Recently, I built Paint Chatbot using RAG AI, next.js and Python
If anyone need my help, please ping me.
And play yume 2kki too. It's not a furry vn but it's geniuenly the only game I enjoy playing
Game has no goal or objective or anxtgong, it's just about exploring and finding cool stuff
i don't have a bank account to use on patreon yet lol
Well, there's a certain site that I can't mention that's a archive of paywalled content
Hey guys, does anyone here have experience with Replika AI?
oh yea. "that"
how come the catbug voice model doesnt let you click on it to download?
There's a catbug voice model now? Lol
well it wont work though
when i click it says invalid password or name
its old though so idk what happened
Ah gotcha. I'd assume either support for it ended or some code broke and the dev is working to fix it maybe? I'm unsure who the dev is/and the status of the app
Ayo? @left cedar level 1 !!! 
someone named fiction is the dev i believe
Ayo? @forest elbow level 1 !!! 
They haven't been active here since early August this year so I'd assume the support probably ended for the bot
but then how am i supposed to become catbug
You can't ;-;
thats not true and i wont believe it
if i make a model request, what are the odds someone who knows how to do it actually makes a catbug one?
Probably a decent chance
I can't imagine it's hard to code
if you want someone to do it for free there's a small chance someone might help (not guaranteed that someone will help tho)
i hope someone does the old one sounded so good
that model is still on weights.gg if you want it
фуриебы ебанные
are you sure?
Ayo? @forest elbow level 2 !!! 
Pls make a ecco2k model
Ecco2k shook his ass at a concert onetime and everyone thought it was playboi carti for some reason
thank you so much
yw
iicr the old models were archived onto weights a while back, so some old models should still be on there (unless the creator removed them or it wasn't archived)
Also you have a cool name :)
hey man thank you alot, yours is sick as well
My actual name is Kafka, I just go by Kavade online :)
Would you like to be friends?
dude kafka is also a sick name, where is that name from?
Czechia
KAFKA HAHAHA
im sorry bro but i find that hilarious
not because of the name itself
What's wrong with it? Lol
there's nothing wrong with it per se
but there was a famous novelist/writer named kafka
Franz Kafka
pretty sure he was born in prague actually
Oh fr?
Ayo? @left cedar level 2 !!! 
yea
i only find it funny because in school, they had us analyse one of his most infamous novellas, called "metamorphosis" over and over
fever dream of a story
I don't love that novel
Omg carti reference
It's..weird
Playboi Carti’s Whole Lotta Red is available now: https://smarturl.it/WLRcarti
Directed by Nico Ballesteros
Follow Playboi Carti:
http://www.playboicarti.com
https://www.instagram.com/playboicarti/
https://twitter.com/playboicarti
Follow Kid Cudi:
https://twitter.com/kidcudi
https://instagram.com/kidcudi
https://facebook.com/kidcudi
https:...
Carti is overrated
Straight to my blocked list

Ayo? @floral pasture level 1 !!! 
His only "decent" songs are Magnolia really
To each their own really tho
👋
no? 😂
That's my opinion lol
I just don't really like his style or voice all that much. He's definitely a talented rapper
I like NBA YB more tho
okay thats fair
Ayo? @final rivet level 1 !!! 
tell me the best female voice, so that it looks like a real one
Oh lord
omg, wut's wrong?
subjective af!
you understand how ridiculous of a question this is right
Ayo? @vernal abyss level 3 !!! 
hi
How do you make the covers? I already got the model but I don't know how to do it
Hi! How are you?
my bio 
-hf ilaria zero
Suggestions for @earnest pagoda
- UVR5 UI, by Eddy and Ilaria Huggingface Spaces
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
- 🆕 FaceFusion UI, by Nick088 Huggingface Spaces
and then weights.gg or this discord for models
whats ur pc gpu ?
Server more fucked up than a whore's anus
hi does anyone have any recs for websites with ai voices for yt voice overs? i've tried 11labs already, but yall know of other alternatives that dont require any payment
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.aihub.wtf/tts/gpt-sovits/
Freemium 11labs: A easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
-realtime
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
No CUDA GPUs are available
Ayo? @rigid moat level 1 !!! 
on gradiowhat do I do to fix it?
@gray rover sorry for the ping but are u porting your RVC fork to colab / lightning.ai someday? :>
Anyone have a good working colab link for rvc?
Ayo? @iron patrol level 11 !!! 
training or inference?
thats called inference
id suggest using ilaria rvc zero, a zerogpu huggingface space which is faster than google colab free gpu
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
if u really want to use google colab instead, use https://docs.aihub.wtf/rvc/cloud/applio-colab/
Last update: June 15, 2024
Is the huggingface one better then colab?
yes, its a zerogpu (A100 gpu paid by ilaria, u wont pay anything and dont need to duplicate it) huggingface space, A100 is 11 times faster than google colab free gpu (T4)
Oh right on and is it regularly updated?
Cause the colabs iv used seem to stop working within a month or more
yea its all working and updated
Right on thanks guys appreciate you both
its normal, the rvc hype is kinda dead, and there can be somethings that stay out of date, also because google colab changes internal things sometimes and there are some old devs
yw
Yeah and well I rarely do ai covers anymore just because my last youtube channel that was full of them got terminated for copyright so I'm doing ai covers rarely to avoid it
yea unfortunately copyright doesnt allow ai covers
as u see there arent as many as it used to
: L'influence de la méritocratie sur la gouvernance institutionnelle en Tunisie : limites structurelles et réalités sociales
Interesting
very 
don't have any plans
okie dokie, i'll wait until someone does it :p
Hello guys 👋
I've been wondering if OpenAI does end up going bankrupt, will this have any major consequences or will that just leave a vacuum for shit like Grok & Gemini to fill?
hi
asking for the impossible to happen
Don't Click!|| if you are reading this, it is already too late. You have been infected by the curse of pee pee poo poo man. If you don't copy and paste this on 5 different servers, you will face the consequences. I was a victim like you, trying to be free.|| i warn you bro
whats the best way to train song models and how do i make the data sets (they use alot of auto tone)
i usually use rvc
are ckpts safe
Hi everyone, I had an artist do a painting of me and my kids (single dad) as DC superheroes a few years ago.
I want to see if there's an ai tool I can use that can draw each of us, full body as DC heroes that I could then port into Photoshop and merge into one image? They have grown up now so id like a new family portrait of us as heroes
Any suggestions on how to tackle this without needing to subscribe to a service?
bro come on 😭 shits so corny
Don't act like you're not scared of pee pee poo poo man
ji
Hola
.
Do udio/Suno songs also get copystriked?
Iv made ai music songs with suno about random things and I haven't got a copyright strike
No because they are allowed to be used I think for only the paid tier for suno but don't know about udio
what i need to setting my programm? rvmpe_onnx is best?
Hello so I don't know if you guys remember me like probably since 1 year I think
I was been hacked that year
Because it wasn't me I was doing something like this
real
Hello, new to the server. I just discovered the world of AI Generated Music, I normally use Suno and have created two EPs with my AI Band Zombie Dust, with lyrics written by me. Nice to meet you.
потные мужщины тягают железо

So when training a voice model, what are some things to keep in mind? I've got about 15 mins of adr Inverview recordings from a specific actor. However if trained on lets say 500 epochs or 750 epochs, will the model be able to sound less flat or monotone which often is the case with many voice models?
Ayo? @pastel pawn level 2 !!! 
More epochs dont mean more quality, epochs depends, use the tensorbaord
and be sure to have a good quality dataset
hmm intersting i thought the more opochs the better the quality. Like i said i have about 15 mins of interview recordings, which is just the actor speaking into the same mic, same audio quality. All from one source which is how i prefer to work because i imagine that if you have a dataset which jumps around in quality its going to be very noticable in the final model
Last update: Feb 10, 2024
yeah just found the link myself, thanks for the guidance thought its really appreciated 😉
yw
Dont promo here, use #1159290752195633273
Hi, noob here what is good local ai image generator. looking for something working great with rtx 40 series gpu
im curious about this too
im about to play this shit
Prolly automatic111 or comfyui
Not really a local user but that's what most ones use
do i need pytorch to install
im getting this permission denied on the cmd
oh wait i fixed it
i just moved it from program files to any other files
the game in the picture you just sent
hi this is haru nice to meet u all
I can finally hunt preds in vcs 🙏
Hi everyone, do you know any AI tool that can process data from images or scans and extract specific information? OCR struggles with photos. It can handle scanned documents, but when it comes to photos, it can't read them.
Like a image to text?
I'm specifically looking for a solution to recognize PDF documents that contain scans or images of ID cards, and I need to extract the PESEL number, date of birth, first name, and last name from them.
In my job, we have to do this manually, but such a bot would greatly streamline my work instead of searching through 20 pages of PDF for client data.
do u guys know how to like remix songs with a ai voice of the original singer with different lyrics?
You'd need to sing the song yourself, convert the voice, then apply the instrumentals
hello
I would like to know if it would be possible to use an AI to translate voices in games for free (in the form of an audio file or a YouTube link to translate the voices, not directly in the game) and recreate them in another language
not accurate but a method that im thinking for that is grab the transcript and translate it, then using something like gpt sovits or elevenlabs maybe to make a realistic sounding translated audio, problem is, it wont match the mouth movement or length of talking
i forget creators also put nsfw in there
hello
mm true
ola
thanks
Uati
Ayo? @icy breach level 1 !!! 
Hi, i'm new here nice to meet ya all

come for resources and go
all the people i remember talking with before on here left lol
or most of them more like
hello
There's #1159290752195633273
oh srry, didn't see it
lol that's not a bad idea realecco2k, but i think nick088 might be joking about the 69 cents an hour thing 😂 what do you guys think a fair rate would be for chatting on here?
-# AI-generated responses may be inaccurate; please verify important information.
what the hell the applio bot can read from the replied message too
Didn't know that either tbh
Resources change tho, many colabs got deleted after time or either they are not updated anymore
yeah but thats what i would think the people that join and dont talk much are here for; updates and new resources, but idk i havent talked in this server in a year i wouldnt know what they were doing
Yea ig for models too
I remember there was alot of like diffrent voice models, where did they go?
Server got deleted once for copyright so super old models may not be found here, they should be in weights tho
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
Ah i see, thanks
I sent u all links where u can find models
Yeah ill look into those, wanted to find few russian one, and supriced it was empty
Yea
i forgot id joined this server tbh, lol
😭
i checked back in recently tho, because im tryna make my own simple speech to speech voice converter from scratch
are there any good tortoise-tts inference and training gradio web ui's available? everything i have seen has been abandoned, or is xtts better?
and part of me feels like im way in over my head
so your RVC fork?
rvc is the program used for that which is the best
is that the fundamental algorithm behind voice conversion shit, like if someone wants to talk as snape from harry potter or something?
for the past month id been playing around with and learning audio signal processing and deep learning to pull this off
xtts2 is better btw, its built on tortoise tts, but both of those are discontinued btw
If u care about quality, use gpt so vits (few shows, a bit of training) which needs actual training
If oyu care about speed (0 shot, no explicit trainng), use fishspeech or xtts2
oh ok i have gpt-sovits installed, but i think it's a chinese version
this is the program yea https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI
this is the original one
maybe try https://docs.aihub.wtf/tts/gpt-sovits/ but idk of local things. Whats ur pc gpu btw?
so like, re make it?
4070 12GB
Ayo? @viral jacinth level 1 !!! 
ig u mean a fork (modified version)
ye good enough, u could try that guide
idk how exactly RVC works, like in terms of the step by step algorithm, so i was gonna reinvent the wheel for learning purposes essentially
and see if i wind up remaking something with the same fundamental functionality
but i have no clue how deep thats gonna go. all ik is that i need the software to take in vocal input, extract the audio features and feed them into a neural network that'll have been trained off of a lossless audio dataset from (insert fictional character here)
i've stared at and tried so many voice cloninig prograsm the past few days, my poor hard drive is hating me right now
you are looking for text to speech right?
just checking, cus if u want speech to speech its another program
Thats not how it works
want to do text to speech with models that i train
then gpt so vits should be good
try to check the guide out
what is the retrieval based voice conversion program for?
@covert lake stop trying bro
rvc, its for speech to speech
?
ah ok cool
Hey i've been doing this for 2 days, mainly did comfyui stuff. I appreciate any tips and advice
hi
обезьяна
Anyone have knowledge about like to why w okada repeating itself when talking into applications like discord
ask in help channels and be specific
hi, can i ask you a question
whats a good video tutorial to merge rvc models?
Guys who want to join owr clan
if its related to help, there's help channels for that
what do you guys think of my new model?
hi guys
sounds great
does it sound believable
I think it does
sounds good, im probably stupid for saying this but you'd probably have to change your speaking mannerisms a little for it to sound more convincing when converted
)
Wtf is that bios 💀
Ayo? @light kiln level 2 !!! 
alr
wdym
.
ye ik
hi im a new guy here
ejjj mam prośbę bo kolega mi dał wyzwanie o suby pod jego kanałem sub nie musi być na długo ale bardzo by mi zależało tt: AnarchicznyBanan
Beam
I tested out Just way you are by Bruno miles on 64kb mp3 and use AI to enhance the frequencies to make it 'lossless' it sounded really good
👋🏽
omg which ai is this
Ayo? @wind stratus level 9 !!! 
Mvsep AI enhancer
Bruno Miles sounds like a nightmare combination of Bruno Mars and Yuno Miles
Idk if that instrumental counts, but generally, no instrumentals in samples in chat
I think that's fine tho idk
I like that instrumental because it's nostalgic
There's some problems, But it removes the lossy effect
Shit I got it mixed up
ale ty masz 2 iq bo pytasz sie ludzi co piszą po angielsku a ty piszesz po polsku jak mają to zrozumieć...
can ai do cold call in french
ai?
down talk here :_:
Hello!
Web3 platform is expanding our team and looking for: developers, moderators, beta testers, analysts, and designers. The salary depends on the role.
You can write without experience. I'm waiting for you in DM.
Hey all, I have been working on a very intriguing and promising project for the past month, and I am looking for other AI-fellows to join and collaborate! The project is related to easy, use-case specific chatbots & virtual assistants using Retrieval-Augmented Generation (RAG). You can find more details on https://voiai.io .If you are interested, please DM me so I can explain the details of the project and how we can collaborate:)
i need the best settings for W Okada newest release
hola 🙂
- UVR5 UI, by Eddy and Ilaria Huggingface Spaces
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
- 🆕 FaceFusion UI, by Nick088 Huggingface Spaces
Hello, I'm Anny, hope i'm welcome?
Ok general question, is an AMD gpu not compatible with W-Okada?
you have to download the amd version
How do I know which version is the AMD version?
Onyx version
Mhmm, oki thanks
Ayo? @idle sandal level 2 !!! 
I'd suggest fork wokada that uses less cpu for your AMD gpu
https://rentry.co/forkvoicechangerguide#download-amd-intel-and-cpu
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update August 30th, 2024: New b2309 version
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuide...
it says how
i already moved it to a dif folder
hi
Oki so, got this working, and honestly thank you very much for the help.
Issue now is just me talking too fast and my voice being a bit incompitble with Jp voices
directml is so bad performance-wise
No it isn't what is the alternative there really isn't a alternative.
That and the voice I got was a bit choppy, so... should I talk slower? I assume that should help but honestly I'm unsure...
That and I wanna record audio with it which... I don't think it can do?
patch zluda into nvidia build
that's the alternative
I mean, I could try to redownload the AMD version, issue is well... I dunno which one that is,
Like my GPU is an AMD Radeon RX 6800M if that helps, but yea at least it's workin so it's better than nothing

give me a sec
Oh and one more question, when I was using the windows version I got this thing where I could upload files? Just out of curious but I can just upload a vocal file or something and have the ai repeat it back?
depend on the model quality
Right so, the model I wanna use is well the Kurumi Model... that's Jp only so... F
I just tried the Kurumi300 thing
try the fork wokada as I mentioned above that is less CPU-bound
do some other jp models work?
Although yea, are we sure it's the model, I honestly think it might be the mic
cause... well I'm using the mic from my laptop so...
i'm gonna do a pro-gamer move
But yea... do you guys recommend like, an actual mic? Cause honestly I think I need that instead of just my laptop audio
but yea, I guess I'm going to call it a day for now? I mean it is working at the very least so, good enough for the moment, I might just use Audicity to do the recocrding so, thanks for the help @chilly lake @polar flax
get the fork version not the original
oh?
some decent two-digit buck headset mic
No zluda would do worse because no cudnn support and stuff you would have to go the applio way and it would be not as good as directml
use non-realtime RVC for just recordings, you can try this easy space
https://huggingface.co/spaces/TheStinger/Ilaria_RVC
Ok so I downloaded the one called "Download AMD, INTEL and CPU
AMD & INTEL ARC / CPU
Latest as of 24th August 2024: dml-b2309 (click here to download)
After the download, you run MMVCServerSIO.exe"
Did I downloaded the wrong thing?
oh... oh thanks
Yeah sennheiser pc37x or a mod mic wirless are a good choice for headset mics for 100 or under
Just a question about this one btw, how do I add models to it?
you unzip the models and then add the index and pth
paste the model's huggingface link here
oki, thanks
Ayo? @idle sandal level 3 !!! 

yeah but for weights.gg models you have to do the unzip way
there's also the upload option

Got it working, thanks very much
going to go, pair some audio and make some strange songs
Yeah that's the unzip way
is the original pre train good?
Just use titan instead original there is no point
There is no point of using rvc at all
when running the applio launch 4 the first ttime how long does it take
it open a browser tab
hello
Hello!
im trying to load a ai voice model that i put into the voice changer
but everytime i do this happens it crashes
I've been out of the AI cover game for a while, what's currently the best link/app for AI voice conversion for songs/
Hey i got a question,
Currently i can change voice in PC using RVC and VB audio cable software .
But if i wanted to do the same in PS5(video game console) while i call someone? How do i do that?
I think with RVC you mean wokada which is the specific one for realtime
I don't think you can do it on PS5 tho
Yes wokada
Ayo? @bitter wagon level 1 !!! 
Ahhh :((
Afaik it wouldn't be really possible
I was thinking another approach.
How about if i can use remote play to connect ps5 to pc and play the game in pc, through remotely then i can change the default microphone settings
Dunno about that could ask in #🔍│help-w-okada but not sure if possible
I know how to do it with dongles to get voice changer into ps5
Ayo? @earnest dragon level 31 !!! 
Yeah but the dongle way is the better way to do it
How?could u guide me
PS5 is a separate device that can only be streamed
do you mean the opposite, play in ps5 and streamed on the pc?
In ps5, when i play games i can call someone in ps5 itself, but i wantsd to use voice changer
Could u elaborate
hi
Hey everone so let me know if this is right but if i understand it correctly, i have a 15 min audio file, which is 900 seconds. at a sampling rate of 44.1khz. Which would then give me the formula 44,1 samples per second x 900 sec = 39.690 samples is this correct? Because im trying to find the right batch size and the amount of epochs to train my model on
Ayo? @pastel pawn level 3 !!! 
too much math. batch 4, max epoch 200, check tensorboard for best epoch
with a default pretrain even 50 epoch model should be fine
Yeah so how do you know which epoch is the best to use? when viewing the tensorboard i've noticed the graph is steady but i've seen numerous refferences to validation loss to check the amount of epochs to train on. but im unsure on where to find that data. that and i see how many steps there are taken but again im unsure on what to do with that
RVC v2
simplest way is loss/g/total, which is a summary of loss/d/total, kl, mel - they are metrics for how much generated audio differs from real one
Ayo? @chilly lake level 4 !!! 
you just get 2 usb headset splitters and 2 aux cables and then voicemeeter potato
with enough data g curves down and more or less becomes flat without much improvement, at some point it may go up
that's usually way beyond the point where anything useful can be extracted from the source data
is there any tutorial on how to set this up
okay so just set a number of epochs check the graphs and stop it where its overtraining. And the graph to look for is not G/total but g/loss right?
will it be efficient? or would there be any lag
loss/g/total, loss/d/total, loss/g/mel, loss/g/kl - if any of them start to climb up that's overtraining
Thank you for the help, really appreciate it, i will see how far i'll get before calling for aid again 🙂
hey give me a workflow (the flow process)\
Anyone know the easiest way to use flux and run it locally?
for a regular speech even with a 10 minute set I cant really tell much difference even between 10 and 20 epochs - they are good enough, of couse more data produces better results
depends on GPU VRAM and regular memory
12GB + 32GB Comfy is probably the best one, minimal effort to get it going
was using stable diff web ui, but I wanna try flux to see the difference
unless you want to fiddle with quantized models and separate encoders
gpu is 12, and 32 mem
Ayo? @elder willow level 1 !!! 
wasn't comfy ui like compromised or something earlier this year?
there was a malicious node
but anyway, to use Flux without much hassle 64GB is really a must have
and 20-24GB VRAM
some memory management tricks work, but usually you have to really close every other app and pray it does not run out of memory
huge decoder
just have the output going into the input of each and then set accordingly on voicemeer potato there might be a tutorial but I figured it out myself and I figured this out in 2017 when voicemod was the only thing and was going into games playing soundboard and also soundboard audio through voice changer and stuff.
That's cool. Would it be the same process for android as well? If i wanted to do the same process
Ayo? @bitter wagon level 2 !!! 
no
might be better to just use fal than lol
would it be the best to have the dataset include yelling, crying, other sounds that the actor makes in order to be able to express emotion with the finished model? I want to use the model to create ADR dialogue for a film, but i want to prevent it sounding monotone and robotic you know?
a good variery is always the best option
And for android, if i want to use okada voice changer, then?
this flux lora needs 24 GB vram to use locally
https://huggingface.co/spaces/Raumkommander/train-flux-lora-ease
tbh SD3 may need even more spec to do
emotions maybe not, but low voice, normal voice, loud voice, singing, etc
Okay got it, and will the finished model be able to emote or will that always be difficult and be monotone?
and as usual, reading from a dictionary is better than the same repeated phrases
You can't really and why just use pc
oh yeah totally, no the dataset is dialogue from the film isolated, where only the main character speaks and recorded on the same mic and boom as the interview snippets of the actor
a good test is "you can't handle the truth" piece
not even through winlator
if it does not turn the drama info a comedy with a squeeky voice, you model is all right 🙂
if i have any other questions is it okay to dm you personally? or would you prefer things to keep in this chatt? just so i know the boundaries you know?
k
.
well currently I'm using Automatic1111
Comfyui is the way to go no one really uses automatic1111 that much anymore.
ah fair, time to learn comfy ig
for flux the best I have seen is pullid flux
@zenith hinge happy birthday
is it even there's
Vote Booster: Vote now for a 10% boost. https://arcane.bot/vote
✨ Tip: Use /card to modify your rank card
Vote Booster: Vote now for a 10% boost. https://arcane.bot/vote
✨ Tip: Use /card to modify your rank card
i eat concrete for breakfast
@earnest dragon could u check dm for a min? I have some questions regarding aux cables
@earnest dragon Also before i buy the cables just wanna confirm once with you if thats the one, so can u check dm pls
yo
Hello
👋
fuck
From the SCREEN 📺 to the RING 🥊 to the PEN 🖋️ to the KING 🫅 yeah wheres my CROWN 👑 thats my BLING ⛓️ causing drama when I RING 📞
NOT THE KSI SONG
What's the best voice cloning system for sound effects.
My use case is wanting to transform male combat exertion SFX to female (I have a dataset, full of female combat exertion SFX)
I know RVC ain't ideal for this, however i have trained models on TITAN and Original pretrains and inferred to some success (but, most breathy sounds don't come through well), that was almost a year ago.
So my question is, has anyone tried this with any success? With RVC or maybe other systems?
Ayo? @quartz solar level 1 !!! 
if you mean like screams, taunts, etc. that is still called voices
Pretty much yeah. No words
!rvc
vc be like
this channel is full
Vote Booster: Vote now for a 10% boost. https://arcane.bot/vote
Hi

Ayo? @mighty locust level 1 !!! 
Model training and inference
Or
Voice Changer
idk, i wanna make ai covers locally
not exactly voice changer since i have an audio i wanna convert
Thats inference, that should work just fine yeah
Get RVC Mainline from the docs
-docs
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
okay thanks!!
i don't listen to them lmfao
why spice had to make all her bars about shit and poop
Ayo? @elder willow level 1 !!! 
pooping yourself seems pretty funny, until it happens to you
does the rvc thing support rvc v2?
Yes
letting my friend use my genshin acc, I JUST WANTED HER TO DO AN ARCHON QUEST FOR ME WHY IS SHE DEFEATING THE OCEANID BOSS
yes
Ayo? @dense heart level 24 !!! 
cornco\🅱️
обэзьйана
🙏
man, this flux 1.1 is really creative smth ngl, the whole prompt was snap.jpeg, now tf is this
i only now realized vencord sent a link emoji my god
Ayo? @tacit falcon level 4 !!! 
ohh my godd lmao
hi
hello bro
Why so serious?
-help
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
hi
hello
hi
One message removed from a suspended account.
for some reason i don't understand, when i start to train with 250 epoch make 250 steps xd what i can do?
I bought the a100 plan, it's soooo much faster for training
My model went from 300 to 450 in 10 minutes
yea many do that tbh, especially in cloud gpu, which people usually use runpod or other things
i thought it was common to just rent gpus for training real models or just very large algorithms
yea people rent alot gpus for ai
not really a weird thing
like for ComfyUI which is used also in PaperSpace if im not wrong
ye exactly
its just people here are used to only google colab mostly
but ofc with paid gpus u can get more performance and dont have to worry about restrictions
i guess its kind of weird if ur renting just to train quick rvc models from a pretrain and you have a good enough pc to do it
people do it for pre-trains
thats what i was thinking
Everyone gpt 4o canvas is out
I tested it and it's brilliant at coding
what are some good female voice changers?
Ayo? @bright gust level 1 !!! 
Hello
Perchance
"By integrating phono-cognitive autoencoders with cross-domain GAN-driven prosodic fusion (GDPF), we achieve a 500% increase in audio fidelity when modulating speech synthesis over multi-threaded parallel NLP pipelines (MT-PNLPs). This is amplified through recursive VAE-infused harmonic oscillators, which recalibrate the tonal imprint of dynamic voice signatures in sub-neural nets, while minimizing tensor collapse in bidirectional transformer feedback loops (TFLs)."
RVC 3 is looking epic
galactic
Hey everyone! I'm Bola, a software developer, startup founder, and passionate advocate for innovative technology. With a background in philosophy and a unique life journey, I'm always exploring ways to create impactful solutions while sharing motivating stories.
👍
Ayo? @lean bough level 1 !!! 
yea y u concern?
A
What is the best yet easiest free way to make ai voice covers
without having to do any of the filtering myself
I am too lazy to filter stuff rn
The update that is cuming since 2023 ?

looking at the last V3 repo update date... it's dead, Ben
RVCGUI by tiger14n can do it pretty well
I like using the app Replay for AI covers
It basically has everything built in
-gui
what? is there something better?
dam
how do you use that lmao i keep getting an error
Hi
Lagged so hard my laptop crashed
.wk
Ayo? @pine mural level 6 !!! 
👑 adiiose - 5228 plays
** 2.** #igrok - 4000 plays
3. remy - 3999 plays
4. exan - 3167 plays
5. luminal - 2823 plays
6. yeresin - 2555 plays
7. sqwab - 1875 plays
8. vice - 1771 plays
9. wg - 1324 plays
10. osc - 1182 plays
11. Markus - 1020 plays
12. buga - 1018 plays
13. Teckmek - 967 plays
14. gtemq - 847 plays
Hi, does anyone have any links to guides/videos or advice of their own on how to sound more natural when using RVC? So far I either sound robotic with low pitch, like a gnome with high pitch and index makes the voice sound less natural, with "tone" being "capped", if that makes sense
Hello, does anyone is have Pri Forsythe or Quadra from Quadstrike when using RVC? Because we all waiting for her.
👋 Hello everyone, I'm new here!
#voicemodell مرحبًا أصدقائي الأعزاء، أحتاج إلى مساعدة في موضوع، أريد النموذج الصوتي للفنانين، فقط باللغة العربية، سواء كان مدفوعًا أو مجانيًا، الأشخاص الذين سيساعدونني في هذا الصدد، يرجى الاتصال بي على الخاص.
thats not how you learn coding
like the voice model?
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
its better u ask in #🔍│help-w-okada
use what and what error be more specific, and would be better u say it in #✨│ai-help
its better u always ask for the user pc gpu btw, cus i seen cases of people with an rtx using cloud, also bc cloud got limits
Thanks
making a rkelly ai. 25 epochs sounds damn good for only a 12 minute data set
rvc3? link?
idk just wanted to know
Rate
Nah bri
Btw where did those 70mb of ram go
There is no official v3 😭
Rvc 3 release date: 999 years
Imagine rvc v3 before gta 6
It has the same chance of x getting renamed twitter again
You mean 𝕏??
top 3 still active member 🔥
Ayo? @polar flax level 42 !!! 
release date:
never
Got to be the hardest community to network with
there's V3 repo that has not been touched for a year or more
but anyway, that statement above was a meaningless AI word salad
IP: 250.164.54.154
IPv6: 9119:c230:bc3c:78ed:f82c:cefc:b7d4:c5cc
MAC Address: 77:2F:3D:E2:C4:CA
Address (not exact): 311 Zeke Center
Used: /doxx @green panther
Wtf never seen this
U sure that's official
Hello !!!!
who knows, there's nothing else
U tried asking in the RVC server?
Where did u even found it
RVC Dev server is a bit more alive that that repo, but barely
Weird
Do others know Abt it
?
Well, i do tried my best. Its almost my birthday. I want a RVC.
h
yes
https://huggingface.co/PriQuadra/PriForsythe/tree/main
This is my problem, no RVC ZIP of Pri Forsythe.
Ayo? @zinc viper level 2 !!! 
@covert lake anyway, by the author's own words that project is dead because it did not work
the whole RVC thing is a madman's creation
starting with a file path to .wav being a textual representation of the content
you want someone to train a model of ya?
Well, yes and no. Yes i try, no its failed.
If I'm not wrong u are engineer in applio server
Como funciona
That is a gated model, doesn't even look like an RVC model
As I said in my message:
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
AI HUB Docs
