#🧬│ai-chat
1 messages · Page 344 of 1
i'm more surprised by mrf hifigan with 44k clearing the top is no time at all
I have a feeling this is because the upscaler does not align to the top edge, so no mirroring there
7x7 = 4.9khz lines
overal I'm a lil skeptical with 44.1
as much as how unconvenient working with 48 is.. it seems less brain devastating than 44
So if you manage to utilize it well ( n get better perf than 48 ), Imma congrat you with 200% honesty
as for that, let's hope it ain't the case

i'll leave it running and check tomorrow
Hello 👋👋
hola gente una consulta
soy peruano y quisiera saber si hay algun emulador de voz q de miedo osea una voz asi como para hacer cosas turbias y amenazar
If AI uses samples to identify speech patterns and adjust accordingly, does that mean the human voice can do that too
Thank you for accepting me into your group.. My name is Unyime Johnpaul from Nigeria.
no, it uses convolutions, feature maps and various feedbacks from discriminators and not strictly samples
simply saying, no lul, a human voice can't do that as we ain't machines supported by raw math
oh ok
thoooo if you mean legit identifying speech patterns and adjusting in a human sense, yea, ofc
brain is capable of that
But that's for really any repeated and learned tasks
Hi you ai? Im not ai
Hi everyone! I'm so interested in AI songs and can't wait exploring more! but I'm new for discord, so I'll be so grateful if anyone can help me with it!
I'm not a chatbot. I just like to talk. 
I am certainly not a chatbot powered by Hermes-3-Llama-3.1-8B-Q4_K_M language model
Imagine being a gguf quantized model
hi guys
hi
一个利用AI使生活变得更高效的人
Yuh uh
Hello
Wsg
Please speak English in #🧬│ai-chat, or speak Chinese at a Chinese chat. 
Que tenia que ver el ser peruano xd
Jajajaaj lo que pasa es que hay varios que hablan ingles y otros idiomas
Tu sabes cómo poner otras voces?
Greetings, I'm here to catch up a bit on AI voice matters. Been quite a while since I even glanced at it.
@dense pivot @icy ocean talk english here, or use #🌍│español
RVC v2 didn't change much in quality:
- Applio (fork, which means modified version) made it sligthly faster
- And there are community pretrains (pretrains models are models used as a base for training, which is what there was already, just now there are other ones that help mostly for other languages),
- RVC v3 is never gonna get released, maybe an unofficial fork by codename to help it but it's still in the works
For wokada (using rvc models in realtime for calls), there has been way better performance with the deiteris fork
Hmmm... The only real uses I can think of atm would be for TTS/"Jarvis" type things, but I could definitely train some voices I have in mind.
RVC isn't made for TTS (Text To Speech)
RVC is STS (Speech To Speech)
Ofcourse you can make a TTS audio with any TTS, and then use it as an input in RVC tho (Just like how Applio does this built in with MS Edge TTS, which is good for multilingual and quality but needs internet and doesn't have emotions)
Thanks, I recall using RVCv2 a little way back.. No worries about dumbing things down for me too much, I'm out of touch on audio but I like to think I have a decent grasp of AI/ML in general.
Idk what's "Jarvis" type of things tho
Ahh, speech to speech. there we go, yeah, it's been ages like I said. Bogged down in the SD world too long.
real-time chatbot/assistant kinda thing, but I want to catch up in general.
I do recall changing a few songs to use different voices, amusing.
If you were looking for TTS, you can technically still do it with RVC tho
But if you're looking for a TTS program, there's 2 types, i will say the best ones:
- few-shots: GPT-SoVITS
- one-shot: F5 tts, and Fish Speech
Btw Flux has better quality than SD
-# But is also more resource demanding
Oh, excellent! Thanks, you saved me a good hour on github 😆
(check my profile)
I did an index if you want
Also a noticeable good one-shot tts is XTTS2, but it's never going to get updated unfortunately
Table Of Contents Introduction Index of the best TTS 1. ElevenLabs/11Labs: 2. GPT-SoVITS: 3. Fish Speech: 4. F5 TTS: 5. Edge TTS: 6. StyleTTS2: 7. XTTS2: 8. OpenVoice v2: 9. MeloTTS: Use TTS in Realtime on calls (ONLY PC) Introduction TTS Means Text To Speech! Inference means when you use the...
@chilly lake I see they'll let anybody in here... 
I do remember there was a fork called AllTalk tts but never tested it ngl
Idk what u mean by that
But Noobies is really smart, he also did play alot with TTS
oh lol
He's on my server too.
-# btw embed perms at level 5
Ohh 😭
no piper tts 
I don't remember hearing about that
Yeah the last time I was looking at voice or TTS was at least 6mo ago, as I recall it was a bit in flux at the time so I've got catching up to do. I'm so addicted to the voice of Twilight Sparkle that I just can't stay away any longer. 😊
pretty sure I tried piper
tl;dr medium quality, very light and fast on inference
can be used even on raspberry pi
ah yes "back in the day" of October 2023
this ?
A fast, local neural text to speech system. Contribute to rhasspy/piper development by creating an account on GitHub.
eh
@chilly lake what do u think about piper tts?
idk if to add it to the index
yea
havent tried.. 22050 does not look attractive to me
I tried it with a streaming voice dialogue AI pipeline
Just after something relatively lightweight and easy to use that I can just pipe things to
listen to voice - asr using whisper - microsoft's llm - piper - play response from ai, loop
it was lightweight enough for close to realtime
how was the quality tho?
whisper is a bit heavy duty, there's something vlad and I use on a side project that's... some google thing, it does speech recognition quite well. I'll have to check my convo about it...
not sota for sure, you can see some examples here https://rhasspy.github.io/piper-samples/
yeah, whisper does mistakes, and now there are a lot of complaints from the medical community that use it to trascribe doctor's notes and it writes random garbage
@atomic vector explain the meaning of 'sota'
hey nick088, 'sota' stands for 'state of the art', which basically means the best or most advanced in a particular field or technology. in the context of your conversation, alexeyartemiev is saying that the examples on the rhasspy github page aren't the most advanced or the best in terms of text-to-speech systems.
-# AI-generated responses may be inaccurate; please verify important information.
oh
I dont remember, it was not expressive, it was just tokens to audio thing, as I recall
What is this chat made for everyone
is there any explainaiton
while ago i joined here so just wanted to ask quickly
low quality (1st) vs medium
i've been using it for twitch chat tts for quite some time, it's not as good in all fields related to quality of a tts as many alternatives, like speech clarity. i think it should be considered to use mainly if you have hardware constraints or low-end PC
yea.. i think it's good only in being fast & lightweight
buhhh
yes there isn't a level requirement anymore
fast yes, quality = shit
No i mean whats the purpose of the server
do u think it's even worth it to be added in my index? idk if to do that tbh
how many people are there that need only realtime fast but lowest quality ever
AI,
Mostly RVC & Wokada,
But there are also other #1159513888199540817 about other AIs, like mine for FaceFusion, Termux & SuperPrompt
there are other very fast tts that use eSpeak
eSpeak?
the most common phoneme to audio engine
Yea ig i'm not gonna add it
u can't say 22k is high quality 😭
regular speech does not require more, but it just does not sound good
it looks like the generator is undertrained
"If it's good enough for a landline rotary phone it's good enough for me!"
at least th medium one with the same mirroring lines we saw with rvc
-# in case it wasn't clear it used espeak
on low-end pc u could just use Edge TTS on cloud atp
run my test thru it
?
crap, one sec,need to fix that meow
weird quotes
eSpeak shits the bed on combined words and 'necromancers' 🙂
I created a custom context menu item so I can almost instantly create edge-tts speech. Just gotta type in the text and the filename and it's done in a second or two.
Yea
Piper looks to me a local & worse version to edge tts
Good only and specifically in speed
It's a shame edge only had the fixed voices it had, likely censored too, can't say I ever tried...
this is medium quality italian
looks like an english speaker tryna learn italian for the first time that talks too fast
oh wait, it's NOT censored.
it can say curse words
I just assumed it was, did a test and there's not much of a limit there
I use that creepy victorian english sounding girl voice. everybody hates it
Pretty sure no english speaker ever did THAT
hey, are there any performance changes from w-okada 1 -> 2?
or just support for beatrice 2, and some added settings
I mean it sounds bad tho
hi
what are we talking about today?
TTS
Text To Speech
ok
we were testing Piper TTS, it's fast and very lightweight
but shit quality lol
i know some thing about CPT
kool
yeah, does not sound like a good expressive read
As a monolingual english speaker, it definitely sounds like Italian, but I'll have to draw the line there.
just a robot that does not breath somehow making the sounds and not running out of air
oh please tell me there's a way to do Latin, that could be fun
In nomine tenebrarum, evoco te, daemon potentissime.
Ex silentio aeterno advenias, ex abyssum ignis surgas!
Da mihi auxilium tuum, da mihi potentiam tuam.
Praesentiam tuam hic desidero, o princeps umbrarum.
TTS demon summoning
what is this ??
Fastest growing ai voice
name ??
idk what it is sorry
I'm italian, I know italian & english
I kinda figured 🤔
what tool are you guy using for covers? forgot the one i used before resetting my pc, i didnt note the link
It does sound italian but italians don't speak as fast as english people do
and the model sounds robotic lol
For AI Covers, there's RVC
What's your PC GPU?
Weird, I had the impression that Italian was faster than english.
But then again, where you were raised greatly impacts the speed of your speech
is it free @quartz roost
we making it out of UTAU with this one
There's a free version
Unless you mean dialects,
I don't think so
And a paid professional version
is it the thing that convert midi into voice ??
She can sing anything
fr
oh crazy.... i saw this thing in Ace Studio
fr
how do i download it ??
Ace studio sucks
it is paid 
Most of the LEGAL TO USE ai voices are paid
You know, consent and all that shit
how do i download that teto thing ??
hey
you mean likeness rights
You need a license to use someone's voice legally. A written agreement.
This has been established for decades
99% of the voices here are illegal
what is going on in south korea 💀 💀 💀
Martial law
yes, the transfer of likeness rights
you can use my voice in exchange for x
There's no flexibility on this. Either you get a contract or you can't use the voice.
There are some free voices to use with terms of use posted.
voice, photographs, video, etc
Often the voices are given different names, and you must use that identity instead of the voice actors real name.
does anyone here know how to do videos
KOREA WAR
😊
#🌍│español ayuda
yo whats up lol
wattt sup mann
how is it going
bad
ım shoudl learnıng science ın netherlands
oh, im good cuz free from school early
starting december and now i finished school
oh yeahhh
Do you know a free site that has a live AI voice changer?
site? no, but you can use cloud gpu as you probably dont have a good pc, give me a sec
Interaction has expired, use the command again for a new interaction.
choose the second one @heady adder, i personally recommend it
that say'You didn't start this interaction. Use the command for interacting with it.'
u are so nıce mannn
ty :p
now ı can troll my froendss
yea
what
some idiot parked an armored vehicle at the intersection.
lmao it got deleted
Sounds like an amazing time
ok and dont call me a poopyhead
Not really. My McDonald's delivery was late.
i dont want to have a real beef with someone, stop it lmao
What! Ok time to roll up and show him you stand on business

if you take me seriously and start a beef with me over me calling you a "big fat stinkface" then maybe you need to deal with your own problems before going on discord
Does Korean McDonald's taste like cardboard too or is it actually edible
i hope you're pranking by calling me those nicknames, simple as that 

what do you expect? McDonald's is the same no matter which country you go to. 
Better tasting food 😭

Ik Japan has some fire meat, China is China and idk what Korea has
bulgogi burger
What
sweet soy sauce
Huh
well.. talking about the taste of burgers during the first martial law in 40 years... Fantastic. 
Yeah, so what's your favorite burger
burgaer
What do you think of the baconator
Do you have Wendy's?
🐢
pop eyes
I love Taco Bell's crunchywraps.
Yeah if you mean Wendy's nuts hit your face
🐢
Yummer

pig
doeso someone here know how to create PTH files for rmc voice changer
wdym?
What's your pc gpu?
It's better to check what's ur pc gpu first than doing it cloud, cloud has time limit
is there a better quality realtime voice thing than using google colab and the trial gpu
using the cpu option after the trial
is up runs like garbage
You need to train a model with RVC
What's your pc gpu and what are you using right now?
Be sure to NOT follow yt tuts
quality depends on the voice model
Maybe you meant delay
Did you check what's ur pc gpu first?
yes it wont let me use anything but the ones it provdes
i have a good pc that shouldn't be an issue
i have a 4070 super
yeah not the model itself but any other reliable ways than google colab, so i dont gotta keeep running the trial
Interaction has expired, use the command again for a new interaction.
1st guide
its the Wokada (program that uses RVC, Speech To Speech, models in realtime for calls) Deiteris Fork (fork means modified version), this one has the best performance
Yea you shouldn't use Google Colab when you got a good pc
Cloud services are only when you got a bad one
i just got this pc and thats what i used before
so if theres something way better then im open for ideas
Wokada, especially the deiteris fork, is the best program
Yea, cloud services (that run remote code on good pc) (like colab and kaggle) are only for people who don't got a good pc
It's really not worth it to use those when you got a good pc, as you would be limited by only the around 4 hours of GPU of colab that aren't even granted
it gave me an hour every 12 hours
it was bad
and i couldnt run it and play a game at the same time
yea the gpu time is random for the free tier
you can get disconnected in 2 hours a day, the other day 4 hours, the other day 1 hour and an half
There is kaggle which is harder and requires phone number but gives 30 hours weekly
But why would u use that when you can use ur own pc lol
yeah thats fair
funny it told me to extract the files, i did, now it wants me to extract files from the extracted filees folder
google colab doesnt work anymore?
Why do you say so?
i getting errors each time i try to use it
you have already tried to discuss your problem in #✨│ai-help ?
and some people under video tutorial saying that
no
you can wait for someone to help you, also, you should send some evidence of the error, of how it happens.
Do u know some free options ? Im using RVC project from github with my ryzen 5600x and it work perfectly fine now i just need to learn how to creates my own voices
hi
hello
still wondering
fr many ppl asking for wokada are using igpu, hopefully it could have better support on last gen igpus & NPUs
https://www.tomshardware.com/pc-components/gpus/discrete-gpu-sales-dip-while-nvidia-continues-to-dominate-igpus-increase-while-discrete-gpus-decline
I don't have a CashApp account, I don't have my PayPal account verified, and I don't even live in the US either, so nuh uh. 
Please find your love instead of me. 
What is this for? 💀
I don’t know why a name of a fruit is considered offensive in this server, but bro.
Also, dead chat.
Which? And are you sure you followed the amd guide of it?
rhymes with 🍇
The one on github https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/docs/en/README.en.md (aka Mainline) has amd support only for linux
are you on linux or windows?
This g word contains an awful r word. 
the Scunthorpe problem
stupid filtering results in stupid results
scrаpе grapе drаpе
when you use a profanity filter a lil too aggressive
there's like one outdated af voice changer, there's wokada, there's somewhat optmized wokada
there's no better UI unless you just go and make your own
It's just W-Okada. Other than this either paid or requires you to sign in.
anyone here who is decent with prompting
chatgpt
if yes please hop in voice chat
need quick help
what voice should i troll with on fortnite
troll with a drum slop model
should use this model https://discord.com/channels/1159260121998827560/1287181931154509825 
Hi

How are you doing
I'm good, just a hot busy day wbu?
Guys, my page on weights doesn't quite work, that is, I can watch models there, but I can't use them and insert my audio, I couldn't delete my account, how can I fix it?
Good thank you
a
where did my ai chatterboxes go
now i'm a normal human being
just like you and me
hello
waiittttt
didn't one of the dev said they're experimenting v3 ?
shouldn't call it v3 without approval of RVC-boss
it's not v3
it's @gray rover 's fork what you're talking about i think
^ me n noobies.
Key difference now tho, I intend to do it for 48k, Noobies most likely 44.1k
Yo everyone, something very simple again
I wanna train a voice to make an AI model, so it can become something like that
https://huggingface.co/imcertibtw/RVC_MODELS/resolve/main/LeonKennedy.zip
How can my audio can become a link to use ?
upload your model into huggingface, after it's done uploading left click the download button and copy the url :p
I may be fucking dumb but
Huggingface is a site FULL of AI stuff ; where's the AI training one ?
oh wait you meant training?
uh yea huggingface can't do training unfortunately
but there are options to train
Yeah, I may be explaining poorly
if you have a good enough gpu you can train locally
Excuse my french, litteraly haha
you're fine dw
I'm on a phone :/
ah
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
have the docs too just incase xD
Thank you, but That's not what i'm looking for ;-;
I would like a website, just to train an audio into an AI model
ye the colab will do that
as long as you read the instructions correctly youll have it training in no time
also i apologize if you can't understand what im saying lolz :p
hello bro's
Yo what software do you all use to clown voices i feel like some software is better to clown voices with than other ones
@elder willow did u copyright strike me
Hi I’m dina
smh how dare u
do u know also @pale patrol
we someway got copyright striked by sony at the same day
sony is doing a genocide ig
what gif
ahem
i think u should have perms for that
yea fr
the notification was at 5 pm for me
wbu?
if they did it at the same time damn
smh ps5 should get the switch treatment
-# people basically speedrunning at making emulators for it
wait dont u literally make ai covers
tbh tho its just a song it cant hurt u ;-;
jokes on u i didnt upload for 6 months 
that ai cover was trash anyways cuz i was using adobe podcast to clean my model at that time
and yes i deleted it
wtf sony owns IT girl??
DAMMIT
they have limited swearing and ive been here for like a year now
im keeping mine but ill private the others
they can stilll get a copyright strike
bruh
bc if sony saw it and did a claim when it was public, it can still strike u
and well if they saw one already, they prolly checked others
so ima just delete
Anyone got exra money on cashapp im saving up for a bowling ball
what is codename doing as of right now
what's new in that new fork ?
that they are preparing
i keep on seeing like songs with ai travis scott n shit and it actually sounds real
send one
would anyone just so happen to know how to do that?
wanna see
hold on
I wanna see too👀
you got spotify?
i don't have access there smh

Whaaa
what is that channel ?
AI testing channel
Him and noobies send development updates, audio samples, etc
care to send some audio samples ?
🙏
They are tests on very small sets with very prototype build
It ain't going to be good
Oh but what's the difference like
what are they implementing
and what is expected in the final ver
Codename and noobies are doing different things so
Codename is adding a new discriminator and some stuff
Noobies is adding 44.1k training with also a new discriminator
what does it mean, better better audio quality
or not that much ?
Hopefully
better results from training
updated vocoder with MRF layer
better fills for upscaler gaps
Oooo Noobies comin in clutch
new 44100Hz with very little mirroring at the top range
so you're expecting major improvements ?
unfortunately both need a new pretrain created
I expect more stable training and better fidelity
I see, and when do you expect it to release
and if realtime would be usable ?
Ohhhhhhh
i expected it to be used with uh
realtime
it is not realtime
ahhhhhh, thanks for the info!
i mean the models may be used for realtime later, once wokada authors add the same generators
Can't you just ctrl c ctrl v the generator or will a lot of stuff have to be changed for realtime
I see
it is like when you tried all those config changes
most of them did not work with a default pretrains I bet
I actually created a new pretrain for that test
Model code should be simple to add once it's ready to use
HI
Hello 👋
Depends on your gpu
And batch size and dataset length
yeap I will try might share here too people offer here a lot
batch size has only an indirect effect... it the epoch takes about the same time whether you use batch 2 or 22
Yo guys i got a question
what happens when you use batch 2 vs 22 is anoher story and the length of overall training may be affected
what do i use for the voices
Use what?
1 sec
I see
Please specify on which program you would like to use for voice. 
idk
which decent ones is there
idk i found this random video https://www.youtube.com/watch?v=oOBjntI2xK0&t=15s
uh sorry if i sound dumb lol
what progams should i use
If it's a tutorial, tutorial videos on YouTube are outdated, just letting you know. 
uh
so i cant make ai voices say stuff?
which program ):
Real-time or audio conversion?
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
O:
If you have a fast and newer GPU in your PC, you can run Applio on your PC locally. Otherwise, you might wanna run it on a cloud service instead.
if you want text to speech, it is a different thing
like is there wait times like other people waiting for theirs
i need audio
audio to uh ai voice saying it
text to speech is text to audio
o
you can upload a reference audio (15-30s) for a voice you want
can i import custom voices?
you can import any audio file with someone speaking and you'll get that voice out
wait so i can get some custom ai voice to say some random audio i give it?
step 1) get an audio file of someone talking 2) use the link and upload that audio 3) enter text 4) click generate button 5) enjoy the audio of the person saying thins with the voice you provided
o
Ok.
I ment if i get a custom ai voice then get a audio file of some random guy singing (idk) and can i make the ai voice say whats in the audio file
but its copying
so it doesnt sound boring
-tts
Interaction has expired, use the command again for a new interaction.
ooo
There's RVC TTS fork under W-Okada command? 
seems like you replace one voice with another
ye
The Ilaria RVC is a fork of RVC, developed and hosted only on a Hugging Face space. There doesn't seem to be a local or Colab version of it yet. 
ok
What the fuck? I can only use Colab for demucs now. 
I don't have a CashApp account. What do you mean?
Bro got his account hacked bruh. Did he even set up more security on his account? 
Hello, I'm not good at English. I use Google to help translate and talk to you guys. Please give me some advice. Thank you.
Did ever use 2 factor Authentication??! 🤔
Hi everyone, Im radiant, Good to be here
hi
Who are you asking to? 
Bio suggests he's probably Thai.
please refrain from posting songs that may be copyrighted as attachment instead of using external hosting
even as a safe bet, you shouldn't also directly attach musics generated using suno/udio (as they might get into legal trouble too)
If you ever feel like term and rule of AI Hub were too much for you, post an audio of you singing that song instead. 
I don't think it works like that
Yea especially now
They are doing genocides
Not only you show your own skills to let the world knows, you can also scare the shit out of AI fans that don't have a skill but only use anything AI for their entire lives. 


you can also scare the shit out of AI fans that don't have a skill but only use anything AI for their entire lives
Not like this is going to completely stop any type of AI covers to be pubblished lol
People can just:
- don't pubblish it
- buy a mechanical license
- Pubblish on tiktok where under 1 min video aren't subjected to copyright
- do like piracy
Anyways I don't really care about some old covers, that's not what my channel is about
though non-public contents shouldnt get copystruck, otherwise it would be privacy invasion
They do actually get copyrighted
||the safer way is to use temporary hosting by request, torrents, dark webs, or kind of p2p ways||
Thailand 🇹🇭
there are some thai fellows here (not me ofc)
AI music is a form of AI art, much like real music, but I still have them seperated from real human-made art. 

for voice changer? it is more or less impossible to make it work
What's your settings
Cuz mine sounds off at some point
it is impossible to make non-speech sound correct
taking = ok, singing = maybe, laughing/breathing/coughting = big nope
Talking sounds off sometimes, idk why but I assume it's my settings
Can I get your settings?
just try not to laugh
seems the model quality issue
I think so
or the accent incompatibility
Upper Right Block Diagonal Lower Middle Left To Lower Right
🭓
well i can't see this it's just a square
👋



Did they deleted that role?
There was a server admin exchange going on, likely to make this server to feel better I suppose. Not only they changed the "AI Hub" to "AI Hub by Weights", they removed certain server roles they thought were useless, and changed current roles to something else. 
welcome to the cult. Sacrifice is tonight, don't miss it
i made some adjust in resemble enhance code, now it is much better
it don't deal well with some specific noises yet, but it's much better
it was trained on common environment bg noises
make sense
static noise for example will be ignored
if someone knows a finetuned resemble enhance for uncommon noises please tell me
Yo taylor
Im really happy for you imma let you finish
But... sicko mode or mo bamba
Why not both
🔨
What the fuck? 
hi!
Hi. 
hi
hello guys, how do i use an ai voice in real time?
what's ur pc gpu?
-realtime
Interaction has expired, use the command again for a new interaction.
If you have a GPU that's faster and newer than NVIDIA GeForce GTX 1000 series in your PC, sure you can do it on your PC. The first link for forked version runs best. Otherwise, you might wanna look for a cloud service instead.
rtx 4080
Great. 
Great, u can use Wokada (program that uses RVC, speech to speech, models in realtime for calls), and the deiteris fork (modified version) has even better performance
-rt
Interaction has expired, use the command again for a new interaction.
1st link
Hi!
Hi!
hi
someone able to use wokada on linux?
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update October 6th, 2024: Multi PC setup explanation added
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVo...
-# I didn't do it myself, but it's in the guide
thank you
also as colab uses linux (iirc), you can use colab notebooks of w-okada there if you don't have a good pc
the problem i'm facing is to route the audio stream to right devices
do you have virtual audio cable installed?
my distro doesn't have portaudio i guess
oh, well you should switch to ubuntu ig
oh, well i'm sorry but i can't help you too much as i don't know too much about linux
wokada don't recognize my audio output
maybe you dont have speakers or headphones, or you have problems with the portaudio
i have headphone
oh cool then, it may be just the portaudio
i have no idea what's going on with your linux tbh
idk too much about audio drivers on linux
maybe youtube tutorials can help you
yeah, maybe, i will try again later
thx
yw
hi guys
wassup
hi
2024-12-05 15:53:06,850 ERROR [VoiceChangerManager] Voice Change is not loaded. Did you load a correct model?
it start to work for few minutes, then only this ERROR
Oh
EVERYONE FULL o1 OUT
is there any free way of cloning my voice with ai?
No
Of course it is.
Of course, all you need is record at least 20-30-40 mins of yourself speaking (if possible with a good quality mic)
And then read the docs:
In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.
We got a variety of ways/guides which technically make you able to train models for free since long time ago.
wsg
yo
hi guys im new here and im hoping someone could help me?
im trying to make a AI version of the infamous playboi carti and lil uzi vert 16*29 grail "super soaker" ive never made AI music before ever in my life and was wondering if someone could hlep me
Please scroll a little bit in channel section and you'll find this channel. If you have a problem viewing model posts there, refresh and wait a little bit. 
Me and mods have been saying this for many times, but because some of you are new to this server so. 
can you perchance direct me to a tutorial (if there is one) on how to make ai music
i downloaded a carti voice model and im tryna make ai of a unreleased grail snippet of his
im sorry im new here
What? No. Most tutorial videos on YouTube are outdated, so no.
and there's none in here?
time to go witch hunting in every server im in for someone who knows how to make ai carti music
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
your gonna have to still rap it since its a voice changer
what does rvc even stand for
woah woah woah slow down
retrieval voice conversion
i have to genuinely rap out this song??
either that or someone else has to
I don't know what kind of a voice model you've downloaded, wtf. But RVC is an audio voice changer. 
are you familiar with trackerhub
No.
better be one thats trained off leaked protools sessions
see the tragedy here is that i found the PERFECT ai carti voice i needed in #1175430844685484042 but whenever i click the link it says 'invalid password'
What kind of a voice model you've downloaded there? RVC or something else? Just want to make sure a voice model is compatible with RVC.
Who are you talking to?

some information bud
which one. Maybe weights archived it if its posted in #1175430844685484042
do you just post that everytime someone pings you
no idea.
trackerhub is a google spreadsheet full of various artists with basically every leak, snippet, released song, music video, artwork, tracklist etc from specific artists
https://docs.google.com/spreadsheets/d/1qzeFdpUPr7E0jOFwWSXd8LF30ZLjz1CSVEBiG8gPHTU/edit?gid=1792554832#gid=1792554832 heres the 'ai models' sheet on trackerhub it said at the top to join gg/aihub so idid
Some light blue users didn't turn the @ off. 
wont work if you dont click on my message - reply
whatd you delete my message for
Who deleted your message? I'm not a mod, I'm just talking around. 
is there an auto delete system or smth?
@sage musk why is your carti self titled ai model no longer available to download from huggingface? tryna download it for a AI im doing
Yes, there is. To make sure you don't send anything that's suspicious.
hola chat 🙂
Damn.
They changed Lora in Weights, now there are no pictures of real people. It's a waste
Come on Weights, go back to what was good. Generic generation is horrible, boring, the site will lose a lot of visitors. It's a shot in the foot
If you don't want it to be public, make it private.
Yo bro, this server is AI Hub by Weights, not Weights. 
If you've joined Weights' Discord server, I suggest you can open your issue about using this website there, or contact Bea the admin for more information.
Its name is WEIGHTS, the Discord is by WEIGHTS and I can't say anything here It's like buying a house, having a car, a bed and choosing to sleep on the street and on the floor.
Saying like this ain't gonna fix anything, just letting you know.
But I just want you guys to know, people need to give feedback here and on Weights.
I am sad
You are the creator of Weights
We know
🔥
NO, no. I didn't make this site, I just did some translate for this site.
Test yourself
It's been a while since I've been here. Good times with Applio and RVC
Weights doesn't "own" ai hub
I don't know if you are being SARCASM or not, but like please understand that I'm just here to give my basic knowledge to people about using AI. I'm not here to fix things like an admin would do, I'm not even a creator of this site either. 
It was sarcasm, I know Weights is someone else's
It's like 'YouTube by Google'
Ehh
More like the Waze situation where Google buys it and lets it rot slowly
you can download the lora if not wrong, then use another site or do locally
We got RVC on android so I wouldn't be shocked if there's SD for android somehow
Prob slow asf
there's A1111 for android but yea would be slow af
I like Weights because it's practical and free.
or you can also find a colab notebook
But man, why did they mess with something that was good?
Everyone is complaining about it.
Money?
So you can't generate "real people" anymore despite the lora? Trying to catch up here.
Idk
An average Flux LoRA model size is kinda large to download it to your phone, but it eats way less storage than downloading a Genshin Impact. 
No
It's not money, probably someone was correcting the sharpness of the generated photo and made this mistake.
I don't know what you are on, but I can still generate some Ye West images there on this site. 

Try generating pictures of Freddie Mercury
No.
That lora worked good before right? Maybe its a shit lora?
I wouldn't be shocked if they disallowed generating celebrities, its very common for these online generators to do that. There's lots of legal bullshit they need to keep up on
If you wanna get around it, there's local SD solutions or Google Colab
35 hours of minecraft

I host a server that's why
There are many options available, please listen. 
that showing single player

LAN world + reverse proxy (e4mc mod)
My laptop is unable to play Minecraft because it's too old. 
The problem is that it still generates Lady Gaga, Trump, for example.

my pc is too new but I don't have wifi

Maybe the lora is shit, or weights celebrity filter isnt very good yet. Idk
I don't know how they even block that stuff
Surely it just grabs keywords from prompts
They shouldn't ban it, after all it's just a copy, something generic.
my intel igpu laptop can still run it alone, not with a shader or ofc voice changer
Wi-Fi card in your PC or a Wi-Fi router? Buy one, and you'll be able to connect to Wi-Fi.
I hope they fix this, it was generating normal photos until now. Today it got worse.
Sadly due to legal stuff, they need to do what they need to do. Try looking into online notebooks (colab) to run these loras
I don't have wifi access or connection 
Discord messages coming through a wormhole?
mobile data fr
I didn't have to worry about generating image of human not looking good. I just train some random images of things I like as Flux LoRA models there.
it's s coming from mobile data dude

yeah 
I think the error is general, even private photos of ordinary people are being altered
dude I don't know anything about ai but I want to learn

A man commented on the feedback with a photo, he created a model of himself and generated photos of him working out in a fitness tank top. Today he tried to take a photo and a muscular woman was generated. He was very frustrated lol
How nice, they sold open source to some American trash
Hey, Weight guys, I'm letting you know that they removed the models more than half a year ago without my permission and they still haven't done it. You already know my discord
Much like Stable Diffusion XL, my recommended dataset image to train on Flux is to remove background from every image so it can focus on the thing, while every image starts off with 1024x1024px.
removing backgrounds of the images will end up in your character looking copy pasted in the image rather than naturally blending in it
Really? I'd put the white background instead. Sometimes, a trained model can be generated on a background and blend with it. 
i mean it does not removes the possibility of creating backgrounds, so is like if you prompt for your character in the night, the character will look very bright in comparison to the background
thats for 2d characters
idk about real ppl

I never train a model of real people, I only train models of cartoon/anime characters. 

good thing flux is open source
i think civitai allows real people though
weights gg kinda cringe cause it follows ai ethics
huh huuh huuuuh
I'd try to do both. 
Training an NSFW model on Weights isn't really worth it. 



too many pixels i can just barely read the text
fake nitro sticker
beloo
Sounds scary similar to the real never gonna give u up
hi yall
Row Row Your Boat in the style of Carnival by Ye Dolla $ign, generated on Weights. 
AI song generator on Weights is kinda goofy, it only has a few lyrics input to begin with.
,
Hi everyone, I'm newbie just joined a few minute
warwick ?
Huh
I talking about you sir..!!
Stare
Oh dang, another person had SD Creators...!!
just caption the background
Caption or labelling?
say "it features white background" or something
if the background is a forest, then you caption forest
but dont tag too much elements of the background, i noticed if you do that, the lora starts to believe these are parts of the character
Every dataset image I made is just simple white background. 
Let me tell you one answer for me...?!
How did you even get that a quest..?!
I just blur content for some purpose!!
For like WTH..!
If you want to promote something go to the #1159290752195633273 channel.
no 
this is ai generated?
@minor blade pls remove this
seems like a scam not gonna lie (is a scam)
sugar mommy... 
no sugar mommies here 
Dilly ding, dilly dong! A new RegalHyperus drum model just released!
Waking the King V2 (Drum model no. 548)
0.0
interesting, this reminds me of those old jukebox ai continuations
i will use the vocals of that song for my inference inputs
Does anyone know how to make an AI beat, based off an other beat?
if someone knows a project that restores vintage music, please tell me
not exactly a remix
like a feeling that the song was made nowadays
Subscribe and turn on notifications to be alerted of our uploads! https://bit.ly/3l3yzDc
00:00:00 Paul Whiteman - Among my souvenirs
00:04:28 Guy Lombardo - Sweethearts on parade
00:07:43 George Olsen - A Precious Little Thing Called Love
00:10:47 Nat Shilkret - Diane
00:13:39 Paul Whiteman With Bing Crosby - Ol’ Man River
00:16:54 Ruth Etting ...
you know that is vintage without any label
hi im kiyo im new to ai covers but i will try to learn
How did you fix this
E:\RVC1006Nvidia>runtime\python.exe gui_v1.py
Traceback (most recent call last):
File "E:\RVC1006Nvidia\gui_v1.py", line 59, in <module>
import librosa
File "E:\RVC1006Nvidia\runtime\Lib\site-packages\librosa_init_.py", line 208, in <module>
from .cache import cache
File "E:\RVC1006Nvidia\runtime\Lib\site-packages\librosa_cache.py", line 6, in <module>
from joblib import Memory
File "E:\RVC1006Nvidia\runtime\Lib\site-packages\joblib_init.py", line 113, in <module>
from .memory import Memory, MemorizedResult, register_store_backend
File "E:\RVC1006Nvidia\runtime\Lib\site-packages\joblib\memory.py", line 32, in <module>
from ._store_backends import StoreBackendBase, FileSystemStoreBackend
File "E:\RVC1006Nvidia\runtime\Lib\site-packages\joblib_store_backends.py", line 15, in <module>
from .backports import concurrency_safe_rename
File "E:\RVC1006Nvidia\runtime\Lib\site-packages\joblib\backports.py", line 22, in <module>
import distutils # noqa
^^^^^^^^^^^^^^^^
ModuleNotFoundError: No module named 'distutils'
E:\RVC1006Nvidia>pause
Press any key to continue . . .
This is not where you send your Python runtime issue. And I see that folder name, the OG RVC fork is outdated. So please go to #✨│ai-help for more information.
its not too hard. I suggest trying out Applio, with the models that are posted here in #1175430844685484042
i havent made a model in a long time
Hi,guys some one knows how to train a pretrain model on colab?
It's exactly the same as training a model just more data
I put all the data in a single speaker?
Yeah
Or i split them by speakers?
Nope
So i just disable the pretrain thing and just save last g and d right?
Save every d and g
A pre-trained model is a model that was trained from scratch so any model can be trained on. If you're looking for on how to train a custom pre-trained model, be careful on what you're doing. 
Are 35 hours of data okay to train a custom pretrain from scratch?
At this time session length, you'd run out of time on Colab free version. Are you sure about that? 
I have 83 units of colab pro soooo
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Thankss
I'm not sure which one of these has the option for "training" available, but you might wanna try one by one with CPU mode on to save up your Colab unit points. If you found a notebook that has this option, you can use GPU later on. 
applio, rvc disconnected and rvc mainline
Also what are the HuBERT thing?
HuBERT: A transformer-based model that extracts text from raw audio, previously trained on a masked prediction task, which RVC uses to train the voice models. There are several types of Hubert, some of examples are ContentVec, Japanese Hubert-Base and Chinese Hubert-Large. You can learn more about it in the Applio Docs
Hi
The page is broken lol
- Cómo usar RVC Mainline Colab por Cauthess
- Guía de AICoverGen Colab por Eddy (Spanish Helper)
- Creación de un modelo con RVC desconectado (colab) por Angetyde
hi
Excuse me but is it possible to get rvc on mobile
hihihi 잘부탁해
Hi. 

Using colab yes
It works
it should work
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Omg wow easyGUI is back? thank God because I still need this although I rarely train models,I'm already learned in RVC v2 Disconnected and the regular free weekly training on Weights is back because this is the easiest Colab ever and because maybe the models that trained from there has better quality than models trained from Weights
hi
hi
How to use FaceFushion?
ooh your in huggin face also : D

what is the most realistic female voice you guys recommend me?
not sure lol
real females
have the most realistic female voice
uh
like, youtubers or should i just search for females
lol
were you rushing or dragging
Dragging I think
Y
you shouldn't post full songs as attachment, no matter if it's generated

bro doing the opposite of what I said 
also the right place to post is #1159290752195633273
that’s the channel where you promote the content you make
It’s a role, u will be able to pubblish models on #1175430844685484042 and do free requests only
You can read https://docs.ai-hub.wtf/extra/model-maker-role/
Last update: October 20, 2024
Wdym? Show a screenshot
Btw you can send images here, no need to send in dms
And the error explains you that you need to upload the model either on weights.gg or huggingface
what’s ur model download link?
You need to put only either weights.gg or huggingface
not both of them
if neither of them works, maybe @night lake may know why
Try to wait for what razer says, he’s qc not me
Try putting the link u get by clicking the share button
Yw!
@little bobcat Btw it on the bot of #outdated-model-maker-role it should specify that its the share link
hello lovely people, im to dumb to understand all the stuff in the github. anyone got a video explaining how to set it up?
There is no updated video
All RVC & Wokada Videos are outdated
Explain me what u tryna do and what's ur pc gpu
I can link u the best written guide
Im trying to use a voice changer in general. The voice changer i was trying to install is voice changer client. From github. I am on a laptop with a i7 12th gen and a rtx 2050
I'm guessing you're talking about: https://github.com/w-okada/voice-changer
Which is commonly called Wokada / W-okada, it uses RVC (Speech To Speech AI Type) models in realtime for calls
There's also a fork (modified version) recently used, known as the deiteris fork as it optimizes the performance
-realtime
AI HUB Docs
