#🧬│ai-chat
1 messages · Page 323 of 1
yea
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
earlier we were supposed to make kg on neo4j so there was not any hierarchical structure, we tried to incorporate levels attributes and classes(entities) as properties in our structure, and now we would like to make it to classes and subclasses structure.
1st link, it's a wokada fork, modified version with better performance
Do NOT watch yt tuts
hello
can i use voicechanger in windows 11?
That's a bot, and here it's mostly about RVC and wokada
Yes, what's ur PC GPU? And do u mean realtime?
no, in windows 3.1
i have laptop. with rtx 3060ti. and yes realtime
what?
im trying to follow it but what it says doesnt match whats happening
me using rvc on windows xp:
i should try that, unironically
Rtx 3060
Ayo? @cursive mirage level 1 !!! 
How many guys here with a MacBook? I'm having trouble with my AI tools on my Mac.
Yeah should be good
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
1st link, wokada, especially a fork which is a modified version that is optimized
If u have issues ask in help channels
It's a joke
Yk people believe that right 😭
Alright, explain what u are using and the issue better in #✨│ai-help pls
Mac isn't the best for AI, are you using RVC or wokada?
You would be able to run wokada and rvc inference (using models) fine, but not training (making models)
Hi @covert lake
????????????????????????????????
What's the best for AI you'd say?
wokanda
peak
4ever
For sure, for sure, always
Ayo? @real hazel level 1 !!! 
SİKSİNLER
true
hello air
Ayo? @echo bough level 2 !!! 
Is there a 48k Beatzforge PTH file?
hey
u don't need it, u need a .pth and added.index for rvc models
unless ur using another ai
here its mostly about rvc and wokada
Do you know coqui xtts?
yea, its discontinued btw
KIZ KARDEŞİNİ SİKİM
Whyyy
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
-svc
So-Vits-SVC
It was making better tts than elevenlabs
in terms of quality gpt so vits is the best rn
so vits is old, rvc is better
yeah and ı am sorry my brother is... Freaky?
is there a rvc inferrence from huggingface that is free!?
Oh ok
Np
k
yes, ilaria rvc zero, did you try what i told you?
im going to try today
wish me some luck
reminder to show me the error on top right
i need to see the error to help u out
ok
goodluck lol
their product was, while good in quality, not hands off... so they lost to other competitors that were many
Oh
I see
you cant put hallucinating tts into production without any supervision
Is there a tut video for gpt so vits
because it either starts repeating things or speaks demonic voices, dont want some company scare a grandma with that
why does my final model sound robotic!? @covert lake
But
I do want
Your dataset is noisy
you can still use it for personal projects
get a forked repo
hi
Ayo? @junior pollen level 1 !!! 
that depends on the model quality, is it a model u trained urself?
i trained it myself
2 min 250 epochs
i dont really think so, u could check https://docs.ai-hub.wtf/tts/gpt-sovits/
did u use tensorboard?
no?
u need a CLEAN dataset
more epochs dont mean more quality, check https://docs.ai-hub.wtf/rvc/resources/epochs-tensorboard/
Last update: Feb 10, 2024
u need to use the tensorboard
@covert lake What Index influence should i use?
the index influece is the setting that specifies how much index is being used, index is the file that contains the model accent
id leave it default
u would need to retrain ur model with the tensorboard if u want an higher quality one, and also be sure the dataset is cleaned
What is the best vocal remover on MVSEP? Any suggestions
BS Roformer
tell me some of them who play ranked league, apex, valorant, etc. and what kind of issue it could be
-_-
-_-
-_-
I haven't really seen those types tbh
Unless u mean those ai who just play the game and take alot of time to finish a level like for super mario
yea didn't see those type lol
Hello
hi
Yo
someone know how can i change the name of the .pth?
does anyone have a good 2010 era eminem model pth i can use (love the way you lie sounding)
just change it lol
i cant
How to use this?;)
HI
h
Ayo? @gray dagger level 1 !!! 
a
can i see a link to a guide on how to train an rvc mode
Ayo? @elder willow level 2 !!! 
Any chance someone can do one free ai voice if that not problem?
for models request, ask in #1159289738314919936 or #1191429836321849435
Did it
will reply in #✨│ai-help as i seen u talked there too
Can I ask you something about gpt so vits
I can't really help you as I never used it, sorry
.fm
hello where am i? it's RVC group here?
i'm new to RVC, came from youtube. how can i start up?
RVC & Wokada yes
youtube vids are old. WHat are u looking for and whats ur pc gpu?
level 1: embed perms in help channels
5: embed perms in other channels
I don't know whats my GPU (someone told me that this GPU can't work)
Ayo? @glad forum level 1 !!! 
can I use my CPU?
I just want a voice transformer (Chinese), and I wanto learn training my models
I'm sorry my English is poor...
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
You can't like convert your English voice to Chinese, i mean u could but would sound bad
Yeah I know, but my computer is not here...
U mean like convert ur voice to another voice in realtime for calls? Or like translate
Yeah, Chinese to Chinese (mandarin)
CPU is slow, would be better to use cloud (remote good pc) than locally (on ur PC)
Alright, could be possible if u get a Chinese model ofc
For Realtime Voice Changing for Calls on Cloud (remote good pc for those who don't have a good one, YOU CANT DO THIS ON MOBILE):
- Google Colabs (4 hours daily of free T4 gpu, easy to use, require only a google account) :
- Kaggles (30 hours weekly of better GPUs, T4x2 & P100, harder to use, requires an account and a phone number)
I remember, it's A2000
A2000?
I'm not sure, my computer is not here
I haven't really heard off that one, it's better u come back when u have ur PC there
Also bc u can't use it even on mobile wokada
U need AT LEAST a pc
For wokada
How can I get a sound model of a certain person (for example 郭德纲)
gotit
now I have a 郭德纲's solo 相声 about 20min, can I train a model with it?
Ayo? @glad forum level 2 !!! 
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
I don't know that language, and Wokada is just for using models in realtime for calls, for training you need RVC
It's another program, the same one used to make models for wokada
As you dont got a good PC, its better you use cloud (remote good pc) for training an RVC Voice Model:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
is wokada something like RVC? I only know RVC yet
Google colab = easy but low GPU time
Kaggle = hard but more GPU time
Wokada is a program specified for using RVC models in realtime for calls
Alr yw
Hey, is the guide on how to download TensorBoard on Applio from the AIHub website working for you? I’m getting a 404 error and can’t figure it out. Could someone send a link to the download page? On the official site, I can only install it through CMD, but that's not working for me.
Ayo? @fresh aurora level 2 !!! 
solved problem thx
Ayo? @main robin level 1 !!! 
@covert lake "Wokada" means the project "voice changer" by W-Okada, right?
and is called "日版RVC" (Japanese RVC) in China
I'm not a nick but yes that's true
I'm just using it but meeting some problem
Wao! I see 3q
heyy
has anyone made an ai voice for the luigi from that one video where he says "hello mario"
Yes
I deleted your yt video link, use #1159290752195633273
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
How long till I can use X-Minus again?
Hi all - Is there anything that'd prevent me in theory from training a YoloV8 model directly on an OpenCV Mat instead of NCHW blob data?
For the past 6-7 months I've been making a full body tracker for VR use and was thinking of ways to optimize inferencing process - currently around half of inferencing run time is just preparing the image and converting it to a blob
I'm looking at having 3-4 cameras all inferencing at 90FPS simultaneously while a game is running so performance is quite important - will likely be done on a companion compute card as not to hit game FPS too hard
I mean, I know it should work, but are there any special architectural considerations that were made that would mean NCHW works better?
Ayo? @glossy hill level 1 !!! 
maybe there are hardware differences with memory access speed that make NCHW ideal
but then, wouldn't it still be quicker to access the memory slower and not full-copy the image when making a blob??
whats the best AI for music
they not like us, they not like us, they not like us.
hey, is it possilbe to remove background music from a video in order to hear a character's voice?
feel like that sort of thing should be possible in this day of age
if the music is faint enough, regular inference will remove it
or you can use UVR
does anyone have a google colab to put AI model on the apacella to make my AI sing the song?
@covert lake Are there colabs for inference?
Ayo? @elder willow level 3 !!! 
Yes
using colab for inference kinda not worth the hassle
either run local on cpu, it is not terribly bad
or use huggingspace
I wonder if you could just run it thru nvidia broadcast
yo
Hi
I had this idea of a ai cover album tbh
Ayo? @ionic flower level 1 !!! 
yes you can if you have background noise when using w-okada instead of using the not as good w-okada noise suppression
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
hi and hello, this is my into msg lol
and that removes BGM?
and what will it sound like when you add ai to the vocal split??
they not like us, they not like us, they not like us.
fein, fein, fein, fein!
eating my lunchly right now! #inthethickofit
what in the fuck?
what in the hawk tuah!
i cant wait for the new episode of talk tuah. preordered the new prime drink and stocking up on feastables to go with my lunchly as we speak! #hawktuahtuesday! #spitonthatthang!
i dont like that they give you such a small size of feastables and prime so i ordered the party size!
who wants some?
no
I feel like you should get mutted for this
from the hawk tuah to the lunchly to the
what in the hawk happened tuah freedom of speech?!
Nah you sound under 13
Reversed Demoman Crying
im 29
i kinda agree with cxsmo
then that's even worse imagine being that old and still saying that stuff.
im not under 13! that would be violating discords terms of service which i abide by dearly my friend
2 words. red flags.
Ayo? @ionic flower level 2 !!! 
what yall know about the talk tuah podcast
but a lot of people lie
everything! i cant wait for her to reveal whom pookie is 👀
Cringe
yeah you're under 13
anny tips?
nah i got nothin
yeah that will happen what gpu do you have
youre so cool
Ayo? @solar agate level 2 !!! 
do I need a 4090 or something? lol?
hawk 1: what's 4-2
hawk 2: uhh
how much did it cost for that?
LMAO
bad joke detected
cool
yeah playing games and w-okada is not something you can do really on a 3070 while gaming unless you mess with the settings on but then you will have delay and also you would need to turn game down
tell one right now mr funniest guy in the world
So I need a 4090 then basically?
no you brainrotted F A C K.
yeah or you can just get a Intel ARC gpu as a secondary gpu just for gaming or the other way around.
Uhh which one is cheaper, how much does it cost?
Why did the bird start a podcast about nature?
Because it wanted to share all its insights on the "Hawk Tuah Podcast!"
u can still play games like valorant, not demanding ones
Im playing valorant but its still laggy
just to reopen your pc build and add it plus setting it up?
so sad
260 for a a770
better try fork wokada, and ur cpu wont suffer as well
I'm using fork wokada actually, but im lagging more using that
even at 1000 ms im laging, but I use 2.7s extra
who wants to be my hawk-tuah the bird festival (im the pigeon)
get out.
i am not in violation of the rules actually
lower game settings as needed, enable DLSS, and fps cap to 60 or according to monitor refresh rate
intjs! i love those!
reminds me of when i went tuah the eagle-only party as a hawk! (hawk tuah) @polar flax
Is drowsy a bot?
Ayo? @weak scarab level 3 !!! 
im thinking the same thing or he might be a tos breaker.
ayo?! erik passed up level tuah! congratulations!
if u only trash talk and act like a kid who wastes ur parents' credit card, why are u here?
trash talk?
when? ive only been nice and kind!
Damn I tried lowering to lowest, I think I just need a better PC
god damn! pure comedy!
not that hard as long as your cpu has the pcie lans
i am here, simply to share my interest in AI and download models that the kind members of this community has provided! and of course make friends along the way!
and also you are 12
erik, you have some thermal paste to keep it cool?
nope! i am 36
Yeah I applied new thermal paste a day ago actually
lies
ok, feel free to share ur AI creations in #1159290752195633273
I have AMD cpu, but thats fine right
good.
i dont have any AI creations
Should I use server instead of cleint?
no one believes anything you say not even your parents or your cat or dog if you have one.
ryzen 5600 should be fine
your choice
then make one
go make it
i am not obligated to! that is not apart of the servers TOS! i will pass, thank you!
Loud Wheezing Intensifies
imagine liking ai but never making anything with ai
many people do, actually!
they are the ones that don't have good pcs
AI doesnt only exist on PC, it exists as a seperate entity i.e. Artificial intelligence is the science of making machines that can think like humans
to correct you, @earnest dragon. cant argue with logic!
but if you like ai and have a good pc then you should be at least making something with ai
@solar agate pop in vc and square up cuh
that isn't true! In essence, while creating with AI can be fulfilling, it’s not a requirement for everyone who enjoys the technology. Each person's relationship with AI can be unique and valid, regardless of their creative output.
Ayo? @solar agate level 3 !!! 
drowzy pop in vc right now
this response was made from AI, something i enjoy whilst having an android!
cant argue with logic @earnest dragon !
drowzy u suck
get in bro
heyo! i leveled up from level tuah- tuah level 3!
drowzy are u mentally afk?
Ayo? @dull night level 1 !!! 
heyo! congratulations on the level up @dull night ! next level is looking like youre going tuah level tuah!
fuck you
@earnest dragon which gpu do u recommend to use 2nd only for processing the voice?
do you think its possible you can do a small requet? ^^
Ayo? @chilly tartan level 1 !!! 
get in vc rn
YO MATE WE GOT Lunchly in the chat
Ayo? @ionic flower level 3 !!! 
get in fucking vc right now before i get angry
im playing val bro
ayo!! congratulations on the level up @chilly tartan next level is level tuah!
join vc
ooooo! @ionic flower with the level 3 👀 big upgrades i see!
join vc
who has tried the fiesta nacho luncly? its my favvvvvvorite!
seems like ur system issue, try update nvidia driver and debloat ur windows/disable unneeded background processes
alisa
@polar flax you wish to be a streamer someday? i know a thing or two about content!
what request?
Alisa I need your help
basically to become a successful streamer you can:
-
make videos
-
alternatively, you can stream
pls stfu
drowzy is sexist cannonical
secksist
I need to change my bio there
im trying to get two voices out of these clips but the music is in the way
change of plans?
drowzy
If you could have dinner with any fictional character, who would it be and what would you want to talk about?
and me
we just wanna talk with you, you're so cool
I just updated my driver yday, how do I debloat? I have 4tb free space and no programs almost
me personally, im more of a harry potter kinda dude! would love to hear thoughts and opinions!
I reset my entire pc yday
if there are any AI harry potter voice models i would LOVE to try them!
drowzy, I wanna talk tuah you
i wanna hear your voice~
thats it for tonight! im going tuah go through the motions and head to bed! hope to talk-tuah you guys tuah-morrow!
stfu you fucking basic bitch @solar agate
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
how is the valorant performance alone?
its good
@drowzy stfu dont even say a word tuah everyone in here
Ayo? @dull night level 2 !!! 
its weird because the regular w-okada worked better than the new one (forgot name)
maybe I have a setting that is wrong?
heyo! congratulations on the level up @dull night you managed tuah get tuah level tuah! 3 tuahs in a row! holy cow!
shut up
the regular one is smooth? well go use it
If you could instantly become an expert in any skill or hobby, what would you choose and why?
iirc 3080 users might have similar experience on it
alisa do you use voice changers urself? do other people notice you using it?
I feel like some people can tell
@polar flax Fair enough! I’ll share my ‘expert’ tips after I wow you with my voice-changing skills. What character should I channel first?
scamming indian
a arc a770 for the price
I have tried the original one on a borrowed 4070 pc before, now I have no idea with just a potato igpu laptop
ah, the classic scammer! Let me grab my best Indian accent and we’ll see who can pull off the best impression. ready?
You have no tips or anything only people that have made stuff or are well knowledgeable with the stuff are the people that can give good tips and stuff
Ok thanks you two, appreciate it
are you ai
hello! you have won a great prize today! just give me your bank details, and I will send you all the money! haha, just kidding! but seriously, let's talk about something fun!
I mean honestly its surprising how well voice changers sounds nowadays though
Nah he's probably 12
ill send my bank details
or worse
but you have to call me with face time @solar agate
I think if you use voice changer for YouTube videos people wont be able to tell, if you use some post editing
If you could travel anywhere in the world for free right now, where would you go and what’s the first thing you’d do there?
just trying to make friends! haha
your house
gen chat moment
yeah and one thing now one has done is surround sound rvc
what do you mean?
Ii you could swap lives with any anime character for a day, alisa, who would it be and what adventure would you want to have?
stop yapping DrowzyGPT
drowzyGPT moment
haha, I guess that makes me the ultimate sleepy sidekick! What’s your superhero name?
Ayo? @solar agate level 4 !!! 
heyo! just reached level four! perfect time for the lunchly break!
surround sound rvc? what do you mean by that?
man KSI's new song is so grrrrrrrrrreat says tony the tiger
Hi
typical social engineering using chatgpt
unfortunately not this moment
ur not cool enough
lol i keep leaving this server just to rejoin
https://github.com/Tiger14n/RVC-GUI/
is there any newer versions of this? Its archived
yea im too nerdy on my things
just dont wanna learn another tool if its out of date
ill keep doing research, ping me if anyone would like to respond (:
check the recent rvc and guides in https://docs.ai-hub.wtf/
Last update: Mar 10, 2024
gptcore
Thanks!
About the vocals, is it possible to remove specific instruments aswell?
I don't update rvc cause im so cool 🥸
i use mainline lol
UVR (local), uvronline.app, mvsep.com
thanks!
hi
surround sound upmixing it so that it sound more spacious and like you're talking to them in there room and just like surround sound is normally
oh is that a good thing?
actually ima start a new YT channel and im planning on using a voice changer so any advice is good
yes it would make it sound higher quality because of the positioning and Soundstage and stuff
interestinh
Ayo? @weak scarab level 4 !!! 
how do you do that? is it easy to do?
@dire summit que haces acá putita, quieres ser mujer?
not really that easy you need to install a bunch of stuff
there is no tutorial because the setup is something I put together using some vsts a daw and vb matrix coconut.
rvc datasets and inference outputs are always mono since dry voices/sounds like used in 3D games are so.
surround sound, reverb and shit are just post-processing
Ok makes sense
tbh im noob when it comes to sound
im just impressed by how quickly the tech is evolving, its crazy
do u have like an example of how it sounds, im kinda curious
yea if u want to be a streamer and play games, ideally get a second gpu/laptop
Ayo? @polar flax level 43 !!! 
I dont think I will stream, I'll just release YT videos but I do play a lot of games and its fun to use different rvcs sometimes
u dont need realtime voice changer for that, just use offline rvc to add on it
offline rvc? how does that work?
is SVC the superset of RVC?
it is regular rvc duh
Yeah but like, would I record a video with my voice and then add it in post editing?
I basically want to mix my real voice with a voice changer to have a balanced voice sort of
just to make it a bit 'better'
if u mean so vits svc that was before rvc apparently??
really? seems svc has better quality, idk
while RVC model has only 50mb the SVC is like GB
-svc
the mic voice (post process with denoise and stuff) and gameplay video with system sound
got it
Ayo? @tired jasper level 1 !!! 
Hi
Damn I feel dumb af ngl, I think I need an explination that even a 10 year old understands 
So umm... I was searching for a Gura AI voice model on weights gg and found these comments. So does anybody know who Felt is and whether it's true he has a custom Gura model that he shares via DMs ? (looks like I can't attach screenshot here)
Screenshot on imgur bc I can't directly attach here https://imgur.com/a/So2hufn
felts not in this server no more 
Sorry, I'm pretty new to all this, just looking for a good Gura model, so is Felt's Gura model publicly archived anywhere for download ?
useful info: RVC extrapolate autotune af. When original audio is very discrete, the output of RVC is like: it's obvious it's autotune
I am the Globgogabgalab, The shwabble-dabble-wabble-gabble flibba blabba blab.
[11:26 PM]
I'm full of shwibbly liber-kind,
[11:27 PM]
I am the yeast of thoughts and mind,
[11:27 PM]
Ohohoh, Hahahaha.. Splendid! Simply delicious!
Im the gobalabagla
dude what

So what's the best Gura model out there rn ? The one with highest number on weights gg I tried didn't work well with a song (Legacy from DMC 5) because of cracked voice in a certain part
Ayo? @snow briar level 1 !!! 
dude rvc is awesome
I have a example of what a song sounds using it but it's on the older settings so not as good because didn't have time to record the new setup. I can send a screenshot of what the setup looks like
Yee would be sick man
another tip: more clean the input, better the output
not always because some models can't deal with all voice noises
i recorded the Grover laugh with my voice, no voice model can deal with better than grover model
this is what the setup looks like
Tried several different Gura models in the #1175430844685484042 so far haven't found a one that wouldn't voice-crack while singing this song yet...
cubase
what RVC index file do exactly?
.
Ayo? @tacit falcon level 10 !!! 
is there any trick to generate output that is less likely to have voice cracks ? It looks like all the Gura models have the same issue with this one song. I don't know about the settings like Voice conversion options and Audio mixing options so I would really appreciate it if anyone can help me out in this part. Thank you
clear input
Ayo? @tired jasper level 2 !!! 
try put input in adobe podcast before RVC
So I should get the acapella version of the song ?
for sure
the most cleanest possible acapella
any noise in your input will affect negatively the output
Also which Gura model in your opinion is the best one rn, in an ocean of countless Gura models out there ?
idk who is Gura, let me search
Sorry, I mean Gawr Gura (Hololive Virtual youtuber)
but one thing is true: bad input got usable outputs for me in good trained models, and very bad outputs in bad trained models. When i put good input, the bad trained models became usable
models with more epochs should be better if not overfit
so a weeb
@snow briar what music are you trying?
"Legacy" ( DMC 5 song)
yeah
There is always a part near 1:05 that cracks inevitably
inputs with background noise, sounds, polyphonic, reverb, and other distortions can ruin the output results
In my case it's just the original song audio and not a cover or something
can't find an official acapella version of it anywhere on the internet though
u need to separate from instruments, back vocals, reverb, noise. with good separation models in mvsep.com, UVR, etc can produce good enough outputs
also ideally avoid duet songs as there's no perfect model enough for that
This requires strong GPU right ? If so rip for me because all I have is a weak laptop that can't even run AI art models
do online in mvsep.com
there are also huggingface spaces for UVR, and also Flux space for AI art
Thank you, I was under the impression that those online AI services that require GPU usage would be locked behind some sort of subscription paywall Lol
hi skibidi rizzlers!
huggingface spaces use shared ZeroGPU, if u hit usage limit, try again in another hour
hi skibidi ohio gyatts!
Sorry, I just found out that it was an issue with the input, not of the AI models
sorry sending DM before asking
i can't send files here, and even in DM without compressing it and split
it's 16MB large
i'm impressing there's no channel to share audio
how should i post this?
solved
Hii how do I use the AI links ( in this group) to make some people sing the song I want?
so, make an ai cover right? Whats ur pc gpu?
I can't use pc so I'm using my phone
alright, you could technically do it locally on ur phone (directly running on ur device) but would be very slow on CPU
Its way better you use cloud (remote good pc)
use ilaria rvc zero which is the fastest for inference (using models)
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
follow the ilaria rvc zero guide
Ah Tysmm
yw
Did I just got Rick rolled at the same time 😭💀
Yes, I had to add that too 
ofc its a real guide, i just added a rickroll to be silly, u can see below
Last question I think,how do I change the model?
Lol 😆
you should download it in the model loader tab first, then in the inference tab click refresh models and select ur model
What's the difference between gemini vs gpt4 for coding, say a range from 1-10
if you were to rate them
yw
depends on which gemini, on the 1.5 pro id give it a 8, i use it mostly for coding as it got long text context, i use the google ai studio
Idk what im on but I just installed it's plugin on android studio, so that's prolly the free one
you used gpt4o? would you say gemini 1.5 pro is better? or they are around the same
Ah thank you so much I did it 🙏🏻I only couldn't find an AI voice for one singer that I needed
Tysmm
yea i used that too, but the bad thing is the context lenght isn’t as long atleast in the free tier, but its good too
id say for short thing use gpt4o, for long ones use gemini pro
oh i used the one in the google ai studio
Has anyone noticed how easy it is to bypass chatgpt filters?
All you need is a thesaurus tbh
which one
4o and 4o mini,
Sometimes it just goes off the rail into AO3 level stuff
I don't know if it's just my account, but why is it so horned out. The trick is to give it an inch. It will take it 10 miles
What’s the best video ai? Preferably an image to video sort.
-hf
- UVR5 UI, by Eddy and Ilaria Huggingface Spaces
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
- 🆕 FaceFusion UI, by Nick088 Huggingface Spaces
why is it that WITH ALL voice changer when i speak at the end it kinda twitches/echoes (the voice)
ask in an help channel #🔍│help-w-okada
hello
hi
hi
.getkey
Ayo? @grand mountain level 1 !!! 
quick question whats the best ai tool to generate voices, i want engaging voices tha sound super realistic
7r mom
You want to make voice models? What's ur PC GPU?
no i want online tools that convert text into voice
Ayo? @royal laurel level 1 !!! 
but i want realistic ones
then GPT-SoVITS
Online tools depend, it would be better you tell me ur PC gou
If ur PC is good enough u can do it locally so have to not worry about GPU time
is it easy to use? id prefer an online site thats quick i dont want to download anything
1650ti
Yeah nvm
Realistic and emotional?
any online tools yk?
you to trash for ai
paid there is elevenlabs
something that sounds like a real human and is engaging
Gpt so vits is the best then, u want custom voices or just random tts?
hi
depends on the voice
Gpt is for voice models
eleven labs or hugging face?
It's the best out there
chat gpt?
no
Hey guys I am looking for someone who is interested in working on AI Models so we might work together.
and there is this one paid zero shot tts I have that even works if the audio has background noise but the background noise sounds low quality but speech sound good.
what ai models
Got so vits
Lemme share u all ways to tts
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.aihub.wtf/tts/gpt-sovits/
Freemium 11labs: A easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
@covert lake bark tts listed as best TTS is a big question
whats the difference between 11;abs and fishspeech?
11labs is better but paid, fish speech is a open source 0 shot tts, easy but not as good as gpt so vits
i dont mind paying, Thanks
It's good for emotions
Idk if I might post video of my work here?
but it is not good for any text
Can't listen rn not home
1- 'to take a.... " then a sound turns into a propeller plane flying overhead for 5 seconds '... then the text continues
2 - random music cutting in
yeah I cant post my vid so..
Ayo? @old seal level 1 !!! 
if XTTS is simply hallucinating from time to time, then bark is an LSD trip from start to finish
CogX 2b, not even 5 - 06:48:47-256343 ERROR CogVideoX: CUDA out of memory. Tried to allocate 35.31 GiB. GPU
trying with less frames
Wtf never had those type of issues
Did you use a random preset?
I'm not sure.. I've never managed to get anything useful from bark
I remember it had issues with random presets
That's why it's better to use one
hm.. not sure about presets at all, I used it from coqui
and using a reference file
I installed it using their package
I mean Google colab lol
Uhh not sure about how coqui works, but it has presets voices
i need adobe podcast free alternative (opensource if possible), already tried Apollo and audiosr, both don't have same performance than Podcast
Oh yeah u might mean 0 shot as in https://github.com/Nick088Official/bark-gui-fix/
I mostly used the presets voices which worked fine
lemme try again
from coqui that is
preset or 0-shot should not matter, it would just compute the speaker latents and embeddings
Other than resemble enhance and hifigan + bwe (https://discord.com/channels/1159260121998827560/1205637385245958215) I don't think there's anything else to try
performance? u mean speed or quality?
i had no luck with resemble enhance, it cropped all my audio
both if possible, but i'm focusing in quality
note that Apollo (mp3 enhancer) was literally trained on original lossless dataset and their corresponding mp3-compressed one
isn't hifigan the base model for any other?
well no perfect solutions so far, but u can denoise/clean the result audio from noise/artifacts (though still not ideal for making RVC datasets)
sure, all restoration model is trained like that, but the bad news is that it learns about a pattern of compression and become very bad in uncommon patterns that wasn't in the dataset
Weird tho
I hope the Apollo devs keep improving it, since it's still new
i'm trying to avoid denoise audio manually 😦
Well rip then
Apollo trains on wavs it damages itself by compressing them into mp3
There is no other open source upscale option
but I did not see any specific compression levels it uses
oh oh, synthetic dataset 💀
CogXVideo2b
Ayo? @chilly lake level 19 !!! 
that's what i'm talking, any other way of damage will be ignored
i guess Apollo can be finetuned to remove noise
the problem is to finetune by myself
lol
I remember they said it was memory efficient
seems more like it may need a hundred day length of dataset
maybe not too much
Pirate adobe podcast 🔥
mission: find the most vary dataset of bad quality speech
with ground truth
💀
could be synthesized from adobe podcast itself if i had paid version
Ayo? @tired jasper level 6 !!! 
i will need to merge noises and normal speechs to create the LQ
someone want to join me?
hi
anyone can help me to get the right one of MMVCServerSIO to my rtx 3060 !!
Thx u
@covert lake ran another test with 8 different references. 1 - female voice instead of reference male, 2 - stuttering and repeated parts of the phrase, wrong voice, 3 - wrong voice, 4 - wrong voice, 5 - wrong voice, radio broadcast noise, 6 - female voice again, 7 - wrong, 8 - wrong
yo guys. how do i make an ai voice on huggingface
anyone
You can't train voice models on HuggingFace spaces, the spaces are made for inference (using models)
Weird, might be it's not good for 0 shot, I will try with presets in some mins when I get home
how do i get the model
Do you want to search for already pre made models or make one yourself?
i wanna make one myself
Ayo? @gaunt sable level 1 !!! 
What's your PC GPU?
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
that's a cpu
be sure to check the GPU tab
i dont see GPU
you don't see any GPU 0 or GPU 1 in task manager?
could u send a screenshot in #✨│ai-help
Hello, my friend! [laughs] how are you?
Voice_Preset:
v2/en_speaker_6
but yea added that 'umm'
so it hallucinates too
give me the plain banana business ```I want to take the first steps towards establishing my own banana business.
The plan is straightforward: find a suitable location along the coast with ample sunlight and fertile soil, invest in high-quality banana plants, and set up the necessary infrastructure for irrigation and maintenance.
Marketing will be key: I'll focus on spreading the word through various channels like social media, local markets, and forming strategic partnerships.
While there will undoubtedly be challenges along the way, such as pest management and navigating market fluctuations, I'm committed to facing them head-on with resilience and determination.
Above all, this venture is about pursuing something I'm truly passionate about and building a legacy that reflects that passion.
yea just noticed
u want me to inference that?
im inferencing another test rn
666 
God, 'm so bored today
v2/en_speaker_7
okay that hallucinated badly
will add an hallucinating warning to bark tts, thx for telling me
bark is the worst of them all
tortoise is slow and 0-shot does not match the voice at all
nah the worst is gtts added just cus that index was of the tts i played around with
oh ye i should deffo delete that, xtts is a better fork of it
that guide is like a year old tbh
gonna remove gtts too while at it
Hi
Hey ppl
can some one turn https://www.youtube.com/watch?v=lpmTabF4FVA&list=PLQU2xjVsgNWtlq7maEKmdR3RVi-1wBpdE&index=3 in to a mp3 please
Ayo? @hazy storm level 1 !!! 
can some one turn https://www.youtube.com/watch?v=lpmTabF4FVA&list=PLQU2xjVsgNWtlq7maEKmdR3RVi-1wBpdE&index=3 in to a mp3 please
sup
here i can have ONNX
Ayo? @copper quiver level 4 !!! 

don't talk about NSFW here.
don't ask the same question multiple times, just use yt-dlp or any other youtube video downloading site
stop asking the same thing over and over 3 times
its not even an help channel
ok
good evenin
no ??
morning? afternoon? noon
#1159290752195633273 , or else your promos will be deleted in every other channel
what software do you guys use for ai covers?
RVC,
What's ur pc GPU?
Hi guys, someone please help me with how to change webcam settings in facefusion. its stuck on my inbuit webcam but i have a better one already connected, works fine in other apps @covert lake
rtx 4050
Ayo? @elder willow level 1 !!! 
laptop gpu? and how much memory
laptop 16 gb
tysm
yw
gm
get better monitor, kb/mouse
supp
It's been a year since I last talk on an AI server. 
welcome back bro
I've done remastering the entire brainrot nursery rhymes EP by the Rizz Records since yesterday. 
Thank you. 
with what ai voices?
or did you used your voice ig
or...idk
Eh, what I meant is to remix the track stems to get louder. The AI cover is another project.
Ayo? @solar torrent level 1 !!! 
oh ok, well, i have some plans for my ai covers
in training I don't understand the dataset step, it tells me to place the training folder but I don't have one, do I have to create it or use one from the rvc?
I've also done getting like 72 people (AI models) to sing each bar of BBL Drizzy by Metro Boomin on one single track. 

fake scam ad
Hey may you help me with the accents?
how?
an accent is what happens when a voice model has no features that match the source feature exactly
index search=0 eliminates most of it
The second one sounds like some voice TTS from 20 years ago. 
I am training a model on my voice but I have Indian accent, is there a way to remove an Indian accent and add american/british accent?
same way
use an audio with british accent, infer it with 0 index use
it will be your voice with british accent
there ya go
or...
alternatively get a british model index
here ya go
guys is astra labs down
@earnest dragon May I dm you?
I drive
hey may I send you my model so you can check it?
okay
I guess I need to explain the workflow I am looking for:
I have trained a model on my voice with an Indian accent.
now I am recording my own voice like I am narrating something or singing a song,
now I am trying speech-to-speech where if possible I am looking for a way to change the accent of my voice model, so I can use my voice but with an accent.
but I guess it is not possible
the accent comes from an index.. like in the example above I used an english text and an english speaker, but an index from a russian model with 1.0 index search, so the output has a russian accent even though neither original or voice model had an accent
for testing sake may you please give it a try if I send you my voice model?
okay
hi
this is awesome, the results are quite impressive
hey guys
hello
Anyone here make music or know of someone who does because I'm wanting to make videos I just can't with copyright ever video
Can someone teach me how to use the ai?
why is it that WITH ALL voice changer when i speak at the end it kinda twitches/echoes (the voice)
Ayo? @tropic carbon level 1 !!! 
I make a bit of music using Suno; you don't really need training per se you just need a good ear for what is good output and what isn't. Also don't just take the gen as is. Be picky. Extend it at the moment you stop liking the gen and keep it up until you find a good gen, then extend some more. You may look at the lyrics for each song in my public list for a good idea of what you can prompt engineer to produce many styles: https://suno.com/@chemicalor9
anyone know an ai tool i can use that will take an mp3 and put an ai pth voice over it
so, inference right (use models, in this case rvc models)?
or you mean making models (training)
U using to many big words for my small brain to understand
Ayo? @green pebble level 1 !!! 
for which program me say settings
hello everybody
what
for help ask in help channels
and be specific
yes, using models, i already have the trained model i want to use, but i want to upload like a clip of me talking or something and get an mp3 back of the ai talking, i know theres live voice changers and i have one, but i want to know if what im describing exsists
what's ur pc gpu?
and ye i understand, using models on pre-recorded audios
in case ur pc gpu is good enough, u could do it locally (on ur pc), else, u have to use cloud (remote good pc)
I've become interested in TTS, is there any software I can use to create custom TTS voice models (no RVC over TTS files)
GPT So Vits is the best one in quality, its few-shots
while fishspeech is easier but its 0 shot, so its not as good
any colab links?
@twin fractal are all the gpt so vits colabs updated ?
would be cool a kaggle or lightning.ai port
thank you
i dont really do gpt so vits
im not sure if those colabs are updated, could give it a try tho
bc i dont see that there are any other colabs of gpu so vits 😭
last update seems 6 months ago
i train my own ai models if that answers your question (mangio rvc)
Ayo? @cunning estuary level 1 !!! 
@covert lake
my goal is to do it locally
mangio is outdated
well it works
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
also yea mangio is outdated, id suggest mainline or applio
use applio instead which isn't outdated and has all the stuff you want and also use titan pretrain when making models
im trying to create a text to speech program that uses real voices
and I really need help
IAHispano applio?
also i have an rtx 3050
do u may remember the memory gpu? and if its a laptop gpu
3050 not good enough
yeah is his the 6gb or 8gb or what
8gb
mm, is it a desktop or laptop?
laptop version is shit
desktop could be good enough
desktop
yea technically should be good enough
yeah but speed won't be the best
thats fine
i usually do this stuff overnight
he said his goal is to do it locally
yeah I know
btw don't ask my why but i have a feeling u use linux
what do you suggest
Ayo? @cunning estuary level 2 !!! 
honestly, they both are the same quality, the main difference is the UI (Interface), applio interface may seem easier
alr thanks
your welcome
no because most of the ai stuff and audio stuff I use is windows only
dunno why u gave me linux vibes then 😭
I use w11 too even if id give it a try to linux too
i mean used linux for cloud computing services anyways
I use windows 10 enterprise ltsc and for cloud I still use windows
oh lol
.
hello everybody
someone has a good german ai voice ? egirl?
Hello everybody
hi everyone
@gloomy linden sup
Does anyone know why when I try to make a ai cover on hugginface it gives me an error “'NoneType' object has no attribute 'setdefault'"” ?
hello everyone
hello , where can i find an ai service without all the "i cant help you with that" ?
there really isn't any unless local or maybe freedom gpt but not the best and paid
okay
send the huggingface space link u are using
guys wheres the website
what website
sup guys
it doesn't use rvc
do you know what they use
anyone can provide me some AI voice model generator?
so, train (make voice models)? whats ur pc gpu
what website
yeah, I have an macbook. I already tried audiomodify but no luck
Ayo? @little pelican level 1 !!! 
dont use audiomodify, all those scam sites just use a paywalled RVC
RVC is the open source program
however, macs arent powerful enough to train rvc models locally (on ur pc)
As you dont got a good PC, its better you use cloud (remote good pc) for training an RVC Voice Model:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
Google Colab = Easy but low gpu time
Kaggle = Hard but wayyy more gpu time, much more suggested
thanks sir
Yw
So I'm thinking of commissioning a new model of my voice, I have 2 questions:
How long should my sample audio be?
And should I use post editing to remove stuff like reverb and etc?
So paying someone to make it for you? #1159289738314919936 or #1191429836321849435
The sample audio could be around 10 mins, and yea u should remove that stuff, unless the person ur paying does that for u
Been using this one https://huggingface.co/spaces/r3gm/AICoverGen
yea its broken
use ilaria rvc zero instead
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
good evening everybody
Okay bet thanks bro
yw
isn't applio zero gpu version better
why would it be better
all rvc versions have the same quality whatsoever
I was wonding if someone can show how to get started with text to speech or whatever that gets done here
I was saying split audio and also the mp3 enhance
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.aihub.wtf/tts/gpt-sovits/
Freemium 11labs: A easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
and there is this other tts not listed that is free to use but not make voices but paid users like me can create the voice for you to then use on the website
this link isnt working: https://docs.aihub.wtf/tts/gpt-sovits/
yea the docs are temporarely down, its https://docs.ai-hub.wtf/tts/gpt-sovits/ for now
is this on youtube?
gpt sovits is a model you have to train like rvc
is it normal for an Applio training model to eat up disk space until its full? (60gb logs folder)
which can't work bc it slows the inference that much that makes it go over the 1 min max inference time (time of inferencing, not time in the input audio) set in the zerogpu duration
so u cant upscale anything
zerogpu spaces on default can only take 60s to inference smt, that value could be changed by the creator but will eat ur acc quota faster tho
no only mp3 enhance breaks but split works perfectly fine
split vs no split
i never really heard of chunk inference making the quality better tbh
i dont seem to hear any difference, i used some random model and audio that was already inputted
I think the pitch and stuff works better when doing that
you really sure? i'm not able to see any difference between those files
its even the same size
it mostly depends on batch size and vram constraint, in this case up to batch 8 for 8 GB vram (assuming in default fp16), though 3060 12 GB is better value given his budget range
batch size is not always equal to vram 8gb gpus can usually run 10 batch size
yea im talking about amount of vram usage for particular batch size, though for instance using batch more than 8 on 8 GB vram capacity may bottleneck the performance, but not throw OOM error (similar to using ultra texture and other demanding game settings that may use more than 8 GB on a 4090 and cause stuttering/texture popping on 8 GB/less card)
I wonder if pink guy song covers are ok here (for the rules to be exact)
yes but only allowed in #1159290752195633273 and most be uploaded to YouTube or whatever.
cuz im doing one song called "ramen king" and it has some questionable lyrics
upload to youtube and add content warning
its still a wip
who knows how to create video (easy) to show: power and it's going up with percentage from 0 to 100%, strength, stamina etc with some background?
what
AI HUB Docs