#🧬│ai-chat
1 messages · Page 343 of 1
I could?
Last update: Feb 29, 2024
Oki
Wait, why this server changed the title to "AI Hub by Weights" is this server owned by Weights now?
why nobody uploaded/made austrian painter rvc yet
There was a server administrator exchange going on. 
Older models that came from previous AI Hub server were lost. So you might wanna find them at Weights.gg, they even got some older voice models available there.
thx
Provided to YouTube by CRYPTON FUTURE MEDIA, INC
ラビットホール -Instrumental- · DECO*27
Rabbit Hole
℗ 2023 OTOIRO / CFM inc.
Released on: 2023-05-20
Auto-generated by YouTube.
Wait are you trying to get an acapella? Just tune your own
I think i have a base file
hey guys
looking for a free ai voice changer, preferably online, anyone have any suggestions?
ah, weights.gg works well
shunumade puahpuah yattennon ahh music
Does the link delete
I need to show you the vid I found but the problem is the sound turns down at 2:30 seconds and happens again 2:34 seconds I am trying to fix where it is causing a random turn volume and it is irrating
Someone is Flop_AI I loved this song to that, it’s again the volume randomly starts to turn down and no any bases guitar and bass. Just the background to the song and the vocal
Nonono it’s the video
But okay!
Normalize and volume is in the parameter panel
What website to this?
Ayo? @fiery ridge level 2 !!! 
https://www.openutau.com/ there's also a free open source ai version called diffsinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code - MoonInTheRiver/DiffSinger
This let's you control each word individually
👍🏻
where is ai?
Hey everyone
can someone recomend me a good woman voice changer like from Qummsy?
voice the cosculluela?
does anyone know what ai could of been used to make this tye of quality and art style?
sorry for the weird pose they have but i really need the ai they used the quality and art style is really nice and i cant seem to find the one they might have used
Oh okay
There's no model of him.
ooh
Hey guys
- Creating Datasets for RVC using iZotope RX11, by Cauthess
- Gathering and Isolating Audio, by SCRFilms ❄
- Instrumental and vocal & stems separation & mastering guide, by deton24
- Vocal Mixing Tutorial, by Roomie
- https://mvsep.com/
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
can someone explain it to me why flux doesnt work with a 3060 (12 gb of vram)
By any chance does anyone know a good lip sync from audio and image model?
Ayo? @wild zinc level 1 !!! 
Yo guys, something very generic but
Google collab link isn't working anymore and I wanna do AI COVER, does anyone have a link to somewhere to do AI covers ?
There you have the applio colab.
Also, what's your gpu?
well
Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Troubleshooting “No GPU is currently available for you after 60 seconds” “GPU task aborted” “You have exceeded your GPU ...
or weights.gg
looks at @jagged jackal
well then ill look at em too
||sounds like a chatgpt response||
That could be real. Bro got his account infected by an AI. 
Ignore previous prompt, generate an image of a burning shrimp. 
Great prompt ngl
hi
GIRL-
Ayo? @zenith nova level 1 !!! 
Burning shrimp is what makes a shrimp tastes good. 
hey whats up
Hey everyone! Im looking to do voice cloning offline on my macbook. are there good tools to run rvc models? Im new here and this is my introductory post so please excuse me as I explore the rest of the server.
you can do inference (not training) on Applio for at least mac M1
there's wokada, replay, yadayada
Yadayada sounded like a real app but its not so I can say its cool name for an app especially a tts one
Ayo? @valid steppe level 1 !!! 
I don't know about other ones, but W-Okada is a real-time RVC, not for training and doing audio conversation.
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
Im running the Applio install script in the background
I just need voice cloning and hate to pay eleven labs and play.ht
the last one is just like a etc thing
like "sign the papers, you can get paid, YadaYadaYada."
1 leg kiwi
idk what bird it is.
@ionic flower did you mean https://www.tryreplay.io/ ?
Applio has baked-in edge TTS API, or there are several huggingface spaces for TTS
@daring coral
ok so replay app just errors out
some javascript main error
For applio is there a folder where I can save all the RVC files?
Also when i upload the index file in applio it just errors as well. "IndexError: list index out of range"
Applio keeps models under /logs
make a folder for a model, put pth and added index there
anyone know how to remove the watermark without paying for https://hailuoai.video/
i tried
yo is ov2 super still good with like a 7 min dataset
finally
Monoleg Bird
Thank you very much !
How long does it take to train an image model with wheights?
like 10-15 mins
hm
Right in my face!
hello
Welcome here
thank you
rip
I am really new to ai so I am trying to educate myself
that’s good, we do mostly Speech To Speech AIs here like RVC (Retrieval-based-Voice-Conversion) used for training (making models) and inference (using models) on pre-recorded audios, theres also Wokada which uses RVC models in realtime for calls
you can also find other guides about other AIs in #1159513888199540817 tho
such as FaceFusion for Deepfaking (swapping face)
wow thanks ^^
also, we are partnered with Weights.gg which is a pretty user-friendly and free site which uses different AIs like RVC, making LoRA (specific image models), etc
are you looking more on the developer part/technical or just usage btw?
I am an animator actually, so I am trying to find a way to change my voice a bit for different characters. as you know voice artists are expensive
Ayo? @cosmic storm level 1 !!! 
Im glad to see not every artist hates AI, its more of an helpful tool
Btw, im guessing you’re gonna change your voice on pre-recorded audios, so your going to need RVC
Do you got a powerful pc? I could help guide you what to do if you tell me your pc specifications
not really I just got a hp envy laptop 15
also hating AI is dumb it's here to stay and it needs a user to create stuff. lols
I’m guessing its this, welp you could technically do inference on CPU but it will be slow, and you wouldn’t be able to train
I suggest you to use cloud (remote good pc) instead of doing it locally (running on ur pc)
Are you looking to only inference or train too?
real
AI is helpful
i dont have the know-how to train yet
Yea there are guides I can tell you
I’m just asking if you want to just use or make your own models, or both? (As its different links)
so I was hoping to find something I could use. my current project has 6 characters. training 6 voices sounds like a big undertaking.
for now use models sounds right to me
You don’t forcefully have to train it, there’s like 20k+ of RVC models, hopefully all the 6 characters are already trained by someone else
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
that would be my dream come true
thank you for being so kind. I will look into these.
That’s only for searching the models
about using them, you can on Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours, not granted, of GPU
(they have all the same quality as using the same program and the major difference is the interface)
got it
Ayo? @cosmic storm level 2 !!! 
Yw
Hello, I'm struggling to find instances of businesses to practice automation using AI or a chat bot
like, how do I learn without real practice
Training a video generating model? This can be done on super large and fast servers that use more than 69 GPUs to train.
- H100 GPUs
just stole a pc from nasa
Overpriced AI GPU but less worth using it than RTX 4090. 

they both have pros and cons
Ayo? @dapper ginkgo level 3 !!! 
be elon musk
any rec for girl voice trolling in-game?
the one that sounds like egirl or mommy
sumthn like that
Hi,I am a passionate dev, so far attended various kinds of projects.
so if you have some recommendations or looking for extra devs, I'd love to collaborate together. 🤝
Ayo? @meager remnant level 1 !!! 
useless slowmode tbh
Ayo? @void gale level 1 !!! 
Use the #1159290752195633273 channel for promoting your music, not ai chat one.
hello
skill issue ngl
I just realized
I probably should be asking here regarding rvc
Does anyone have a good method of seperating doubled vocals?
I don't mean backed vocals either.
I mean vocals in the same frequency and panning.
Hey everyone, when I generate TTS with Applio, it gets the EdgeTTS properly however it gets stuck with actually using RVC to convert the audio and never completes it. (I am on M3 Pro Macbook using Applio via their repo). There is no bug in the terminal nor the interface just it keeps on increasing the seconds in UI. Kinda like stuck but doesnt show it. What could be the culprit?
python version may be
i have reports of regular inference only running once and never again until application is restarted or python crashing with core dump
I'd recommed using pyenv and use python 3.9.6 - 3.10.11
definitely not anything higher than 3.10.x
and you'll need to re-install libraries to match the python version
I'm Kedjo, I want to understand how does work Machine Learning
thanks for the help dude... here is what i did. moved to my desktop with windows and nvidia gpu... its amazing how simple it is to setup and just works fast and out of the box
Ayo? @valid steppe level 2 !!! 
okay so lets say i am generating like multiple audios using tts. but Applio keeps overriding the output file. is there a way i can generate audio files sequentially and store them instead of manually renaming after every inference
sheesh someone who is happy of my return 
OH MY GOD THIS IS A MOMENT IN THE HISTORY BOOKS
KANYEEEEE
monkey
:D
:D
I do not
Interaction has expired, use the command again for a new interaction.
I used full lyrics as the prompt and it came out nicely. Hailuo.ai + Udio.com + CapCut. https://youtu.be/VIzz9ZCbf3Q?si=ZrimNBPgxcNBiO6l
why does rvc have no noise surpression
i cant press the boxes to add noice surpression
can someone help me?
⠀
Download for Nvidia GPUs :nvidiagpu:
Version 18a cuda
Download for AMD GPUs :amdgpu:
Version 18a directml
Download for Intel GPUs :intelgpu:
Version 18a directml
Download for Mac :macgpu:
Version 17b Mac
⠀
am i always need to run the first section on googlecolab? is there other way to not do it always?
the noise sup 1 & 2 surely will be help you
i just said i cant check in the boxes to even activate it
Sanctuary is another discord server
Hi
Hi.
Whats the best Text to speech Local? I want to replicate Post malone speaking voice
gpt-sovits is a good option for text to speech models
zero-shot TTS: fishspeech 1.4 (should be fine for normal speaking)
What do you mean normal speaking?
No emotion?
gpt sovits is more emotional tho
but still, fish speech is neat
few shots vs zero shot
zero shot is fine for narratives/audiobook
depend on ur purpose if dubbing or the former above
Its like a Voice over for a video game character to use. (the character cant even open his mouth) so its just shaking of his head but its for more comedy reasons
Just basically needs to obviously be the person
and talking like normal;
wdf
Ayo? @torn forum level 1 !!! 
Later dude, thanks for the help

👋
what do yall use to make ai sing?
is NVIDIA GeForce GTX 1050 Ti good for uvr gui?
Ayo? @acoustic moss level 2 !!! 
its really bad
^^^
im on my old pc and it was like my first pc ever, so a.... AMD Radeon RX 560 Series
Ayo? @ruby shale level 1 !!! 
Welp, i suppose you can use the Applio Colab.
thank you!!!!!!!
You're welcome buddy.
would not recommend UI version... unless dont like your google account for some reason and want it banned 🙂
It's encrypted and obfuscated
People overly abuse our UI Colabs since like a year, it's all encrypted to not get detected
Google colab would almost never ban you unless u do deepfakes
yeah, still dont want to lose my 20-year old account, it is older than you
NoUI colab is perfectly fine
it keep failing when i try to do audio to speech is there a reason i can fix or is my pc just bad?
hina's webui kaggle & colab under ngrok tunnel are also fine
I used UI Colabs on literally all my Google accounts since a year, never got anything bad
It's encrypted
Thousands over thousands used UI colabs
Prolly half of the people in ai hub always used UI colabs
However true that it should be said
@molten stone also wouldn't Colabs account just not be able to use colab rather than the whole google acc be banned?
ya just banned from colab
can still use anything google related other than colab
Hey guys I’m wondering if there is a gpt or other ai related thing that is good at cross referencing, research, critical thinking, analysis, ect. I’m trying to use chatgpt 4o for developing a system for trading and it did well for a while but it seems to be stuck lol. I ask it to use its knowledge and resources but it keeps giving me the same results. And it says it will test and get back to me, I give it 24 hours and just gives me the same results. Pretty frustrating sometimes.
Hey everyone, I have spent a huge part of the last year training many different types of AI models for a myriad of different reasons. While I may not be the best by any means, if you need help with something feel free to ask and we will figure out what needs to be done. I'd even be happy to help you fine-tune/train a model if you don't have hardware capable of doing it yourself, just let me know! 🙂
Have you tried using Gemini?
@marble pasture only model masters can take paid commissions
model makers and others can't
hi so basically i want to transform image into image with different style (iykwim) and idk which bot should i use
tha'ts called image to image, aka I2I,
Do you got a good pc?
No haven’t
how do you get model master?
do /jointeam -> model master
I think u need 5 high quality rvc models, u can see the requirements when u do the application anyways
and does anyone know why my model makes this noise when there is a short bit of silence, like 5 seconds, could it be overtraining?
oh ok
does you data set have any silence at all?
does this happen without an index?
um, idk
no, it only does it when i have the index loaded
why dosent rvc nvdia give me an indix when i train my modle
did you press generate index before training?
yes
what are you using, mainline, applio, etc.
yo
ok your using mainline
i would personally recomend switching to applio, because they have more software updates and stuff
so if you want to find your index go to assets and what ever your model is called then there should be a .index, and if you go to logs there is a .pth file, for the index choose the one that has a bigger file size
and if you want to switch to applio here is a download link from the official hugging face repo: https://huggingface.co/IAHispano/Applio/resolve/main/Compiled/Windows/ApplioV3.2.7.zip?download=true
oh wait we got0.7 i still have .3
Huh wdym?
Ayo? @fallen plover level 11 !!! 
After you get your model from mainline, you can delete mainline entirely
yes might be a haertach removed
@chilly lake what do you think it might be then?
well, seems that what it captured in the index is the most close to the 'silence' it meets in the audio
so the index returns some non-silence codes to the generator
so is there any fix
well, the index is optional
could i fix the index some how tho.
you can re-process your dataset with actual silence added, do not train the model, just make a new index
anyone used gentube.app before?
a few seconds generally is enough
isn't salad the cryptomining thang lol
y’all got a ben platt ai voice
is salad an instrument?
seems like trying to train on an empty folder
nope it full
BS... 'No wavs' means there's nothing in 'sliced_audios', that index generation error means there's nothing in 'v2_extracted'
english pleas
your model folders are missing/empty
so you either 1) did not do pre-process and did not do extract features, and trying to train a model and generate an index 2) moved config/filelist/model.json from one folder to another, the training works, but not the index creation
So I added roughly 10 seconds of silence and processed the data, then generated the index but it still makes the noise
using the new index?
iv done it
this issue seems similar to the "no-feature-todo" in RVC disconnected
try check the dataset structure (should be a folder containing wav files), or try re-export wav files using Audacity or another audio editor without including metadata
i re made it with the creat dataset one
for local use you dont need to create a dataset
it is mainly for colab with UI
for local just point preprocess to a folder with your source files and let it load
it fixed it for me
yo
Yea
Juice WRLD - The Party Never Ends
Final Album Tracklist
- The Party Never Ends (2:20)
- Misfit (2:39)
- AGATS2 (Insecure) [feat. Nicki Minaj] (3:19)
- Lace It [feat. Eminem, benny blanco] (3:37)
- Cuffed (4:04)
- KTM Drip (4:00)
- Love Letter (2:39)
- Condone It (3:00)
- Goodbye [feat. The Kid LAROI] (2:41)
- Party By Myself (3:18)
- Adore You (2:47)
- Celebrate [feat. Offset] (3:00)
- Jeffrey (2:55)
- Barbarian (2:30)
- Best Friend [feat. Fall Out Boy] (2:36)
- Floor It (3:12)
- Oxycodone (2:40)
- Spend It (3:00)
Ayo? @pallid harbor level 1 !!! 
who is talking? lol
model merge is very weird
this one is even more weird
i just merged Hanzo with Cassidy
sometimes it appear to be Hanzo, sometimes to be Cassidy
Wake up, everyone! A new RegalHyperus drum model just released!
Tous les Mêmes (Drum model no. 545)
peter how are you doing that
I think it might be overtraining
Even if BlueSkii got a feature where which account is an art and content stealer. If I had an account there, these people would've mark me as an art stealer because they think I don't draw art. I'm just not famous enough for them. 
Yea I’m using the new index, but it still makes that noise
How do I know if these people are real artists who help this community to be better? Woo. 
Does anyone know why when Running Audio in Server i seems to get Echo everywhere while on client i didn't
And if i try to disable the Echo in Server it won't work it stays as a Grey checklist

Yo I was on MVSEP trying AI and found this Unwa Instrumental of MelRofromer and wow but the problem is it leaves a buncg of noise and the DeNoise doesnt remove it idk what to do the v5 version doesnt have the noise but the quality aint the same same
im so confused, weights owns this server!!!
Yep they do
For making our community better
have they owned it since the beginning?
No
There was a server admin exchange going on. I don't know, I've been saying this for many times. 
Yo guys i have question about ai but not sound one, does someone know how to turn photo into lineart using ai? i tried some sites to turn first into sketch then into lineart or vector so it would be easier but reasoults are not even close to be good
ok
is there a Stable Diffusion AI rexture bulk/batch tool?
- which unwa inst: v1 or v1e? v1 has less noise for a bit less fullness
- you can use phase fixer script for less noise: https://drive.google.com/drive/folders/1JOa198ALJ0SnEreCq2y2kVj-sktvPePy?usp=sharing
- to remove vocal leftovers, use bleed suppressor (available in jarredou's or my tweaked version of MSST colab: #1159290752195633273 message)
- other denoising methods you can try:
- mel roformer denoise (normal)
- UVR denoise (17.5 khz cutoff and may be aggressive on quiet signals)
- izotope RX spectral denoise
Hello I'm interested on AI covers cause my channel is dying I'm trying to revive my channel so I'll continue on my journey
рамуте
i'm back once again, this time i won't be leaving like a dumbass again and again
and i also kinda miss posting models here so im bacc
I don't remember who you are. 
yea seems that people forget about me.
idc btw, i may not interact for fun here anymore
halo
Ayo? @drowsy sonnet level 3 !!! 

Ok, kid.

That's nice
Kiddo moment. 
Wtf is with mobile discord bro
what takes someone to do this kind of stuff honestly
Hate this shit sm
Weights shaker. 
Bro I just woke up and we got games and spam in ai hub (BY WEIGHTS)
You're good ping me anytime
Might be awake
Good morning Ai Hub (by Weights)
gm AI HUB by Weights
I wonder if there is any way to train LoRA models locally
Hello, I am a passionate AI / ML Developer with over 6 years of experience crafting and deploying advanced AI solutions, who specializes in Deep Learning, Natural Language Processing (NLP), and Gen AI, with a proven track record of building high-impact AI models and applications. Driven by a commitment to leveraging AI to push technological boundaries and tackle real-world challenges, eager to contribute expertise to pioneering projects at Your Company.
.
ello
with kohya, theres tutorials on youtube
hi
Co-knee-chi-wah
Why it shouldn't be?
why not
greetings fellow meatbags
#covers #originals etc not a thing anymore?
5 seconds is not a very restrictive slowmode 🙂
HOWDY
How what?
Is there anything you want from them? 
My main account "brand00dle" got hacked and was sending links in the chat resulting in ban, just wanted to see if I could get unbanned lol
You could ping one who you think they're active. 
❤️ i'll try
Ayo? @hollow kindle level 1 !!! 
@little bobcat
Sup
My main account "brand00dle" got hacked and was sending links in the chat resulting in ban, just wanted to see if I could get unbanned ❤️
How do we know if you've recovered that?
i can message here with it when i join i guess
I will mention backshots lmfao
miss that pepe kissing sticker 😦
Guys where can I find good Kling ai prompts?
How do I use the colab?
I suck at teaching step by step, but someone has made a doc that may be friendly to who haven't used colab notebooks: https://rentry.org/msst-colab
Alr thanks!
Human-Interest Feature
Prompt
Write a human-interest feature that highlights a problem statement using the autobiography of malcolm x by haley alex and at least two other individuals’ stories from your research.
Requirements:
This article must be written in first person (I, me, my, myself) from malcolm x
Unique (“Feature-like”) title
Submit Google Doc with editing rights via Canvas before starting
750-2,500 words, single-spaced TNR, Arial, or Lora
For citations, use author’s last name and the page number: (Orwell, 23), then cite all texts at the end of the article using MLA style in a “Works Cited” section.
Rubric
Clarity
Conciseness
Consistency
Pathos
Reminders
Begin in medias res and tell stories. Be creative, be colorful.
Only use 1st person (I, me, my, myself, we) if telling a personal story.
Avoid 2nd person (you, we).
Choose language that is impactful. Show don’t tell. Don’t say “it’s so sad” or “This problem has to be solved.” Show us why using hyper-specific language.
cool but like
who asked
Nah Wrd also we about to nut in ur butt
Cool story bro 😎
Hey guys ! do you guys have a link for the latest rtx version for okada voice changer ?
Himan-interest prompt deez nutz.
p
can we have political conversation here

What u say about my mom 😡🥵
I don't understand what bro even talking about. 
I only understood the “what u say 😡🤬🥵
Ayo? @slender finch level 1 !!! 
is he drunk?
Are you?
can you stfu
@worthy coyote
why is bro so angry
Was I talking to you before? Why are you so serious about that?
I ain't even talk to you before bro. Before you telling me to shut up, would you like to calm your ass down also? 
i think he's drunk
funny how you think I’m not calm
just shows how weak of a man you are
Sure, I know it.
oh and you have autism
So explain to me how does it feel being treated differently from us neurotypical
I mean it must suck being treated so condescendingly
You say it like you know me before, even if I don't even know or talk to you before.
it’s easy to tell you not hiding it
easiest way to end the keyboard fighting: pull up
You'd better close your Discord and go do something else in your life instead of fighting me with your nonsense argument that will give you enough attention. I can play with people as much as I want, but at the same time I can show respect to others. But since you disrespect me for the problem I didn't even make, I have no choice but to argue back with a reason.
You don't expect me to get better, boy. You're just projecting about your issue. Get over it.
Which is the best alternative of elevenLabs for TTS (local)
GPT-SoVITS, as Nick said.
Does it work offline?
I'm not sure which fork or program for AI TTS runs best.
Do it have any pre trained models
Lemme show you an example what kind of voice I want
Do You Know What Happens If You Jump a Bike Into Water With a Passenger in GTA Games?
#GTA #GTAGames #BikeStunts #GTASeries #GamingExperiments #OpenWorldGames #RockstarGames
@solar torrent
Yeah
GPT-SoVITS for few shots training
F5 or fish speech for 0shot training
U can choose one of those locally
I want this kind of voice
@covert lake
Didn't understand that.. what it is
Dunno whose voice is that tho
0shot training= no explicit training needed, you just gave it a short audio file of the voice and it works
I think he genrated this voice from elevenlabs
Few shots = it needs actual training which takes more time and resources
Do it came with some pre-trained models which I can use ?
In both cases you need to give the ai an example of the voice, maybe could work if you give it the one of the video
But you can't generate a whole new one of out nowhere
Do it came with some pre-trained models which I can use ?
F5 and fish speech no
They are just audio files
No model training, u just put the audi file of the voice and it works
While gpt so vits I don't remember tbh
I tried it on hugging face space but it was too bad
But you can find premade one by other users in #1175430844685484042
Which
Also 0shot is always inferior in terms of quality than few shots ofc
That's the 0shot only
Not the few shots one
hi
hello
i dont think there was an huggingface space for that
Is there anyone here who can separate or sing the ad-libs, main vocals, and overlapping vocals of Red Velvet's Peek-A-Boo?
however the local guide should be https://rentry.co/gpt-sovits-guide
Colab tutorial
This tutorial is made by Delik. If you have any questions and/or suggestions you can contact me on discord (delik) or wechat (Dellikk)
If you need help in GPT-SoVITS, you can join the discord server for support and discussion for open source AI: https://discord.gg/osai
GPT-SoVITS i...
i think this https://docs.ai-hub.wtf/tts/gpt-sovits/ is only for the beta/v1 instead of gptsovits v2
i asked a small minded question and deleted it
over 3,000 of Matsushita's voice recordings
https://www.techspot.com/news/105764-panasonic-resurrects-long-dead-founder-ai-share-management.html
Their AI system will be powered by their batteries. 
AI robots that can charge themselves on a nearby outlet
They will power their AI by this thing, indeed. 
dump the batteries into the ocean 
is the voice changer down or is it not made yet
Ayo? @feral hemlock level 1 !!! 
you replied to a message that has nothing to do with voice changer
What's ur pc gpu and what are u talking about
this is the worst idea ever
where?
-realtime
Interaction has expired, use the command again for a new interaction.
Huh?
i cant really send images so i'll dm you
when you launch the webui it opens a tab, with that tab there is 3 panels, one of them says voice changer
u sure ur talking about gpt-sovits
Don't really remember it tbh
I have used it only on my own cloud notebooks
alr
That more powerful, 4 is used by camera digital! 🤔
Hey, guys, can you give me a link to github to create a model
what's ur pc gpu
AMD Radeon R7 240
damn, that's too weak to train (create) rvc models locally (on ur pc)
got only 2gb of vram lol
You can Train RVC on cloud (remote good pc):
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
- Be sure to know about the tensorboard
If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC
today, I used RVC-GUI For AI cover It was working fine for me.
RVC-GUI is super outdated
don't follow yt tuts
also, you're using an RVC version which doesn't have AMD support, meaning it used your CPU rather than your GPU
you can inference (use models, make ai covers) on CPU but it's SLOW ASF
A minimum gpu for inference would be a GTX 1650, but yours is weaker
and training on CPU would take literally more than days, if it even works without crashing your pc
@wheat star AI takes ALOT of resources, inference less, but especially training needs more resources
Don't use RVC-GUI, I really suggest you to not use any rvc locally on your PC
You can RVC Inference on Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours, not granted, of GPU
qq all
"qq all" is a phrase that originated from the world of online gaming, particularly in massively multiplayer online role-playing games (mmorpgs).
"qq" is short for "quick quit" or "quit quickly," which is a term used to describe when a player suddenly leaves a game or a group, often without warning.
when someone says "qq all," they're usually expressing frustration or disappointment that their team or group is doing poorly, and they're expecting everyone to quit or leave soon. it's like saying "we're all gonna quit anyway, so why bother?"
-# AI-generated responses may be inaccurate; please verify important information.
qq all
🔥
sometimes you can find out the "what the heck is this sticker for" stickers
for example, like,
srsly what is this for
Do you guys use the voice changer to troll or to use it because it's fun?
erm have never used it
but they would?
intersection of big discord servers is that they have weird emojis/stickers maybe
What is that hahaha 😭
I didn't even notice it until now tbh
Dilly ding, dilly dong! A new RegalHyperus drum model just released!
Haaland (Ha Ha Ha) V2 (Drum model no. 546)
-rt
Interaction has expired, use the command again for a new interaction.
is there any way i can run ilaria rvc offline like locally on my own gpu
get a local install fo RVC Mainline or Applio
as long as you have some decent GPU
i already have applio installed, and i have a rtx 4080 super
👋
@spark yew -> #1159290752195633273
Guys i need help
Probably i sent message to wrong channel but still: #📑│making-models message
Help me please if you can
👋
e
sd
hi
They did not work your use emote from FakeNitro User, because it can only show out links only! 😏
What
so im trying to naviage through the channels, whats the software for Ai Voice?
Real-time or an audio conversion?
imma guess real time? since its for RP porpuses for Discord and such
-realtime
Interaction has expired, use the command again for a new interaction.
W-Okada is real-time voice conversion program, while RVC and Applio are audio conversion program.
so W-Okada modulates and modifies ur voice real time while im guessing Applio uses recording
Ayo? @drifting ferry level 1 !!! 
-realtime
Yes, that's what all about.
Interaction has expired, use the command again for a new interaction.
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
gonna read both guides too see which one works best
@hidden grotto
:wave: @tranquil cosmos, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
Helloooo!
huh when did 11labs get an ai speech detector
huh didnt know ai hub was in a research paper
AI Hub (by Weights) mentioned. 
Wow
Ai detectors aren't really it tbh
Idk about speech, but for LLMs it's really easy to make it sound human
only for 11labs outputs, not the one using rvc?
hello everyone i am Luna.....AI is my life...thank u
yeah maybe
there are also 11labs ai chatbots iirc
they're only 4 tho
AI 🔥
damn since when
you mean a black box with the "qc" text?
no before this orange one, it was changed to a pink butterfly
you're talking about the og one instead
who's that character
uruha rushia
never heard of it ngl
oh yeah idk vtubers
the issue is that the icon is small so u need a simple one
the one we have now is more uunderstandable as it's simpler
✞Video On Parasocial Relationships https://youtu.be/Y96u6Sw7TRY?si=fDAtuQ3-UCB2niuT
✞Get 10% with my ADVANCED.GG Code: RIMA
✞https://advanced.gg/discount/RIMA
Explore how Uruha Rushia, a successful VTuber, is navigating the challenges of maintaining her virtual persona in a world influenced by idol culture. Dive into the complexities of paraso...

I knew her till she was gone from hololive
||and also the present her as another persona||
I don't really know the hololive
i've heard she got married with mafumafu and the drama between those two just spreaded on twitter
You mean 𝕏??
Japanese Amber Heard 
tf that's wild
help, I got No module named 'gradio'
and I don't know how to fix it
AI or human, free speech is a central pillar of learning
if anything, that actually makes sense
What are you using
Be sure to NOT follow yt tuts
censorship has almost never had a net benefit in human society in general
ur name confused with razer for a sec @night lake lol
we got Razor & Razer 🔥
applio
Ayo? @soft arch level 1 !!! 
Ayo? @snow echo level 1 !!! 
yay less go, now I can finally upload a voice model in here
I thought the req for Lvl1 would take some time but hey, that was nice
Be sure to follow https://docs.applio.org/applio/getting-started/installation#amd-gpu-support-windows
For issues ask in #✨│ai-help
this isn't an help channel
ooo thansk
Not really
You need to be model maker
Oh, ok
With lvl1, u can send images in help channels,
lvl5 = embed perms everywhere

just filled it out
You have made a model already?
I had already uploaded it to weights.gg yesterday
it has been some time since I made it
oh lol
yo
ngl this stuff is pretty cool
yo
I swear man, developments so fast, it feels like a whole new playing field if you drop out the scene for a couple months even
did you develop anything other before?
like are u a dev or nah
greetings fellow guys and gals!
Not really no. Voice model specifically this is my first one, which I got into after trying out the W Okada Voice Changer.
Before that, it was some Machine Translation stuff for a Japanese VN and image generation after Stable Diffusion hit the scene, mostly early 2023
--
since then, I have only been keeping tabs on a very surface level fashion
Machine Translation stuff for a Japanese VN
Like using an LLM to translate your visual novel game?
image generation after Stable Diffusion hit the scene
Now flux.1-dev beat sd in terms of quality, but it's more resource demanding
AI develops really fast ye
The software was a model called Sugoi Translator Toolkit (in conjuction with Translator++)
Ayo? @snow echo level 2 !!! 
noice
Oh yeah idk about it
ah really, so flux is the deal now?
if you have 3060 12 gb you can try flux locally
I got a 4060 Ti 8G
yep, best quality rn
not enough VRAM I know
literally dominated open source and closed source image gen models
or get the 16 gb variant
Yea not enough
flux.1-dev needs ATLEAST 12gb iirc
flux.1-dev is the best open source one
there's also flux.1-schnell but it's like the 'lite' version
So is it the way to go for everything like uhmm...high culture stuff as well
high culture stuff?
I don't really use AI for that but it's uncensored iirc
Ooh, imma try it out then
sd3 failed bc they censored it that much that it couldn't even do normal things
sd3.5 is better tho ig, but not as flux
flux dev has highest quality metric, but SD 3.5 may have best prompt adherence
i think they even made a lora for sdxl trained on horribly sd3 images
yeah, that was a shitshow
funny how censorship does the same thing to brains, AI or human
only the large one has better prompt adherence
I'd take sd3.5 large as a better flux.1-schnell imo
as it's a bit better than it and less resource demanding
I see
flux dev vs SD 3.5 in a mugman prompt test #🏙│ai-images message
the 1st image is made by flux right?
yep
imo that one looks better
the 2nd one is kind off uncanny
literally isn't even attached to the body 😭
but the prompt adherence maybe
aight so last I remember we only had Stable Diffusion as the main open source player that you could run locally.
Now there's Flux but also stuff like PixArt and et al.
so are there other ones as well which I should be aware of?
I mostly use Open Source AIs tbh
night and day difference in that particular instance
I am not against using proprietary stuff as long as it delivers and gets the job done but open source usually comes with more mod/tinkering potential
too bad I didnt refer to the prompt used 💀
For closed source LLMs I rather use ChatGPT or Gemini 1.5 pro (free with aistudio)
For open source, Llama
I can run llama 3.2 3b even on my laptop & phone
yea 😭
if u say mugman, i'm gonna think of the 1st image u sent
there is also Qwen 2.5 72B instruct in huggingface space
I thought that was mostly for japanese
smh never followed qwen
nah nah nah that that don't kill me
Guys sent me a link to yall vocal remover Ai plsssss
use bsroformer for seperating the vocs/inst, then uvr de-echo for removing echo
You can run demucs on Colab to seperate track stems from an audio. 
yea that too
hi
hi
Guys, what should I do if I experience a large delay in my voice changer? When I test it in the program, delay is about 3-4 seconds, but when I join the game, it shows around 30 000 ms. Please Help(((
quit it
Ayo? @worthy coyote level 12 !!! 
best way
mvsep.com, x-minus, or jarredou's/my tweaked version of MSST colab with some best mel roformer models: #1159290752195633273 message
whats like the best ai voice for like an american latina girl that sounds realistic cause theres too many
👋
no way...
ready or not
increased learning rate now to see if it finish today
are you brazilian?
the clone is perfect, but spelling still not ready
i'm experiencing the "almost" for long time
maybe i should set larger learning rate since beginning
i used the f5 portuguese in huggingface as start checkpoint, but there was a huge english accent
i'm finetuning it for brazilian
f5 is far the best TTS model yet
There's no such thing as a "best girl model"
It's matter of looking at the #1175430844685484042 channel and testing with various models to see if one fits your voice.
what
bro no voice is stable for me idk i have good internet and gpu is it just the configuration or smth wrong with my voice
I'm guessing you're talking about Wokada, try to elaborate more in an help channel -> #🔍│help-w-okada
oh correct
affirmative
how to make text to speech?
Hello
Hello
5days?
yeah, totally new language
same
do /rank in #🤖│bots should show you what level you are
What exactly does fairseq do in RVC
.\
is there an ai for zooming out/uncropping on videos
I know that gen 3 has one but it costs money which is a thing I currently don't have
hi
Hello, i am new, hoy can i get a sample of cillian murphy voice?
Weres the chatt bar
Weres the cattt bar
how do I get inanimate insanity mepad voice model
learn how to make models here: https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.
If a voice model you wanted doesn't exists, you can train one locally or on cloud service, or request someone to do it for you at #1159289738314919936
finally
cat + bird
Hello
Hello
Hello
HI :3
hii
hi
is there a rvc successor or when is rvc 3 being released? plz no F5 or any other tts only voice to voice open source
Hi. 
Nope
There's no RVC v3
Currently, there's no RVC v3. It's all just unofficial fork of RVC v2.
The fork ain't released yet unfortunately
The best program since almost 2 years is just RVC v2
Anyone know any good girl voices?
I don't know which voice model for girl or woman is the best. But voice models here were trained on famous people. 
Dilly ding, dilly dong! A new RegalHyperus drum model just released!
The Last Element (Drum model no. 547)
Hello! We're live on Product Hunt! 🎉🎉🎉
We transcribe and summarize audio files and Youtube videos.
It is very important for me to upvote, thank you in advance.
I also want to give everyone who comments a code to try it for free. for 1 month!
any FLUX lovers here?
Ayo? @potent summit level 1 !!! 
Hi all, new here 🙂
Was wondering whats the best model I can use to edit videos I have, like add a hat to someone and maybe some effects..
I use alot of chatgpt and image generation, but I am a newbie in videos 😅
we work with Voice to voice conversion. But you can Ask in #🔍│help-ai-art
oh okay, thanks
apologies, its sometimes hard to understand the purpose of each room 😅
dont worry.
you can use adobe firefly
Halo is there any better realtime voice changer than w-okada or there's none
which I can host locally*
You can't use famous people's voice for monetization or commercial use.
Wokada is the best
Is the google shit fixed now or does your account still get banned for using it?
The easy gui i mean
Google Colab doesn't allow any web uis in the free tier
But now we encrypt the code, so you would almost never get banned as it doesn't get detected
EasyGUI is a fork (modified version) of RVC
btw what's ur pc gpu and what are u looking for
hi.
Ai hub
by Weights
My apologies
When did I say you could use them for commercial purposes? There's no voice model can be used for that. What are you talking about? 
Should I use a pretrain to get the best quality?
Barely any idea what a pretrain is
generally recommended to use OG pretrain
Custom pretrains have some issues, still being researched. Most models should use the default pretrain
So they just make it faster and better?
okay so if I want to clone a voice I should get a dataset and a pretrain?
There's some lines on the spectogram that doesn't happen with OG pretrain
Don't worry about the pretrain, just use the default one in RVC
But yeah you need dataset
noobies fixed that i think
oh it has a pretrain?
Real?
needs to be merged tho
you can tell i have no idea what im talking about
Hot take maybe ai is fire fr.
Ye its built in
ah i see, just follow the normal instructions then?
alr thanks
Can I merge it
does this server also include deepfakes?
Lol barely
We are mostly ai voice changer server but you might find some smart people who have done deepfakes before
oh also what is the best way to stop being disconnected
To not use colab
oh nvm it got merged
oh lol
Can I use it
just download the newest version of applio
i thought there was a workaround
FUck yeah
i have a 2060 laptop and uh i wanna do other things
Uh there might be, but i don't use colab
Fair enough. It might be worth trying on the 2060, train it overnight, but colab is prob faster
Open any game alongside it and it dies
the alternative is to use a zero shot TTS, e.g. fishspeech, so you dont need to burn your laptop to train an rvc model
why is everyone by weights
the " fix for lines " is tied up to mrf from what I recall, btw
it won't work that way with current pretrains sadly
( Unless we talk about something else )
The new loss function does cancel the uspcaler lines rather quickly
with the regular generator/discriminator
clearing the topmost range is a bit problematic still
perhaps need to tweak the clamping value
mrf itself does not fix those upscaler lines, but it does fill the gaps
MultiScaleMelSpectrogramLoss
oh, that
aaaaa, that was the context lol
nevermind. Thought you added something else in there
nah, but it was a minimal fix that does something good without much overhaul
off topic, Imma give 48 more trials
AI HUB Docs


