#🧬│ai-chat
1 messages · Page 359 of 1

Anyways yall have a good night. If anyone needs anything hit up ai support and I’ll look around
sorry for that if feel like that ..
i am from pakistan , there is 3.33 pm now😅
ohh 😅
guys i installed clownfish voice chanegr how to uninstall it now
No.
hii can anyone teach me how to use the latest UVR5 to make ai cover? i can't find tutorial right now 😭 plz
hello i need turkish girl ai model someone have?
To find a voice model, go to #1175430844685484042 or look up on Weights.com.
any free ai for voice cloning
i just had a spark of creativity and now want to clone an ai voice again but high key i forgot how to do it
elaborate:
- ur pc gpu
- what u want to do
elaborate:
- ur pc gpu
- what u want to do
idk im kinda new to this whole thing
im running on an HP evny x360 laptop
i found this server like a year ago through some video online and all i really need is the software to clone an audio clip
why did i just get paperclip reacted?
oh it runs integrated graphics, meaning you can't run it locally but can on cloud
all yt tuts are old
makes sense
RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models
Wokada = uses RVC for realtime inference
you need rvc right?
also Nice pfp
ty
I just noticed its lancer from deltarune
yeah
rvc sounds like the one i want
im pretty sure the old software i used had rvc in the name so
alright, I will ping you about the available cloud options in #✨│ai-help
aight
I downgraded Espeak version to 15.0 and it's working fine.
guys is there any tool or algorithm for mp3 to text.
any ASR that supports the language.. whisper3
Try melobytes audio transcription
thanks
i'm sorry if it's obvious but what is pitch extraction? and why does almost every model have it?
the method of extracting the base harmonic
oh uhh how do you do it if i may ask? step by step
running by running 16000 SR file thru 'f0 predictor/extraction algorithm', built in into rvc
I have a feeling you're using wokada and getting confused thinking that model infos are settings like epochs
i am indeed using w-okada
lol most of the infos you see on models are just infos about how the model was trained, more epochs or less don't affect quality, but be sure that the model was trained with mangio-crepe, crepe or rmvpe as those are the best pitch extractors, also known as f0
u can share a screenshot of ur wokada in #🔍│help-w-okada
I can help u out with the settings
oh yes please
Hey! I was wondering, would you guys know of places where I could look around for (amateur/hobbyist) voice actors for a potential RVC fandub project? People that wouldn't mind just providing their performance without the sound of their voice (because it'd be used for voice conversion). Casting Call Club has some pretty good VAs but unfortunately most of them don't really seem too enthusiastic about AI. I thought an AI focused server like this might a good place to ask.
guys anyone had idea abt discord sevrer making?
i made a server everytime someone sends a msg in genral chat it becomes a ping
Has anybody used Magnus?
Magnus?
a new Chinese ai
yes
guys anyone had idea abt discord sevrer making?
i made a server everytime someone sends a msg in genral chat it becomes a ping
what
like if someone say hi in my discord server
it will automaticly show as a notification
but its not the case for other servers liek this oen
if i say
HI
nobdy will get notification
untill i @ him/her
do u get pinged or just notifications when you’re not using discord ?
notification ig
and the sound
maybe i mess up setting up teh server
how ot fix
go to notifications and make it just pinged
Okay so I've created an app... for the time being it's using edge-tts to generate the speech output, not too slow, but obviously there's not a lot of control over the voices (though between rate and pitch you can do a lot more than you think).
That being said, it's hard to beat the 2 seconds or so it takes to get the audio (depending on length).
I want to do it with cloned voices, but it has to be fast, either on CPU or a 3060 12GB.
What are my best options?
i did
but thats oinly for me right
how to make it for every otehr meber as well>
yes you won't receive any msgs
ik ill only get pinged for @ now
but that only works for me
how to make it for other members as wqell?
i think its cz its a small server
my friend's is like tht too
That's a personal matter usually ppl get that mode get @ pinged
but as u r server owner you get it
yes
yh
oh ok
wdym small server
likewhat other server do that i havnt done
u owner of that server?
naw
how many members are there ?
then u shouldnt get ping/notification by every msg
4
so yes
they have to set it like you did rn
there
ok
yep i’ve too
yes
okk
How long should my voice sample be for training? Want it to sound the best it can
It can be around 20-30-40 mins.
As long as it's clean enough.
But the bigger the dataset, the better
(Anything exceeding 1 hour of dataset is unnecessary, also take care on your audio's consistence)
Hello, I'm new in Weights, how can I have an accurate Ai cover? My Ai covers are mostly dry and they sometimes can't withstand high notes
What should I fix so that my Ai covers are mostly accurate
And what is the possible Ai cover outcome on settings? (Sry I have bad grammar, my second language is English)
yo im tryna make long videos that teach ap topics with ai. is there any free no watermark ai video that are like actually good i lowk cant find shi
use high-quality vocal datasets and adjust pitch guidance and formant shifting. add reverb, eq, and slight autotune to make it sound natural (u can do this in a lot of music softwares) try different voice models for better accuracy. tweak settings to avoid robotic sounds.
What should I fix on the settings?
Vocals:
Pitch?
Pre Stemmed?
Additional:
Remove Echo/ Reverb? (Yes/No)
Focus on Background Vocals? (Yes/No)
Instrumental Pitch?
Volume Envelope?
Constant Protection?
Most of the music softwares are premium
its different for every vocal recording so honestly js tweak around
just look
.
Invideo?
How can I make my Ai Covers more accurate? They only withstand rap songs...
just get better acapellas
idk im not genius ngl but
when i made cleaner acapellas it just came out better
some songs work better than others
Can you suggest me one? I can't find ones since most of them are premium
dm me rq
question, how do people ai remaster concert vocals? i have a song that was previewed at a concert but idk how ppl remaster the vocals using ai so well
Is there a channel to share ai music too? I only see video and image
guyys is there a voice chaanger model for hindi/urdu?
not quite always rap songs, but also anything that don't have BV/harmonies that may be difficult to separate
how do i use the voices on a changer? is there a video tutorial on it?
go to #🔍│help-w-okada and read the pinned guide, esp the fork voice changer
any checkpoint recommendations for ai art? looking to make this https://x.com/supreme_waifus/status/1898921781966020793?s=46&t=9BOKYseLAv4A6bIbKoDxug
there are some AI mastering services but they don't quite replace professional mastering jobs. btw there's a model to remove the concert crowd in a colab notebook #📰│dev-updates message
AI MODELS
Of which?
Wonder if I can give the weights premium to someone 🤔
Weights Premium from #🏆│live-leaderboard. 
Yeah, gonna give it to someone else, don't quite need it
Just not sure who'll be the lucky person, yet 🤔
I already have Weights Premium. Might as well gonna get another Nitro Classic if I reach that rank number.
Mh mh
🐢
🥬
is "FINE SHIT" from carti the most obvious ai/lawson cover of all time or am i stupid
I'm trying to make an ai cover of a metal song with plankton's voice, but doesn't seem like there's any model that even supports screaming for metal?
you could use plugins that make it sound like growling
Fine shyts. 
sounds really noisy but nothing an eq cant fix
Trained voice models are pretty much that. If they were trained with softie audio dataset, the voice models can do softie audio.
If you use a voice model with a metal/rock vocal, the audio would still sound soft. 
I need some sample data to practise creating a chat agent, such as a business, and I need a lot of them so I can create a lot of chatbots. Could you guys please help me with this?
AI recommendations for creating backing track for existing vocals?
🕌 🌙
ramadan mubarak to you too talha
Can anyone help me?
I need to make a music with ai song but in my language not english
Ramadan mubarak
Hey guys. There is a female Russian model without squeakiness, but also without a robo-voice?
cuz i cant find it
shit
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @hidden grotto
- https://weights.com/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.gg/essentials/how-to-make-voice-models/
:wave: @covert lake, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
so, you need to translate the language of a song to another? I don't think there's any AI that does that, atleast for free
No i want to write my own lyrics in my language
How do i find voice models if they dont pop up for me? is there like a requirement preventing me from seeing it instantly?
I miss u
So then write the lyrics in ur language
Or use google translate
Rio
On #1175430844685484042 ? that might be a discord model, see if there's a n X near the search bar and click it, also try restarting discord
Thanks i found some models
It says my ai cover was started and in queue 174
But the weights website won’t play
💔
guys how do i get a voice changer
Download one off a website and put it on ur computer
and after
Set it up and use it
realtime voice changer for calls?
tell your pc gpu in #🔍│help-w-okada
Is there a setting on resemble enhance that does not mute some if not all of the input singing audio? 😭
Hey y'all know how to remove the reverb delay because I use every reverb model and even Bandit Plus and still present
wddym
hmm
hmmmmmmmmmmm
UVR de-echo if you dont mind 17.5khz cutoff
Are there models that can be used without graphics? Beatrice type.
you mean the thumbnail in voice changer? it is optional and not a part of the model
I think it would be the format, since I don't have a graphics card... and the RVCs are not working well for me.
you did mean the graphics card 
it doesn't perform well without that in realtime
If I want to update applio / gradio. I just download the new version and let it overwrite the old files?
nvm i guess not since now my install is broken
Hello in English
RVC programs can be run with only CPU. It won't be that fast compared to a very fast GPU. You can't use a Beatrice voice model with any RVC program, but the original W-Okada version of course it does.
The only easiest way if you want RVC/W-Okada to be that fast so much, when your PC doesn't have a GPU, you can try run them on a cloud service like Google Colab and Kaggle instead.
Hey i'm kinda new to this training thing. I got a quick question. I downloaded a .pth file from huggingface, and i wanna use it on W-Okada for realtime voice conversion. It sounds quite cool, but i want to do some fine tuning because emotions like laughing, anger, sadness sound really weird with this voice model. I downloaded RVC WebGui, and i tried to load the model in there but it says "error" because the model apparently was trained on different sizes etc., what can i do to fix this or how can i do finetuning with this model? I already recorded .wav files to load etc., but its just not working. If anyone can help me i'll donate them 50 dollars.
just DM me if you like 🙂
You don't have to pay me. You can go #📑│making-models or #✨│ai-help if you wanna get help about training a voice model.
Hey , is there anyhow i can use RVC on my android phone , I got a pc that can run RVC natively , It's just that i want to use it on my android as well , is there anyway related to cloud or smth ?
Hi everyone,
I wanted to share an article I recently wrote on how Artificial Intelligence is transforming the world of investing. AI is making a huge impact on financial markets, from speeding up trades to analyzing huge amounts of data that human investors simply can’t process as quickly.
In the article, I dive into how algorithmic trading and AI-driven investment strategies are changing the way decisions are made. I also compare the strengths of AI with the intuition that human investors bring to the table. Along with that, I discuss the challenges and ethical concerns that come with the rise of these technologies.
Who else tries to run RVC on an Android phone? That gotta be real slow.
Hi guys, I recently made a model by voice on one site, but after downloading it back I got pth and json files, but my program for changing the voice does not use json, it needs index, can you tell me what to do?
Rola dont means Hello in portuguese 🥶
@tepid basin ngl I’m thinking of making RVC a Switch homebrew app
Wish my switch was moddable
Did something big happen
even the latest
wdym? U can jailbreak latest firmware since months
I don't keep up
mod it if u don’t care about online
wait for switch 2, it could use the gpu I suppose
u could use amd with rocm
switch uses Nvidia gpu
AMD APU devices like ROG ally can already use it
anyone can give a good e girl voice for trolling
buh
does anyone own photoshop
i wanted to make a jd vance meme and i'd rather not edit the background myself
Are you sure about this?
whats the best self hostable tts model right now
and one that doesnt take up 300 million gb of vram
depends on the language
english
ok
yeah im making a discord bot that can actually have like conversations with you in discord vcs
i have whisper and llama set up to take in audio and make responses
raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled
how do i fix this
how do i activate the enviornment
i used this command to install it
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu126
if you did not activate the environment, then it did install in the global repo
but then when you run whatever you're trying to run, it probably has its own environment where there's torch
im trying to run insanely-fast-whisper
or, perhaps, you did not uninstall previous torch so your pip install did not install cuda
i uninsatlled previous torch
it likely said 'requirements already satisfied'
how do i check
Installing collected packages: torch
Attempting uninstall: torch
Found existing installation: torch 2.3.1+cu121
Uninstalling torch-2.3.1+cu121:
Successfully uninstalled torch-2.3.1+cu121
WARNING: The scripts torchfrtrace.exe and torchrun.exe are installed in 'C:\Users\htauk\AppData\Local\Programs\Python\Python312\Scripts' which is not on PATH.
Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location.
Successfully installed torch-2.6.0+cu126
3.12 python owww
should i update to 3.13?
hey
hi ai
Use Python 3.10 or 3.11 to pip install instead. Python 3.12 and newer versions have problems with certain packages. Make sure you have set PATH to the program at installation.
Or here's how you add folder path to PATH environment.
Machine, I will cut you down, break you apart, splay the gore of your profane form across the stars! I will grind you down until the very sparks cry for mercy! My hands shall relish ending you... here and now!
Hello, I don't know if this is the right place to ask about this, so sorry in advance. I want to ask if anyone knows where we can generate AI TTS that supports language switching? Because as far as I've searched, I only found AI TTS in English. I want to make AI TTS for memes circulating on the internet. So, does anyone know?
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: Easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
You can check TTS in our tts index
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
You could try another tts from our tts index and use the output as an input in rvc
U don't play ultrakill?
No
I don't
In fact, fps gamse are a no go for me, unless it's some good campaign titanfall / bf3 / cod / wolfenstein but nowadays you won't find such gems
You should 🙏
Thank you!
Hi, does anyone know what technology discord is using to apply voice filters?
Im really curious
Prob some version of voice mod
voice mod, you mean voicemod[.]net?¿
Ye
Mmm not sure i saw some RVC2 models uploaded by some user, and far as i know voicemod is not supporting RVC
I listened to some of these new built-in voice filters (at least the girl voice filters) and these sound kinda like AI, but mixed with voicemod.
huh
are they convincing at all?
Not at all.
Voicemod ones or Discord ones?
Discord ones.
lol as expected
I listened to them and these sound crappy as hell
oh discord
Voicemod sounds great? or meh?
I think voicemod is kinda regular.
what is toptier for you?
RVCv2 models?
its meh
rvc my beloved
RVCV2 models if trained/cleaned properly.
depends on your gpu
Welp, it will kinda depend on the model and your gpu.
раша есть халпаните
purple
What did he say razy?
yippi
Пожалуйста, используйте русский канал чата.
no idea
lol what
ima send this to my russian friend
put it through chat gpt
Ah, I see! The phrase "раша есть халпаните" is in Russian. When translated directly:
"раша" refers to "Rasha" or "Russia" (informally, often used as a slang term for Russia).
"есть" means "to eat" or "there is" (depending on context).
"халпаните" seems like a non-standard word or a misspelling, but it may be intended to mean something like "help me" or "bring it to me," possibly derived from a regional or slang usage.
The phrase likely means something like "Russia, eat (or help) me" or "Russia, bring it to me." However, "халпаните" isn't a common word in Russian, so it could be part of a dialect or informal slang.
If it's from a specific context or meme, the meaning could change! Could you provide any additional details on where you saw it?
some brain damaged russian-english mix
хелпаните = help me please
you gotta go an extra step instead of using shorter "помогите"
That explains it lmao
Thanks Nooby
you have not heard the classic from Brighton Beach grocery stores, I guess
Macaronic language is any expression using a mixture of languages, particularly bilingual puns or situations in which the languages are otherwise used in the same context (rather than simply discrete segments of a text being in different languages). Hybrid words are effectively "internally macaronic". In spoken language, code-switching is using ...
there's explanation
Runglish, Ruslish, Russlish (Russian: рунглиш, руслиш, русслиш), or Russian English, is a language born out of a mixture of the English and Russian languages. This is common among Russian speakers who speak English as a second language, and it is mainly spoken in post-Soviet states.
The earliest of these portmanteau words is Russlish, dating fro...
Does anybody use Google AI Studio?
Hi
I use in beging, but i realy no like Gemini, i no know but to me is the wost ai for me
Now the launch the gemma 3, but i think is only one Gemini using 32b (i no know what this Billions)
hola alguien tiene modelos de voces de artistas de rreggeaton
Modelos de regeneração ?
does anyone know how i would make a choir sound similar to on sight by Kanye
bro thx
just prepared a 30 minute dataset in 10 minutes 🗣️
very high quality very certain
chat
best singing english male pretrain
still OG?
KLM?
Verdade sua
Try KLM.
am i doing something wrong or is rmvpe just ass sometimes
especially when a male voice does a low to mid pickup for a phrase
hey guys
klm gives the model a cleaner sound compared to the original pretrain
I can confirm
i tested it with both noisy dataset and clean datasets
Una vez el KLM me ayudó bien para un modelo coreano
Así que supongo que debe servir bien para hacer datasets de cantantes
el rango de voz del klm final es mas bajo que los anteriores
Más bajo?
Bueno, OG superiority.
esta weno para ingles tambien
eso si no puede lidiar con ruidos randoms como clicks y golpes al mic
mas bajo que los klm anteriores, mas alto que el og
Con que tenga un rango de voz más alto que el OG está bien
De todos modos me iba mal con los KLM anteriores debido a que esos todavía estaban en beta
el final todo esta bien, 0 errores
gradients buenos, losses mejores que el og
Eso es lo que importa almenos
Sirve como alternativa provisional al OG para casos especificos
recomiendo usar el original si el dataset es muy ruidoso y tiene muchos sonidos random 🦈
Opino lo mismo.
Aunque soy demasiado tryhard y siempre limpio mis datasets de manera impecable hasta donde pueda.
hey guys, quick question, does w-okada work on android, or is rvc the only one that does?
only pc
okay, gotcha, thanks
do you have a example of this? 👀
Way lelena pory
-rt
Interaction has expired, use the command again for a new interaction.
-# The prefix for commands is !
Select a category from the menu down below to view all related commands
Unlock the world of LunaBotPrime, absolutely free! Dive into the realm of premium music quality, enjoy it to the fullest. LunaBotPrime - where the music never stops, and the magic continues!
You can invite LunaBotPrime with this link
LunaBot 🌙 is the perfect music bot! Feature rich with high quality music! And Custom Playlist
You can start listening music by just joinning a voice channel and typing: /play [song name or link] (Remove brackets).
We support only Spotify, soundcloud, bandcamp and more!
To view more help on a specific command or category, run
/help <command> or /help <category>
Important Links:
Support
Premium
Invite
Command Categories:
🎶: Music
💰: Premium
⚙️: Utility
📕: Admin
Select A Page From Dropdown Menu Below
You are not allowed to use this command.
leave it
if you have enough of it laughing can be good
ok so
insanely fast whisper,
it is not insanely fast
it takes a long time to set up before it actually transcribes
how do i remove that set up time
Either applio, weights or mainline kaggle
There you have a guide.
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
if the dataset laughs too much in the end I'm training it with her and I'll see how it goes.
but ty for the help
i am new here and i was wondering, how do i get the ai models (pth files) running on my computer, i got here in the trackerhub ai model tracker
I already have a RVC model, I was just wondering how to run them.
I have a 8 minute dataset ready for training. If I copy paste the same dataset to make it 16 min will the result be better?
no, it'll make it worse actually
Nope it will make it worse
Because you're not feeding RVC with new info
Just artificially duplicating it
saddicus ?
Is more epochs better for a short talking dataset? like 8 min
Actually you gotta watch the tensorboard and train till the lowest point.
.
Good morning ! Does Voice changer works with AMD cards ?
go to #🔍│help-w-okada and read the pinned guide
that doesn't mean to discuss it here instead of the right channel as I said there
Since you asked in this channel and not going to #🔍│help-w-okada, I will answer you anyway. Yes, "DirectML" W-Okada works with AMD/Intel GPU.
The thing is that the original version of W-Okada, AMD/Intel GPU there is poorly supported, which makes it not detecting any AMD/Intel GPU in some situations. The fork version DirectML W-Okada is at more optimized for AMD/Intel GPUs.
Did I explain it right?
Does it working with nvidia cpu
🥺
Does NVIDIA even makes a CPU?
Yes but not for gaming pc 💀
Iirc its server cpu
it is also project digits
not a mini "supercomputer" but rather a mini workstation
Can I be your mini workstation
Im 5'3
NO.
Kinky
Bi and trans?!
Twinning!!
💋 🍑 💗 😝 ✈️ ✈️ 💥 🏢
Typical ai hub user
I'm not trans by choice
Closest thing to being a girl
I wish I was a girl but that's society for ya
HOW. to send gif
just be a latina e-girl
THANK. you
GET. gud
I wanna be your e-girl
YOU'RE. welcome
YOU'RE welcome.
I do my own vocal train
Ing
Like I speak in a weird pitch irl
To get used to that pitch
And slowly train my throat uwuwuwuwuwuwuwuwuwu
Let's be real. I don't know how W-Okada would even work on an NVIDIA CPU. I only see NVIDIA being interested on making RISC-V.
I was trying to rage bait
Sorry
I'm not experienced with nvidia CPU

Is there any character you want me to train for you
I have arounddd 4-5 years of experience of isolating vocals, way before I discovered RVC
I remember spleeter gui or whatever it was and I also saw UVR release
I thiiink I know a little about isolation uwu
Back in my day we used to call it extraction
Actually I bet I could train an anime DIO model
Go to #1159289738314919936 and pick one to train. 
Thank you
Hi, someone has already try the voice of Inoxtag ?
No. What's up with that?
No, it's just that I'm new here and I'm installing software so that I can use Inoxtag's voice with my own for a class project
Welp, check the #1175430844685484042 channel to see if there's a inoxtag model done.
There doesn't seem to be any voice model of Inoxtag available in #1175430844685484042. Maybe no one has made it yet.
Hmmmm
Here's a lil thingy if anyone needs rvc model blending standalone
it's run with:
py model_blender_gui.py
or:
python model_blender_gui.py
Opens up such a thingy
( it does support merging of models that used old dict keys for sr, such as 48k instead of 48000 etc. (( og rvc most likely? and older builds / obscure forks)) )
🐢 🎂
🦈 🎂

life safer
🙏
👀
i dont run rvc locally so i need it :3'
Tutel having birthday, catch even more greenies 🥬 🥬 🥬 🥬 🥬
Hope you're having a good day ~ and again, Happy birthday ✨
Thanks cody!!!!
✨
Dw it's fine buddy
(whisperx) C:\Users\htauk\proj>conda install pytorch==2.0.0 torchaudio==2.0.0 pytorch-cuda=11.8 -c pytorch -c nvidia
Fetching package metadata ...............
Solving package specifications:
PackageNotFoundError: Dependencies missing in current win-64 channels:
- pytorch ==2.0.0 -> intel-openmp
- pytorch ==2.0.0 -> libuv >=1.44.2,<2.0a0
- pytorch ==2.0.0 -> mkl >=2018
- pytorch ==2.0.0 -> python >=3.10,<3.11.0a0
- pytorch ==2.0.0 -> typing_extensions
- torchaudio ==2.0.0 -> python >=3.8,<3.9.0a0
- torchaudio ==2.0.0 -> pytorch 2.0.0 -> intel-openmp
- torchaudio ==2.0.0 -> pytorch 2.0.0 -> libuv >=1.44.2,<2.0a0
- torchaudio ==2.0.0 -> pytorch 2.0.0 -> mkl >=2018
- torchaudio ==2.0.0 -> pytorch 2.0.0 -> typing_extensions
(and similarly for the other packages)
(whisperx) C:\Users\htauk\proj>
how to fix
@chilly lake do you know some ai-based software that can transcribe and generate subtitles? something easy to use locally
okay thanks
lil bigger output
how fast is it
I have rvc training running at the moment, so I think it should be faster
hm
ok
btw, im having trouble setting up whisperx
its a conda problem
do you think you could help
or nah
^
i dont use conda, I make the virtual environment normally, python -m venv venv, acivate, then install requirements
usually needs to update torch to cuda afterwards
should not use torch 2.0.0, it is probably not available any more
whisperx requires it
you can use higher version, 2.3.1 should work
Hi all. Is there any chat here where its allowed to share AI music? (YT)
Nope, promotion isn't allowed on the server anymore.
Yes
And we don't allow direct file sharing here either
Ok, thanks for info
Promo channel is removed.
Can I somehow download this sound?
it is not a voice model
I made a system called Sunshine that tracks humans using Bluetooth and Wi-Fi adapters and triangulates with multiple devices
similar to this one
coll
It can accurately track humans by their disturbances to the frequency. So you can ideally see behind walls or know where people are in a building without seeing them
Thx @wmlx
@proud horizon
Because an old man named Mike keeps spamming in #✦│chat already, so I'll have to move here instead.
By the way, he's 32-33 years old, but acting like some 13 years old kid though. His actions aren't quite mature to go around.
Don't promote some random site here.
Pretty much he is. His ass doesn't seem to be employed.
I built Oblix. Not random site, would love if you can try and give feedback.

text/image to video AIs:
- Locally (runs on ur pc):
- pyramid flow (Image/Text to Video)
- cogvideox 1.5 5b: Image to Video, Text to Video
- Cloud (remote good pc, running on an online website for example, easier to setup):
- Weights.gg
- pyramid flow (Image/Text to Video) (HuggingFace Space)
- OpenAI Sora (paid only, in some countries)
- lumalabs
- Hailoua AI
@dusky pewter sorry but promos ain't allowed here
Nice gonna check it when I'm home
Hi! I’m trying to find the training file in tortoise tts and can’t find it… any suggestions?
Anyone knows a model that can "enchance" images? I have an image sketch and I want it made better. For example turning game art into a promotional art piece for the game. I have a sketch of the protagonist and some zombies, but I want AI to make them a bit better, improve the lightning, make them stand in cool poses and such. Is there a model for that?
which site are yall using to train rvc models now? that kaggle notebook stopped working
There are two options
- Use a built-in tool in Automatic1111 Stable Diffusion.
- Draw them by yourself more.
Other than RVC Kaggle notebook, there's Applio the RVC.
which one works? the reason I moved to kaggle is cuz the Applio didn't work months ago
Applio is available as locally program and Google Colab notebook. Not sure about Kaggle one.
Months ago? That was long.
talking about the colab
Google Colab banned web UI notebooks for free users.
what about a non web ui one?
that's what I used to train with until it stopped working
You can indeed use the command line one, but it would be hard to get around since you'll have to click each code cell to work.
is there a general pre processing problem with rvc?
when I do manage to get something to work it just doesn't pre process
Preprocess completed in 0.00 seconds on 00:00:00 seconds of audio.
did you point it to a folder or to a file?
point it to a folder
I pointed it to the zipped datset
so put you wav files into the folder on the drive, provide a folder name
since ever
RVC does not unzip anything
rvc disconnected has a cell 'Load Dataset'
it does the unzipping, but preprocess needs a folder name
Applio NoUI has no such thing, you need to just do it manually on google drive
I wonder why it shut down ugh
I'll try now, thanks
nobody wants to maintain it
and google breaks colab every week
any1 knows how to record with rvc on fl
thank you!! pre procesing finally worked🙏
Never heard of Applio being made into command line-only Colab notebook. Only some RVC notebooks that are non-web UI.
You can use all applio functions from command line, so why not
can anyone help me change the path of c++ so its in the directory im trying to use to install pyramid flow
Hi
true
Is removed
I no know, i think are removed because haved so many bad or offensives ia musics
Promo channel is removed from this server.
if it's really what you want, you can leave it in #1159516963014451302
Not really but close enough, AI covers are mostly copyrighted.
Promo channel lacked some good moderation, compared to other chat channels.
alguem tem aihubbrasil?

someone do rvc with fl studio
I don't know what to explain, but you may use an UVR5 model to remove reverb from audio.
What does this mean?
use vocoflex atp
use vocoflex atp
Don't try to evade automod here with slur words.
this is nice but smh lm studio
Still.
it is the easiest way to run a quantized llm model
orpheus is llm that does token processing
the token then get voiced by a generator
oh.. well, I have no idea
you use lm studio?
for this
what do u use usually for llms?
also that's really interesting
I haven't heard before an llm that does tts too
i dont
xtts, fish, most of modern ones are using GPT/LLM to expand the text prompt
that's where the expressiveness is coming from
what an excuse
excuse for
Hi
hekkio gyuts
How can i use This chat for sintaxis
which settings you use guys to play games but not lag the ai?
No one knows?
guys is there any model to hair color change?
what?
use #🔍│help-w-okada and elaborate
promos ain't allowed
can't you do that with photoshop ?
I couldn't do it because I had to change blue hair to blond like light brown
This server is SFW, we don't allow topics on how to make sexually suggestive accounts to get views
I don't think there's any other thing that can do it other than:
- photoshop skills
- image to image ai, being able to change a part with generative fill
ok thanks, appreciate it
So we can't share music videos we made on here?
does anyone have access to an AI tool that makes all pictures/boxes for a strip/comic?
promos aren't allowed
Hey guys I’m gonna buy a pc mic headset from GameStop today. How do I set up the vb audio cables to the live local real time voice changer app to talk in discord etc
don't ask in multiple channels
Do you know anything i can use to improve the room noise from the background from a person talking before training? Theres a small distance between the person and mic.. and you can hear it when i talk in rvc
I used x-minus to this
could you elaborate:
- your pc gpu
- your pc os
- the issue more detailedly
Thats the issue. I use mimicpc.
I too, i use Android in a dex mode 🤤
mimicpc?
we have a documentation for the best audioo cleaning models
ohh cloud, well I dunno that site, but you could check https://docs.aihub.gg/rvc/resources/dataset-isolation/#vocal-isolation--cleaning
Last update: Dec 24, 2024
I checked and Im not sure whats the best option for the room background noise
maybe you could try Mel denoiser
What is ''mimic pc'' ?
Easier if u google
Ok..
I didn´t find nothing in the google
mic recordings always have background noise, so you should use that denoise model
Só true bestie
um br no meio de gringos, pqp
- A PC case that looks like a chest monster with teeth named mimic.
- A PC that mimics another PC but sometimes with different specs.
That's what I know.
bob esponja caca y pipi
Legitimo
Yes
Anyone really good with LSTMs? I just have some questions
`1
my tupac model so good how would you rate it
hello
Worked. I used De reverb mono to remove the room echo
saddicus
hello
Why is that there owo
mac 
i've ventured into 'what is activation function' field
And they have that shit in there owo
I will never find peace....
fp16 training makes it very weird
there's high precision around 0, but step away and it is multipliers of 64
im back once again
i havent made an ai using colab in around... 1 year maybe
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- Hina's Mod AICoverGen WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
- 🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
does anyone even remember me lol
ok im gonna try to train a model a whole year after not using colab
Moved to /faq command.
Overtraining, or overfitting, in machine learning is when a model learns too much from the training data, including noise or irrelevant details. As a result, it performs well on the training data but struggles to make accurate predictions on new, unseen data.
💡 Tip: An effective way to detect overtraining is checking if the TensorBoard Graph starts rising and never comes back down.
promos aint allowed and will be deleted
do u need any help
who is good with making ai songs (like fake songs)
ai songs ?
Hi guys ! I'm new here... I wrote a song a couple months ago and I'd like to hear it with my own voice but unfortunately, I can't sing. Anyone has a free app or website that makes AI singing voices ? I mean... I saw many of you know how to clone celebrities' voices so maybe you know how I can clone my own voice ?
Hello everyone I want share my creations with you, where can I do it ?
We don't allow promos
Rvc does that, but you need to train your own model by signing and talking
Wouldn't that be bit hard? 🤔 If they "can't" sing
Meaning they can't really train their own voice module for the VC
I mean they can train it based on their voice but not based on their singing voice
technically you can make a speech model sing but yeah the results aren't the best
some very expressive speech models are able to sing almost like a singing model
but still is not as good as a true singing model
Hi, guys
yo
i need some help with ai
i'm looking for the best ai
like for learning
as a student
Learning about? I would just talk to Gemini, Claude, deepseek, Chatgpt and grok until your brain grows bigger
comp sci
i need to insta learn concepts and units
need it to explain everything like a teacher
just tell me any ai that could be useful
whay
Is it rvc or w-okada
He's already getting help in #🔍│help-w-okada. It's just that he didn't know where to ask about it.
i see, you are very helpful
Thank you
hello. what app or site should i use to change voice?
i tried pretty hard to get animate anyone working to no avail
just irresolvable depend issues all around
What's your PC GPU? What do you want to do?
There are two different programs you use to "change voice" as what you said:
- RVC, uses to do AI cover
- W-Okada the realtime voice changer
Pick one.
What's up?
i did way too much to be able to provide any useful info lol
also found an unnoficial local version but when installed with uv it returns
File "/home/user/AnimateAnyone/app.py", line 5, in <module>
import gradio
ModuleNotFoundError: No module named 'gradio'
even though i have it installed
Never heard of this one.
heard of what
AnimateAnyone 
ah i see
basically it takes a photo of someone and uses a video to map movements of another person and animate the photo
im fucking nuking pyenv because it wont stop pointing my system verison to itself in 3.10.8 lol
saved all my envs to a text file so i can set them back up later
not like a lot of them mattered anymore anyway
finallly got the local verison running
had to leave some reqs versionless in the file, had to do a very minor debug in the python verison its using, make a dir in project dir and add a specifically named image to it
nah lol, more issues still
but i technically got it up
missed a step, actually
still more problems
what is it
voice changer dose not work
what program are you using though
ok wait
yes the same
for me when i test the sound dosent change and itrepeats the same voice as mine
idk
are you sure you properly loaded the model
also dont ping me, please
it freezes discord because stupid linux issue
but normally i wouldnt care
lol
it has several default models
i use them
okay i wont ping anymore
tnx
well i also know nothing about your setup either
on another note it seems like uv has problems running a bunch of scripts and will output the same error about not finding gradio even though it was used to pip install it
it's fine though because calling to python3 still works
the one time i used w-okada the output was much worse than the voice changer from the original RVC-webui repo, despite being told it's supposed to be better, apparently
what else do you use ?
yoo
can some one help me out
is there any live translator ai model
<@&1159293204038955078>
no we dont really have any resources for that on here
the only one I heard is https://github.com/Sharrnah/whispering
you could google some stuff tho
thhanks buddy
not really, tried working on that once.
already suggested a program to him
suggested a program to him* (i cant help it)
wrong grammar again
yup
good
Sounds scammy
the deal is legit
but he is offering one of his websites it seems
and he is also offering to do it for you for 10$

guys is realtime ai canvas still a hype or nah
NVIDIA realtime AI terrain generator thing is so long dead. Now there's Stable Diffusion, where it can be used to generate image in realtime.
For W-Okada, go to #🔍│help-w-okada, and let's not asking some random member about it here.
alr thank you! I thought there might be more
You want him to sing Certified Lover Man or something? 
nah gang lmao im being deadass
Do you mean to do "RVC" AI cover? Nah, I think you can do that by yourself.
Hi everyone, who want 1 year sub perplexity- have simply guide to activate - no promotion , its is free
Tell me about your PC specs
There are many ways to generate an AI image:
- use Weights bot in #🤖│bots, but you'll have to connect your Weights.com account first.
- any website that offers AI image generation, both free and paid.
- a Stable Diffusion program like Automatic1111, Forge and ComfyUI.
There are plenty of image generator websites you can visit in this list. #🏙│ai-images message
promos ain't allowed
Can i speak on russian in this chat? I need help with one programm
we don't allow advertising and promotions
this server is mostly english only except for 3 channels, either ask in channels in the help category or https://discord.com/channels/1159260121998827560/1159346439424573440
you will probably find more help in english
thank
also remember to elaborate:
- your pc gpu
- the issue
- what u want to do
when asking for help
don't just say one program, as there's thousands of programs
hie
detroit become human foreshadowing years before wow
I think there are much earlier examples... like Star Trek
Data creating paintings as AI art too XD
Does anyone know of any free software for automatic subtitling?
You can use any ASR model, an example of a project I made: https://huggingface.co/spaces/Nick088/Fast-Subtitle-Maker
Does it work for Spanish and Portuguese languages?
i fixed it
But the pronunciation is not accurate
For example, I say hello and she says hellioa
😂😂
_>
It depends on the model, the whisper large v3 supports both of those
what happen for X? i cant find it
Can you elaborate
How do I do ai vocal changer
You need to elaborate
What's your PC GPU? What do you want to do?
I want to convert vocal track into a different voice
What's your PC GPU?
Can I do it on mobile?
Oh you're on phone,
Well phones could technically do it locally (running on the hardware) via termux but it's not suggested since its slow
It's better you use cloud (remote good PC)
RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models
Wokada = uses RVC for realtime inference
Train (make) RVC Models on cloud:
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Be sure to know about the tensorboard
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC
RVC Inference (use models) on pre-recorded audio on Cloud
You can use either:
- Weights.com: Easiest Possible Ever Automatic
- Ilaria RVC Zero: Fastest free on cloud
- Applio UI Colab: RVC Fork with some extra features like TTS
- RVC AI Cover Maker UI: Automatically Separates the Vocals and Instrumentals, converts the voice and mixes them back
For any issues, ask in #✨│ai-help
I’ve done the Google collab thing before but I can’t remember which one is was
I gave you already all the working current one
And if you're talking about one of like 2 years ago, it's prob an outdated one
It's better you use one that I gave you
im searching for actual like vocalists
like artists type shit you know
cause i know where to look for the model
i just want somebody that can hop on a song as a drake feature
oi
baka
baaaaaaaaaaaaka
no promos allowed
Heya
I have a problem
I know it will be a bit funny but I want to play a woman on fivem but my voice doesn't sound female enough. That's why I need to support myself with tools. Do you recommend any voice changer that will also sound good in Polish?
chat why cant i find the applio option anymore on RVC. I wanna use RefineGAN. There's no option on the main branch
Hi guys, I want to train a model based on my voice. Kind of torn on how to make good data. I want the model to include singing, rapping, and talking from me. How much of each should I use and how many minutes of data as a whole? The model will mainly be used for music.
if i wanted to clone my voice and make it sound like it was me on an mp3 file how would i do that
they called it lol
detroid androids were incredibly cheap
like $8000?
even 80k would be ridiculously low
It takes place in 2038... whatever Nvidia GPU is being sold at that time will probably cost more than those androids
still with 24GB VRAM
Introducing the new Nvidia 8070 TI Super Ultra Omega... with 8GB VRAM and 200x frame generation
neuro sama become human
AMD 9070 XT is laughing at this
elon might have shadowbanned you
No one here trained Beatrice model, only RVC voice models are available from this server.
go make your own
Beatrice voice models are faster than RVC, but also give lower quality at the same time.
I've never seen any program that can train a Beatrice model. I only see RVC. 
e
How do I make ai songs? And could you show me the website if you guys want to?
There are Suno and Udio.
What's that?
You don't know? These websites are AI song generators.
uwu~
yep
I paid attention to your message just because there was the word "Suno" in it, I already used that. 😅😅
hey i just joined and i want some info or help on which AI is the best for coding or website building?
(free one)
Hey, has anyone here worked with MCP servers.
Just like how cursor allows the users to add MCP servers to their IDE to add functionality over their AI agent,
I have my own tool with an LLM for which i want to add the MCP servers, how is this possible
can anyone give me some insights
Guys, please help! Where did this table with the program installation disappear? How can I download it now?
disappeared on github
elaborate:
- your pc gpu
- what u want to do
- what program are u talking about
voice changer program. I could install it before. The program's website on github just disappeared.
you prob followed a 2 years old yt tut much time ago
programs change and adapt over time
also, you need to elaborate still:
- your pc gpu
- if you want the voice changer to be realtimw oe pre-recorded audios
4.20
voice changer to be realtimw
4.20? that's not a gpu
for realtime u will need wokada, check ur pc gpu in task manager and tell in #🔍│help-w-okada
3701
For local, QwQ 32B is amazing for its size
oh ok
Are you sure that's a right NVIDIA GeForce or AMD Radeon RX GPU number?
its Rysen
ryzen*
yeah that's not a pc gpu
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
yeah..
AMD Ryzen is a CPU, not GPU.
amd ryzen is a cpu


AI HUB Docs

