#🧬│ai-chat
1 messages · Page 352 of 1
but
So what's up with that? available anywhere or, still premium
they are pretty noisy
the buzzy / after-sep noise?
sort of yeah
Need to build a set for rvc so ye
Any you guys know that don't have aftereffects but still outperform mvsep's bsrofo?
think I'd be satisfied enough with such
but they are insanely good at fully extracting vocals from the song
Hmmm.. guess I'll have to test stuff around then
you could try syhft v3
what are they run on? uvr? or some other framework?
if uvr, which ver
uvr and msst script
got any links?
gui-wx.py launches the gui where you can set up the models and their configs
and it has download models thingy
Bet
for uvr you will need to look into audio sep server
oh yea, cause for uvr I see there's 5.6
I still run on first beta that supports bsrofo lol
as anjok has released new patches only in the server currently but he's preparing a big new update
np
i personally recommend to try big beta 4 by unwa if you want "full" vocals but without the static noise that are created by models that are trained with emphasis on some metric
ye actually wanted to but
Got a lil lost
I suppose it's not this one
or is it, but uses different internal naming? / ver indexing ?
I suppose not cause of the date
so
he has big beta models those are for vocals, the models with "e" are models with emphasis
he also made kim ft models which are basically slightly improved versions of kim's melband roformer
the server?
for unwa models?
big beta
oh, there
that i linked
Got it. That should be all
Thanks once again
( ps, sorry I asked so much. I just happen to now have limited bandwidth so, gotta be careful with what I potentially get rip me
So better ask than be sorry later
also if you want to dereverb vocals use anvuew melband dereverb v2
it's the best dereverb model for singing, it literally eliminates 99.9% of the reverb somehow
and it even cleans up the bleed
you think It'll work for speech too? ( artificial reverb's artificial reverb but ye, guess I'll check
yeah no problem, man, ask whenever you want, i'm always glad to help
✨ 🙏
sadly no, for speech better to use dialogue dereverb
but it's the best for singing vocals
oh yea, in that case Imma stick to my ai vst
dialogue derev genuinely sucks ass ( all in rx that's " machine learning " pretty much
tho ye. Imma go off for some testing now, wish me luck
i like it in rx11
what vst do you usually use?
For me personally it's too damaging
but then, I am quite sensitive for that
wave's dereverb pro
ah
i tried it, didn't like it, leaves too much echo for my taste
and if you give it a lil bit of your input in rx then it shines
else ye, for users not willing to work a lil on the output, might not be the best
as you pointed it out
leaves a bit of trails to remove manually
But good thing about it is, it doesn't damage the spectrum on it's own so, for audio geeks like me that like to play around, it's perfect
i usually try acon's deverbarate first, imo it does a pretty good job of being balanced between aggressiveness and accuracy, and if it fails i just do rx 11 dialogue dereverb with low sensitivity and it works for me
sensitivity put to 10 is pretty bad most of the time, so i put 4-5
yeah
alr
msst script supports all architectures and models
the gui is just bas curtiz's lil script that makes the usage of the script a lil bit more convenient
got it
have you ever looked into apollo models?
not really
past the time when bsrofo was the king and mel roformer was just barely usable, I stopped using separators tbh
so, got tons of new things to try really
it's really cool, maybe not super useful for rvc, but still
basically an apollo model aims to improve low quality (32kbps-128kbps) audio into lossless quality
the original model was already doing pretty well, but it was small and trained on low amount of data
but the guy called lew on audio sep trained 2 different apollo models
first one is vocal enhancer, which tries to fix the muddiness of bs roformer vocals
ah that
I remember it now
is it based on the premise of audio super res? / upon it's core ?
as in, diffusion-based ?
and it does it pretty well but is pretty noise, the second version is better though
i have zero clue, but it works slightly in a different way than audio super resolution
i think it's some kind of generative model, i'm not sure
and the second model that lew made is called universal enhancer, and it's so fucking amazing
it handles low quality compression artifacts really well
works for vocals too but also adds noise, but you can denoise it easily
Hmmm.. I could see how it does one time when enhancing the visual novel based audios
some novels are sadly.. well yea, using trashy compression
some obscure crap codecs
basically that model has bigger dataset, more params and aims at a wider variety of different lossy formats
aaa
I suppose folks that wanna revive some old collections of mp3s or aac can thrive now
absolutely
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Onuro AI > Cursor AI
What is the best way to isolate vocals, and leave it clean (without mvsep, it takes too long)
you can use a website for this lemme get it rq
you can do this
also has other options
ew spleeter in 2025
eh gets the job done
personally i use it for insturmentals
I'd recommend this colab notebook for best roformer models, much much better than that spleeter model
unwa's mel roformer inst v1/v1e is the best for instrumental stem
i....have no idea what half of those words mean
you typed when I forgor to include link
if this link gives me a virus ill send an airstrike your way
god i hate mac
why is windows just so much better
it's literally a colab link, why do you suspect it?
im a noob/have next to no experience when it comes to technology
literally only joined this server so i could get ai voice models to do a funny bit for a video of mine
but imam try and see if i can actually learn this stuff
lowkey soudns interesting
It helped so much!
THATS SO CLEAN THANK SOU SO MUCJ
Hey @polar flax , can u say me whats the model that separate lead vocals and background vocals?
And a model that remove reverb and delays
If u can pls :3
yeah use this instead
you helped me to thanks
wft why my autocorrect keeps correcting "helped" to "help-desk"
https://github.com/Chenglin-Yang/1.58bit.flux @covert lake
We present 1.58-bit FLUX, the first successful approach to quantizing the state-of-the-art text-to-image generation model, FLUX.1-dev, using 1.58-bit weights (i.e., values in {-1, 0, +1}) while maintaining comparable performance for generating 1024 x 1024 images. Notably, our quantization method operates without access to image data, relying sol...
this paper says that it will basically reduce the size and memory usage of flux dev by 7.7 and 5.1 respectively without losing much of the quality
3 gb flux models coming soon ig
that may produce good quality
Ye I have heard of it
I feel like there will be deffo a big quality loss tho
maybe, but if it'll be like q8 quality or even better then it's massive
If it's less than that, prob not worth it
i mean, q4 and q6 still look decent, so if it'll match the quality of q8 it'll totally be worth it
who knows maybe it'll also improve inference speed
We just gotta wait
I'm glad people are finding ways to run good ai on not super expensive cards
i mean, not only that, but it also might give more room for further improvements
imagine someone just makes gigantic model with huge amount of params and then just scales it down with that thing
Yea
SLM are getting good too
Btw, did u ever train a LoRA locally?
no
AI Hub by Weights. 
- melroformer karaoke (by aufr)
- mvsep male/female separation (for duets, better than Sucial's but may not be perfect)
- melroformer dereverb (by anvuew)
btw u can try the tweaked version by me: https://colab.research.google.com/drive/1IC6Q1hLF55_tK6mhky0SWYKGVF9T5WsY?usp=drive_link#scrollTo=vKOCPJkyw9yh
@barren kiln -> #🌏│русский
ok
aah
I've already written there.
or u can write #🔍│help-w-okada
so i saw sowwy
- Maybe you have "monitor" turned on or something like that + You need a virtual cable
that's all i know 😅
should I put a virtual cable in the "monitor" option?
no monitor for you to listen to yourself
Honestly, I'm not talking about this 😅
I'm not a master
Like there's a dash in the "monitor" option?
(
Hm..
I think I had instructions somewhere
wait
Guide Written by:
Github - VTArcelia
Discord User - https://discord.com/users/824922747423031359 aka VTArcelia
Thanks to the following people : lusbert, poopmaster, felt, fazemasta, antasma, shadictl, x_hina, sushi
thanks are for anything added to guide, taken from any talks, settings added when...
what should I do if I have already clicked on the "passthru" button?
ok
The "passthru" button outputs your actual microphone audio instead of the converted voice audio.
can you download ur weights.gg models you trained? thanks for now answering, yes you can
Does anyone know if you can train a UVR model for a specific artist?
no
Ah dang
ofc you can do that but its super complex
your better off asking the audio separation server though yeah you can
but it will be shit
theres different archetypes though
i tried doing a vocaloid one like almost half a decade ago and it was decent lol
because youre skilled
too bad i lost the data. probably would be better off trained on newer archetypes anyway
v6
thank you, you do have a link for it?
AVbr9KV3kw
just put that on here
is this main chat
yes
k
maybe
Do you guys know how to Change the sound files in Minecraft
with a pack
i am wondering, currently ai voice models doesn't do good on non-vocal things (cough, sneeze, moans, etc). Is it because the lack of data on those sounds? Will they be able to do it if we provided enough data of those on the training model? like 5-10 minutes dedicated to only noises like those?
maybe its too much, but also depends on the pretrain, probably not many data present in that
so around 2-3 mins?
so like 10 mins of speaking
maybe even less
huh
for my experience
do you have any examples that i can listen to if i may ask?
not public, sorry
ahh okay
i usually dont give people models away
oh its a comms?
yup
nono im not asking for the models, i was asking for a sample short output (mp3s?)of the models coughing/sneezing but i think thats still private property hahah
ohhhhhh
yes
no i dont save those
ahh
okioki
then can i ask about your training settings? is mangio crepe 32 hop length still the best?
or did the community discover something better?
nah use rmvpe
rmvpe is better?
is it better for talking only or in general use model like singing too
how many steps usually and how long is the data you usually use?
depends on how much data i have
less than 10 minutes 300 epochs, less than 20 500 and then i check if everything is okay
optimally how much / how many minutes do you prefer
because i usually do 10 mins 600 steps
both usage but mostly for inference not realtime
do you have different settings for both?
or is there a good setting for both usage at once?
for normal inference id say 10 minutes is enough for a good result, sure not PERFECT but you still have a good result
sorry for the many questions, the way i use these are the old ways like around a year ago , i don't know if there are much findings hahah
no prob i love answering when i can :)
10 mins with 500 steps?
300 otherwise theres overtraining issue, also epochs not steps
i want to make a model where it is good for both singing and speaking(inference, not realtime) , and is capable of non-vocal noises like coughs
and btw if you want really better results you should learn to use the tensorboard
to see the overtraining?
important thing is to check if the model is clean, the more clean it is the more you can do basically
or undertraining
isnt epoch like, 10 epoch 200 steps meaning 1 epoch is 20 steps?
i think it changes everytime, i dont check steps tbh
again, im not sure, i suffer from severe memory loss :)
yall know how to give villagers guns?
i used to code rvc but now i dont remember anything basically
not the right server for that
oh
mods
Believe it or not, I have the entire repository of Ilaria RVC Hugging Face space downloaded from Hugging Face. 

why
I thought this repo was the only repo of Ilaria RVC available. 
I tried to launch it with run.bat, but it won't launch. 
LMFAO
you need to create a venv, install the dependencies and then python launch the app
Oh wait, I think I have an idea on how to launch this Ilaria RVC repo with the already installed Python instead of the one from C:/Program Files/Python311. 
update us
ilaria rvc is made by me
@covert lake
ugh, i would ban but i'm so lazy to pull off that mods ban guideline thingy
Finally, real.
wow
basically now we should copy and paste official looking reasoning from the pins of specific channel
Inspired by that environment.bat file inside Automatic1111 folder, so this is how I get the virtual environment path to Python done. 
dang thats lame now i understand why reasonings are "not professional"
ilaria rvc zero local is real
Huh what happened

- Search for it in AI HUB Docs or Applio Docs. You will probably find your answer there 📚
- Ask for help in #🔍│help-w-okada if it's related to real time voice changing but make sure to read #1297207135469305866 first
- Ask for help in #✨│ai-help for general help, but use the command
!howtoaskfirst to learn how to structure your question properly and increase your chances of getting a reply - Last but not least, ask for help in #🔍│help-ai-art if it's related to AI images.
try to make it a precompiled 
Lmao. 
Hi guys, im trying to do something in rvc but i dont want to install again the program and im using mimicpc and to use the voice that i want it says that I should paste the link to a voice model to download, does anyone know how to do this? I paste one for example from github and it didnt work
if you want to do it online i suggest ilaria rvc
Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Extra Model Fusion Troubleshooting “No GPU is currently available for you after 60 seconds” “Where can I see my ZeroGPU ...
I just got that Ilaria RVC done locally lmao. 
THANK YOU!!!!!! I will try this
no problem!
So not only I downloaded the Hugging Face one, I've downloaded another one from GitHub where everyone here called it the mainline Ilaria RVC. This repo was made to run locally, yet, it has been long outdated.
rest in development hell
So after getting an error at first and then reloading the page it finally worked, Thanks 👍
happy to hear that! no problem and have fun!
Then I run two different Ilaria RVC GUIs at the same time. 
i dont see that ui since forever
is ilaria rvc still have the rmvpe+ for infrence?
normal rmvpe
ah ok ok
I just revived the Ilaria RVC project back. 
resurrecting the deads
If you're wondering how I got those files from Hugging Face even if there's no download the entire repo option to be found, I used huggingface_hub in Python to achieve this. 
is this not ila rvc old wtf\
Yeah, that's why I said the mainline Ilaria RVC repo is currently outdated.
older than you
a
b
c
d
E
f
g
h
guys, any good AI agent that can read e-mails and sort/respond them with trained content ? (i am not very proficient in this field lmao)
the content would be, previous e-mail
@covert lake sorry for ping but i think this is kinda important
HF only takes away the time the inference took, even if the GPU request time is 60 seconds. My inferences takes just 30s
yoo that’s good
yep
🔥
hi
The maximum file length to separate for UVR5 UI on HF seems to be 21-22 min (with model already loaded) That takes 57 - 59 seconds to separate, file size doesn't matter (my test was 250MB)
we aint chat gpt
It was the lvl i think
i
j
How much does RVC take in disk space , I dont remember I think it was like 30gb (?
mainline is 8GB iirc
Are we talking about the same RVC from GitHub? the one with 25k stars?
i think so
Well here we go again 😔
k
I put a movie reference into a project I'm working on.
Output:
Audio shape: (32000,)
Transcribing audio...
Wake word detected. Ready to assist.
Listening...
Listening...
Transcribing audio...
You said: What do you see when you close your eyes?
Stop phrase detected. Exiting.```
Does anyone know how to use Dolby Atmos stems to separate vocals? I searched on YouTube and couldn't find any tutorials.
where i can find a programm to train my own model?
what gpu do you have?
rtx 3060ti 12 vram
use applio
thanks ;>
no problem
Is there a place I can send my AI vídeos to you guys?
have fun :)
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Such Gossip (Drum model no. 565)
For anyone that thinks AGI is not available to us right now. Please tell me how if we give for example Copilot: Current internet access, a tesla bot body and a yottabyte (YB) of storage how within a few years that would not obtain AGI?
What are some of the road blocks that would inhibit?
Its not that we don't "think", it's that "its literally not possible" 😭
Hi! I have a question: is there any way to separate two voices that are singing together at the same time, like in a small two-person choir? I really wanted to do this for a specific song. I tried using a backing vocals separation model, imagining that one person's voice would be on the lead vocals stem and the other on the backing vocals, but I realized that this only works well when there is a clear separation between the main and backing vocals. When both people are singing together, with almost equal emphasis, I couldn't separate them. Do y'all know if there is any tool or technique that works in this case?
@ancient swan https://www.reddit.com/r/StableDiffusion/s/5dkRwW9WX3
If anyone can help me just reply to this message
guys does anyone know anything about crayo ai
anyone here ever used openrouter? does it have a minimum paymen amount or can i add like a single dollar to it?
openrouter wont tell me and from that im assuming i could pay like a cent
haven't heard that random service
~~3840x2160 with DP2.1 UHBR20 is better than 5120x2880 with DP1.4 lmao https://www.tomshardware.com/monitors/gaming-monitors/acers-predator-x323qx-elevates-16-9-gaming-to-5k-at-144hz~~
yeah, that's why i'm saying that it's better to wait till youtubers do their benchmarks
Yup
you can try uvr5
or uvr ui
it seems the most lazy thing ever
To promote your YouTube, go to #1159290752195633273.
hey guys I am trying my luck with AI influencing but am pretty new to all the AI and Stable diffusion stuff. If anyone has worked around this, can you dm me ? I need some help
Hi. I can help you @jagged juniper
does anyone know how many songs you can make on weights.gg for free or is it unlimited?
Does anyone have a collection of celebrity/character TTS weights for sale or anywhere to download them? Not voice to voice but Text to voice
when it comes to training RVC models do i want to use the Mangio-RVC fork or some other software?
Applio, unless you really want an outdated and unsupported Mangio fork
C
I hate you
@polar flax sorry for pinging, but does the male/female separation model only apply to duets where one voice is male and the other is female? Or can it also be applied to a duet where both voices are female or both voices are male?
@radiant canyon #1159290752195633273 please
To get some help about using Stable Diffusion, you can go to #🔍│help-ai-art
phew i took the 3 good usernames
yea dont use inappropriate names
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
guys where to put epochs in voice changer?
same place you put 87 gas in a car engine

but actually where?
epoch is a property of a model.. how long it was trained
voice changer does not give a f
it is for you to decide whether the model is good to use or not
"300 epochs" means nothing without knowing how big the dataset was. 300 epoch on 10 hour set is crazy high, 300 epoch on 20 sec set is polishing a turd
did any changes happen for AMD gpu users?
or should i still use the Online Realtime one
last time i tried to do realtime ai voice changer it went horrible xd or not that great
chat, wtf is huggingface ?
The Hugging Face is a website service that stores code repositories similar to GitHub.
got it, tks
facehugger
its been so long since ive seen a real queue in weights.gg
what happend? did some of the servers get put offline?
Lol me too, today i created a model, and later show me this same queques for the another ai covers, i though was my fault for create an model before, but now i can appreciate that i not the only with the same problem
Ohh just in case Weight was remove his shop option and the option for the vocal remover, they will come back?
hi
chat, do you know models that are able to generate product photos ? like levels photoai, but focused in products ?
chatgpt LOL
it works with existing products ? cause everytime i give it something it hallucinates and return some variation of what i gave
Hlo
Uhh
Hmm
@hidden grotto
:wave: @safe flume, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
alguien tiene la IA de cosmic kid?
@hidden grotto
:wave: @trim pecan, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
gracias, ty
Stable Diffusion
I made the villager sing "fly me to the moon" 😭
hey, just to be clear, okada is the real time voice changer and rvc is speech to speech? sorry if this is dumb im new to all of this
@hidden grotto
:wave: @sharp lotus, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
so i want do AI Cover again and then i found UVR Roformer, what reccomend to use for separated vocal?
UVR 5.6 i use MDX stuff
RVC also has realtime VC, but RVC just like have both VC & Inference (not Applio)
so i need to find a rvc app where i can locally transfer a mp3 file of my orignial voiceover to the new v/o?
Last update: Dec 24, 2024
you can also consider:
- vocals: unwa's beta 5e or becruily's vocals
- inst: v1/v1e or becruily's
would someone here help me de reverb something?
Becruily's models needs the updated bs/mel roformer scripts?
nothing different from other roformer models
is there any model that can split songs that have multiple singers now in UVR? like 2 lead vocals?
Hello again, i tried installing roformer model to UVR but there is no Roformer Model check button to click when choosing the model param.
I have installed Anjok's https://github.com/Anjok07/ultimatevocalremovergui/tree/v5.6.0_roformer_add%2Bdirectml?tab=readme-ov-file
I have tried reinstalling too @.@
i tried installing the patch and it resulted in an error, cannot import name rename_privateuse1_backend
ill try fully installing
hlo my name is madhav i want to lern about ai things can you help me please
I want to learn inserts a broad topic without specifying details can you help me please
Wdym by ai things 💀
hello, i have problem with Anvuew mel dereverb v2 where it gives an error that says
RuntimeError: "The size of tensor a (352768) must match the size of tensor b (352800) at non-singleton dimension 1"
and where can i find Mel roformer karaoke model?
someone covers songs from a live song to a sunoAI cover
so im stuck with the file for the voice changer
Stuck in what way?
You gotta provide more details
In case you're not using it, you should definitely go for the fork;
https://rentry.co/ForkVoiceChangerGuide
The website has all the information on how to use it and so on.
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update December 12: NEW UPDATE VERSION b2332
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuid...
Other than that, you should check #🔍│help-w-okada if it's about real-time voice changer support.
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Skyfall (Drum model no. 566)
Minecraft Fandom over Minecrart Wiki moment. 
does anyone know how to turn long form scripts into an AI voiceover?
Maybe break down the parts of the script and then use aj on it
i never used ai btw in development so idk
You meant text to speech (TTS) or some kind of AI program that generates AI voice?
i usually use elevenlabs but the text is just so long so its not feasible to break down to parts or anything like that - so im looking for ways or softwares that can make this work.
Just need my script to be readout by an AI voice
c'mon that's not so hard to break it down into each voiceline, and there's nothing better than doing it manually
its 3 hr script 🙂
Do u know about that update on bs/mel roformer scripts? What's it for?
perhaps compability to the phantom center model or something
I see.
if you're talking about .py files then there are some new features to it, but so far no models that are trained with them yet
Oh, I thought new models needed to update those files to work
Hey guys, nice to meet you! My name is Ruan, and I'm from Brazil. I'm not a programmer, but I'd like to create a female voice to be my virtual assistant on WhatsApp. Can you help me figure out how to get a voice that speaks Portuguese without sounding like an AI?
anyone sunoai ?
hey, anyone in here wants to connect in a call this evening? I'm a full time marketer looking to get into coding (nextjs/ts)
Hello
Hii
@opal marsh
thatrandomdude.exe, My prefix is g.
I can't read messages here
wat
?
What wrong with this code
from datetime import datetime
class Task:
def init(self, title, description, due_date):
self.title = title
self.description = description
self.due_date = datetime.strptime(due_date, '%Y-%m-%d')
self.completed = False
def mark_completed(self):
self.completed = True
def __str__(self):
status = "Completed" if self.completed else "Pending"
return f"Task: {self.title} | Due: {self.due_date.date()} | Status: {status}"
class TaskTracker:
def init(self):
self.tasks = []
def add_task(self, task):
self.tasks.append(task)
def remove_task(self, task_title):
for task in self.tasks:
if task.title == task_title:
self.tasks.remove(task)
break
def get_pending_tasks(self):
return [task for task in self.tasks if not task.completed]
def get_overdue_tasks(self):
today = datetime.today()
# Deliberate mistake: Should be `task.due_date < today`
return [task for task in self.tasks if task.due_date > today and not task.completed]
def display_tasks(self):
if not self.tasks:
print("No tasks available.")
else:
for task in self.tasks:
print(task)
Example usage
if name == "main":
tracker = TaskTracker()
# Adding tasks
tracker.add_task(Task("Finish Project", "Complete the pending project module", "2025-01-15"))
tracker.add_task(Task("Team Meeting", "Discuss project updates with the team", "2025-01-10"))
tracker.add_task(Task("Submit Report", "Send the project report to the manager", "2025-01-05"))
print("All Tasks:")
tracker.display_tasks()
print("\nPending Tasks:")
for task in tracker.get_pending_tasks():
print(task)
print("\nOverdue Tasks:")
for task in tracker.get_overdue_tasks():
print(task)
ask chatgpt, sheesh
is weights.gg a real time or just a generator?
chat legit deader than my dog
no grammar
ahem
my bad
Weights.gg uses RVC audio conversion. It is not a real-time voice changer service.
not yet


any voicechanger suggestions?
Fork W-Okada.
i dont rly understand where to go directly tho once im on the site i usually dont use github
Uh.
-realtime
Interaction has expired, use the command again for a new interaction.
First link is fork W-Okada. This version runs better than the second link.
oh ok where do i download
What? Download links and recommended settings are all said on the guide.
If you have any problem about W-Okada, you can go to #🔍│help-w-okada.
oh ok ty for your time!
somebody make me a blair waldorf voice model and ill paypal u
You just gotta commission someone #1191429836321849435
You can #1159289738314919936 here.
#1159289738314919936 put a request here and make sure to get the model master to work on it
okay, thanks everyone
How can i use these models for text to speech locally?
they're not text to speech models
but applio has built in edge tts
Can i use gpt sovits model in this
no
gpt sovits is it's own tts
how to use it for tts tho
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
TV Off (Drum model no. 567)
does anyone know a decently fast tts model that sounds decently realistic that i can use with ollama or smtg
how do i use the voice models
W-Okada the realtime voice changer or RVC the audio conversion program?
so i downloaded the model
but unusre how to use it from there
Hi
Realtime voice changer or the audio conversion program?
uhh
Answer one.
if i need to talk to have it converted that's fine aswell
Applio can do TTS.
We need something like the photo model on weights.gg, where you can upload photos to get an exact likeness, but for music. You’d upload a singer’s songs, and the model would create new tracks that sound exactly like their style. Imagine hearing brand-new music from artists who’ve been gone for years. It’s just an idea, and there might be copyright stuff to figure out, but it would be incredible!
that would create copyright issues
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
this could help 
well, maybe could try fish speech or F5
you can also check the index above ^^
There's also PiperTTS which is fast and lightweight, but wasn't added to the index since it's not really good quality
weights.gg uses RVC for inference (use models) on pre-recorded audios and Training (make) Models, so no realtime yet
are you looking for realtime?
I could help you setup realtime if u tell me your PC GPU
Hiii I’m looking for the best option to voicechange pre-recorded audio
im very new to this
but would be so cool to be able to sing something with my own voice and then just change the voice to any voice model
and is there a way to do it directly in Ableton Live? like as a VST
Nope, that's not possible.
But in fact this is possible with RVC.
Lemme hand you the docs.
Okay thankk you so much
A question, what's your GPU?
also would love to know where is the best place to find models for it
are they called RVC models?
Yep.
You can use the #1175430844685484042 channel.
what
aint it a ai pic generator?
Last update: Oct 21, 2024
There you have the guides. You got either the option of installing RVC locally (if you got a nice GPU) or just using online colab/kaggle versions.
okay thank you
Welp, i guess you can try installing RVC locally.
Either install mainline or applio.
RVC forks. Welp, the og version and a alt version with various extra features.
Yes.
amazing
Thanks!
is there a way to search somewhere for specific types of voice models?
like search keywords
like “cute”
female vocal
Umm.. nop.
Welp you can try.
If there isn't a model of the voice you're looking for, you can just request it making a post on the #1159289738314919936 or commission it to any model master on the #1191429836321849435
Thank you! This helps alot
hi
and is there a way to do all these things when im only on my iPhone?
no
Oki
I don't think so..
what is this and what do I click
I have never seen this before
anyone can help with the rvc on how to use on discord?\
Yo tb quiero saber esto
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
si es muy complicado, yo iria directamente a weights.gg a hacer el modelo, es gratuito
pero el modelo que necesito es pa cantar
con mi voz
no para hablar
alguien sabe?
le pago al que sepa
las guias y la opcion en weights es justamente para eso

Spanish chat 
@elder willow @regal gate revisen las guias que les pasaron
Allí explica los pasos para hacerlo local o en la nube
ok gracias
Es mejor Applio o RVC ahora mismo, caballo gordo
?
Applio ig
Si tienes alguna duda igual me dices
guessing isnt enough, i need my rvc cover to be clean enough to caress and lick
does anyone know where to find really high quality rvc voice's i just need the .pth files, i've tried hugging face rvcmodels and this discord but i cant find one like david attenborough if anyone were to try to use a similar voice to david attenboroguh or any of the famous voice speakers for documentary's how would they go about it without obviously using eleven labs.
Currently i have my voice but i couldnt find any good voices with australian accents to combine with mine
That's too long to read. But I think you wanna like find the best voice model here? 
sort off by voice model do you mean like mangio, illaria, or voice specifically im looking for a particular voice that is just high quality but yes i guess if there is a good voice model i could use that instead
I don't know any generic voice model that sounds better for your needs. Voice models here are full of fictional and famous people, all of them are fanmade.
oh right yeh well i was trying to find one that was high quality in the voice models section but a lot of them were bad and had glitches and bg audio i thought there would be like a area that has more high quality but if not i guess i can just keep looking to hopefully find a good qualtiy one
what is the best ai for video. is there any that do it locally through some kind of software? i just feel like they all cost too much
If you want local it can get expensive as fuck
But there are free sites that do video generation but you have a very long queue and mostly only limited per day
it is too demanding for consumer gpus, RTX 3090/4090 may be bare minimum to do with reduced settings but workstation gpus over 24 GB are recommended
so better option is to rent some A100/H100, or yea even video gen services have long queue cuz too many ppl using it
Some NVIDIA (Quadro) RTX GPU models have more VRAM than GeForce GPUs, can be used for high-performace AI tasks, but they usually sold more expensive. However, NVIDIA A100 and H100 are even more expensive compared to Quadro RTX GPUs. These GPUs are too over your budget, so it would be better if you have GeForce RTX 4090 or wait for RTX 5090 to generate video locally.
Not real time i just want to do tts, not Audio to audio
also why isnt there an open source of elevenlabs, eleven labs is around for more than a year, still we have the shitty turtle tts
i did mochi through comfyui. pretty good but takes a long time to process
U could have specified that since Generator is vague
Turtle tts? Tortoise is old ASF and already has better forks
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
why can help me in promlem RVC
Can you elaborate in detail?
I say the program does not output sound
If you have any problem using W-Okada, go to #🔍│help-w-okada. I don't think RVC would do "output sound" like that.
hey there, i'm trying to build a text-to-song generator, but i'm confused about where to start. should i focus on implementing research papers like musicgen, or should i use apis from services like suno or elevenlabs? one problem i've noticed with udio and suno is that they don't offer official apis, and the unofficial ones aren't very reliable. also, please keep in mind that my pc's specs are quite poor and it doesn't even have a gpu. it would be really helpful if anyone could offer some suggestions or guidance.
why can help me in promlem RVC
Is index rate in weights 0?
It says when uploading a model that the index file is optional
Or does it use a different index rate when the index file is given?
index file is optional and missing index is the same as setting index rate to 0
also you can use any index with any model for extra fun
Yeah but what if I upload my index file
go ahead
There's no index rate setting
probably default 0.75
Hi
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
@covert lake https://www.youtube.com/watch?v=_qQwSVzYNpA have you tried veo 2?
❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/papers
Try Veo2 here (Notes: likely USA only so far and there may be a waitlist):
https://deepmind.google/technologies/veo/veo-2/
📝 My paper on simulations that look almost like reality is available for free here:
https://rdcu.be/cWPfD
Or this is the orig. Natur...
Nope I didn't
Oh veo 2
I was looking at smt to run locally tbh
Кто русский помогите с проблемой
can i pay someone to do it?
is that possible
yes u can do a paid request or check one of the model masters shop
and how ik who trusted?
or are they all?
model masters, they did an application to be able to get paid commissions
okay thanks
yw
guys
is RTX 4070Ti enough to train models?
i mean small models
or fine tuning with my dataset?
yeah dw
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Walking the Wire (Drum model no. 568)
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Dancing on the Moon (Drum model no. 569)
Taking free model requests, if you have a dataset ready already DM
( human vocals only no instruments)
where i can get the models
im trying to make my own model and it keeps saying this zip needs a .pth file
what is that and how do i get that
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @covert lake, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
what's ur pc gpu
Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible
You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours daily, not granted, of GPU
Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio
U could do it locally but small models, idk how much to suggest that
Train (make) RVC Models on cloud:
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI, no guide as of right now)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Be sure to know about the tensorboard
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC
RVC Inference (use models) on pre-recorded audio on Cloud
You can use either:
- Weights.gg: Easiest Possible Ever Automatic
- Ilaria RVC Zero: Fastest free on cloud
- Applio (ui)
I would suggest kaggle training
Do anyone here know how to make the lo fi text to speech that’s been used a lot in the gungame servers on counter strike source ?
just ur usual TTS with low pass filter as post processing
but do u know which voice im talking about?
bc yes I know I can put effects on and make smth similar, but I’m looking for that specific tts
bc I’m almost sure that they just used a tts where u can choose that specific robotic voice
try clone it using zero shot tts like fishspeech 1.4, F5, etc
Thing is I cannot find it now
but i think it says like “welcome to gungame server blablablab”
and also “someone is on knife level”
that’s just what I remember
crazy nobody actually dmed me
Taking free model requests, if you have a dataset ready already DM
( human vocals only no instruments)
( i think nobody who wants a free model has a dataset ready, lol )
how do i fix delay
if your talking about w-okada, you cant fix the delay, you an reduce chunk which reduces the delay but the voice quality at the same time
hey there, i'm trying to build a text-to-song generator, but i'm confused about where to start. should i focus on implementing research papers like musicgen, or should i use apis from services like suno or elevenlabs? one problem i've noticed with udio and suno is that they don't offer official apis, and the unofficial ones aren't very reliable. also, please keep in mind that my pc's specs are quite poor and it doesn't even have a gpu. it would be really helpful if anyone could offer some suggestions or guidance.
guys what's the best voice separator to use?
i recommend ultimate vocal remover
does it use a lot of gpu?
tell me what tutorial link and pc gpu u have in #🔍│help-w-okada
what's ur pc gpu
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
the most robotic one would be google translator ig
i use a laptop..
u could also try adding effects, not sure
UVR5, RVC and AI tasks benefit from GPU for their faster processing speed and such.
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
hopefully it has gpu 0 and gpu 1
meaning it doesn't have only integrated gpu but also dedicated
HD Graphics 520
Only that?
yea dude it's like a 9 years old computer
ur laptop could run it on cpu locally, but it would take many many hours if it even runs man
8-9 years old
i'm 14 you think i can easily ask my dad to change my gpu 😭
It would be better to buy a laptop that was made four years ago from now.
What the fuck even is that age?
The fuck are you talking about
Hell, the fuck is your PFP?
Screw it, the fu-
alright got your point
wdym by that tho you tryna groom me or smth🤨
Unlike desktop, not all laptops are upgradable.
CYou're just looking to separate 2 voices in a song right?
Multi-voices
I don't groom a child. I like women better.
alr no need to say that man
I wanna separate voices like in RnB
rnb songs have a lot of voices overlapping so
You wanna seperate some harmonies chorus going on an audio, right?
There are some UVR5 models that can do that, but I don't remember the name. 
You could try using Cloud UVR https://docs.ai-hub.wtf/rvc/resources/dataset-isolation/#colab and using the 6_HP-Karaoke model
Last update: Dec 24, 2024
basicaly, separate the vocals and instrumentals, then use the vocals as input, use that model and you should be able to seaparate the voices
Mel Karaoke could work too
This should be pinned
There is an updated one
Found it
I'll replace it in a bit
and also this https://docs.google.com/spreadsheets/d/1pPEJpu4tZjTkjPh_F5YjtIyHq8v0SxLnBydfUBUNlbI/edit?usp=sharing
Okke
I am looking for good software that let's me accurately transcribe video files to text. Any good/free suggestions?
Thank you!
Is there any good free AI video generator
text/image to video AIs:
- Locally (runs on ur pc):
- pyramid flow (Image/Text to Video)
- cogvideox 1.5 5b: Image to Video, Text to Video
- Cloud (remote good pc, running on an online website for example, easier to setup):
- Weights.gg (paid only)
- pyramid flow (Image/Text to Video) (HuggingFace Space)
- OpenAI Sora (paid only, in some countries)
- lumalabs
- Hailoua AI
Do you know of any good free video to text transcribers that are free?
like, subtitles?
whisper
though extracting the voices might give better results
https://huggingface.co/spaces/Nick088/Fast-Subtitle-Maker could help too, it uses whisper, however it's made for subtitles rather than normal txt output
can you do this on a macbook
do what
what do you mean by "extracting voice" would work better?
Nope. I need it to transcribe audio from video files to text
i know my windows pc is super bad and my mac has a m2 chip
i want to try make a rvc model of myself so i can do text to speech but its hard
what program can run LLM models that has text to ai
what
what do you mean by text to ai?
ai voice. i want to take a model from #1175430844685484042 and type out a prompt
and have it speak whatever i type
it's voice to voice not text to speech
wait tf did the tos change
Nah
you used to get a strike before rather than just a warning
it was.
it isn't a strike
mk, do you know one then?
I used polish singers yet.
elevenlabs but it's paid
I never did western songs.
one thats locally run
What the heck happen?
fishspeech maybe, but i haven't used it
you could check out whisper, maybe u can run it locally https://github.com/jhj0517/Whisper-WebUI?tab=readme-ov-file
u can prob find a lot of web uis of whisper
could you tell me your windows pc gpu just to check?
because mac isn't the best for rvc training either
I don't think I ever saw someone sucessfully train a model on mac since the speed is similar to CPU
Did anyone have UMG strike on youtube?
Yes charmander!
btw RVC is Speech To Speech natively
even if it can be used for TTS
yes basically everyone
AI Covers published on youtube without permission nor a license, can get you strikes and get your account terminated. Do it at your own risk
#aicover #aicoversongs #aicovers #ai #artificialintelligence
I got 2 strikes
which is why I deleted all my covers
also, no privating/unlisting won't work
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
curreent colab link of what
RVC Disconnected V2
what u tryna do? and could u tell me ur pc gpu first to see if it's powerful enough?
Intel i7
that's a CPU
yeah
google colab is a cloud computing service only for people with a bad pc btw, I'm asking your GPU since some people use colab because they don't even know their GPU
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
CPU = Central Processing Unit
GPU = Graphics Processing Unit
nvm it's a bad pc
Train (make) RVC Models on cloud:
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI, no guide as of right now)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Be sure to know about the tensorboard
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC
RVC Inference (use models) on pre-recorded audio on Cloud
You can use either:
- Weights.gg: Easiest Possible Ever Automatic
- Ilaria RVC Zero: Fastest free on cloud
- Applio (ui)
I also linked RVC Disconnected ^
And this is why i never did sanah AI Covers!
It's like I wanted my computer to be bad. Anyway
I loved to being sobbed.
u can't do much about it
I had to check it, there were some people with a 2k pc literally using colab 😭
people need to always check their gpu first before using cloud to see if it's powerful enough
btw u could also upgrade to a good desktop pc if u want
What? She Joined UMG?
Great thank you! I will use gpt sovits as the model sounds much better than rvc
You guys code?
I do a little bit😅
Same. JavaScript
Yes javascript
idk her
Hello, I'm trying to create a model file, but when I run the RVC v2 Disconnected Google Colab that I usually use, the training ends without the epochs running. Even though I set the epochs to 1000, the training stops after only 35 seconds. I thought the Colab might be blocked, so I tried using a Colab from a YouTube video, but the training also ends without running any epochs in that Colab as well. What should I do in this case?
The Colabs I used are RVC v2 Disconnected and Bahaa AI. Both of them end without running any epochs.
Hola1
Hi, do anyone here know some of the early stage tts used in like around 2008
simple free ones
did u try checking ur pc gpu first if it's good enough
that let you upload videos and turn it into text?
I think u have to convert the video to an audio
Heck, and I rarely even publish AI covers!
I almost lost my channel
yeah, got 1 strike active, 2 total
i feel ya
I haven't gotten the orange card... yet
Any good software to do that?
U can just google "MP4 to FLAC" or "mp4 to MP3" and find hundreds of sites who convert file types
@covert lake It will sound stupid, but will it change the situation if the Voice is from the video game? :3
-# I think your answer will be no 💀
Huh what situation
ai covers on yt
!
thing is, my first strike expired a long time ago
I'm not sure
What mostly is the problem is the instrumentals
If the melody only resembles the original And the lyrics and the melody are not 100% similar to the original? 😮
is there a program version instead of a website for nvidia-b2332?
Hello good evening. I have a question. What program do you recommend using to play voices? I currently use voice ai but it's not very good.
nvm you dont have to do it if not sure
you still have to buy mechanical license of the music itself depending on its copyright holder, same thing goes for remixes
:3
guys can anyone help me how to download voice changer client and use it
hi everyone
anyone know how to make consistent face while generating image?
i tried using seed number but its still random
using an image prompt
i think thats everyone do
ouch such a silly question how i use this? i usually using leonardo ai 
Question: is there anyway to make dubbing from one language to another with ai? like, english audio + voice model = audio in another language.
Going to bed rn, good night y'all
hmmm
.
is there any good website to train AI voice model
what's ur pc gpu first
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
What is undressing AI? You mean like some AI image generator tool that makes a character undressing itself?

bunch of school kids are getting arrested every so often for 'nudifying' their classmates, very naughty
better ignore those perverts
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
That Is How I Roll! (Drum model no. 570)
Huh
just woke up, got back for some answers, got disappointed
I thought about gpt-sovits, but it's basically just for english and chinese
elevelabs has dubbing, merlin clone as well
any accurate RVCs to make cover songs?
DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Dancing in the Flames on the Moon (Hybrid) (Drum model no. 571)
I was looking for something local and that had support for rvc models, but thanks anyways
I dont think there's anything close for local. The products I mentioned are big $$$
hey
https://www.youtube.com/watch?v=A_kLk-bEKSA
is it possible to do it with other programs like coqui and whisper? if so, how?
In this video we dive into real time speech to speech translation, speaking in one language, and having your own voice speak in a different language!
Resources -
Github Code: https://github.com/ALucek/speech2speech-translation
AssemblyAI Documentation: https://www.assemblyai.com/docs/guides/real-time-streaming-transcription
Elevenlabs Document...
thats crazy
wtf
yay
Is there a tool to batch convert multiple files with a RVC Model of choice??
Dosnt there exists a Astorian Voice Mod/Filter?
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @hidden grotto
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @covert lake, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
AI HUB Docs
