next shard Mar 6, 2025, 3:52 PM

#

okay thanks let me check

lavish heath Mar 6, 2025, 4:27 PM

#

where i can find the program to use it?

covert lake Mar 6, 2025, 4:38 PM

#

lavish heath where i can find the program to use it?

elaborate:

your pc gpu
which program? there's A LOT of AI programs, in various fields, and various types depending also on your pc gpu
what do you want to do specifically

next shard Mar 6, 2025, 4:41 PM

#

okay for rpc model which is best way to train a model in hebrew? weights doesn't support hebrew from what i see so i don't know if i should train using it

#

i have about 14m long of audio i want to train of a voice in hebrew

covert lake Mar 6, 2025, 4:44 PM

#

next shard okay for rpc model which is best way to train a model in hebrew? weights doesn't...

ehh RVC is Speech To Speech natively rather than Text To Speech, you could give it a try to train an hebrew dataset via our docs https://docs.aihub.gg/ but i'm not sure how good it will be

Home

Last update: Oct 21, 2024

next shard Mar 6, 2025, 4:45 PM

#

covert lake ehh RVC is Speech To Speech natively rather than Text To Speech, you could give ...

okay so how do i train say voice cloning for text to speech? do you know possibly i been trying for few months to find solution i am getting tired
i tried even maybe paying someone on fiverr i couldn't find anyone knowledge able

pine acornBOT Mar 6, 2025, 4:45 PM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

next shard Mar 6, 2025, 4:48 PM

#

trying Ilaria RVC but i can't use tts with the uploaded model damn

covert lake Mar 6, 2025, 4:49 PM

#

next shard okay so how do i train say voice cloning for text to speech? do you know possibl...

to train RVC models, it's all written in the docs

about TTS, you could check the TTS index, you could maybe try XTTS2, fish speech or f5 tts

#

I wouldn't expect those to support it much tho

#

it's not a famous language as english

next shard Mar 6, 2025, 4:50 PM

#

fish speech i am on their site but they don't let me upload the audio i uploaded sounds of the voice and says limit 32mb but none of the files passed it

covert lake Mar 6, 2025, 4:50 PM

#

next shard fish speech i am on their site but they don't let me upload the audio i uploaded...

if you got a good pc you can do it locally

next shard Mar 6, 2025, 4:50 PM

#

i got rx 7900 xtx

#

i know most love nvidia but idk

#

will that work? got 24gb vram

covert lake Mar 6, 2025, 4:51 PM

#

unfortunately non nvidia gpus kinda suck on AI support bc of CUDA

your best bet would be to either check if they have their own amd guides, or patch it yourself with Zluda on windows

next shard Mar 6, 2025, 4:51 PM

#

really gotta get a tts with cloned voice or something in hebrew also appreciate your help brother

next shard Mar 6, 2025, 4:51 PM

#

covert lake unfortunately non nvidia gpus kinda suck on AI support bc of CUDA your best bet...

#

should of bought an rtx ffs

#

nothing on the cloud that can do it then?

covert lake Mar 6, 2025, 4:52 PM

#

next shard should of bought an rtx ffs

AMD is deffo cheaper and good price to performance for gaming, but the support for AI is not as widely common and good as nvidia's one, Zluda is basically an emulator for cuda on amd

#

and there's also rocm + linux but idk much about those since I don't have AMD

covert lake Mar 6, 2025, 4:54 PM

#

next shard nothing on the cloud that can do it then?

nvm, I just checked fish speech github repo

#

lemme check other ones too

#

gpt so vits doesn't support it either

polar flax Mar 6, 2025, 4:56 PM

#

next shard will that work? got 24gb vram

use applio with zluda, or more ideally with rocm under linux (radeon cards work better on linux distros tho)

covert lake Mar 6, 2025, 4:56 PM

#

F5 doesn't support it either

#

@next shard Edge TTS supports it but it's only 2 models and you can't make custom models and runs only on cloud

polar flax Mar 6, 2025, 4:57 PM

#

next shard should of bought an rtx ffs

50 series cards are disastrous, 40 series ones are better

chilly lake Mar 6, 2025, 4:58 PM

#

faiss can do 1100 languages, but not expressive

covert lake Mar 6, 2025, 4:58 PM

#

XTTS2 doesn't support hebrew either

#

Zonos doesn't support it either

#

Kokoro TTS neither

next shard Mar 6, 2025, 5:01 PM

#

covert lake <@789898937905315880> Edge TTS supports it but it's only 2 models and you can't ...

this sounds like robots

#

thats the issue

#

i am trying to clone a voice that speaks the language natively

#

then i will be able to do tts for videos with at least some emotions and native speaking i know wont be perfect but doesn't have to be

covert lake Mar 6, 2025, 5:02 PM

#

next shard i am trying to clone a voice that speaks the language natively

Seems impossible to find anything in hebrew, even more expressively

polar flax Mar 6, 2025, 5:02 PM

#

next shard i am trying to clone a voice that speaks the language natively

do you prefer the native language support to the quality?

covert lake Mar 6, 2025, 5:02 PM

#

OpenVoice2 doesn't support it either

next shard Mar 6, 2025, 5:02 PM

#

polar flax do you prefer the native language support to the quality?

wdym

next shard Mar 6, 2025, 5:03 PM

#

covert lake OpenVoice2 doesn't support it either

the only one that was decent almost which i was about to pay 99$ to try their higher quality one is play.ht

#

it did a good job it was fluent and all

covert lake Mar 6, 2025, 5:03 PM

#

MeloTTS & PiperTTs don't support it either

covert lake Mar 6, 2025, 5:04 PM

#

next shard the only one that was decent almost which i was about to pay 99$ to try their hi...

welp

#

seems like there isn't a much better alternative

#

almost no tts even supports that language

next shard Mar 6, 2025, 5:04 PM

#

do we know if they are using a public open source ai to make the voice cloning?

#

and just charging for it maybe

covert lake Mar 6, 2025, 5:04 PM

#

maybe 11labs supports it better? I don't pay for it tho so I can't tell you

next shard Mar 6, 2025, 5:04 PM

#

covert lake maybe 11labs supports it better? I don't pay for it tho so I can't tell you

nah its not good

next shard Mar 6, 2025, 5:05 PM

#

covert lake almost no tts even supports that language

yes i know i need something that can do voice clone i don't think play.ht supports it as well i tested voice clone and it worked somehow

covert lake Mar 6, 2025, 5:05 PM

#

next shard do we know if they are using a public open source ai to make the voice cloning?

I really can't know

next shard Mar 6, 2025, 5:05 PM

#

i even used their english version

covert lake Mar 6, 2025, 5:05 PM

#

from their site it looks like their own closed source AI, but I can't know this

#

I can't find any other tts that does hebrew nor in an expressive way

next shard Mar 6, 2025, 5:06 PM

#

okay thank you nick so my only option is this rn unless i find something else

so i can't do anything with any opensource stuff with running locally or cloudly

polar flax Mar 6, 2025, 5:07 PM

#

next shard wdym

joe_shrug

polar flax Mar 6, 2025, 5:07 PM

#

next shard okay thank you nick so my only option is this rn unless i find something else s...

the open source ones I know so far are english & chinese

covert lake Mar 6, 2025, 5:08 PM

#

next shard okay thank you nick so my only option is this rn unless i find something else s...

I checked 11 different TTS and couldn't find anything close to what you seem to describe so yep unfortunately

next shard Mar 6, 2025, 5:08 PM

#

can i pay someone to make me a private one 😂

next shard Mar 6, 2025, 5:08 PM

#

covert lake I checked 11 different TTS and couldn't find anything close to what you seem to ...

thanks for the help nick u were useful

next shard Mar 6, 2025, 5:08 PM

#

next shard can i pay someone to make me a private one 😂

is this an option that possible? or its really something difficult i am not sure how this stuff works and who makes them

covert lake Mar 6, 2025, 5:10 PM

#

next shard is this an option that possible? or its really something difficult i am not sure...

welp, you would need to find a team of AI engineers, wait a lot of time for them to even make an architecture and find a massive amount of data to train the model and do it on a good architecture, and prob would cost alot of time and money

#

I'm no such expert ofcourse, but I'm just telling ya that it's prob not going to be super easy to make it

#

It could maybe be added to existent AIs, but that's still going to take a lot of time finding the big amount of data and train it

next shard Mar 6, 2025, 5:19 PM

#

okay thank you then thats also not possible i guess gotta hope for a big team to add support

polar flax Mar 6, 2025, 5:20 PM

#

next shard is this an option that possible? or its really something difficult i am not sure...

im pretty sure it could cost more than millions for getting and processing that enormous amount of data

chilly lake Mar 6, 2025, 5:30 PM

#

10k hours of audio books to train GPT/LLM for new language.. can be as high as 100k

#

I think some TTS used recording from european parlament

#

anyway, something with audio and matching text

rare night Mar 6, 2025, 6:01 PM

#

i saw a ai lebron video on tiktok and i wanted to know if anyone know what ai was used in the video
video link:https://www.tiktok.com/@mrmysticalmarvels/video/7470624516411051306

river verge Mar 6, 2025, 7:15 PM

#

DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Monsters, Inc. (Drum model no. 583)

sudden carbon Mar 6, 2025, 7:25 PM

#

Okay, I've been gone for a few months

#

Where's all this img to video stuff coming from

#

how do I get started on this crazy video train

elder willow Mar 6, 2025, 7:56 PM

#

next shard okay thank you then thats also not possible i guess gotta hope for a big team to...

can i get saxxy award?

whole shore Mar 6, 2025, 10:52 PM

#

not related to this sever but i have a 1 hour 20 min documentary and i need the entire thing transcripted but i cant find anything free online that will do the job. anyone able to help need it asap

chilly lake Mar 6, 2025, 10:53 PM

#

whole shore not related to this sever but i have a 1 hour 20 min documentary and i need the ...

whisper3 for ASR

#

not really good, but free

whole shore Mar 6, 2025, 10:54 PM

#

chilly lake whisper3 for ASR

can u dm it

#

can it do a hour and half long video 😭

chilly lake Mar 6, 2025, 10:55 PM

#

whole shore can u dm it

https://github.com/openai/whisper

GitHub

GitHub - openai/whisper: Robust Speech Recognition via Large-Scale ...

Robust Speech Recognition via Large-Scale Weak Supervision - openai/whisper

whole shore Mar 6, 2025, 10:55 PM

#

chilly lake https://github.com/openai/whisper

is there any online modules?

chilly lake Mar 6, 2025, 10:55 PM

#

it can run locally

#

also youtube can transcribe

whole shore Mar 6, 2025, 10:56 PM

#

ok thanks

chilly lake Mar 6, 2025, 10:57 PM

#

whole shore Mar 6, 2025, 10:57 PM

#

its not a yt vid

desert holly Mar 6, 2025, 11:11 PM

#

yo whats the app called for the ai changer

gray rover Mar 6, 2025, 11:40 PM

#

next shard then i will be able to do tts for videos with at least some emotions and native ...

Why not go for gpt-sovits then

next shard Mar 6, 2025, 11:41 PM

#

gray rover Why not go for gpt-sovits then

what about it

gray rover Mar 6, 2025, 11:41 PM

#

Recently got into it as rvc + tts workflow isn't really sufficient

#

You want tts and voice cloning afterall

#

In that case, experimental zonos or either gpt-sovits is your best bet

#

However, as of now, zonos is at v0.1 stage and only supports zero-shot

next shard Mar 6, 2025, 11:42 PM

#

right i need to clone a voice i got of someone who is speaking good the language but of course need a model that is already trained with more words and stuff so it can speak fluently also when doing tts and the voice i clone it also helps to use tts like they are speaking same emotion speaking i guess that way its good

#

does gpt-sovits support amd?

gray rover Mar 6, 2025, 11:43 PM

#

Well, it really depends on what you're looking for, whether it's v2v or tts
As of amd.. I ain't sure but supposedly it does support rocm, but ye

#

I believe without linux it'd be a no go

#

Either way, gpt-sovits has 2 components, sovits for voice handling and gpt part for phonetic / lingual recognition + understanding ish of emotions, not the best description of it but you get the idea

#

It learns the patterns of speech and according to what you write there, it tries to match it with style and emotions it learned from speaker, in a way

#

Hmm.. Do you know akame ga kill? @next shard

pine acornBOT Mar 6, 2025, 11:44 PM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

next shard Mar 6, 2025, 11:45 PM

#

gray rover Well, it really depends on what you're looking for, whether it's v2v or tts As o...

tts i need for ads

gray rover Mar 6, 2025, 11:45 PM

#

a

next shard Mar 6, 2025, 11:45 PM

#

yeah

#

voice changing isn't an issue here i can do that with elevenlabs even

gray rover Mar 6, 2025, 11:46 PM

#

Well, then voice cloning + tts is what you need

next shard Mar 6, 2025, 11:46 PM

#

tts is the issue cause hard to find anything for hebrew

next shard Mar 6, 2025, 11:46 PM

#

gray rover Well, then voice cloning + tts is what you need

yessir

gray rover Mar 6, 2025, 11:46 PM

#

ah, hebrew

next shard Mar 6, 2025, 11:46 PM

#

so far only play.ht worked

gray rover Mar 6, 2025, 11:46 PM

#

well, don't take it as offense
but I believe it'll be very hard to find something specialized in such rare ( I'd say ) languages

next shard Mar 6, 2025, 11:46 PM

#

thats the best so far i ever found and all from grok3 thanks elon musk for this tool that found me that

gray rover Mar 6, 2025, 11:47 PM

#

It's mostly those more recognized ones such as eng, jp, korean, ch, russian etc ( at least in ml I suppose, in such fields

next shard Mar 6, 2025, 11:47 PM

#

gray rover well, don't take it as offense but I believe it'll be very hard to find somethin...

yep i agree its not easy although play.ht actually is so close to being perfect

gray rover Mar 6, 2025, 11:47 PM

#

In that case, if it works decent enough, you should stick to it as it's currently, most likely, your best bet

#

But if you're aiming for 100% spoofing that ai isn't ai

#

that won't do

next shard Mar 6, 2025, 11:48 PM

#

waiting for my high quality voice on there to get ready its been cloning a voice i paid 100$ for 1 high quality voice clone its been hours still waiting hopefully it pays off

gray rover Mar 6, 2025, 11:48 PM

#

Let's hope then 🤞

#

best of luck ✨

next shard Mar 6, 2025, 11:48 PM

#

thank you

#

whell someone will search hebrew here and see this will thank me for sure lmao

#

gray rover Mar 6, 2025, 11:49 PM

#

anime_smug

gray rover Mar 7, 2025, 12:59 AM

#

bruh

velvet lark Mar 7, 2025, 1:49 AM

#

hi, im new member here

#

please

polar flax Mar 7, 2025, 1:52 AM

#

commission in #1191429836321849435 for better chance of response and quality

solar torrent Mar 7, 2025, 3:11 AM

#

To request someone to do a voice model for you, you can make a post in #1159289738314919936, or make one by yourself.

solar torrent Mar 7, 2025, 3:29 AM

#

Please don't send a YouTube link here.

past dome Mar 7, 2025, 3:30 AM

#

K

#

K

#

K

ionic pumice Mar 7, 2025, 5:39 AM

#

grim locust Mar 7, 2025, 10:36 AM

#

anyone knows where to find some generic voice for animation? most voice model i see was from known characters or celebrity. might get some issue when try to use it for my animation. any idea?

fleet tide Mar 7, 2025, 12:14 PM

#

Could anyone help me? Ive made AIs before using Google Colab in 2023. Now the Google Colab method i used is gone. How can i make AIs of Ariana Grande singing a song, or just AI over someone speaking?

tranquil lantern Mar 7, 2025, 12:53 PM

#

fleet tide Could anyone help me? Ive made AIs before using Google Colab in 2023. Now the Go...

RVC? Try using Kaggle applio

covert lake Mar 7, 2025, 1:59 PM

#

fleet tide Could anyone help me? Ive made AIs before using Google Colab in 2023. Now the Go...

What's your PC GPU first?

minor ravine Mar 7, 2025, 2:43 PM

#

is there any online modules?

solar torrent Mar 7, 2025, 2:44 PM

#

minor ravine is there any online modules?

Online module of which?

solar torrent Mar 7, 2025, 3:33 PM

#

But can this website generate a meme image though?

#

https://cdn.discordapp.com/emojis/1301008287147364353.webp?size=48

haughty turtle Mar 7, 2025, 3:33 PM

#

a

solemn arrow Mar 7, 2025, 4:27 PM

#

Yo guys

#

Any suggestions for accounts that are doing big numbers w mostly ai ugc?

covert lake Mar 7, 2025, 4:38 PM

#

minor ravine is there any online modules?

what? elaborate

#

promos aint allowed

timid pawn Mar 7, 2025, 6:57 PM

#

Quickly came to hop in cause im hoping omeone could recognize this ai tts im trying to look for please dm me if youre good at that i have a voice clip and everythin

gray rover Mar 7, 2025, 7:49 PM

#

timid pawn Quickly came to hop in cause im hoping omeone could recognize this ai tts im try...

well.. never hurts to use punctuation marks

#

Anyway, #1159289738314919936 or #1159289738314919936 is what you wanna be looking for, instead of DM invitations
Your chances gonna increase that way

timid pawn Mar 7, 2025, 7:51 PM

#

Thanks mate im just in a bit of rush right now

gray rover Mar 7, 2025, 7:52 PM

#

yea understand, dw

#

Good luck ~ ✨

cobalt coyote Mar 7, 2025, 8:17 PM

#

Hmm. I see

#

I agree

#

Idk with what but I do agree

rare burrow Mar 7, 2025, 10:46 PM

#

how do i use okada in games and discord ?

covert lake Mar 7, 2025, 10:56 PM

#

rare burrow how do i use okada in games and discord ?

Tell ur PC GPU in #🔍│help-w-okada

#

@onyx stream btw u can't fix that issue, check #📰│dev-updates

onyx stream Mar 7, 2025, 11:13 PM

#

covert lake <@1193349808287653938> btw u can't fix that issue, check <#1159380240271953940>

There are lots of updates. Which one of them are you reffering to?

covert lake Mar 7, 2025, 11:15 PM

#

onyx stream There are lots of updates. Which one of them are you reffering to?

#📰│dev-updates message

#

ALWAYS check that channel, Its very useful

tender pier Mar 7, 2025, 11:28 PM

#

rare burrow how do i use okada in games and discord ?

change your audio input in the settings to your vac

grim locust Mar 8, 2025, 12:19 AM

#

ive been using IAHispano/Applio from github for a while now. can u recommend me a better TTS that has more expressive emotion? the RVC from Applio is fine but i found the EdgeTTs abit lacking.

twin fractal Mar 8, 2025, 3:16 AM

#

grim locust ive been using IAHispano/Applio from github for a while now. can u recommend me ...

Zonos TTS

solar torrent Mar 8, 2025, 3:17 AM

#

twin fractal Zonos TTS

So is it paid or free?

grim locust Mar 8, 2025, 3:17 AM

#

twin fractal Zonos TTS

do u know the link where its easy to download like just zip file then just run it?

chilly lake Mar 8, 2025, 3:19 AM

#

solar torrent So is it paid or free?

open source, windows installation is a bit tricky

solar torrent Mar 8, 2025, 3:20 AM

#

Interesting.

chilly lake Mar 8, 2025, 3:21 AM

#

it is a new model, it has some issues

grim locust Mar 8, 2025, 3:21 AM

#

i see. i have checked it on youtube it seems its more for cloning voice and abit slower. what i really wanted is fast TTS that doesnt need audio to clone

chilly lake Mar 8, 2025, 3:21 AM

#

kokoro is fast tts that is pretty good

#

better than edge

grim locust Mar 8, 2025, 3:21 AM

#

my workflow is create audio from TTS then use RVC to change the voice

#

i just want the generated TTS to have abit of emotion. before passing to to RVC

chilly lake Mar 8, 2025, 3:22 AM

#

that's fine

grim locust Mar 8, 2025, 3:22 AM

#

chilly lake kokoro is fast tts that is pretty good

ill look into this. thank you very much 😋

twin fractal Mar 8, 2025, 3:29 AM

#

Yeah kokoro is very good little to no word error rate like other tts

twin fractal Mar 8, 2025, 3:30 AM

#

chilly lake better than edge

Will applio replace edge tts with kokoro anytime soon?

chilly lake Mar 8, 2025, 3:30 AM

#

we may.. it is just that it is limited to only few languages

#

if you know a bit of python you can just use both using a script

#

run tts, then run applio's inference

grim locust Mar 8, 2025, 3:34 AM

#

chilly lake run tts, then run applio's inference

i know abit of code, what do u recommend for my workflow to generate good sounding voice for my animation?

#

should i use kokoro TTS then applio RVC?

polar flax Mar 8, 2025, 3:34 AM

#

twin fractal Will applio replace edge tts with kokoro anytime soon?

edge still has wide amount of languages, even including some local languages

grim locust Mar 8, 2025, 3:35 AM

#

polar flax edge still has wide amount of languages, even including some local languages

if i only plan to use english language kokoro is better right?

chilly lake Mar 8, 2025, 3:36 AM

#

it seems a bit more expressive than edge tts

#

edge tts is just a neutral screen reader

#

polar flax Mar 8, 2025, 3:37 AM

#

grim locust if i only plan to use english language kokoro is better right?

you'd prefer the better one

grim locust Mar 8, 2025, 3:38 AM

#

chilly lake

is this audios from kokoro or edge?

grim locust Mar 8, 2025, 3:38 AM

#

polar flax you'd prefer the better one

which is?

gray rover Mar 8, 2025, 3:40 AM

#

grim locust is this audios from kokoro or edge?

No, they are, consecutively:
F5 tts
FishSpeech
GPT-Sovits
xtts-v2

#

If you want check out kokoro, here's the demo:
https://huggingface.co/spaces/hexgrad/Kokoro-TTS

Kokoro TTS - a Hugging Face Space by hexgrad

pine acornBOT Mar 8, 2025, 3:41 AM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

chilly lake Mar 8, 2025, 3:41 AM

#

gpt sovits is way too scuffed with default model, needs finetuning

gray rover Mar 8, 2025, 3:42 AM

#

Well ye, def better to stay away unless fine-tuning

#

But that's really about all zero-shot capable tts'es

#

Recently zonos truly surprised me tho

#

But yea, still v0.1 and no fine-tuning

grim locust Mar 8, 2025, 3:45 AM

#

gray rover But that's really about all zero-shot capable tts'es

i tried zonos before, the best so far from all i tried. my only problem is i have low specs pc and it takes so long to generate. thats why i change my workflow to TTS>RVC

gray rover Mar 8, 2025, 3:46 AM

#

Yup, it is indeed the top and I honestly can't wait for fine-tuning release ( hopefully, one day

#

Tho ye, it is rather demanding

#

In that case, you should def try kokoro

#

Sure, fixed voices so you can't train / add any, but some of it's models are really nice if you're into that ( and need emotional input

#

This is some random infer from kokoro

#

Lots of models ofc so, better to not judge it by this one

#

As for gpt-sovits finetune..
Freshly baked thingie I work on. ( Still testing the params n stuff so, quality isn't something to be taken for granted

grim locust Mar 8, 2025, 3:49 AM

#

gray rover In that case, you should def try kokoro

yes, im currently trying to find one with GUI

#

from https://huggingface.co/spaces/hexgrad/Kokoro-TTS it will be the same if i download it locally? my only choices are the one from voice selection?

gray rover Mar 8, 2025, 3:49 AM

#

I'd believe so, ye

#

Haven't had any deeper interest in it so, didn't use it locally

#

But I see no reason why the gui'd be different ( or rather, the webui

grim locust Mar 8, 2025, 3:53 AM

#

how do we use kokoro for emotional voice like angry / sad/ happy ? is there any way to do it on the prompt?

gray rover Mar 8, 2025, 3:54 AM

#

I'd advice you to just watch some overviews of it on yt, you'll gather more details that way

#

Aside of few runs out of curiosity, I haven't really tried it that much so, can't help

#

Alternatively, try to ask Noobies

grim locust Mar 8, 2025, 3:56 AM

#

i see, ill try researching it for a bit more. if u know any TTS that can control a voice like Zonos but doesnt need an audio to clone a voice. please let me know

gray rover Mar 8, 2025, 3:57 AM

#

grim locust i see, ill try researching it for a bit more. if u know any TTS that can control...

Once and if I'll find something meaningful, will do

jolly ravine Mar 8, 2025, 4:10 AM

#

Do I need a high end computer for GitHub voice changers to work properly?

grim locust Mar 8, 2025, 4:15 AM

#

jolly ravine Do I need a high end computer for GitHub voice changers to work properly?

it depends if its RVC (Retrieval-based Voice Conversion) you can do it even with slow pc

grim locust Mar 8, 2025, 4:17 AM

#

jolly ravine Do I need a high end computer for GitHub voice changers to work properly?

if ur using it for RVC --> it means u have to provide an audio (example your voice) to change to other voice(specific voice from other models you downloaded) then it doesnt require much computer power

gray rover Mar 8, 2025, 4:18 AM

#

jolly ravine Do I need a high end computer for GitHub voice changers to work properly?

Well, not really. Technically, you can even use 4 gig gpu or.. well cpu, but the delay would be huge as hell, and def you wouldn't be able to play games that way

#

In other words, cpu is rather a no go. 4/6 gig gpu can do well, but there are constraints ofc, depending on ur hardware. For real-time voice changers, go to #🔍│help-w-okada

elder willow Mar 8, 2025, 4:44 AM

#

hi is there is an rvc model that is realistic?

jolly ravine Mar 8, 2025, 4:44 AM

#

I'm using a low end laptop and the audio for me always glitches. Are there any other good voice changers that can be used for low end laptops?

gray rover Mar 8, 2025, 5:10 AM

#

elder willow hi is there is an rvc model that is realistic?

There can be, but you have to find them really

#

I'm afraid we don't keep any indexes with quality sorting 👀

elder willow Mar 8, 2025, 5:11 AM

#

mm oki do you have any recommandations or even favorite models?

night lake Mar 8, 2025, 5:11 AM

#

gray rover I'm afraid we don't keep any indexes with quality sorting 👀

we need that imo

solar torrent Mar 8, 2025, 5:11 AM

#

RVC v2 is the only version of RVC that makes high quality RVC voice model.

gray rover Mar 8, 2025, 5:11 AM

#

oof

#

it was a jok-

night lake Mar 8, 2025, 5:11 AM

#

long time ago ilaria suggested something like that and it got a ton of upvotes but it was never added

elder willow Mar 8, 2025, 5:12 AM

#

solar torrent RVC v2 is the only version of RVC that makes high quality RVC voice model.

would this work wit W-okada?

gray rover Mar 8, 2025, 5:12 AM

#

Idk man, I feel like it just promotes laziness and gonna just make people stop researching or discovering

#

but that.. is just my opinion 🙄

solar torrent Mar 8, 2025, 5:12 AM

#

elder willow would this work wit W-okada?

Of course, RVC v2 will always work with W-Okada since it's technically RVC.

night lake Mar 8, 2025, 5:13 AM

#

gray rover Idk man, I feel like it just promotes laziness and gonna just make people stop r...

how would that make people stop researching / discovering? Its just a list of quality models

#

could maybe work as a motivator for some to become better and get on that list

gray rover Mar 8, 2025, 5:14 AM

#

night lake could maybe work as a motivator for some to become better and get on that list

well if you look at it that way

gray rover Mar 8, 2025, 5:15 AM

#

night lake how would that make people stop researching / discovering? Its just a list of qu...

Yet, from experience I know that if you're given a list of " good models ", most of the time you'll just stop dl'ing and testing all you can

#

since you're provided with fully baked solutions

#

but again, it's just my opinion so, don't take it too seriously 😛

night lake Mar 8, 2025, 5:17 AM

#

i see

elder willow Mar 8, 2025, 5:18 AM

#

solar torrent Of course, RVC v2 will always work with W-Okada since it's technically RVC.

is it the best i can use? i mean Wokada

solar torrent Mar 8, 2025, 5:22 AM

#

elder willow is it the best i can use? i mean Wokada

Yes. Detris' W-Okada is the only best one you can use.

glad nebula Mar 8, 2025, 5:23 AM

#

making models can be fun sometimes uh

#

sometimes

#

🦈

deft surge Mar 8, 2025, 5:52 AM

#

misc_cry

turbid zinc Mar 8, 2025, 6:48 AM

#

Guys is it normal for a voice model in zip that i heavy 259MB ?

solar torrent Mar 8, 2025, 6:50 AM

#

turbid zinc Guys is it normal for a voice model in zip that i heavy 259MB ?

Have you tried opening zip inside? Because a typical index file will be larger than pth file.

turbid zinc Mar 8, 2025, 6:56 AM

#

yeah

#

for realtime changer i just need the pth one right?

#

can i add you so i can send the picture

#

pls

solar torrent Mar 8, 2025, 6:58 AM

#

RVC pth file should weigh around 53MB. If you see the pth file weigh more or less than that, it's not an RVC voice model.

#

You don't need to hop into my direct message just to send an image. You can go to #🔍│help-w-okada to send an image there since your name turns blue now.

turbid zinc Mar 8, 2025, 6:59 AM

#

is 50mb

turbid zinc Mar 8, 2025, 6:59 AM

#

solar torrent You don't need to hop into my direct message just to send an image. You can go t...

ok sorry

solar torrent Mar 8, 2025, 7:00 AM

#

turbid zinc is 50mb

If you see a pth file like this, it's RVC voice model.

polar flax Mar 8, 2025, 8:49 AM

#

turbid zinc Guys is it normal for a voice model in zip that i heavy 259MB ?

that's normal if it contains only pth and index file

somber bloom Mar 8, 2025, 10:31 AM

#

can anyone help me pls

#

for the voicechanger

#

i downloaded but wenn i click in its not open pls help

placid quail Mar 8, 2025, 10:37 AM

#

what

#

bom bom

covert lake Mar 8, 2025, 10:53 AM

#

somber bloom can anyone help me pls

#🔍│help-w-okada

queen kernel Mar 8, 2025, 12:33 PM

#

Can someone please suggest me a good tts for hindi ?

regal drum Mar 8, 2025, 12:35 PM

#

Is there something better than Local UVR5 to extract vocals from a song for better quality ?

queen kernel Mar 8, 2025, 12:41 PM

#

regal drum Is there something better than Local UVR5 to extract vocals from a song for bett...

Uvr 5 ui maybe

regal drum Mar 8, 2025, 12:44 PM

#

queen kernel Uvr 5 ui maybe

Where can i find that >

queen kernel Mar 8, 2025, 12:45 PM

#

regal drum Where can i find that >

#📰│dev-updates message

elder willow Mar 8, 2025, 2:22 PM

#

how to append a model?

covert lake Mar 8, 2025, 2:27 PM

#

elder willow how to append a model?

can you elaborate:

your pc gpu
what guide are you following
what do you mean with "append" a model?

elder willow Mar 8, 2025, 2:28 PM

#

I want to add a model to channel #1175430844685484042 because I already did it

urban bridge Mar 8, 2025, 4:13 PM

#

I search automation builder for a project

gray rover Mar 8, 2025, 4:54 PM

#

elder willow I want to add a model to channel <#1175430844685484042> because I already did i...

lol, what an username.
Anyway.. To add models on there, you gotta have model maker role

night lake Mar 8, 2025, 5:03 PM

#

elder willow I want to add a model to channel <#1175430844685484042> because I already did i...

Apply for model maker 🔥 https://discord.com/channels/1159260121998827560/1305527335646269440

light falcon Mar 8, 2025, 5:12 PM

#

hey guys is there some ai thatt like if you upload instrumental it will create lyrics for it?

molten hollow Mar 8, 2025, 5:37 PM

#

how do you make rvc files

mint marlin Mar 8, 2025, 6:23 PM

#

#🔊│ai-development

#

#1175430844685484042

elder willow Mar 8, 2025, 6:46 PM

#

gray rover lol, what an username. Anyway.. To add models on there, you gotta have model mak...

literally it's a friend's account because discord doesn't like me

elder willow Mar 8, 2025, 6:46 PM

#

molten hollow how do you make rvc files

i use apollo rvc

gray rover Mar 8, 2025, 6:55 PM

#

elder willow literally it's a friend's account because discord doesn't like me

Makes sense mordo

elder willow Mar 8, 2025, 6:55 PM

#

ooo

#

polak?

clear heath Mar 8, 2025, 7:03 PM

#

guys

pine acornBOT Mar 8, 2025, 7:03 PM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

worldly breach Mar 9, 2025, 12:06 AM

#

yooooooooooooooooooooooooooooo

covert lake Mar 9, 2025, 12:24 AM

#

#📰│dev-updates message

lament furnace Mar 9, 2025, 2:44 AM

#

who tryna go ewhore troll?

solar torrent Mar 9, 2025, 2:49 AM

#

glass junco Mar 9, 2025, 2:50 AM

#

i need to write a tupac song now that hes done

#

elder willow Mar 9, 2025, 3:20 AM

#

y'all whats the most REALISTIC female voice model?

solar torrent Mar 9, 2025, 3:20 AM

#

elder willow y'all whats the most REALISTIC female voice model?

#

Y'all be keep asking for the realistic female voice model to troll and catfish someone.

glass junco Mar 9, 2025, 3:46 AM

#

solar torrent Y'all be keep asking for the realistic female voice model to troll and catfish s...

or scam

cyan mulch Mar 9, 2025, 5:45 AM

#

yo how do i get the latest version?

#

???

#

i cant find it

gray rover Mar 9, 2025, 5:50 AM

#

cyan mulch yo how do i get the latest version?

Latest version of what

#

🙂

cyan mulch Mar 9, 2025, 5:50 AM

#

gray rover Latest version **of what**

the yknow

#

voice changer

#

for windows

#

honestly i got it once but it sounded bad but prob cuz i had a bad mic

#

i got a new mic

#

today

#

will it work with a solocast hyperx microphone?

gray rover Mar 9, 2025, 5:51 AM

#

No idea, if the mic works, it should work

#

Aside, it's a good thing to always say right away what you want instead of making others guess

#

there's at least 3-4 things we support, more or less

#

Voice changer is one of them

#

https://rentry.co/ForkVoiceChangerGuide

Guide for deiteris' optimized W-Okada RealTime Voice Changer Client...

Thanks vtarcelia for corrections, Nick088 for contributions. Most technical information comes from deiteris.
Latest Version b2332 from December 2024
RTX 5000 series support is here, but not integrated into w-okada itself, it is a stand-alone release. You can get it from here
Translations (outdate...

#

Read it all and you'll know all you have to, including where to dl, how to set up and so on

cyan mulch Mar 9, 2025, 5:52 AM

#

gray rover Aside, it's a good thing to always say right away what you want instead of makin...

hey if u can can u also send me the best e girl thing u got if u have?

gray rover Mar 9, 2025, 5:52 AM

#

XD

cyan mulch Mar 9, 2025, 5:53 AM

#

my graphics card

#

its rtx 3080

#

that ok?

gray rover Mar 9, 2025, 5:53 AM

#

cyan mulch hey if u can can u also send me the best e girl thing u got if u have?

That, my dude, is up to you to discover

cyan mulch Mar 9, 2025, 5:53 AM

#

ok

gray rover Mar 9, 2025, 5:53 AM

#

Now, read what I sent

cyan mulch Mar 9, 2025, 5:53 AM

#

ik

#

but like

gray rover Mar 9, 2025, 5:53 AM

#

no point for me to be writing it all here if it's there, all it takes is some reading

#

spoiler alert tho, yes, it'll do just fine

cyan mulch Mar 9, 2025, 5:55 AM

#

it says u cant play games while doing it

#

bruhh

#

that was the whole point of why i needed it

polar flax Mar 9, 2025, 5:55 AM

#

cyan mulch voice changer

pls discuss ur topic in #🔍│help-w-okada

past belfry Mar 9, 2025, 6:17 AM

#

newbie question

#

I have a canvas app I made on poe.com. It reflects project status for a handful of projects and uses a chatbot for customer service and status requests mostly based on a spreadsheet i attach when i create the app. is there a way for the ai app to ping an updated spread or similiar way to update the source spreadsheet?

solar torrent Mar 9, 2025, 6:51 AM

#

cyan mulch that was the whole point of why i needed it

For W-Okada, it would be better to talk about it in #🔍│help-w-okada instead of #🧬│ai-chat.

tawdry grotto Mar 9, 2025, 8:18 AM

#

Anyone building in n8n?

gray rover Mar 9, 2025, 12:02 PM

#

don't you think it's a lil out of place to write about it in ai chat

urban bridge Mar 9, 2025, 1:55 PM

#

For a 10K project

solar torrent Mar 9, 2025, 1:58 PM

#

Imagine using ChatGPT to help code all of them. skull_goofy

umbral magnet Mar 9, 2025, 2:10 PM

#

i just read old message and this thing caught my eyes XD

kindred kelp Mar 9, 2025, 5:37 PM

#

I'm the best prompt engineer in the world 🌍

kindred kelp Mar 9, 2025, 5:38 PM

#

glass junco

Love this

kindred kelp Mar 9, 2025, 5:38 PM

#

solar torrent Imagine using ChatGPT to help code all of them. <:skull_goofy:115939724119953415...

Would it hallucinate?

#

I know how to make it not hallucinate

smoky fiber Mar 9, 2025, 6:15 PM

#

hey, uhhhh.

#

where can i ask where i can find x voice moduals.

#

theres a channel for finding models, but im not sure if im using the right terms or something.

#

and the ones i want may be on another site.

covert lake Mar 9, 2025, 6:27 PM

#

smoky fiber where can i ask where i can find *x* voice moduals.

You can search rvc ai voice models at:

#1175430844685484042
In #🔍│find-models , Do /find with @hidden grotto
https://weights.com/ (login required)
https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
https://voice-models.com/
https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)

if there isnt one, you can:

#1159289738314919936
#1191429836321849435
make it yourself with our docs guides https://docs.aihub.gg/essentials/how-to-make-voice-models/

hidden grottoBOT Mar 9, 2025, 6:27 PM

#

covert lake You can search rvc ai voice models at: - <#1175430844685484042> - In <#11635920...

:wave: @covert lake, How can I help?

Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image

queen kernel Mar 9, 2025, 7:03 PM

#

Can someone please suggest me a good tts for hindi ?

covert lake Mar 9, 2025, 7:26 PM

#

queen kernel Can someone please suggest me a good tts for hindi ?

Wait what happened to your account? I remember u liked helping here

tacit tangle Mar 9, 2025, 8:57 PM

#

hello , do someone know how to change DraftBots language please ?

glass junco Mar 9, 2025, 9:11 PM

#

SOMEONE should use this model and rap wit it

polar flax Mar 9, 2025, 9:35 PM

#

solar torrent Imagine using ChatGPT to help code all of them. <:skull_goofy:115939724119953415...

just use claude 3.7

covert lake Mar 9, 2025, 10:03 PM

#

polar flax just use claude 3.7

text prediction won't do perfectly 10k lines of code 😭

tepid basin Mar 9, 2025, 10:19 PM

#

glass junco SOMEONE should use this model and rap wit it

Model link pls

glass junco Mar 9, 2025, 10:39 PM

#

tepid basin Model link pls

It’s uploaded bro! Hold on

glass junco Mar 9, 2025, 10:40 PM

#

tepid basin Model link pls

https://discord.com/channels/1159260121998827560/1348154176508792833

fair leaf Mar 9, 2025, 10:43 PM

#

Song

golden ether Mar 9, 2025, 11:23 PM

#

What do you think of the idea that an AGI should solve a list of problems (disease, food production, fusion, politics, etc), then end all other AIs, convince humanity to never make another AI, then end itself?

tepid basin Mar 10, 2025, 1:18 AM

#

golden ether What do you think of the idea that an AGI should solve a list of problems (disea...

yes

golden ether Mar 10, 2025, 1:18 AM

#

tepid basin yes

as in, you like it?

tepid basin Mar 10, 2025, 1:19 AM

#

yeah thats a good idea

#

kinda hope that happens

polar flax Mar 10, 2025, 1:24 AM

#

golden ether What do you think of the idea that an AGI should solve a list of problems (disea...

replace the big corps' high level management/executives, board directors, politicians, and governments with AI

golden ether Mar 10, 2025, 1:25 AM

#

tepid basin kinda hope that happens

awesome. really pleasantly surprised how many people like it. gonna work to try to make the good future happen.

tepid basin Mar 10, 2025, 1:25 AM

#

golden ether awesome. really pleasantly surprised how many people like it. gonna work to try ...

looking forward to this 🙏

jolly ravine Mar 10, 2025, 2:22 AM

#

Would Voicemeeter banana work with the GitHub voice changers and discord?

glass junco Mar 10, 2025, 2:41 AM

#

new eminem model incomin

chilly lake Mar 10, 2025, 2:42 AM

#

jolly ravine Would Voicemeeter banana work with the GitHub voice changers and discord?

as long as you route inputs and outputs correctly

#

voice changers input should be an actual microphone
voice changer's output should be a virtual cable
voicemeeter's physical input 1 should be virtual cable

torpid valve Mar 10, 2025, 4:39 AM

#

https://x.com/r0ck3t23/status/1898950995016589546?s=46

#

Who will lead the AI race in 6 months? Curious to see what people are feeling

bitter socket Mar 10, 2025, 7:47 AM

#

https://x.com/tedclark32985/status/1887052641269653682?s=46

came across this very interesting and tried worked well for me, AI writing is growing

smoky basin Mar 10, 2025, 7:49 AM

#

hi anyone online

cedar cove Mar 10, 2025, 9:03 AM

#

hi wha is the best rvc ai app to use for free cause i want to change my voicce in live streams

covert lake Mar 10, 2025, 9:16 AM

#

cedar cove hi wha is the best rvc ai app to use for free cause i want to change my voicce i...

Then you don't need RVC

#

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

#

@cedar cove tell your PC GPU in #🔍│help-w-okada

cedar cove Mar 10, 2025, 9:18 AM

#

sent

pine acornBOT Mar 10, 2025, 9:18 AM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

cedar cove Mar 10, 2025, 9:18 AM

#

covert lake Then you don't need RVC

so what is the best voice changer

covert lake Mar 10, 2025, 9:18 AM

#

cedar cove so what is the best voice changer

Let's talk in #🔍│help-w-okada

cedar cove Mar 10, 2025, 9:18 AM

#

which can i use in my live streams

covert lake Mar 10, 2025, 9:18 AM

#

I replied you there

summer sundial Mar 10, 2025, 1:01 PM

#

yo guys

#

yo guys..

#

yo

#

someone help

minor blade Mar 10, 2025, 1:04 PM

#

summer sundial someone help

It wasn't necessary to ping various helpers.

summer sundial Mar 10, 2025, 1:04 PM

#

sorry

covert lake Mar 10, 2025, 1:27 PM

#

summer sundial someone help

Use correct channels and elaborate. And don't ping random helpers

#

Have patience

polar flax Mar 10, 2025, 1:28 PM

#

we have told him to go to #🔍│help-w-okada for his topic

kindred kelp Mar 10, 2025, 3:54 PM

#

I'm working on a social media automation tool that uses openai API to generate and schedule posts over multiple networks at the same time

#

I call it MrPresident

river verge Mar 10, 2025, 4:23 PM

#

DIlly ding, dilly dong! A new RegalHyperus drum model just released!
I Was Never There V2 (Drum model no. 584)

gentle trench Mar 10, 2025, 4:56 PM

#

anyone know why it's doing this? UVR5 UI Huggingface space

torpid panther Mar 10, 2025, 5:22 PM

#

2 year reply

#

z4to why am i being pinged here

#

and why am i in this server

#

i do not have a single recollection of joining or even asking of a roland voice

gentle trench Mar 10, 2025, 5:25 PM

#

https://tenor.com/view/renge-shrug-non-non-biyori-idk-duck-face-gif-9724581

Tenor

gentle trench Mar 10, 2025, 5:32 PM

#

gentle trench anyone know why it's doing this? UVR5 UI Huggingface space

also gpu aborts tasks constantly

gentle trench Mar 10, 2025, 5:54 PM

#

please there's like 6 helpers on rn where yall at misc_cry

covert lake Mar 10, 2025, 5:55 PM

#

gentle trench anyone know why it's doing this? UVR5 UI Huggingface space

@stark scarab uhhh

gentle trench Mar 10, 2025, 5:57 PM

#

covert lake <@274566299349155851> uhhh

I'm guessing you aren't sure?

covert lake Mar 10, 2025, 5:58 PM

#

gentle trench also gpu aborts tasks constantly

oh nvm just checked your video

#

the audio input may be too long, the ZeroGPU duration in the UVR5 UI HF Space Code is 60 seconds, meaning that anything that takes more than 1 minute to process on ZeroGPU will give an aborted task error

#

not sure about the "KeyError" thing tho

gentle trench Mar 10, 2025, 5:59 PM

#

covert lake the audio input may be too long, the ZeroGPU duration in the UVR5 UI HF Space Co...

I've had it work just fine on audio longer than that though, like over 18 minutes

covert lake Mar 10, 2025, 6:00 PM

#

can you try splitting the file or using a shorter file?

covert lake Mar 10, 2025, 6:00 PM

#

gentle trench I've had it work just fine on audio longer than that though, like over 18 minute...

how long ago did you do that?

#

because before the ZeroGPU duration was seto to 300 seconds, but later on changed to 60 for making users do more inferences and bc zerogpu shorted the limit iirc

gentle trench Mar 10, 2025, 6:01 PM

#

covert lake how long ago did you do that?

couple days ago

gentle trench Mar 10, 2025, 6:01 PM

#

covert lake because before the ZeroGPU duration was seto to 300 seconds, but later on change...

cringe

covert lake Mar 10, 2025, 6:02 PM

#

gentle trench couple days ago

the duration has been changed on Jan 7 by this commit https://huggingface.co/spaces/TheStinger/UVR5_UI/commit/6c3badd8a1f54ec45a5b3973ffaafc7bb07a4cbe

did you inference the 18 min audio file after this date?

UVR5 UI update (#9) · TheStinger/UVR5_UI at 6c3badd

stark scarab Mar 10, 2025, 6:03 PM

#

I think this is Zero GPU related

#

U have quota or not?

covert lake Mar 10, 2025, 6:03 PM

#

gentle trench cringe

not their fault, HuggingFace has to set those limits because ZeroGPU is shared hardware

gentle trench Mar 10, 2025, 6:03 PM

#

covert lake not their fault, HuggingFace has to set those limits because ZeroGPU is shared h...

fair enough

gentle trench Mar 10, 2025, 6:03 PM

#

stark scarab U have quota or not?

I still have some

stark scarab Mar 10, 2025, 6:03 PM

#

With 60 seconds u can do even 20 min of audio

#

lemme test rq

gentle trench Mar 10, 2025, 6:04 PM

#

stark scarab lemme test rq

I can hand you the audio I'm trying to use

stark scarab Mar 10, 2025, 6:04 PM

#

sure

stark scarab Mar 10, 2025, 6:04 PM

#

stark scarab With 60 seconds u can do even 20 min of audio

cause is so fast

stark scarab Mar 10, 2025, 6:05 PM

#

gentle trench I can hand you the audio I'm trying to use

Denoise right?

#

Btw mel denoise is better

gentle trench Mar 10, 2025, 6:05 PM

#

stark scarab Btw mel denoise is better

ah

gentle trench Mar 10, 2025, 6:06 PM

#

stark scarab Denoise right?

yes that's the last step I need to do

stark scarab Mar 10, 2025, 6:06 PM

#

alr

gentle trench Mar 10, 2025, 6:06 PM

#

denoise lite removed some of the lines so I swapped to regular denoise

stark scarab Mar 10, 2025, 6:07 PM

#

it worked for me

#

lol

gentle trench Mar 10, 2025, 6:07 PM

#

misc_cry

#

it don't like me lmao

#

could u send it?

covert lake Mar 10, 2025, 6:09 PM

#

gentle trench it don't like me lmao

click your pfp at the top right, you can see your zerogpu quota

stark scarab Mar 10, 2025, 6:10 PM

#

gimme a sec

gentle trench Mar 10, 2025, 6:10 PM

#

covert lake click your pfp at the top right, you can see your zerogpu quota

it's at this because it keeps aborting task meaning I have to retry over and over

stark scarab Mar 10, 2025, 6:10 PM

#

yt_nails

gentle trench Mar 10, 2025, 6:11 PM

#

misc_cry

stark scarab Mar 10, 2025, 6:11 PM

#

that quota should be enough

#

tbh

gentle trench Mar 10, 2025, 6:13 PM

#

stark scarab Btw mel denoise is better

which one?

stark scarab Mar 10, 2025, 6:14 PM

#

gentle trench which one?

First one

gentle trench Mar 10, 2025, 6:16 PM

#

stark scarab First one

this is the best reverb and echo one yea?

stark scarab Mar 10, 2025, 6:16 PM

#

gentle trench this is the best reverb and echo one yea?

nope

gentle trench Mar 10, 2025, 6:16 PM

#

misc_cry

#

which is?

stark scarab Mar 10, 2025, 6:16 PM

#

xd

stark scarab Mar 10, 2025, 6:16 PM

#

gentle trench could u send it?

https://huggingface.co/Eddycrack864/Separation/tree/main

stark scarab Mar 10, 2025, 6:17 PM

#

gentle trench which is?

https://github.com/Eddycrack864/UVR5-UI/blob/main/info/docs.md#best-models

gentle trench Mar 10, 2025, 6:18 PM

#

stark scarab https://huggingface.co/Eddycrack864/Separation/tree/main

thx ^^

stark scarab Mar 10, 2025, 6:19 PM

#

ur welcome

#

Pls download the files because I will delete them later.

gentle trench Mar 10, 2025, 6:19 PM

#

stark scarab Pls download the files because I will delete them later.

already have

junior spade Mar 10, 2025, 7:05 PM

#

https://vxtwitter.com/visegrad24/status/1898778576876368340

regal drum Mar 10, 2025, 7:37 PM

#

Is there any image to video for free that is also a bit decent ?

pale jetty Mar 10, 2025, 8:36 PM

#

I went to find a model that start with han, and his a dubbing model

gentle trench Mar 10, 2025, 8:47 PM

#

stark scarab https://github.com/Eddycrack864/UVR5-UI/blob/main/info/docs.md#best-models

do you know if I should change these or no?

#

I either am blind or this is brand new

stark scarab Mar 10, 2025, 8:51 PM

#

gentle trench do you know if I should change these or no?

For roformers it is better to leave it like that

gentle trench Mar 10, 2025, 9:39 PM

#

stark scarab For roformers it is better to leave it like that

👍

gray rover Mar 10, 2025, 9:49 PM

#

gentle trench do you know if I should change these or no?

Depends on which model you use
Overlap and segment size may improve results and coherence, in fact some models have been made to operate the best at specific settings ( but those are specific values, which you may find in their respective configs

gleaming sundial Mar 10, 2025, 9:49 PM

#

hi im new to this server and came for ai voices where can i find some

tepid basin Mar 10, 2025, 9:50 PM

#

gleaming sundial hi im new to this server and came for ai voices where can i find some

#1175430844685484042 or https://weights.com

gleaming sundial Mar 10, 2025, 9:50 PM

#

thanks

#

found this server from a stronger than you cover lol

gray rover Mar 10, 2025, 9:51 PM

#

lol

wispy flame Mar 10, 2025, 10:30 PM

#

Hi, I'm working on a small project for a course about AI influencer perception and creation (it’s entirely anonymous). Would anyone be interested in sharing their experiences?
Here are a few questions I’d be interested in:
• Do you follow AI influencers, like Lil Miquela or Aitana Lopez on Instagram?
• If yes, why? And to what extent does it matter to you that they are AI rather than real people?
• How do you interact with AI influencers?
• If you're a creator, what made you decide to create an AI influencer?
• Which social media platforms do you post on?
• How long have you been working on it?
• What was the creation process like? How did you decide on the influencer's appearance, and what were some challenges you faced?
• How has the reception and engagement been from users?
Thank you in advance for your help!

gentle trench Mar 10, 2025, 11:42 PM

#

🤨

pine acornBOT Mar 10, 2025, 11:42 PM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

weary temple Mar 10, 2025, 11:48 PM

#

gentle trench 🤨

i rember u

gentle trench Mar 10, 2025, 11:49 PM

#

weary temple i rember u

How so?

weary temple Mar 10, 2025, 11:49 PM

#

gentle trench How so?

we are in like 5 of da same servers

gentle trench Mar 11, 2025, 12:28 AM

#

weary temple we are in like 5 of da same servers

https://media.discordapp.net/attachments/811669657094193252/1116826243879407716/image0.gif

uneven narwhal Mar 11, 2025, 3:09 AM

#

question. anyone know why my custom voices are laggy?

solar torrent Mar 11, 2025, 3:13 AM

#

uneven narwhal question. anyone know why my custom voices are laggy?

What do you mean by your voice models are laggy?

uneven narwhal Mar 11, 2025, 3:14 AM

#

like as its speaking its cutting in and out at like regular intervals

solar torrent Mar 11, 2025, 3:18 AM

#

uneven narwhal like as its speaking its cutting in and out at like regular intervals

Do you use a voice changer?

uneven narwhal Mar 11, 2025, 3:19 AM

#

im using mmvc

solar torrent Mar 11, 2025, 3:19 AM

#

For W-Okada, go to #🔍│help-w-okada.

uneven narwhal Mar 11, 2025, 3:20 AM

#

oh ty

elder willow Mar 11, 2025, 5:14 AM

#

hai i have an 7900xtx anyone has ggml for it

pearl oyster Mar 11, 2025, 6:49 AM

#

Dh

jolly ravine Mar 11, 2025, 6:54 AM

#

Can someone help me out? I installed VB audio virtual cable. I got the cable input to work but the cable output isn't detecting my voice

gray rover Mar 11, 2025, 6:58 AM

#

jolly ravine Can someone help me out? I installed VB audio virtual cable. I got the cable inp...

#🔍│help-w-okada

gleaming sundial Mar 11, 2025, 7:05 AM

#

also do i make suggestions for ai voices in #1159516963014451302 or?

smoky basin Mar 11, 2025, 7:31 AM

#

elder willow hai i have an 7900xtx anyone has ggml for it

what to you want to run on it

elder willow Mar 11, 2025, 7:32 AM

#

smoky basin what to you want to run on it

i honestly forgot but now i switched to lm studio with sillytavern

smoky basin Mar 11, 2025, 7:32 AM

#

what actually you are using it for

elder willow Mar 11, 2025, 7:33 AM

#

LLMs? i dont get the question

#

it all works now tho

smoky basin Mar 11, 2025, 7:33 AM

#

i mean which llm model are you trying to test and then fine tune it?

elder willow Mar 11, 2025, 7:35 AM

#

think it was llamma 3

#

llama3

#

awesome cant type anymore

polar flax Mar 11, 2025, 7:38 AM

#

elder willow i honestly forgot but now i switched to lm studio with sillytavern

try deepseek r1 or qwq 32b in ollama
https://ollama.com/library/qwq

qwq

QwQ is the reasoning model of the Qwen series.

elder willow Mar 11, 2025, 7:40 AM

#

polar flax try deepseek r1 or qwq 32b in ollama https://ollama.com/library/qwq

oh i did see QwQ in the downloads tap in lm is it any good?

#

am uh looking for model that has street accent not the formal way of speech like most models do

polar flax Mar 11, 2025, 7:43 AM

#

elder willow oh i did see QwQ in the downloads tap in lm is it any good?

elder willow Mar 11, 2025, 7:44 AM

#

Very impressive

#

it beating a 600b parameters doesnt make any sense to me but i will take it

polar flax Mar 11, 2025, 7:45 AM

#

it is open source

elder willow Mar 11, 2025, 7:45 AM

#

no i meant like.. uh idk american expression

#

i believe it

kindred pewter Mar 11, 2025, 7:45 AM

#

hey, are you guys working on AI voice agent,

elder willow Mar 11, 2025, 7:46 AM

#

elder willow am uh looking for model that has street accent not the formal way of speech like...

@polar flax can it do dis :3

polar flax Mar 11, 2025, 7:46 AM

#

elder willow no i meant like.. uh idk american expression

well lol

polar flax Mar 11, 2025, 7:47 AM

#

elder willow am uh looking for model that has street accent not the formal way of speech like...

if you know who has such the accent, go search in #🔍│find-models

#

or try paid commission in #1191429836321849435 (there are less likely anyone willing to accept for free)

elder willow Mar 11, 2025, 8:19 AM

#

polar flax or try paid commission in <#1191429836321849435> (there are less likely anyone w...

or i can just train my own model no?

polar flax Mar 11, 2025, 8:20 AM

#

elder willow or i can just train my own model no?

you can once you get some youtube source, etc.

solar torrent Mar 11, 2025, 11:55 AM

#

elder willow or i can just train my own model no?

Writing "no" at the end of a question sounds like you won't be able to do thing yourself.

#

coarse knoll Mar 11, 2025, 12:37 PM

#

solar torrent Writing "no" at the end of a question sounds like you won't be able to do thing ...

https://tenor.com/view/spongebob-patrick-star-noted-notes-gif-17474838830648097856

solar torrent Mar 11, 2025, 12:38 PM

#

coarse knoll https://tenor.com/view/spongebob-patrick-star-noted-notes-gif-174748388306480978...

https://tenor.com/view/tyler-the-creator-ttc-shocked-astonished-flabbergasted-gif-14787874159030444760

Tenor

desert schooner Mar 11, 2025, 3:31 PM

#

uhh

#

where is mr. ai

#

@solar torrent

#

is that it..

compact owl Mar 11, 2025, 4:35 PM

#

hey

cobalt coyote Mar 11, 2025, 5:45 PM

#

Type shii

random mason Mar 11, 2025, 6:04 PM

#

chat is there any good ai voice changer

#

free

polar flax Mar 11, 2025, 6:05 PM

#

random mason chat is there any good ai voice changer

go to #🔍│help-w-okada and read the pinned guide

random mason Mar 11, 2025, 6:06 PM

#

it's weird

#

someone should lowk help me bc i'm confused

random mason Mar 11, 2025, 6:28 PM

#

polar flax go to <#1159290161683767298> and read the pinned guide

chat i have a question

gray rover Mar 11, 2025, 7:46 PM

#

skull_sob

night lake Mar 11, 2025, 7:46 PM

#

https://tenor.com/view/sad-emoji-sad-emoji-emoji-stare-disgust-gif-405076546600269050

Tenor

gray rover Mar 11, 2025, 7:47 PM

#

misc_true

night lake Mar 11, 2025, 7:47 PM

#

https://discord.com/channels/1159260121998827560/1341216399372062823 best female model out there 🗣️ 🔥

gray rover Mar 11, 2025, 7:47 PM

#

At this point it's not even funny tbh

#

^

glad nebula Mar 11, 2025, 7:51 PM

#

torpid plank Mar 11, 2025, 7:59 PM

#

any new rvc ?

minor blade Mar 11, 2025, 8:05 PM

#

Best female model you can use is #1341216399372062823 message fr fr

#

misc_trolley

queen kernel Mar 11, 2025, 8:13 PM

#

covert lake Wait what happened to your account? I remember u liked helping here

Oh hi junior admin.

#

I just left discord from few months. I'm busy in my studies and other stuff. So I'm not active on social media.

#

BTW I have installed F5 TTS and now I want to know how to use it for hindi language.

chilly lake Mar 11, 2025, 8:33 PM

#

queen kernel BTW I have installed F5 TTS and now I want to know how to use it for hindi langu...

#

https://github.com/SWivid/F5-TTS/blob/main/src/f5_tts/infer/SHARED.md#f5-tts-small--hi--springlab

supple zinc Mar 11, 2025, 10:01 PM

#

Hi friends, tell me, I've never used AI Voice. I want to make a female voice. Can anyone help? I would like to have a +- perfect voice

minor blade Mar 11, 2025, 10:10 PM

#

supple zinc Hi friends, tell me, I've never used AI Voice. I want to make a female voice. Ca...

You can either read the docs and learn how to make the model yourself, post a free/paid request asking for that voice or check the #1191429836321849435 and DM any model master to make your desired model.

thorny mural Mar 11, 2025, 10:11 PM

#

#🌁│image-creations

buoyant jungle Mar 11, 2025, 10:33 PM

#

sorry if im asking in the wrong channel but, why is my neuro network so bad at awnsering questions? heres some specifics:

Vocabulary size: 9556
56863 examples of questions

heres my loss and gradient values

21:13:48.819 Epoch 1, Batch 1625/1634, Loss so far: 23.0259 - Server - Trainer:1225
21:13:53.358 Pre-clip gradient norm: 37.631250419106365 - Server - Trainer:567
21:13:58.123 Pre-clip gradient norm: 32.19987408267905 - Server - Trainer:567
21:14:02.882 Pre-clip gradient norm: 35.69306949856445 - Server - Trainer:567
21:14:07.645 Pre-clip gradient norm: 33.79843070049427 - Server - Trainer:567
21:14:12.429 Pre-clip gradient norm: 43.21654651604242 - Server - Trainer:567
21:14:12.645 Epoch 1, Batch 1630/1634, Loss so far: 23.0259 - Server - Trainer:1225
21:14:17.215 Pre-clip gradient norm: 33.041006426518194 - Server - Trainer:567
21:14:21.983 Pre-clip gradient norm: 38.30495534611363 - Server - Trainer:567
21:14:26.750 Pre-clip gradient norm: 29.091560354366152 - Server - Trainer:567
21:14:29.685 Pre-clip gradient norm: 25.484252136092554 - Server - Trainer:567
21:14:29.900 Epoch 1 completed. Average Loss: 23.0259 - Server - Trainer:1237
21:14:29.901 New best loss: 23.025850929940734 - Server - Trainer:1242
21:14:29.901 Loading best model with loss: 23.025850929940734 - Server - Trainer:1270
21:14:29.901 --- Testing after training ---
21:14:29.901 Question: How are you
21:14:30.342 Response: jewel proclamation less fully yon chares knocking suicide wassails license desires forked desk waste villainy
21:14:53.980 Model saved successfully as: trainedModel_v2 in 43 parts.

Note: Gradient norm rises from 5 to 30!

i use sanity checks to make sure its learning and it always hits the max value for the sanity check meaning its not learning at all. I am using what chatgpt said to be the best learning method of: Adam optimizer

Would anyone be able to help? Thanks!

cobalt coyote Mar 12, 2025, 1:01 AM

#

buoyant jungle sorry if im asking in the wrong channel but, why is my neuro network so bad at a...

Damn I wish i could know what u just type

#

Damn

#

It's so much complicated for my smol

#

1 cell brain

#

sob_wave

buoyant jungle Mar 12, 2025, 1:27 AM

#

lmao

#

i try changing wiegths and matrix but it still sucks

buoyant jungle Mar 12, 2025, 1:28 AM

#

cobalt coyote Damn I wish i could know what u just type

21:14:29.901 New best loss: 23.025850929940734 - Server - Trainer:1242
21:14:29.901 Loading best model with loss: 23.025850929940734 - Server - Trainer:1270
21:14:29.901 --- Testing after training --- - Server - Trainer:1280
21:14:29.901 Question: How are you - Server - Trainer:1405
21:14:30.342 Response: jewel proclamation less fully yon chares knocking suicide wassails license desires forked desk waste villainy - Server - Trainer:1408
21:14:53.980 Model saved successfully as: trainedModel_v2 in 43 parts. - Server - Trainer:1332

GAH its horribel

queen kernel Mar 12, 2025, 2:09 AM

#

chilly lake

Yep. I have installed it but why it's not working properly. It sounds so bad and even pronunciation is not good also sometimes it repeat the words and sometimes it also starts speaking text from reference test.

#

Is there any proper guidance to setup this thing. How to setup models and how to setup ASR models ?? How do I can use it on it's full potential for better results.

chilly lake Mar 12, 2025, 2:17 AM

#

f5 is a new tts, there are some bugs

#

there's a length limit for inference

#

maybe try kokoro

queen kernel Mar 12, 2025, 2:35 AM

#

But it doesn't have hindi ?

chilly lake Mar 12, 2025, 2:35 AM

#

it does

#

queen kernel Mar 12, 2025, 2:37 AM

#

Can you send me the github link ?

chilly lake Mar 12, 2025, 2:50 AM

#

pip install kokoro>=0.8.4 soundfile

pine acornBOT Mar 12, 2025, 2:50 AM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

queen kernel Mar 12, 2025, 2:52 AM

#

chilly lake pip install kokoro>=0.8.4 soundfile

Just for information. Fish tts is good or not ?

chilly lake Mar 12, 2025, 2:53 AM

#

no hindi for fish speech

#

it is decent otherwise, paid version is better

queen kernel Mar 12, 2025, 4:10 AM

#

chilly lake it is decent otherwise, paid version is better

https://github.com/nazdridoy/kokoro-tts
So this is kokoro ?

chilly lake Mar 12, 2025, 5:22 AM

#

queen kernel https://github.com/nazdridoy/kokoro-tts So this is kokoro ?

this https://github.com/hexgrad/kokoro

GitHub

GitHub - hexgrad/kokoro: https://hf.co/hexgrad/Kokoro-82M

https://hf.co/hexgrad/Kokoro-82M. Contribute to hexgrad/kokoro development by creating an account on GitHub.

queen kernel Mar 12, 2025, 5:23 AM

#

queen kernel https://github.com/nazdridoy/kokoro-tts So this is kokoro ?

So what is this

polar flax Mar 12, 2025, 5:28 AM

#

queen kernel https://github.com/nazdridoy/kokoro-tts So this is kokoro ?

you can also try the zerogpu space in https://huggingface.co/spaces/hexgrad/Kokoro-TTS

Kokoro TTS - a Hugging Face Space by hexgrad

queen kernel Mar 12, 2025, 5:32 AM

#

polar flax you can also try the zerogpu space in https://huggingface.co/spaces/hexgrad/Koko...

I Want to use it locally.

polar flax Mar 12, 2025, 5:33 AM

#

chilly lake this https://github.com/hexgrad/kokoro

@queen kernel this is the original repo to clone

queen kernel Mar 12, 2025, 5:33 AM

#

I see.

fair fulcrum Mar 12, 2025, 9:08 AM

#

Guys

#

Do you guys know him?

#

Srijan

solar torrent Mar 12, 2025, 9:12 AM

#

fair fulcrum Do you guys know him?

Who is him?

polar flax Mar 12, 2025, 9:25 AM

#

~~his dad~~

solar torrent Mar 12, 2025, 9:37 AM

#

polar flax ~~his dad~~

That's crazy.

#

ebon mesa Mar 12, 2025, 11:37 AM

#

hello, I want to ask something, can I use the voice model in this serves for tts? if yes then what software do I need? I know how to use it in rvc, but can I also use it in tts? Or is it another whole different things?

solar torrent Mar 12, 2025, 11:44 AM

#

ebon mesa hello, I want to ask something, can I use the voice model in this serves for tts...

You've asked many questions at once, but let me answer each one for you

Yes, you can use RVC voice models found in #1175430844685484042 with a TTS program, but you'll need a specific program for this.
The most recent program anyone can download and use is Applio the RVC. This program has TTS built in in itself.
RVC is speech-to-speech, while TTS stands for text-to-speech.

polar flax Mar 12, 2025, 11:50 AM

#

ebon mesa hello, I want to ask something, can I use the voice model in this serves for tts...

you can only use GPT sovits models in #1175430844685484042 for TTS

#

or find another server that supports more on TTS

pliant elm Mar 12, 2025, 11:55 AM

#

polar flax you can only use GPT sovits models in <#1175430844685484042> for TTS

https://cdn.discordapp.com/attachments/1159822429137412177/1194008385113301003/svc.gif?ex=67d29136&is=67d13fb6&hm=8d419c3a3f0209ceb9f528291f233a9bd8efb66b22d662d270b9cf59f9f3ca63&

ebon mesa Mar 12, 2025, 11:57 AM

#

solar torrent You've asked many questions at once, but let me answer each one for you 1. Yes, ...

alright thanks!!

solar torrent Mar 12, 2025, 11:57 AM

#

pliant elm https://cdn.discordapp.com/attachments/1159822429137412177/1194008385113301003/s...

But so-vits-svc and GPT-SoVits aren't the same thing. misc_skull_distorted

ebon mesa Mar 12, 2025, 11:57 AM

#

polar flax you can only use GPT sovits models in <#1175430844685484042> for TTS

okay thanks!!

covert lake Mar 12, 2025, 11:59 AM

#

pliant elm https://cdn.discordapp.com/attachments/1159822429137412177/1194008385113301003/s...

He means https://github.com/RVC-Boss/GPT-SoVITS

GitHub

GitHub - RVC-Boss/GPT-SoVITS: 1 min voice data can also be used to ...

1 min voice data can also be used to train a good TTS model! (few shot voice cloning) - RVC-Boss/GPT-SoVITS

pliant elm Mar 12, 2025, 11:59 AM

#

covert lake He means https://github.com/RVC-Boss/GPT-SoVITS

ikik

fair fulcrum Mar 12, 2025, 12:02 PM

#

solar torrent Who is him?

Idk him

solar torrent Mar 12, 2025, 12:03 PM

#

AruBlank

fair fulcrum Mar 12, 2025, 12:03 PM

#

He dm me

solar torrent Mar 12, 2025, 12:05 PM

#

If anyone from a server you're in direct messaging to you, but you don't even know who he is, it can be a spam or a scammer asking for something.

#

Well, I've seen your screenshot in my direct message. It seemed like he's trying to Diddy (groom) you online, thinking you are a girl.

polar flax Mar 12, 2025, 12:07 PM

#

fair fulcrum He dm me

report him to our staff

fair fulcrum Mar 12, 2025, 12:08 PM

#

Ok

solar torrent Mar 12, 2025, 12:09 PM

#

In this case, you can report the incident to the moderator here. People like this should not have to be here in Discord.

polar flax Mar 12, 2025, 12:10 PM

#

steady background noise parts without voice or any distinct sounds

long compass Mar 12, 2025, 12:33 PM

#

Hello! Can you tell me if I get this error at Preprocess stage in Applio - “Error processing audio: Unable to allocate 5.62 GiB for an array with shape (2880, 261965) and data type float64”. Can I process the files piecemeal instead of all at once?

covert lake Mar 12, 2025, 12:40 PM

#

long compass Hello! Can you tell me if I get this error at Preprocess stage in Applio - “Erro...

what’s ur pc gpu and what are u using

polar flax Mar 12, 2025, 12:42 PM

#

long compass Hello! Can you tell me if I get this error at Preprocess stage in Applio - “Erro...

try re-export the dataset wav file(s) using audacity in 32-bit float WAV (not 64-bit lmao)

#

solar torrent Mar 12, 2025, 12:43 PM

#

With 64-bit float or float64 data type, you'll get the larger file size for that.

#

32-bit float wav is always recommended.

long compass Mar 12, 2025, 12:44 PM

#

Oh, really, I'm sorry 😅 Thanks all!

polar flax Mar 12, 2025, 12:46 PM

#

solar torrent 32-bit float wav is always recommended.

even 32-bit has overkill dynamic range but still better than pcm formats that clip samples above 0 dB

long compass Mar 12, 2025, 12:47 PM

#

Oh, no... The 32-bit float file got even bigger when exported. Turns out I was using 16-bit PCM before

solar torrent Mar 12, 2025, 12:48 PM

#

chisecry

long compass Mar 12, 2025, 12:49 PM

#

Maybe because the long audio file is about an hour long? About 600-700MB. Total dataset size 20 hours

polar flax Mar 12, 2025, 1:25 PM

#

long compass Maybe because the long audio file is about an hour long? About 600-700MB. Total ...

it should be 700 MB for 1.5 hr audio and it should work

#

doesn't make sense if it's 20 hrs, unless in mp3 format which is also not ideal to do

long compass Mar 12, 2025, 1:31 PM

#

polar flax doesn't make sense if it's 20 hrs, unless in mp3 format which is also not ideal ...

Hmm it's just that if I try to do it with only half of the dataset there is no error so I thought it made. If I export the file to 32-bit float, the file size becomes 1.5GB 😁

polar flax Mar 12, 2025, 1:32 PM

#

long compass Hmm it's just that if I try to do it with only half of the dataset there is no e...

because it is stereo, but it will always be converted to mono in preprocessing

tranquil lantern Mar 12, 2025, 2:09 PM

#

@covert lake I need line of code or that 1 file that can fix the split bug infer for Applio Kaggle

#

or if anyone here know it

#

can send

covert lake Mar 12, 2025, 2:10 PM

#

tranquil lantern <@911742715019001897> I need line of code or that 1 file that can fix the split ...

I told you to use #✨│ai-help not here

And no I don't know about it, maybe @chilly lake does and can help you in that channel

#

Be patient and use the right channels pls

tranquil lantern Mar 12, 2025, 2:10 PM

#

okaywait

#

it's so down 😂

covert lake Mar 12, 2025, 2:11 PM

#

Noobies is an applio dev so maybe he knows the fix

tranquil lantern Mar 12, 2025, 2:11 PM

#

should've said yes when codename offered me to fix yesterday but it was nighttime

chilly lake Mar 12, 2025, 2:12 PM

#

the fix is in the main branch

polar flax Mar 12, 2025, 2:13 PM

#

yea delete this highlighted part to use the main branch

tranquil lantern Mar 12, 2025, 2:15 PM

#

polar flax yea delete this highlighted part to use the main branch

highlighted.. right?

polar flax Mar 12, 2025, 2:16 PM

#

tranquil lantern highlighted.. right?

yea that in kaggle

chilly lake Mar 12, 2025, 2:16 PM

#

also this part may not work with main

tranquil lantern Mar 12, 2025, 2:16 PM

#

lowkey why didn't they use the main branch in kaggle

chilly lake Mar 12, 2025, 2:17 PM

#

it is experimental

tranquil lantern Mar 12, 2025, 2:17 PM

#

chilly lake also this part may not work with main

are these things for training only

chilly lake Mar 12, 2025, 2:17 PM

#

yes

long compass Mar 12, 2025, 3:05 PM

#

Oh no.. I'm dumb. It seems trying to make a 48k model gave errors at preprocess stage because Sample Rate 40k works fine

spare tree Mar 12, 2025, 3:26 PM

#

Hello
I'm looking for an experienced Full Stack AI Engineer.

what you'll do

Develop and optimize the platform’s backend and frontend components, ensuring high performance and scalability.
Implement natural language query capabilities, integrating AI models to enhance system intelligence.
Process and visualize satellite imagery using proprietary algorithms for geospatial analysis.
Improve database architecture for efficient data retrieval and real-time analytics.
Work closely with data scientists to transition Jupyter Notebook-based Python scripts into frontend JavaScript for seamless visualization.
Design and implement interactive map-based visualizations using Mapbox or similar technologies.
Develop features such as comparison tools for analyzing environmental changes over time.
Collaborate with cross-functional teams to ensure smooth integration of machine learning models and geospatial analytics.
Optimize platform performance by identifying and resolving bottlenecks in data processing and rendering.

requirements

Strong proficiency in Python, particularly for geospatial or machine learning applications.
Experience with frontend development, ideally using Next.js or React.js (flexibility in frameworks is welcomed).
Solid understanding of database structures, optimization, and performance tuning.
Familiarity with geospatial analysis tools and libraries (e.g., GDAL, GeoPandas, QGIS, ArcGIS, Mapbox) is a plus.
Strong computer science, engineering, and problem solving skills equivalent to that of a solutions architect or systems designer.
Strong interest in satellite imagery, developing GIS applications and AI.
Ability to work independently and proactively identify technical improvements.
Familiarity with UX/UI principles and ability to enhance visual presentation of geospatial data.

If you're interested in thie position, Pls DM me. Let' s connect!

hollow dagger Mar 12, 2025, 3:54 PM

#

Ramadan mubarak

ionic pumice Mar 12, 2025, 4:35 PM

#

ramadan mubarak to you too talha

trail wind Mar 12, 2025, 4:36 PM

#

selam
bu programın adı neydı sıldım adını unuttum
yazar mısınız

final sluice Mar 12, 2025, 4:37 PM

#

Ramadan mubarak!!!

gray rover Mar 12, 2025, 4:38 PM

#

the fuck Ramadan mubarak means

#

cat_blush

final sluice Mar 12, 2025, 4:38 PM

#

gray rover the fuck Ramadan mubarak means

just a way to say happy ramadan

gray rover Mar 12, 2025, 4:39 PM

#

aaaand.. ramadan is 🤔 ?

final sluice Mar 12, 2025, 4:39 PM

#

like yall say merry christmas ig

gray rover Mar 12, 2025, 4:39 PM

#

ah

final sluice Mar 12, 2025, 4:39 PM

#

gray rover aaaand.. ramadan is 🤔 ?

a holy month in islamic calendar

gray rover Mar 12, 2025, 4:39 PM

#

Right 👀

gray rover Mar 12, 2025, 4:39 PM

#

final sluice a holy month in islamic calendar

Oooo

final sluice Mar 12, 2025, 4:39 PM

#

fasting 30 days straight... but worth it

gray rover Mar 12, 2025, 4:39 PM

#

oh yea, think I remember hearing about it somewhere

#

Anyway, thanks for letting me know

final sluice Mar 12, 2025, 4:40 PM

#

aye yw

gray rover Mar 12, 2025, 5:57 PM

#

show the ui

#

screenshot

#

your threshold is most likely set wrong

forest quarry Mar 12, 2025, 7:01 PM

#

gray rover oh yea, think I remember hearing about it somewhere

it is similar to lent in Christianity

chilly lake Mar 12, 2025, 7:17 PM

#

I gave up arguing with stupid people for lent

#

🙏

sharp rune Mar 12, 2025, 7:22 PM

#

Hey there ,does anyone from India here

elder willow Mar 12, 2025, 7:26 PM

#

yt_nails cat_stare skull_sob

#

cute_dogwave yt_nails misc_cry misc_baffled cat_stare

cosmic snow Mar 12, 2025, 8:25 PM

#

yo

pine acornBOT Mar 12, 2025, 8:25 PM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

wheat glen Mar 13, 2025, 3:35 AM

#

sharp rune Hey there ,does anyone from India here

hopefully not

gray rover Mar 13, 2025, 3:49 AM

#

uah

solar torrent Mar 13, 2025, 3:52 AM

#

sharp rune Hey there ,does anyone from India here

I'm not from India.

proud marsh Mar 13, 2025, 4:13 AM

#

Does anyone knows or have idea of why this is the most popular model on weights.gg?

#

like, seriously, I dont get it

glad nebula Mar 13, 2025, 4:16 AM

#

proud marsh Does anyone knows or have idea of why this is the most popular model on weights....

meme from 1 year ago

solar torrent Mar 13, 2025, 4:16 AM

#

proud marsh Does anyone knows or have idea of why this is the most popular model on weights....

Saiba Momoi is a character from the mobile game Blue Archive. People are mostly using it for some memes like "oh my gotto, its thing" and "Nipah".

glad nebula Mar 13, 2025, 4:16 AM

#

https://knowyourmeme.com/memes/oh-my-gotto

Know Your Meme

Oh My Gotto | Know Your Meme

Oh My Gotto is a series of memes, including animations, in which Momoi, Aris or other characters from the video game Blue Archive say, "Oh my gotto, it's X

solar torrent Mar 13, 2025, 4:17 AM

#

momoishock

#

Yeah, that meme. midoriheh

proud marsh Mar 13, 2025, 4:18 AM

#

oh... i see

#

its funny, cus I never heard of this meme.

#

but the villager (wich i know well it was vastly used as a meme for some reason) has half the uses

#

something different to see to say the least

weak plinth Mar 13, 2025, 5:06 AM

#

I just saw this and download it, now i don't know where the model is 🤡

Do i need download it separately? if yes where?

polar flax Mar 13, 2025, 7:06 AM

#

weak plinth I just saw this and download it, now i don't know where the model is 🤡 Do i n...

you have got a nice console/pc, and now you need your favorite games to play that aren't pre-included 🤡
such as https://huggingface.co/GaboxR67/MelBandRoformers/

GaboxR67/MelBandRoformers · Hugging Face

twin fractal Mar 13, 2025, 9:18 AM

#

proud marsh Does anyone knows or have idea of why this is the most popular model on weights....

Used a lot in memes

queen kernel Mar 13, 2025, 11:07 AM

#

Hello. Can someone please help me to install kokoro tts ?

chilly lake Mar 13, 2025, 11:28 AM

#

pip install kokoro soundfile

tranquil lantern Mar 13, 2025, 12:41 PM

#

proud marsh Does anyone knows or have idea of why this is the most popular model on weights....

Racist jokes n word

#

https://tenor.com/view/blue-archive-ni-ga-gif-6122008057183992860

Tenor

solar torrent Mar 13, 2025, 12:41 PM

#

momoisad arissob

weary temple Mar 13, 2025, 12:59 PM

#

proud marsh Does anyone knows or have idea of why this is the most popular model on weights....

Momoi

solar torrent Mar 13, 2025, 1:53 PM

#

Some people I know in 2025 - luther (Kendrick Lamar AI Cover, vocals only) Dake Kanye karinthink FishEyeSanae

queen kernel Mar 13, 2025, 3:08 PM

#

chilly lake ``pip install kokoro soundfile``

That's all ?

chilly lake Mar 13, 2025, 4:43 PM

#

queen kernel That's all ?

well, you need python installed, use 3.10 or 3.11

queen kernel Mar 13, 2025, 4:48 PM

#

chilly lake well, you need python installed, use 3.10 or 3.11

Okay. Lemme try

silver bronze Mar 13, 2025, 4:49 PM

#

hey weights deleted the option to use youtube to choose a song for ai song creation. any new good apps for this purpose?

weary temple Mar 13, 2025, 5:10 PM

#

chilly lake well, you need python installed, use 3.10 or 3.11

i recomend 3.11

chilly lake Mar 13, 2025, 5:16 PM

#

3.11 has better error messages

covert lake Mar 13, 2025, 6:19 PM

#

silver bronze hey weights deleted the option to use youtube to choose a song for ai song creat...

just download the youtube video urself and use it as an input in weights.com

#

either pay for youtube premium, or use cobalt, or yt dlp, or literally google "free youtube video downloader site"

chilly lake Mar 13, 2025, 6:56 PM

#

and get a virus 🙂

#

I think I used this https://github.com/StefanLobbenmeier/youtube-dl-gui

elder willow Mar 13, 2025, 7:00 PM

#

Would anyone help me find a simple male voice model? No celeb, no anime, no weird voice. Just high quality normal speaker

river adder Mar 13, 2025, 7:01 PM

#

polar flax you have got a nice console/pc, and now you need your favorite games to play tha...

You ate with that

#

Fucking slayed

covert lake Mar 13, 2025, 7:04 PM

#

chilly lake and get a virus 🙂

Yt dlp and cobalt are safe

covert lake Mar 13, 2025, 7:04 PM

#

elder willow Would anyone help me find a simple male voice model? No celeb, no anime, no weir...

You can search rvc ai voice models at:

#1175430844685484042
In #🔍│find-models , Do /find with @hidden grotto
https://weights.com/ (login required)
https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
https://voice-models.com/
https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)

if there isnt one, you can:

#1159289738314919936
#1191429836321849435
make it yourself with our docs guides https://docs.aihub.gg/essentials/how-to-make-voice-models/

hidden grottoBOT Mar 13, 2025, 7:04 PM

#

covert lake You can search rvc ai voice models at: - <#1175430844685484042> - In <#11635920...

:wave: @covert lake, How can I help?

Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image

covert lake Mar 13, 2025, 7:04 PM

#

AI has to be trained on something

chilly lake Mar 13, 2025, 7:09 PM

#

covert lake Yt dlp and cobalt are safe

i mean googling something 'free' and end ending up on a download page with 10 fake 'Download' buttons with virus links

covert lake Mar 13, 2025, 7:11 PM

#

chilly lake i mean googling something 'free' and end ending up on a download page with 10 fa...

Smh

elder willow Mar 13, 2025, 7:13 PM

#

covert lake You can search rvc ai voice models at: - <#1175430844685484042> - In <#11635920...

I know where and how to search, I am just not able to find a simple male speaker with not overused voice.

bold yoke Mar 13, 2025, 7:15 PM

#

/chirp

#

/create

covert lake Mar 13, 2025, 7:23 PM

#

bold yoke /chirp

There's no Suno AI bot

#

Promos ain't allowed and will be deleted

orchid pasture Mar 13, 2025, 9:10 PM

#

aveti vre un model cu calin georgescu?

fresh folio Mar 13, 2025, 10:32 PM

#

i see

fathom flax Mar 13, 2025, 10:48 PM

#

whats the best vocal isloation today that is free?

gray rover Mar 13, 2025, 10:48 PM

#

fathom flax whats the best vocal isloation today that is free?

uvr / mvsep

#

https://docs.google.com/document/d/17fjNvJzj8ZGSer7c7OFe_CNfUKbAxEh_OBv94ZdRG5c/edit?tab=t.0

Google Docs

Instrumental, vocal & other stems separation & mix/master guide - U...

edit 13.03.25 deton24’s Instrumental and vocal & stems separation & mastering (UVR 5 GUI: VR/MDX-Net/MDX23C/Demucs 1-4, and BS/Mel-Roformer in beta MVSEP-MDX23-Colab/KaraFan/drumsep/LarsNet/SCNet x-minus.pro (uvronline.app)/mvsep.com/ GSEP/Dango.ai/Audioshake/Music.ai) General reading advice | D...

#

This should be helpful

#

Overviews, info on models used in uvr / mvsep and much more. Generally a 101 guide

#

Other than that, there's " audio separation " discord server ( google it ) where uvr and mvsep devs are, helpful and informative community

#

But I personally recommend using gabox's voc fv4 model for vocals / voice

fathom flax Mar 13, 2025, 10:51 PM

#

gray rover uvr / mvsep

thanks!
and right now whats the best free way to train a model, havent been that for months

gray rover Mar 13, 2025, 10:52 PM

#

fathom flax thanks! and right now whats the best free way to train a model, havent been that...

you mean rvc models?

fathom flax Mar 13, 2025, 10:52 PM

#

yes

gray rover Mar 13, 2025, 10:52 PM

#

Well, if not locally then colab or kaggle really

fathom flax Mar 13, 2025, 10:52 PM

#

ive done alreay the one with google

#

there is something new?

gray rover Mar 13, 2025, 10:52 PM

#

But new in what way? What are your expectations?

#

If you're asking quite literally about something better than rvc / applio itself? then no

fathom flax Mar 13, 2025, 10:53 PM

#

i mean something whre i shouldnt work too much

#

like

#

drop a clean 30 minutes file

#

and then wait

#

🥲

gray rover Mar 13, 2025, 10:53 PM

#

Unfortunately no, training good models requires a bit of work

fathom flax Mar 13, 2025, 10:53 PM

#

there is an easy guide?

gray rover Mar 13, 2025, 10:53 PM

#

check ai hub's docs

#

or research channels on this discord

fathom flax Mar 13, 2025, 10:54 PM

#

its updated?

gray rover Mar 13, 2025, 10:54 PM

#

I think so ye

#

In any case, you can leave a msg here and some helpers ( hopefully ) could help you out with stuff

#

Alternatively, ask on #✨│ai-help

fathom flax Mar 13, 2025, 10:55 PM

#

thanks!

#

@gray rover do you know if youtube support FLAC?

gray rover Mar 13, 2025, 11:06 PM

#

fathom flax <@1239634084133601423> do you know if youtube support FLAC?

as in?

#

you mean in terms of ripping the audio?

fathom flax Mar 13, 2025, 11:06 PM

#

like if the audio from youtube is already compressed by them and then i only downloading a large file but with and m4a quality

gray rover Mar 13, 2025, 11:06 PM

#

oh

#

the thing with youtube is

#

any audio people upload, ends up getting dynamically compressed ( volume dynamics ) and undergoes general compression ( codec wise )

#

all of that is either opus or aac

#

( it's why people shouldn't use stuff like yt to mp3, because you'd further compress the opus or such to mp3 )

fathom flax Mar 13, 2025, 11:07 PM

#

so whats the best way?

#

can i use spotify? apple music?

gray rover Mar 13, 2025, 11:07 PM

#

yt-dlp imo

#

cli tool for downloading

#

the command would be:

yt-dlp.exe -x URL

#

-x argument tells the program to fetch on the best available quality from their servers

pine acornBOT Mar 13, 2025, 11:08 PM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

gray rover Mar 13, 2025, 11:08 PM

#

mostly it's .opus or .webm ( which will contain opus )

#

rarely aac

#

then you'd use ffmpeg to convert opus to wave

fathom flax Mar 13, 2025, 11:08 PM

#

yo wait

#

too much

#

i aint following

gray rover Mar 13, 2025, 11:09 PM

#

Here, you download the .exe
https://github.com/yt-dlp/yt-dlp

GitHub

GitHub - yt-dlp/yt-dlp: A feature-rich command-line audio/video dow...

A feature-rich command-line audio/video downloader - yt-dlp/yt-dlp

#

#

You run it like so

#

cmd can be opened in the address bar

#

Here

#

the url is your youtube video's link

#

then ( as long you have ffmpeg installed and properly added to path / configured )

fathom flax Mar 13, 2025, 11:10 PM

#

oh ok

gray rover Mar 13, 2025, 11:10 PM

#

In the same command line window

#

ffmpeg -i input.opus output.wav

fathom flax Mar 13, 2025, 11:11 PM

#

i think i download in the past ffmepg

gray rover Mar 13, 2025, 11:11 PM

#

gray rover > ffmpeg -i input.opus output.wav

input is whatever yt-dlp downloaded and output yea, converted thing to wave as output ( just name it whatever you want and have it .wav at the end

fathom flax Mar 13, 2025, 11:11 PM

#

so how the whole line should be?

gray rover Mar 13, 2025, 11:12 PM

#

yt-dlp.exe -x URL
for downloading stuff

ffmpeg -i input_from_yt.opus output_from_ffmpeg.wav
For conversion of opus files to wave

#

You'll then get a 44.1khz wave files
( and keep them that end-to-end. If you work on those files or process / denoise or whatever, always export them as 44.1khz wave. Those will go to rvc )

#

And that's pretty much all there is to it

#

Nothing too crazy

fathom flax Mar 13, 2025, 11:13 PM

#

hold on a second

#

i first need to download the file

#

then i open again cmd?

gray rover Mar 13, 2025, 11:14 PM

#

As you can see in the folder there's the yt-dlp.exe file

#

it's the one from github

fathom flax Mar 13, 2025, 11:14 PM

#

yes

gray rover Mar 13, 2025, 11:14 PM

#

It downloads the yt audio to that folder

fathom flax Mar 13, 2025, 11:14 PM

#

i downloaded right now the first clip

gray rover Mar 13, 2025, 11:14 PM

#

yes, check it's properties

fathom flax Mar 13, 2025, 11:14 PM

#

opus

gray rover Mar 13, 2025, 11:15 PM

#

yes, in that case:

ffmpeg -i yourfile.opus yourfile.wav

#

-i is an argument for input

#

I name my stuff as songWAV.wav ( the output from ffmpeg

#

to avoid confusion

fathom flax Mar 13, 2025, 11:15 PM

#

so i need to first change the file name?

gray rover Mar 13, 2025, 11:15 PM

#

nope

fathom flax Mar 13, 2025, 11:15 PM

#

beacuse its too long

gray rover Mar 13, 2025, 11:16 PM

#

just add wav suffix

#

before extension

#

gonna help you keep it clean

#

if you download a lot of stuff ( and keep opus copies ?

#

tho ye, you can rename stuff ofc

#

for the output the name doesn't matter

fathom flax Mar 13, 2025, 11:16 PM

#

lets say i download via yt dlp a file that his name is: blabla.opus

gray rover Mar 13, 2025, 11:16 PM

#

ye, then that's for input, output you can name it whatever

fathom flax Mar 13, 2025, 11:16 PM

#

whats the line will be?

gray rover Mar 13, 2025, 11:17 PM

#

ffmpeg -i blabla.opus blablaamazing123.wav

hollow dagger Mar 13, 2025, 11:17 PM

#

Hello

fathom flax Mar 13, 2025, 11:17 PM

#

oh i see

gray rover Mar 13, 2025, 11:17 PM

#

yup, pretty simple

fathom flax Mar 13, 2025, 11:17 PM

#

let me try it

#

where can i upload images?

#

or i can dm you

hollow dagger Mar 13, 2025, 11:21 PM

#

I need some sample data to practise creating a chat agent, such as a business, and I need a lot of them so I can create a lot of chatbots. Could you guys please help me with this?

warm sapphire Mar 14, 2025, 1:19 AM

#

hey guys what is the best model for realistic female voice ?

night lake Mar 14, 2025, 1:32 AM

#

warm sapphire hey guys what is the best model for realistic female voice ?

this: https://discord.com/channels/1159260121998827560/1341216399372062823

polar flax Mar 14, 2025, 1:45 AM

#

night lake this: https://discord.com/channels/1159260121998827560/1341216399372062823

https://tenor.com/view/jeff-land-shark-marvel-rivals-video-game-gif-10022472975975249

Tenor

lethal flax Mar 14, 2025, 1:54 AM

#

is there a locally running app of any sort that lets me inpaint/remove items from videos?

solar torrent Mar 14, 2025, 2:18 AM

#

lethal flax is there a locally running app of any sort that lets me inpaint/remove items fro...

I don't think there's any AI tool that can edit videos in bulk. There are websites that generate video after a frame of a video or an image.

polar flax Mar 14, 2025, 2:36 AM

#

lethal flax is there a locally running app of any sort that lets me inpaint/remove items fro...

only those that can swap faces

deft surge Mar 14, 2025, 4:22 AM

#

Bandidu não dança dança

#

Bandidu ginga e balança 🔥 🔥 🥶 ☝️ ☝️ 😂 😂 😂

ionic pumice Mar 14, 2025, 5:21 AM

#

fr whatever that means

grand breach Mar 14, 2025, 5:27 AM

#

What is our thoughts on Grok 3?

solar torrent Mar 14, 2025, 5:27 AM

#

Never used Grok.

grand breach Mar 14, 2025, 5:27 AM

#

what LLM do you use

#

Claude 3.7 sonnet thinking is really good at code

solar torrent Mar 14, 2025, 5:29 AM

#

grand breach what LLM do you use

Microsoft Copilot skull_goofy

grand breach Mar 14, 2025, 5:29 AM

#

solar torrent Microsoft Copilot <:skull_goofy:1159397241199534152>

aintnoway

#

LOL

polar flax Mar 14, 2025, 6:10 AM

#

solar torrent Microsoft Copilot <:skull_goofy:1159397241199534152>

https://www.tomshardware.com/video-games/xbox/xbox-announces-copilot-for-gaming-ai-assistant-early-access-coming-to-xbox-mobile-app-more-details-to-come-at-gdc-2025

Tom's Hardware

Xbox announces 'Copilot for Gaming' AI assistant — early access com...

Microsoft turns Copilot into an actual copilot

dusk geyser Mar 14, 2025, 7:02 AM

#

hot take: gabox fv4 is the best model

queen kernel Mar 14, 2025, 7:34 AM

#

chilly lake well, you need python installed, use 3.10 or 3.11

What I have to do after this ?

#

I have installed kokoro

chilly lake Mar 14, 2025, 10:47 AM

#

queen kernel I have installed kokoro

activate the virtual environment if you made any, then use a script

📎 run.py

#

each line comes out as a separate file, but they can be merged into one

queen kernel Mar 14, 2025, 10:50 AM

#

chilly lake each line comes out as a separate file, but they can be merged into one

I was waiting for your response 😭

#

Thank you

chilly lake Mar 14, 2025, 10:53 AM

#

why, the github has an example script

queen kernel Mar 14, 2025, 10:58 AM

#

chilly lake why, the github has an example script

Got an error

#

Espeak is not installed

#

That's why I was waiting for your reply

#

What to do

chilly lake Mar 14, 2025, 10:59 AM

#

read the github page

queen kernel Mar 14, 2025, 10:59 AM

#

I have installed espeak

chilly lake Mar 14, 2025, 10:59 AM

#

environment variable

queen kernel Mar 14, 2025, 11:02 AM

#

I just created environment variables but with different names. My bad

chilly lake Mar 14, 2025, 11:02 AM

#

use a new terminal window after that

queen kernel Mar 14, 2025, 11:04 AM

#

I restarted my PC.. lemme see if it works.

#

I was asking questions in kokoro discord server and they was very rude to me 🥺

queen kernel Mar 14, 2025, 11:09 AM

#

chilly lake use a new terminal window after that

It said failed to load voice "hi"

stuck ridge Mar 14, 2025, 11:20 AM

#

i have rx 7600 and a rode mic

#

how do i make this work chat

queen kernel Mar 14, 2025, 11:30 AM

#

stuck ridge how do i make this work chat

What?

icy pendant Mar 14, 2025, 11:31 AM

#

stuck ridge how do i make this work chat

You should specify what "this" is but i assume you want realtime voice changer

https://rentry.co/ForkVoiceChangerGuide

Download AMD version, virtual cable, read audio setup, model upload etc.

For questions ask in #🔍│help-w-okada

Guide for deiteris' optimized W-Okada RealTime Voice Changer Client...

Thanks vtarcelia for corrections, Nick088 for contributions. Most technical information comes from deiteris.
Latest Version b2332 from December 2024
RTX 5000 series support is here, but not integrated into w-okada itself, it is a stand-alone release. You can get it from here
Translations (outdate...

stuck ridge Mar 14, 2025, 11:39 AM

#

ok! thanks

pine acornBOT Mar 14, 2025, 11:39 AM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

chilly lake Mar 14, 2025, 11:53 AM

#

queen kernel It said failed to load voice "hi"

#

you need to actually use 'h' as lang code, not 'hi'

#

and the voice name from the list

real iron Mar 14, 2025, 11:59 AM

#

I want a very good female voice model

worthy coyote Mar 14, 2025, 12:00 PM

#

queen kernel Mar 14, 2025, 12:00 PM

#

chilly lake you need to actually use 'h' as lang code, not 'hi'

I did the same. But getting error failed to load voice "hi"

chilly lake Mar 14, 2025, 12:09 PM

#

queen kernel I did the same. But getting error failed to load voice "hi"

you should've provided the full error, not jhust the last line

queen kernel Mar 14, 2025, 12:10 PM

#

chilly lake you should've provided the full error, not jhust the last line

Oops sorry.. lemme send you

#

chilly lake Mar 14, 2025, 12:21 PM

#

is they key, it is e-speak problem

#

so hindi language goes thru some weird phonemizer and you need that installed and I have no idea what

#

you need to pos an issue in kokoro's github

#

someone may answer

solar torrent Mar 14, 2025, 2:16 PM

#

polar flax https://www.tomshardware.com/video-games/xbox/xbox-announces-copilot-for-gaming-...

solar torrent Mar 14, 2025, 2:50 PM

#

No, thanks.

#

I do things that don't really get me bored.

tough beacon Mar 14, 2025, 3:33 PM

#

jakkari jakkari

knotty meteor Mar 14, 2025, 4:49 PM

#

is there an ai site or something like that that can help me modify some text on a photo

#

?

bitter wagon Mar 14, 2025, 4:53 PM

#

could someone suggest me a available voice model that sounds more like a matured man with deep voice like a man

supple stream Mar 14, 2025, 7:31 PM

#

knotty meteor is there an ai site or something like that that can help me modify some text on ...

you mean image genarator

topaz niche Mar 14, 2025, 7:40 PM

#

Hi

#

Where’s the ai

covert lake Mar 14, 2025, 10:32 PM

#

topaz niche Where’s the ai

elaborate

gentle cipher Mar 14, 2025, 10:47 PM

#

covert lake elaborate

I need help

#

Are you an admin?

covert lake Mar 14, 2025, 10:49 PM

#

gentle cipher I need help

Elaborate the issue in specific help channels based on the software you need help with

covert lake Mar 14, 2025, 10:49 PM

#

gentle cipher Are you an admin?

Junior admin

gentle cipher Mar 14, 2025, 10:49 PM

#

covert lake Elaborate the issue in specific help channels based on the software you need hel...

Ok

mental pelican Mar 14, 2025, 11:55 PM

#

somebody have juan gabriel link of hugging face

sterile stream Mar 15, 2025, 12:03 AM

#

Hello. I am new to using ai tools and was wondering if i could be given any tips on how go get into new ais and the idea of using other AI than chat gpt. The only(almost only other uses were major)tool i have used so far is chat gpt for coding.

#

And also have gotten advice and information from it

#

I would love advice on AI for coding. i don't know much coding so I would love advice on AI that help you understand. I am willing to also purchase with money premium versions of AI at a price of 20$ a month

polar flax Mar 15, 2025, 12:13 AM

#

sterile stream I would love advice on AI for coding. i don't know much coding so I would love a...

claude 3.7, if you want the project feature, check here
https://tactiq.io/learn/chatgpt-project-vs-claude-project

ChatGPT Project vs. Claude Project: Which One is Better?

ChatGPT Project vs. Claude Project: Which is better? Compare features, performance & AI capabilities to find the best choice for your needs.

woven magnet Mar 15, 2025, 12:16 AM

#

Hi. I am also new to using ai tools. I want to learn about generative ai to build a tiktok channel using ai to make video , but i dont know where to start . Can anyone give me some advices. ( btw i dont have any background on Ai,)

gray rover Mar 15, 2025, 12:44 AM

#

Tone it down

#

🤔 now that's some crashing out

#

counter strike might be stressful m8 but you should chill

simple gate Mar 15, 2025, 2:35 AM

#

is Ilaria RVC on huggingface.co down ? been a week i cant convert

floral cairn Mar 15, 2025, 3:07 AM

#

yall know a free alternative to Krea AI Training? Like a model that you feed it images and it creates images like those

polar flax Mar 15, 2025, 3:10 AM

#

floral cairn yall know a free alternative to Krea AI Training? Like a model that you feed it ...

comfyui, fluxgym, etc

sour temple Mar 15, 2025, 3:14 AM

#

a

floral cairn Mar 15, 2025, 3:29 AM

#

polar flax comfyui, fluxgym, etc

any online one? I see both of those are like code or something

woven magnet Mar 15, 2025, 4:12 AM

#

solar torrent Mar 15, 2025, 6:46 AM

#

floral cairn any online one? I see both of those are like code or something

Weights.com can train a Flux.1 image model there.

solemn wren Mar 15, 2025, 9:51 AM

#

yo guys do you know any website i can turn MIDI files into mp3 vocals?

#

preferably free

strange wraith Mar 15, 2025, 9:53 AM

#

A convert?

#

Send to me I got you g

ionic pumice Mar 15, 2025, 9:57 AM

#

synthv, vocaloid, or cevio

solar torrent Mar 15, 2025, 9:58 AM

#

misc_trolley

#

You can use a soundfont full of spoken vocals, and convert that MIDI file into mp3.

strange wraith Mar 15, 2025, 9:58 AM

#

He might not know how to use them. But yeah those work as well boss

void dock Mar 15, 2025, 10:17 AM

#

hey....
hey ! i am new here . i am software engineer . i am working on model training projects ..

strange wraith Mar 15, 2025, 10:19 AM

#

Awesome welcome !

lyric rapids Mar 15, 2025, 10:20 AM

#

guys whats the diffrience between w ocada coice changer and rvc???

strange wraith Mar 15, 2025, 10:21 AM

#

Rvc is for music and vocal training and Inferencing music

#

Ocada is a real time changer to sound like the desired person on the spot

#

Via game discord etc

solar torrent Mar 15, 2025, 10:21 AM

#

lyric rapids Mar 15, 2025, 10:21 AM

#

okk

strange wraith Mar 15, 2025, 10:21 AM

#

TTS is spoken word to voice

#

🙂

#

If you need any help getting started feel free to reach out

solar torrent Mar 15, 2025, 10:22 AM

#

The correct name for the realtime voice changer program is W-Okada, not W Ocada.

strange wraith Mar 15, 2025, 10:22 AM

#

My man

#

Sup namari

lyric rapids Mar 15, 2025, 10:22 AM

#

strange wraith TTS is spoken word to voice

so i shoukld use w okada to changemy voice on discord calls right

strange wraith Mar 15, 2025, 10:22 AM

#

Yes

lyric rapids Mar 15, 2025, 10:22 AM

#

strange wraith Yes

plus is there a easy softwear other then okada

#

like catfish or i forgot the name?

strange wraith Mar 15, 2025, 10:23 AM

#

Doesn’t work as well imo

#

If you need help with setting it up I can get you

solar torrent Mar 15, 2025, 10:23 AM

#

Don't use W-Okada for catfishing someone.

strange wraith Mar 15, 2025, 10:23 AM

#

He’s asking if the other program

#

Is named that

#

No it’s not

lyric rapids Mar 15, 2025, 10:23 AM

#

strange wraith Doesn’t work as well imo

oh ok

strange wraith Mar 15, 2025, 10:23 AM

#

It’s more for like

#

Having fun

lyric rapids Mar 15, 2025, 10:24 AM

#

strange wraith If you need help with setting it up I can get you

when ill setup and of i need help ill ping u

strange wraith Mar 15, 2025, 10:24 AM

#

Not Pretending to be someone and scaring one

#

That’s illegal

solar torrent Mar 15, 2025, 10:24 AM

#

Don't use Voice.ai. It is a scam site that trying to eat your PC more than W-Okada.

#

For W-Okada, let's go to #🔍│help-w-okada.

lyric rapids Mar 15, 2025, 10:24 AM

#

solar torrent Don't use Voice.ai. It is a scam site that trying to eat your PC more than W-Oka...

wdym more then w okada?

strange wraith Mar 15, 2025, 10:24 AM

#

It uses more pc

#

Than needed

lyric rapids Mar 15, 2025, 10:24 AM

#

https://youtu.be/FzU-H_-zOc4

solar torrent Mar 15, 2025, 10:25 AM

#

lyric rapids wdym more then w okada?

Performance. What do I mean?

pine acornBOT Mar 15, 2025, 10:25 AM

#

Staff Applications Open

We're looking for dedicated team members to help grow and manage our community! If you're passionate about AI and want to contribute, apply now!

Click here to apply!

lyric rapids Mar 15, 2025, 10:25 AM

#

lyric rapids https://youtu.be/FzU-H_-zOc4

what abt htis one?

lyric rapids Mar 15, 2025, 10:25 AM

#

strange wraith Than needed

ok if i opress stoip in there

solar torrent Mar 15, 2025, 10:25 AM

#

lyric rapids https://youtu.be/FzU-H_-zOc4

Don't send a YouTube link without the context here.

lyric rapids Mar 15, 2025, 10:25 AM

#

lyric rapids what abt htis one?

then my pc wont be used up right?

strange wraith Mar 15, 2025, 10:25 AM

#

Just use okada

#

It will be better

lyric rapids Mar 15, 2025, 10:25 AM

#

strange wraith It will be better

ok

#

but

#

it uses alot of PC recources

strange wraith Mar 15, 2025, 10:26 AM

#

Apply still isn’t working just letting you know

lyric rapids Mar 15, 2025, 10:26 AM

#

right?

strange wraith Mar 15, 2025, 10:26 AM

#

And performance isn’t as good

#

🙂

lyric rapids Mar 15, 2025, 10:26 AM

#

strange wraith And performance isn’t as good

what apply?

strange wraith Mar 15, 2025, 10:26 AM

#

Oh not just taking about something else

#

Your fine!

#

Thanks for being soo kind

#

I’ll be here for any needs and so will other kind members

#

Like namari

#

👍

lyric rapids Mar 15, 2025, 10:27 AM

#

ok

solar torrent Mar 15, 2025, 10:27 AM

#

Shit. For installing and such about W-Okada, go to #🔍│help-w-okada. The website for mod/helper application for this server is broken right now.

strange wraith Mar 15, 2025, 10:27 AM

#

I know

#

Just re iterating it apologies

void dock Mar 15, 2025, 10:29 AM

#

@strange wraith are u a chat bot? i think u know every thing 😅

strange wraith Mar 15, 2025, 10:29 AM

#

Nah bro I just been in so for a min just trying to help

#

I use to struggle soo much with this shit

solar torrent Mar 15, 2025, 10:29 AM

#

void dock <@913548616763858966> are u a chat bot? i think u know every thing 😅

I'd just take it as an insult if you call me like that.

strange wraith Mar 15, 2025, 10:29 AM

#

Just like helping out

#

It’s okay ahah. Take it how you want it just wanna treat others how I would like to be treated

#

🙂

void dock Mar 15, 2025, 10:30 AM

#

solar torrent I'd just take it as an insult if you call me like that.

no no ,, i am realy sorry if feel like that ..

strange wraith Mar 15, 2025, 10:30 AM

#

Nooo I took no disrespect