#✨│ai-help | AI HUB | Page 253

simple ore Jul 3, 2025, 7:44 PM

#

it has onnx in its name

tight bane Jul 3, 2025, 7:50 PM

#

How can I look for a voice model?

forest vector Jul 3, 2025, 8:04 PM

#

anyone know how to fix this ?
2025-07-03 22:02:27,643 ERROR [VoiceChangerManager] CUDA error: an illegal memory access was encountered
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.

running deitris' w-okada voice changer, on a gtx 1070.
happens at a random point in time, after 2-7 mins.

simple ore Jul 3, 2025, 8:09 PM

#

running out of VRAM most likely

forest vector Jul 3, 2025, 8:10 PM

#

I thought so too, but via task maneger it does not use more than 3/8 gb of dedicated memory

#

i feel like it might be inhibited by something, but cant tell why or how

simple ore Jul 3, 2025, 8:13 PM

#

anything else running with hardware acceleration? discord / browser / some 3d game?

forest vector Jul 3, 2025, 8:13 PM

#

discord, 1 browser with a video, and voicemeeter

simple ore Jul 3, 2025, 8:19 PM

#

any overclocking/undevolting?

forest vector Jul 3, 2025, 8:19 PM

#

none

uncut eagle Jul 3, 2025, 8:28 PM

#

can i get help for voice changer ?

outer frigate Jul 3, 2025, 8:33 PM

#

i want to thank every one who helped me it works

#

@simple ore @hallow thistle

steady otter Jul 3, 2025, 8:52 PM

#

I used voice.ai it was good enough for what I needed

past juniper Jul 3, 2025, 9:40 PM

#

do i just leave the embedder on default (hubert_base_112) i also see contentvec and whisper?

#

I have a 4060 and a AMD Ryzen 9 7950X3D

kindred fern Jul 3, 2025, 10:41 PM

#

My mic is picking up the audio but it says "pipeline is not initliazied"

keen coyote Jul 3, 2025, 10:50 PM

#

any specific settings that make a huge difference in the voice model? cant tell if the models bad or if its my settings

smoky solstice Jul 3, 2025, 11:05 PM

#

Trying to run Deiteris' W Okada on an M1 Macbook Pro and getting the following error even after doing the proposed fix of using "xattr -dr com.apple.quarantine" to fix it. On Sequoia 15.2. Anyone have any ideas on what the isuse is?

simple ore Jul 3, 2025, 11:13 PM

#

smoky solstice Trying to run Deiteris' W Okada on an M1 Macbook Pro and getting the following e...

it is to disable quarantine for downloaded stuff

#

I think

#

"
This attribute is added so that it can ask for user confirmation the first time the downloaded program is run, to help stop malware. Upon confirmation the attribute should be removed automatically, and then the program will run normally.
"

smoky solstice Jul 3, 2025, 11:16 PM

#

simple ore it is to disable quarantine for downloaded stuff

Sorry I'm a little confused. You're talking about the command to disable the apple quarantine right? I already did that through the "xattr -dr com.apple.quarantine" command they provided in the tutorial.

simple ore Jul 3, 2025, 11:17 PM

#

okay

#

but you did not post anything else

smoky solstice Jul 3, 2025, 11:18 PM

#

I think the issue might be an outdated MacOS version as a previous person has posted here but I'll post an update after ive updated.

smoky solstice Jul 3, 2025, 11:38 PM

#

yeah that didnt work i fear 😞

opaque scarab Jul 3, 2025, 11:47 PM

#

-colab

patent trellisBOT Jul 3, 2025, 11:47 PM

#

opaque scarab -colab

📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**

Google Colab

• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

knotty moth Jul 4, 2025, 12:07 AM

#

keen coyote any specific settings that make a huge difference in the voice model? cant tell ...

chunk only affects the performance delay, if it's lower than what the gpu spec is capable of, it may cut off

#

extra and "force fp32" in advanced settings may do fine grain quality improvement but dont expect if the model itself sounds not good

#

check some models in mvsep.com like some multi stem SCnet & BS roformer, then the drumsep models

placid holly Jul 4, 2025, 12:35 AM

#

do i need download that 3 file (zip) for RTX 5000 W-Okada?

i only know 7z usually use 01 02 03, but dont know about Zip

simple ore Jul 4, 2025, 12:36 AM

#

placid holly do i need download that 3 file (zip) for RTX 5000 W-Okada? i only know 7z usual...

you need all 3 files, it is simply split to allow upload to github

placid holly Jul 4, 2025, 12:37 AM

#

simple ore you need all 3 files, it is simply split to allow upload to github

ok thank you 🤝

viral mason Jul 4, 2025, 12:46 AM

#

it's peak

fleet cedar Jul 4, 2025, 2:14 AM

#

how do i install the voice changer

#

i use an rtx 5070

edgy topaz Jul 4, 2025, 2:27 AM

#

How do I make AI voice unnoticeable not like in real time like record in weights

simple ore Jul 4, 2025, 2:29 AM

#

fleet cedar i use an rtx 5070

download the 5000 series version

viral mason Jul 4, 2025, 2:45 AM

#

edgy topaz How do I make AI voice unnoticeable not like in real time like record in weights

you don't use weights misc_trolley

edgy topaz Jul 4, 2025, 2:45 AM

#

viral mason you don't use weights <:misc_trolley:1159468147133395025>

Oh what do I use?

viral mason Jul 4, 2025, 2:46 AM

#

anything else :3

reef yacht Jul 4, 2025, 2:46 AM

#

can someone tell me why is the voicechanger super delyed?

hasty gust Jul 4, 2025, 2:48 AM

#

I had realtime voice changer client for 2 years now. is there a new update or a new client??

viral mason Jul 4, 2025, 2:49 AM

#

reef yacht can someone tell me why is the voicechanger super delyed?

what are ur settings in the voice changer, which download link did u use

reef yacht Jul 4, 2025, 2:49 AM

#

viral mason what are ur settings in the voice changer, which download link did u use

watched a youtube video

stoic bronze Jul 4, 2025, 2:49 AM

#

Hey, I’m using AMD and I’m wondering what I should be downloading. VCC is really acting slow on my pc and I don’t know how to fix this - it lags

#

Previously I downloaded a light and working VCC but I can’t remember where it’s at

fleet cedar Jul 4, 2025, 2:51 AM

#

simple ore download the 5000 series version

send link

simple ore Jul 4, 2025, 2:51 AM

#

fleet cedar send link

https://github.com/IllIlIlIllIl/voice-changer/releases/tag/b2335

#

Download all 3 files, then extract the .zip file, it will automatically extract ALL 3 FILES into one. Then open the MMVCServerSIO folder and run MMVCServerSIO.exe (or called MMVCServerSIO if you don't have extensions activated).

viral mason Jul 4, 2025, 2:53 AM

#

reef yacht watched a youtube video

send a screenshot of the settings u have

edgy topaz Jul 4, 2025, 2:53 AM

#

viral mason anything else :3

What do I use?

knotty moth Jul 4, 2025, 4:36 AM

#

viral mason send a screenshot of the settings u have

bro left cat_wtf

knotty moth Jul 4, 2025, 4:37 AM

#

edgy topaz How do I make AI voice unnoticeable not like in real time like record in weights

what do u mean "in weights"

viral mason Jul 4, 2025, 4:39 AM

#

knotty moth bro left <a:cat_wtf:1359901919665197223>

https://tenor.com/view/nickola-viraj-gif-4864148932422811000

Tenor

edgy topaz Jul 4, 2025, 4:50 AM

#

So Like I use the voice in weights

knotty moth Jul 4, 2025, 4:55 AM

#

edgy topaz So Like I use the voice in weights

then what do you expect of it?

edgy topaz Jul 4, 2025, 5:22 AM

#

knotty moth then what do you expect of it?

No, they say that I don't need to use weights

#

So what else do I do to not make Ai recorder or anything unnoticeable

knotty moth Jul 4, 2025, 6:24 AM

#

edgy topaz No, they say that I don't need to use weights

nah I dont think that's the point

knotty moth Jul 4, 2025, 6:25 AM

#

edgy topaz So what else do I do to not make Ai recorder or anything unnoticeable

so you want the results sound like recorded in average mic?

#

there are some post processing effects you could do

#

so try searching it

naive glen Jul 4, 2025, 6:28 AM

#

Hello, sorry for bothering and excuse my English but it is not my first language, but I have a question about how parrots are made in an alternative case with collab, why I tried to install kohya locally using pinokio, which did work but I don't know why an error occurs that I put all the parameters all the necessary folders to create a Lora but it tells me that the folder has not been found where one puts the images or that path does not exist I tried to do it in a thousand ways to verify that it existed And if it exists but it does not I know why it doesn't take it, so I don't know if anyone knows how to fix that error or in the worst case the truth is I don't know how to make loras in collab, why didn't I find updated links or links that currently work because at least all the ones I looked for gave me an error or something like that, so I would like to know if someone could help me or know something

edgy topaz Jul 4, 2025, 6:34 AM

#

YASSSSSSSS

earnest forge Jul 4, 2025, 8:12 AM

#

viral mason it's peak

it could be, but now i have to figure getting that final voice into a game joe_mad

blissful lily Jul 4, 2025, 9:11 AM

#

steady otter I used voice.ai it was good enough for what I needed

Hey can you check my dm? I got a question about it

steady otter Jul 4, 2025, 9:11 AM

#

blissful lily Hey can you check my dm? I got a question about it

Buddy atleast send the dm 😭

blissful lily Jul 4, 2025, 9:13 AM

#

steady otter Buddy atleast send the dm 😭

Sent misc_burger_plead

pliant lily Jul 4, 2025, 10:48 AM

#

Can someone help me? whenever I have to mix two models, I get an error

pliant lily Jul 4, 2025, 11:20 AM

#

but I tried, with two 48k models, I tested with several models

#

Can you send me links to some that work?

#

I've already tried running it locally and via Google Collab

knotty moth Jul 4, 2025, 11:34 AM

#

pliant lily but I tried, with two 48k models, I tested with several models

it's because Applio treats "48k" ≠ "48000" due to models prob trained using different fork/version

#

so try using mainline rvc

pliant lily Jul 4, 2025, 11:40 AM

#

knotty moth so try using mainline rvc

mainline rvc?

pliant lily Jul 4, 2025, 12:11 PM

#

gg

hallow loom Jul 4, 2025, 12:28 PM

#

'VoiceChanger' object has no attribute 'resampler_in' what does this mean?

wanton lion Jul 4, 2025, 12:57 PM

#

everytime i try to launch the start_http.bat file its crashes

hearty dome Jul 4, 2025, 1:40 PM

#

guys which ai voice changer is good most ive seen are 1 year old are there any up to date ones

low shard Jul 4, 2025, 1:48 PM

#

wanton lion everytime i try to launch the start_http.bat file its crashes

This is a General AI Server, we won't be focused on voices anymore

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

#

start_http.bat is apart of original wokada, ALL VIDEO TUTORIALS ARE OLD, DONT TRUST THEM, wokada deiteris fork is better

low shard Jul 4, 2025, 1:49 PM

#

hearty dome guys which ai voice changer is good most ive seen are 1 year old are there any u...

there's no updated video tutorial, only written guides, tell your pc gpu and what you want to do

low shard Jul 4, 2025, 1:49 PM

#

hallow loom 'VoiceChanger' object has no attribute 'resampler_in' **what does this mean?**

This is a General AI Server, we won't be focused on voices anymore

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

glacial charm Jul 4, 2025, 2:14 PM

#

Hello i just want to make a pokemon song but i want to change the lyrics how can i do that

jaunty marten Jul 4, 2025, 2:25 PM

#

Hello, would it be possible to make a voice like this?
https://youtu.be/JupFhvq36PA?si=ZGWF3U1_RYvYcOrq

simple ore Jul 4, 2025, 2:26 PM

#

glacial charm Hello i just want to make a pokemon song but i want to change the lyrics how can...

i imagine using something like ace-step with an instrumental track + new lyrics for audio2audio

low shard Jul 4, 2025, 2:27 PM

#

jaunty marten Hello, would it be possible to make a voice like this? https://youtu.be/JupFhvq3...

You can search rvc ai voice models at:

https://discord.com/channels/1159260121998827560/1175430844685484042
In https://discord.com/channels/1159260121998827560/1163592055830880266 , Do /find with @earnest musk
https://weights.com/ (login required)
https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
Suggested Models for Realtime Voice Changing (Wokada)
https://voice-models.com/
https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)

if there isnt one, you can:

make it yourself with our docs guides
Ask a free request in https://discord.com/channels/1159260121998827560/1159290139609137264
Be aware that we don't allow any paid comms, so don't fall for any "pay me 20 dollars and i will make the model for you" dm

earnest muskBOT Jul 4, 2025, 2:27 PM

#

low shard You can search rvc ai voice models at: - https://discord.com/channels/1159260121...

:wave: @low shard, How can I help?

Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models

edgy topaz Jul 4, 2025, 3:00 PM

#

knotty moth there are some post processing effects you could do

Really?

#

What's the name?

viral mason Jul 4, 2025, 3:37 PM

#

edgy topaz So Like I use the voice in weights

Just download it and use it in okada 💔

edgy topaz Jul 4, 2025, 3:38 PM

#

viral mason Just download it and use it in okada 💔

Ohh I mean on mobile sorry!

#

I am buying an computer soon

viral mason Jul 4, 2025, 3:39 PM

#

earnest forge it *could* be, but now i have to figure getting that final voice into a game <:j...

U can dm me if u want bc idk what you're talking about

viral mason Jul 4, 2025, 3:39 PM

#

edgy topaz Ohh I mean on mobile sorry!

Ohhh you use mobile

edgy topaz Jul 4, 2025, 3:39 PM

#

Yes

viral mason Jul 4, 2025, 3:39 PM

#

Yeah you're pretty limited on options until u get the computer

edgy topaz Jul 4, 2025, 3:39 PM

#

viral mason Yeah you're pretty limited on options until u get the computer

Oh I see

viral mason Jul 4, 2025, 3:41 PM

#

I don't know of any websites besides weights to record your voice and have it output as an ai voice model

#

Sorry.-.

#

Maybe some helpers or mods know tho

edgy topaz Jul 4, 2025, 3:41 PM

#

Oh is okay

viral mason Jul 4, 2025, 3:41 PM

#

Or some QC (idk what it stands for)

analog obsidian Jul 4, 2025, 3:47 PM

#

viral mason Or some QC (idk what it stands for)

qc aren't helpers

viral mason Jul 4, 2025, 3:47 PM

#

They're smart tho

#

Like uh

#

Noobies

#

Idk if they're even a QC lol

analog obsidian Jul 4, 2025, 3:48 PM

#

viral mason Idk if they're even a QC lol

he's a engineer

#

jeffthelandshark_3

viral mason Jul 4, 2025, 3:48 PM

#

Red or Blu side

#

What's his load out

fair oasis Jul 4, 2025, 3:53 PM

#

guys may i ask: my regular mic works fine but my virtual cable mic somehow has my computer audio bleeding into my mic. its been messing up my voice ai as a result, what can I do to fix it?

solid sequoia Jul 4, 2025, 4:35 PM

#

i have installed a model from #1175430844685484042 , no matter what i do though the model won't use my rx 6600 xt instead uses my cpu which kills the performance a lot any way i can fix this?

toxic zenith Jul 4, 2025, 4:41 PM

#

How an I make a custom decadriver sound?

#

Like the announcer

low shard Jul 4, 2025, 4:59 PM

#

solid sequoia i have installed a model from <#1175430844685484042> , no matter what i do thoug...

This is a General AI Server, we won't be focused on voices anymore

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

low shard Jul 4, 2025, 4:59 PM

#

toxic zenith How an I make a custom decadriver sound?

what? u mean model?

#

what's ur pc gpu?

solid sequoia Jul 4, 2025, 6:07 PM

#

low shard This is a **General AI Server, we won't be focused on voices anymore** Elaborat...

rx 6600 xt
windows 11
utilizing gpu for the model to perform better. right now it only uses cpu
can't upload screen shots due to missing permissions

low shard Jul 4, 2025, 6:17 PM

#

solid sequoia rx 6600 xt windows 11 utilizing gpu for the model to perform better. right now i...

you didn't elaborate everything

what do you want to do? what tutorial link are you using?

#

!give-meida-perms 1h @solid sequoia

solid sequoia Jul 4, 2025, 6:18 PM

#

low shard you didn't elaborate everything what do you want to do? what tutorial link are ...

i want to use my gpu instead of cpu for the model so it performs better

#

and for tutorial link are you refering to youtube links?

low shard Jul 4, 2025, 6:20 PM

#

solid sequoia i want to use my gpu instead of cpu for the model so it performs better

i want to use my gpu instead of cpu for the model so it performs better

how do you want to use the model? what are you planning to do? ai covers? realtime voice changer for calls?

solid sequoia Jul 4, 2025, 6:21 PM

#

realtime voice changer for calls and ingame voice chat

low shard Jul 4, 2025, 6:21 PM

#

theres many different ai programs

#

RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

#

do you need wokada or rvc?

low shard Jul 4, 2025, 6:21 PM

#

solid sequoia and for tutorial link are you refering to youtube links?

yes, or the download link of the program

solid sequoia Jul 4, 2025, 6:21 PM

#

i came from this tutorial, used the links given in the desc
https://www.youtube.com/watch?v=SxdnGxicJOg&ab_channel=novision

low shard Jul 4, 2025, 6:22 PM

#

solid sequoia i came from this tutorial, used the links given in the desc https://www.youtube....

all video tutorials are outdated

#

that tutorial uses an over year old original wokada lmfao

#

I wrote it in the comments

#

you just wasted time using that tutorial basically

#

delete the program, and delete vb audio cable too

solid sequoia Jul 4, 2025, 6:23 PM

#

it seemed to work perfectly fine though, i just need it to use gpu instead of cpu

solid sequoia Jul 4, 2025, 6:23 PM

#

low shard delete the program, and delete vb audio cable too

alright

#

which link can i find a newer version on

low shard Jul 4, 2025, 6:23 PM

#

solid sequoia it seemed to work perfectly fine though, i just need it to use gpu instead of cp...

it's outdated, dont bother using it

#

plus that version sucks for amd

low shard Jul 4, 2025, 6:24 PM

#

solid sequoia which link can i find a newer version on

-realtime

patent trellisBOT Jul 4, 2025, 6:24 PM

#

low shard -realtime

💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard Jul 4, 2025, 6:24 PM

#

read the 1st link

solid sequoia Jul 4, 2025, 6:24 PM

#

alright thank you

low shard Jul 4, 2025, 6:24 PM

#

wokada deiteris fork

low shard Jul 4, 2025, 6:24 PM

#

solid sequoia alright thank you

let me know

solid sequoia Jul 4, 2025, 6:39 PM

#

low shard let me know

it works so much better now both low cpu and gpu usage, more responsive as well thank you, 1 last question though my "echo" "sup1" "sup2" settings seem to be disabled i couldn't find anything about it in the audio setup page you sent me

simple ore Jul 4, 2025, 6:51 PM

#

solid sequoia it works so much better now both low cpu and gpu usage, more responsive as well ...

they are disabled in server mode

pine star Jul 4, 2025, 7:10 PM

#

anyone had issues where they followed this guide https://docs.aihub.gg/rvc-voice-changer/realism/
it broke their youtube?

#

youtube just keeps saying "Audio renderer error. Please restart your computer"

crude flame Jul 4, 2025, 7:11 PM

#

pine star youtube just keeps saying "Audio renderer error. Please restart your computer"

Have you tried restarting your pc

pine star Jul 4, 2025, 7:12 PM

#

yeah it works whenever i dont have voicemeeter potato on

#

im pretty sure that i followed every step it told me

crude flame Jul 4, 2025, 7:12 PM

#

Try clicking a1 on both the things in voicemeeter

pine star Jul 4, 2025, 7:13 PM

#

like this?

#

i clicked on a1 for voicemeeter input and aux1

crude flame Jul 4, 2025, 7:14 PM

#

And in the first one if that doesn't work

pine star Jul 4, 2025, 7:14 PM

#

it still dosen work

#

i have light host on too, idk if thats the issue

crude flame Jul 4, 2025, 7:15 PM

#

Huh that usually fixes it

pine star Jul 4, 2025, 7:15 PM

#

should i remove the b1 and mono in stero input 1?

crude flame Jul 4, 2025, 7:15 PM

#

Try turning off the denoiser you have on in the first column

pine star Jul 4, 2025, 7:16 PM

#

#

wait it picks up sound

#

but youtube is still broken lol

crude flame Jul 4, 2025, 7:17 PM

#

So it's picking up sounds just not outputting them?

pine star Jul 4, 2025, 7:17 PM

#

yeah

#

i have my system input and out put my headset

crude flame Jul 4, 2025, 7:17 PM

#

In hardware out a1 you have that set to your headphones right

pine star Jul 4, 2025, 7:17 PM

#

yeee

#

wait how does the imput work?

#

cuz when i talk it dosent input anythign

#

i dont see any instructions to put my headset mic input anywhere

crude flame Jul 4, 2025, 7:18 PM

#

Not in voicemeeter

#

In w-okada you use your mic input

pine star Jul 4, 2025, 7:19 PM

#

oh so the wokada output will be the line 1 (virtual cable)?

crude flame Jul 4, 2025, 7:19 PM

#

Then the output gets put into voicemeeter then into light host then outputs into discord or whatever

pine star Jul 4, 2025, 7:20 PM

#

Like this?

#

"Once you have completed all of the above steps you can now go into anything and set the mic input to "Voicemeeter Out B2"."

crude flame Jul 4, 2025, 7:21 PM

#

No

#

Input is your headphones

#

Wait

pine star Jul 4, 2025, 7:21 PM

#

oh i got confused with the instructions

crude flame Jul 4, 2025, 7:21 PM

#

I'm confusing myself lol

pine star Jul 4, 2025, 7:21 PM

#

this is where i got it from

crude flame Jul 4, 2025, 7:22 PM

#

So input is your mic output is line 1 and monitor you can leave empty

#

I'm going to redo that guide when I get home

pine star Jul 4, 2025, 7:22 PM

#

oh ok

pine star Jul 4, 2025, 7:23 PM

#

crude flame I'm going to redo that guide when I get home

is it possible to add more details on what changing the setting actually do

#

so people know what they are actually changing

#

maybe have a bracket next to instuctions saying what it does

crude flame Jul 4, 2025, 7:29 PM

#

pine star is it possible to add more details on what changing the setting actually do

Yeah def

viral mason Jul 4, 2025, 7:41 PM

#

are these still the best settings for the t-de-esser 2

pine star Jul 4, 2025, 7:53 PM

#

@crude flame just realised that when i downloaded virtual cable lite it automatically set everything to line 1 and thats why everything is breaking lol

#

apprently on windows 11, there system -> sound input output

#

and theres also system -> sound -> volume mixer input output

#

anime_pray

viral mason Jul 4, 2025, 7:56 PM

#

that's confusing

pine star Jul 4, 2025, 7:58 PM

#

yeah

#

so much inputs and outputs

carmine mica Jul 4, 2025, 8:11 PM

#

program to use the templates?

low shard Jul 4, 2025, 8:11 PM

#

carmine mica program to use the templates?

elaborate

low shard Jul 4, 2025, 8:11 PM

#

carmine mica program to use the templates?

This is a General AI Server, we won't be focused on voices anymore

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

carmine mica Jul 4, 2025, 8:13 PM

#

for voice models to create TTS with rvc

swift thunder Jul 4, 2025, 8:28 PM

#

Does anyone know of a software or method to remove audience noise and get results like this?
https://youtu.be/0xj7UiwVOa0?si=aHo1Igg_cCklyMHp

steady otter Jul 4, 2025, 8:41 PM

#

swift thunder Does anyone know of a software or method to remove audience noise and get result...

yeah xminus has a decrowd feature

swift thunder Jul 4, 2025, 8:47 PM

#

steady otter yeah xminus has a decrowd feature

I tried it but it is not very stable, there is noise in the instrumentation

#

like that of Mvsep

steady otter Jul 4, 2025, 8:51 PM

#

swift thunder I tried it but it is not very stable, there is noise in the instrumentation

cant you replace it with the oficial instrumental?

swift thunder Jul 4, 2025, 8:55 PM

#

steady otter cant you replace it with the oficial instrumental?

But the drums and guitar? I'm impressed by the intro, since everything is identical to the original Live instrumental. I've been trying to get such results for a year.

#

https://youtu.be/0r3GpS1hy_Y?si=ucNGeRqmlDU0IsDX

or this

brittle wing Jul 4, 2025, 9:07 PM

#

Hey i have a question, is there an AI tool for generating subtitles? I want to show something to my friend but he doesn't understand it cuz it's in my native language and not english

crude flame Jul 4, 2025, 9:55 PM

#

pine star yeah

updated the guide anime_pray also found a way to not route system audio to voicemeeter so you dont have to deal with missing audio

i just tested it on a fresh version of voicemeeter so it should work

simple ore Jul 4, 2025, 9:55 PM

#

brittle wing Hey i have a question, is there an AI tool for generating subtitles? I want to s...

you can run the audio thru ASR like Whisper

#

depending on the quality of the audio and language you may get something decent... or not

#

you can upload the audio to youtube and let me make a transcript

pine star Jul 4, 2025, 9:57 PM

#

crude flame updated the guide <:anime_pray:1159685390156967936> also found a way to not rou...

Thank yiu!

fleet cedar Jul 4, 2025, 10:48 PM

#

simple ore Download all 3 files, then extract the .zip file, it will automatically extract ...

i kept getting this error help

simple ore Jul 4, 2025, 10:54 PM

#

norton antivirus?

fleet cedar Jul 4, 2025, 10:55 PM

#

simple ore norton antivirus?

no

#

i dont have any antiviru

simple ore Jul 4, 2025, 11:01 PM

#

weird

brisk grove Jul 4, 2025, 11:02 PM

#

@simple ore what do u recommend for me to use for the ai voice in games?

fleet cedar Jul 4, 2025, 11:02 PM

#

simple ore weird

i already uninstalled a antivirus like 2 days ago

#

bitdefender

#

help

fleet cedar Jul 4, 2025, 11:03 PM

#

fleet cedar i kept getting this error help

when i click this "ok" it would say this error

simple ore Jul 4, 2025, 11:04 PM

#

fleet cedar when i click this "ok" it would say this error

why are you copying?

fleet cedar Jul 4, 2025, 11:04 PM

#

simple ore why are you copying?

im not copying..

simple ore Jul 4, 2025, 11:04 PM

#

you need to download all 3 files, use 7-zip to unzip

#

dont use windows BS

#

it is the worst

fleet cedar Jul 4, 2025, 11:04 PM

#

simple ore you need to download all 3 files, use 7-zip to unzip

not winrar?

#

can i use winrar

brisk grove Jul 4, 2025, 11:05 PM

#

what do i download for the voice changer?

simple ore Jul 4, 2025, 11:05 PM

#

winrar should be able to handle the split archive too

#

or winzip

fleet cedar Jul 4, 2025, 11:05 PM

#

ill get winrar

#

or 7zip

#

@simple ore now it works after i installed 7zip

brittle wing Jul 4, 2025, 11:07 PM

#

#

how do i fix

#

this

brisk grove Jul 4, 2025, 11:07 PM

#

@fleet cedarwhat do u have that u put the voice to?

fleet cedar Jul 4, 2025, 11:07 PM

#

brisk grove <@688409015436443651>what do u have that u put the voice to?

wym

brisk grove Jul 4, 2025, 11:08 PM

#

fleet cedar wym

like the soundboard

#

to be able to use the voice

#

like voicemod

fleet cedar Jul 4, 2025, 11:08 PM

#

@simple ore can u give me the right settings for my GPU

simple ore Jul 4, 2025, 11:09 PM

#

https://rentry.co/forkvoicechangerguide

brisk grove Jul 4, 2025, 11:11 PM

#

fleet cedar <@155030383648440320> can u give me the right settings for my GPU

@fleet cedarwhats that?

fleet cedar Jul 4, 2025, 11:11 PM

#

simple ore https://rentry.co/forkvoicechangerguide

is this correct

#

idk

fleet cedar Jul 4, 2025, 11:11 PM

#

brisk grove <@688409015436443651>whats that?

bro cant u clearly see

#

its obviously the AI rvc voice

brittle wing Jul 4, 2025, 11:11 PM

#

brittle wing

how to fdix

brisk grove Jul 4, 2025, 11:13 PM

#

fleet cedar its obviously the AI rvc voice

can u send a link for the download

#

@fleet cedar

simple ore Jul 4, 2025, 11:15 PM

#

fleet cedar is this correct

read the guide

ionic oak Jul 4, 2025, 11:25 PM

#

Hello, I don’t know if this is the right place to ask, but I don’t know how to change my voice in real time. Which app should I use? Because I have several models that people sent me, but I don’t know how or where to use them?

simple ore Jul 4, 2025, 11:32 PM

#

-rt

patent trellisBOT Jul 4, 2025, 11:32 PM

#

simple ore -rt

💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

brisk grove Jul 4, 2025, 11:42 PM

#

@fleet cedaralr i got it setup now how can i get it to actually work in game and in discord?

amber verge Jul 5, 2025, 12:06 AM

#

some reason it doesn't open

dapper rain Jul 5, 2025, 12:11 AM

#

Is this channel for dedicated RVC assistance?

low shard Jul 5, 2025, 12:17 AM

#

dapper rain Is this channel for dedicated RVC assistance?

it's about any general ai help, rvc included

low shard Jul 5, 2025, 12:17 AM

#

amber verge some reason it doesn't open

This is a General AI Server, we won't be focused on voices anymore

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

dapper rain Jul 5, 2025, 12:18 AM

#

Gotcha but you guys will still keep up RVC troubleshooting support still?

low shard Jul 5, 2025, 12:20 AM

#

dapper rain Gotcha but you guys will still keep up RVC troubleshooting support still?

yeah ofc, we just merged the channels to have less channels

#

if you need help, pls elaborate

plain pumice Jul 5, 2025, 12:52 AM

#

Yo

#

My voice changer is stuttering

#

It sounds so bad

#

Its like tweaking

#

My words are transfering but its static

low shard Jul 5, 2025, 12:54 AM

#

plain pumice My voice changer is stuttering

This is a General AI Server, we won't be focused on voices anymore

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

plain pumice Jul 5, 2025, 12:54 AM

#

Oh

low shard Jul 5, 2025, 12:54 AM

#

plain pumice Oh

voice changer is too generic, there could be over 100 programs classified like that lol

#

you need to elaborate more to get help, else we dunno even how to help

plain pumice Jul 5, 2025, 12:55 AM

#

XD

#

I cant screenshot here

#

Can I just dm?

low shard Jul 5, 2025, 12:55 AM

#

!give-media-perms 1h @plain pumice

low shard Jul 5, 2025, 12:55 AM

#

plain pumice I cant screenshot here

now u can

#

also it would be better u elaborate everything

#

all the infos i asked are crucial

plain pumice Jul 5, 2025, 12:55 AM

#

#

So basically

low shard Jul 5, 2025, 12:55 AM

#

plain pumice

lemme guess, youtube tutorial?

plain pumice Jul 5, 2025, 12:55 AM

#

Im trying to use a voice changer on

#

Yes

low shard Jul 5, 2025, 12:56 AM

#

you're using an over year old version of original wokada

#

its the same as using windows xp in 2050 basically

plain pumice Jul 5, 2025, 12:56 AM

#

LOL

low shard Jul 5, 2025, 12:56 AM

#

also vb audio cable has been reported to use issues on windows

low shard Jul 5, 2025, 12:56 AM

#

plain pumice Yes

all video tuts are outdated

plain pumice Jul 5, 2025, 12:56 AM

#

How do I still use it

low shard Jul 5, 2025, 12:56 AM

#

you can simply uninstall everything and forget you even watched it basically

low shard Jul 5, 2025, 12:56 AM

#

plain pumice How do I still use it

simply, you can't, its a shitty version

#

you need a better one

plain pumice Jul 5, 2025, 12:57 AM

#

Like?

#

Whats the new wakada

low shard Jul 5, 2025, 12:57 AM

#

plain pumice Like?

what's your pc gpu? what's your operative system?

plain pumice Jul 5, 2025, 12:57 AM

#

Gpu?

low shard Jul 5, 2025, 12:57 AM

#

plain pumice Gpu?

that's crucial

plain pumice Jul 5, 2025, 12:57 AM

#

Intel core i5

low shard Jul 5, 2025, 12:57 AM

#

plain pumice Intel core i5

thats a cpu

#

cpu = central processing unit

plain pumice Jul 5, 2025, 12:57 AM

#

Oh

low shard Jul 5, 2025, 12:57 AM

#

gpu = graphics processing unit

#

this isn't chatgpt, it runs on your hardware and its way more intensive and complex

plain pumice Jul 5, 2025, 12:57 AM

#

Oh

low shard Jul 5, 2025, 12:57 AM

#

gpu does all the complex tasks like gaming, 3d, and AI

low shard Jul 5, 2025, 12:58 AM

#

plain pumice Oh

You can check your pc gpu on Windows via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU

#

so yeah, don't expect a 1 click experience, AI is not really user friendly

#

chatgpt runs on everything bc it runs on cloud, remote good pc

#

while this is a program that runs on your hardware, not a rich hardware by someone else

plain pumice Jul 5, 2025, 12:59 AM

#

Intel (r) uhd graphics

low shard Jul 5, 2025, 12:59 AM

#

plain pumice Intel (r) uhd graphics

do you have any other gpus?

#

check gpu 1 and gpu 0

#

the one you mentioned is integrated graphics, it's literally too weak to do any type of AI and to even get recognized as a GPU lol

plain pumice Jul 5, 2025, 1:00 AM

#

Oh

low shard Jul 5, 2025, 1:00 AM

#

don't expect AI to run on bad hardware, it's more intensive than gaming

low shard Jul 5, 2025, 1:00 AM

#

plain pumice Oh

soo, you got another GPU?

plain pumice Jul 5, 2025, 1:00 AM

#

How do I check?

low shard Jul 5, 2025, 1:00 AM

#

plain pumice How do I check?

just send a screenshot of your task manager

#

you should have gpu 0 and gpu 1 maybe

plain pumice Jul 5, 2025, 1:00 AM

#

Alr

#

low shard Jul 5, 2025, 1:01 AM

#

plain pumice

the whole task manager

plain pumice Jul 5, 2025, 1:01 AM

#

Ok

#

\

low shard Jul 5, 2025, 1:03 AM

#

plain pumice \

the performance tab

plain pumice Jul 5, 2025, 1:03 AM

#

#

I know I may be wasting ur time

#

(I definetly am but not on purpose)

#

Im just if you call it.. A caveman when it comes to checking ur pc and allat stuff

#

@low shard

low shard Jul 5, 2025, 1:05 AM

#

plain pumice

yep you don't got got any other GPUs

You got 3 options:

Buy a better pc
Run it locally (on ur pc) using the CPU mode of the wokada fork which has better performance https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/ (but this isn't suggested as it could be unstable)
Use **cloud **(remote good pc):

About Cloud, there are different services:

Google Colabs (4 hours daily of free T4 gpu, easy to use, require only a google account) :
- W-Okada's Deiteris' Fork Voice Changer Google Colab (currently works only on google colab PAID tier)
- How to use Original W-Okada's Voice Changer Google Colab (has a Guide) (currently broken)
Kaggles (30 hours weekly of better GPUs, T4x2 & P100, harder to use, requires an account and a phone number):
- W-Okada's Deiteris' Fork Voice Changer Kaggle (the best and only working one currently for free)
- Original W-Okada's Voice Changer Kaggle (currently broken)

#

so yeah that's why it was laggy, other than being an old version with worse performance, you also got bad hardware, so not a good combo

#

the best options are just either using cloud or buying a better pc

plain pumice Jul 5, 2025, 1:06 AM

#

How do I use cloud?

low shard Jul 5, 2025, 1:07 AM

#

plain pumice How do I use cloud?

click this link and read the guide

#

reminder that it's more complex and you got limited free time btw

#

and you also need to give your phone number, as it's a google service and they dont want you to use alt accs

plain pumice Jul 5, 2025, 1:09 AM

#

Uj

#

Ik

#

Google is always like that

#

Like a really secure software

low shard Jul 5, 2025, 1:13 AM

#

This is a General AI Server, we won't be focused on voices anymore

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

#

rvc and wokada do 2 different things

#

RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

#

they do not want u to use alt accs lol

exotic elm Jul 5, 2025, 1:27 AM

#

hey guys, how i use a model from the #1175430844685484042 ?

low shard Jul 5, 2025, 1:37 AM

#

exotic elm hey guys, how i use a model from the <#1175430844685484042> ?

Elaborate:

your PC GPU
your operative system
what you want to do

#

-realtime

patent trellisBOT Jul 5, 2025, 1:37 AM

#

low shard -realtime

💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard Jul 5, 2025, 1:37 AM

#

1st link, wokada deiteris fork

long obsidian Jul 5, 2025, 3:51 AM

#

Anyone know realistic ai photo generator which can use your face to make a picture

toxic zenith Jul 5, 2025, 4:10 AM

#

low shard what? u mean model?

yeah, I mean custom text to speech, I see decade driver models here but i don't know how to convert text to speech into my style

simple ore Jul 5, 2025, 4:12 AM

#

long obsidian Anyone know realistic ai photo generator which can use your face to make a pictu...

https://github.com/VectorSpaceLab/OmniGen2?tab=readme-ov-file

long obsidian Jul 5, 2025, 4:19 AM

#

Thank youu

long obsidian Jul 5, 2025, 4:21 AM

#

simple ore https://github.com/VectorSpaceLab/OmniGen2?tab=readme-ov-file

Also when creating images of people do they look genuinely

#

And can i feed it data to create pictures from it

runic thunder Jul 5, 2025, 6:32 AM

#

hii

#

how can I create an ai voice?

long obsidian Jul 5, 2025, 7:50 AM

#

@simple ore and also which out of these do u think is the best omnigen2/fluc/SDXL

raven barn Jul 5, 2025, 8:19 AM

#

literally have no idea how to train with a 5070

#

it just doesnt wanna work

toxic zenith Jul 5, 2025, 9:05 AM

#

how can I make my own text to speech?

knotty moth Jul 5, 2025, 9:06 AM

#

raven barn literally have no idea how to train with a 5070

make sure you're using the latest Applio

#

and do manual install with latest pytorch (2.7) and cuda 12.8

#

the original rvc and even mangio won't work at all

raven barn Jul 5, 2025, 9:09 AM

#

i did that

#

like so many times

#

i dont know what i did wrong

knotty moth Jul 5, 2025, 9:13 AM

#

raven barn i dont know what i did wrong

a bit correction

#

if not the latest release

#

clone the Applio repo itself https://github.com/IAHispano/Applio

GitHub

GitHub - IAHispano/Applio: A simple, high-quality voice conversion ...

A simple, high-quality voice conversion tool focused on ease of use and performance. - IAHispano/Applio

#

then just double click run-install.bat

#

#

it should include torch 2.7.1 which is needed for RTX 50-series

raven barn Jul 5, 2025, 9:14 AM

#

okay ill try this

long obsidian Jul 5, 2025, 9:14 AM

#

@knotty moth can u check dms for a sec i asked u a question there i can send it here too if u want

knotty moth Jul 5, 2025, 9:14 AM

#

long obsidian <@681186927151546397> can u check dms for a sec i asked u a question there i can...

no just ask here

long obsidian Jul 5, 2025, 9:17 AM

#

knotty moth no just ask here

Do you know a good ai photo generator which can use a face to generate pictures

#

I saw flux and onnigen2 are good ones but which would you recommend me

raven barn Jul 5, 2025, 9:25 AM

#

knotty moth and do manual install with latest pytorch (2.7) and cuda 12.8

do u mind sending links for those, i dont want download the wrong thing

#

just incase i may have

knotty moth Jul 5, 2025, 9:32 AM

#

raven barn do u mind sending links for those, i dont want download the wrong thing

raven barn Jul 5, 2025, 9:33 AM

#

knotty moth

no for the cuda and pytorch

#

i think i found it

toxic zenith Jul 5, 2025, 9:37 AM

#

where can I train AI voice using google colab

long obsidian Jul 5, 2025, 10:02 AM

#

@knotty moth so what do you think

raven barn Jul 5, 2025, 10:15 AM

#

currently trying it rn

#

downloaded pytorch 2.7 and cuda 12.8

#

didnt do anything

#

proof

#

idk but im tempted on just getting out my 4070 and using that

#

cuz ik it will work

#

is it more or less the same thing?

#

ill look into it when i wake up

#

been trying to get this to work for ab 10 hours

#

rvc doesnt work

#

appolio doesnt work

#

i hope but thx sob_pray

simple ore Jul 5, 2025, 10:31 AM

#

long obsidian <@155030383648440320> and also which out of these do u think is the best omnigen...

to do image edits, like replacing characters/merging images, both omnigen2 and flux kontext

simple ore Jul 5, 2025, 10:32 AM

#

raven barn didnt do anything

full screenshot how you installed it

long obsidian Jul 5, 2025, 10:32 AM

#

simple ore to do image edits, like replacing characters/merging images, both omnigen2 and f...

For creating people or changing small features (making ai influencer) which would u suggest me to use

simple ore Jul 5, 2025, 10:33 AM

#

it also requires a small fix to add "50" to "infer-web.py"

simple ore Jul 5, 2025, 10:33 AM

#

long obsidian For creating people or changing small features (making ai influencer) which woul...

omnigen2 then

#

all in one

simple ore Jul 5, 2025, 10:34 AM

#

raven barn appolio doesnt work

you're messing something up then

raven barn Jul 5, 2025, 10:34 AM

#

simple ore you're messing something up then

Can I dm u the screenshots tmr I just got off

simple ore Jul 5, 2025, 10:34 AM

#

raven barn Can I dm u the screenshots tmr I just got off

https://github.com/IAHispano/Applio/issues/1020

raven barn Jul 5, 2025, 10:35 AM

#

simple ore https://github.com/IAHispano/Applio/issues/1020

That’s exactly what I did

#

I got that line straight from their website

#

Even installed cuda 12.8 or wtv

simple ore Jul 5, 2025, 10:36 AM

#

i require a screenshot of how you done it

#

because there are 20+ who said it works fine

raven barn Jul 5, 2025, 10:36 AM

#

The only version of pyhton I could get to work with that version of PyTorch was 13.11.9

silver lynx Jul 5, 2025, 10:37 AM

#

hey is there still like a list of what settings to use with which gpus

simple ore Jul 5, 2025, 10:37 AM

#

silver lynx hey is there still like a list of what settings to use with which gpus

https://rentry.co/forkvoicechangerguide#known-working-settings-for-chunk-and-extra

raven barn Jul 5, 2025, 10:38 AM

#

simple ore i require a screenshot of how you done it

I’ll send it to u tmr

silver lynx Jul 5, 2025, 10:41 AM

#

simple ore https://rentry.co/forkvoicechangerguide#known-working-settings-for-chunk-and-ext...

seems like a bad website, for amd 6xxx XT gpu's it says its MAX settings are 128 + 2.7s and then like a few sentences below the tabel it says the 6650 XT can do 60-80 ms

simple ore Jul 5, 2025, 10:48 AM

#

silver lynx seems like a bad website, for amd 6xxx XT gpu's it says its MAX settings are 128...

read the section above

#

knotty moth Jul 5, 2025, 11:01 AM

#

raven barn downloaded pytorch 2.7 and cuda 12.8

torch 2.7.1 exactly

#

you were trying on 2.7.0

raven barn Jul 5, 2025, 11:01 AM

#

Oh

knotty moth Jul 5, 2025, 11:03 AM

#

raven barn Oh

if you run through run_install.bat it should have installed torch 2.7.1

raven barn Jul 5, 2025, 11:03 AM

#

knotty moth if you run through run_install.bat it should have installed torch 2.7.1

I found the hugging face version

#

I’ll look at it again when I wake up later

simple ore Jul 5, 2025, 11:06 AM

#

knotty moth you were trying on 2.7.0

does not matter, both work

#

RVC1006Nvidia requires a small manual fix

low shard Jul 5, 2025, 11:08 AM

#

runic thunder how can I create an ai voice?

This is a General AI Server, we won't be focused on voices anymore

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

hybrid flame Jul 5, 2025, 12:24 PM

#

Hello, can you please advise if there is an up to date guide to install and use RVC on a pc with AMD graphics card (7800xt) for real time voice changing? Thanks

limber siren Jul 5, 2025, 12:30 PM

#

Hi, did Weights remove the option to log in with github?

hybrid flame Jul 5, 2025, 12:32 PM

#

I don't understand, since I'm not a developer, I'm just here to use it.

knotty moth Jul 5, 2025, 12:37 PM

#

hybrid flame Hello, can you please advise if there is an up to date guide to install and use ...

get the AMD one here

#

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#download-amd-intel-and-cpu-on-windows

Deiteris' W Okada Fork

Last update: May 5, 2025

hybrid flame Jul 5, 2025, 12:38 PM

#

knotty moth get the AMD one here

thanks

willow cipher Jul 5, 2025, 12:49 PM

#

why does echo and suppresion can't be turned on?

#

nvm figured out

long obsidian Jul 5, 2025, 1:54 PM

#

@simple ore hi i got a question about the installation my gpu is nvidia 5060ti and my cuda version is 12.9 on the pytorch the newest one is 12.8 but the flash attention doesnt support the pytorch version of 2.6.0 with this cuda version what should i do? Should I download the 12.4 one and the standard flash attention? (this is about the omnigen2), will it work like that despite my gpu being newer version

simple ore Jul 5, 2025, 2:01 PM

#

long obsidian <@155030383648440320> hi i got a question about the installation my gpu is nvidi...

you dont need cuda toolkit

#

as long as you have torch cu128 either 2.7.0 or 2.7.1 it is fine

#

you can find flash attention 2 wheels here https://huggingface.co/lldacing/flash-attention-windows-wheel/tree/main

#

download cp version that matches your python install (3.10, 3.11, 3.12)

#

long obsidian Jul 5, 2025, 2:07 PM

#

@simple ore so i need to install with this: pip install torch==2.7.1+cu128 --index-url https://download.pytorch.org/whl/cu128 and install one of the flash attn versions corresponding to my python (3.11) uve listed?

simple ore Jul 5, 2025, 2:08 PM

#

you install torch and torchvision pip install torch torchvision --upgrade --index-url https://download.pytorch.org/whl/cu128

#

then you download cp311 flash wheel and use pip install the_name_of_the_downloaded_file.whl

long obsidian Jul 5, 2025, 2:10 PM

#

i just found out that the python version ive downloaded is 3.13 if i download 3.11 will it work?

simple ore Jul 5, 2025, 2:10 PM

#

nope

#

3.13 is not recommended for anything, too many incompatible libraries out there

long obsidian Jul 5, 2025, 2:11 PM

#

how can i swap it to 3.11, just delete it?

simple ore Jul 5, 2025, 2:11 PM

#

install 3.11, make sure you select 'add to path' checkbox on the install screen

long obsidian Jul 5, 2025, 2:11 PM

#

alr imma do this rn thanks

simple ore Jul 5, 2025, 2:11 PM

#

then remaking the virtual envrionment using 3.11

#

py -3.11 -m venv venv

long obsidian Jul 5, 2025, 2:13 PM

#

there are multiple versions of 3.11 (after the 11 which version should i get)

simple ore Jul 5, 2025, 2:13 PM

#

https://www.python.org/ftp/python/3.11.9/python-3.11.9-amd64.exe

long obsidian Jul 5, 2025, 2:23 PM

#

ty im downloading the stuff rn

long obsidian Jul 5, 2025, 2:40 PM

#

is it normal to be this slow for a simple image

#

also @simple ore how can i train models for it (can i actually do it)

#

im seeing this for the past 5 min

simple ore Jul 5, 2025, 2:47 PM

#

did you make sure you have cuda torch installed?

#

check device manager/performance / memory and vram use

long obsidian Jul 5, 2025, 2:49 PM

#

sinful mango Jul 5, 2025, 2:49 PM

#

Hello all !
I am a newbie here and not a developper at all, I dabble a bit and mostly just surf, read, and do lots of trials & errors.
I am currently working on mods for Cyberpunk 2077 for my private usage (not for sharing on nexusmods or else for copyright issues).
I saw a lot of "voice ai" swaps for the main character and wanted to create my own with a voice actor I really appreciate (a french dubber for a character in a tv show).
I recently tried Zonos to create TTS audio with a sample of the french dubber and the result is quite good.
But that is just the beginning, now I am in front of the hardest part :
Take all the audio files of the character in the game , and create modified audio files of those source files but with the cloned voice I get on Zonos.
And so I have two options :
Either I get all the text of those audio files and script something with python to batch generate the audio files using Zonos.
Either I find a tool allowing audio-to-audio by using a cloned audio reader (is that even possible and does it exist ?)
My configuration is : i9 14900 / 64gb ram / RTX 4090 / Windows 11 Pro

Any help/pointers would be deeply appreciated ^^ (and I repeat : I am not a developer, I can dabble and am willing to learn but consider me an utter noob)

long obsidian Jul 5, 2025, 2:49 PM

#

simple ore did you make sure you have cuda torch installed?

ughh i followed ur steps

#

from here i have done the steps till 2

#

then i deleted the pip files and followed ur instructions

#

i didnt do the 3.2 tho

#

i will try doing that rn

long obsidian Jul 5, 2025, 2:51 PM

#

simple ore you install torch and torchvision `pip install torch torchvision --upgrade --ind...

should i change --upgrade here or leave it like this

#

also this is the requirments txt file

torch==2.6.0
torchvision==0.21.0
timm
einops
accelerate
transformers==4.51.3
diffusers
opencv-python-headless
scipy
wandb
matplotlib
Pillow
tqdm
omegaconf
python-dotenv
ninja
ipykernel
wheel
triton-windows; sys_platform == "win32"

simple ore Jul 5, 2025, 2:55 PM

#

install the requirements, then upgrade torch

viral mason Jul 5, 2025, 2:58 PM

#

sinful mango Hello all ! I am a newbie here and not a developper at all, I dabble a bit and m...

Hey there, I can help you create a voice model of the person you're talking about ^^

long obsidian Jul 5, 2025, 2:58 PM

#

i changed the text file to

torch==2.7.1
torchvision==0.22.1
timm
einops
accelerate
transformers==4.51.3
diffusers
opencv-python-headless
scipy
wandb
matplotlib
Pillow
tqdm
omegaconf
python-dotenv
ninja
ipykernel
wheel
triton-windows; sys_platform == "win32"

and i will check if it works now

#

i copied the torch and torchvision from the cmd from the installing step before it

#

its still sitting on 0/50

#

and doesnt move

#

can u type step by step what i should do to fix this (sorry if im being to annoying

simple ore Jul 5, 2025, 3:03 PM

#

You likely have cpu torch

#

reinstall cu128

long obsidian Jul 5, 2025, 3:03 PM

#

with this command? pip install torch torchvision --upgrade --index-url https://download.pytorch.org/whl/cu128

simple ore Jul 5, 2025, 3:04 PM

#

activate the environment

#

then run that

long obsidian Jul 5, 2025, 3:04 PM

#

okey

#

#

it says this

simple ore Jul 5, 2025, 3:04 PM

#

or venv\scripts\python -m pip install torch torchvision --upgrade --index-url https://download.pytorch.org/whl/cu128

#

i dunno why you're using conda

long obsidian Jul 5, 2025, 3:05 PM

#

should i delete it

#

and start anew

simple ore Jul 5, 2025, 3:05 PM

#

regular python venv

long obsidian Jul 5, 2025, 3:05 PM

#

and if i delete conda and the file itself (conda create -n omnigen2 python=3.11
conda activate omnigen2) i change the conda part here to venv?

simple ore Jul 5, 2025, 3:07 PM

#

long obsidian Jul 5, 2025, 3:07 PM

#

oh okey

#

and after making this enviroment do i have to activate it or just go on?

simple ore Jul 5, 2025, 3:08 PM

#

#

after that is done, pip install torch torchvision --upgrade --index-url https://download.pytorch.org/whl/cu128

#

and pip install flash.whl

long obsidian Jul 5, 2025, 3:09 PM

#

i will try that rn

sinful mango Jul 5, 2025, 3:12 PM

#

viral mason Hey there, I can help you create a voice model of the person you're talking abou...

Would love that ! I'm all ears !

viral mason Jul 5, 2025, 3:13 PM

#

Alrighty I can help out in dms since these two are doing nerd stuff 👍

toxic zenith Jul 5, 2025, 3:19 PM

#

low shard This is a **General AI Server, we won't be focused on voices anymore** Elaborat...

how can I make custom text to speech

#

I mean using custom models, not pretrain models

viral mason Jul 5, 2025, 3:32 PM

#

I think they're talking about something like uberduck

#

Like how it had TF2 tts

long obsidian Jul 5, 2025, 3:35 PM

#

simple ore and pip install flash.whl

i did what u told me and it still stays like this

toxic zenith Jul 5, 2025, 3:36 PM

#

yeah, for text to speech

#

i want to make my own sound on decade driver

viral mason Jul 5, 2025, 3:39 PM

#

I dunno how the rvc works I just know how to clean datasets and how to read graph

#

Trust

#

Whoever is typing RN your name is all rectangles lmao

#

It's scary

#

I am proof that anyone can make a voice model as long as they try

analog obsidian Jul 5, 2025, 3:44 PM

#

TTS are zero shot, no dataset needed, only a few seconds
rvc requires training, 10 mins minimum for increased consistency

#

fun fact: rvc core component (vits) is 'hacked' in order to do speech to speech conversion instead of tts

#

yup actually rvc first "name" was just vits

toxic zenith Jul 5, 2025, 3:51 PM

#

ok

#

so how can I create an AI voice model?

#

rvc pls

analog obsidian Jul 5, 2025, 3:52 PM

#

toxic zenith so how can I create an AI voice model?

https://docs.aihub.gg/essentials/how-to-make-voice-models/

How to Make Voice Models

In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.

long obsidian Jul 5, 2025, 3:52 PM

#

@simple ore i tried reinstalling and following your order again but it stays the same just 0/50 0%

hallow thistle Jul 5, 2025, 3:52 PM

#

toxic zenith rvc pls

You mean RVC voice model and not TTS? Alright, there are ways to train a voice model.

analog obsidian Jul 5, 2025, 3:53 PM

#

https://www.bilibili.com/video/BV1A14y1a75R/ this is probably the very first rvc model ever made, when it was internally just named "vits" (since rvc is a modified vits)

simple ore Jul 5, 2025, 3:55 PM

#

long obsidian <@155030383648440320> i tried reinstalling and following your order again but it...

check venv/lib/site-packages folder and see what torch is installed

long obsidian Jul 5, 2025, 3:56 PM

#

simple ore check venv/lib/site-packages folder and see what torch is installed

torch-2.7.1+cu128.dist-info its this

#

rn im trying running it without the flash-attn to see if it will work

simple ore Jul 5, 2025, 3:56 PM

#

would it be easier for you to just use comfyUI?

tribal cosmos Jul 5, 2025, 3:56 PM

#

for live voice changers, is w okada still the best or is there something new?

simple ore Jul 5, 2025, 3:57 PM

#

tribal cosmos for live voice changers, is w okada still the best or is there something new?

-rt

patent trellisBOT Jul 5, 2025, 3:57 PM

#

simple ore -rt

💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

long obsidian Jul 5, 2025, 3:57 PM

#

simple ore would it be easier for you to just use comfyUI?

idk how to set it up and also still doesnt move without the file

#

i will try rq to reboot pc and see if it will work

tribal cosmos Jul 5, 2025, 3:57 PM

#

simple ore -rt

??

simple ore Jul 5, 2025, 3:57 PM

#

read the guide above

tribal cosmos Jul 5, 2025, 3:57 PM

#

oh thank you

simple ore Jul 5, 2025, 3:57 PM

#

long obsidian idk how to set it up and also still doesnt move without the file

https://docs.comfy.org/tutorials/image/omnigen/omnigen2

long obsidian Jul 5, 2025, 3:59 PM

#

simple ore https://docs.comfy.org/tutorials/image/omnigen/omnigen2

can u also send a guide for comfyui

tribal cosmos Jul 5, 2025, 3:59 PM

#

simple ore read the guide above

whats the difference bteween the top one and the 2nd one?

simple ore Jul 5, 2025, 3:59 PM

#

tribal cosmos whats the difference bteween the top one and the 2nd one?

Wokada Deiteris Fork

#

use that

tribal cosmos Jul 5, 2025, 4:00 PM

#

simple ore Wokada Deiteris Fork

okay thx

hallow thistle Jul 5, 2025, 4:00 PM

#

tribal cosmos whats the difference bteween the top one and the 2nd one?

Deiteris' fork W-Okada and "original" W-Okada are both versions of W-Okada, but they are developed separately by different authors. Deiteris' W-Okada is better. What are differences? Performance and some bug fixes, especially.

tribal cosmos Jul 5, 2025, 4:02 PM

#

hallow thistle Deiteris' fork W-Okada and "original" W-Okada are both versions of W-Okada, but ...

oh i see thanks

#

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

tribal cosmos Jul 5, 2025, 4:04 PM

#

tribal cosmos https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

does anyone know where the download button is i cant find it 😭

toxic zenith Jul 5, 2025, 4:05 PM

#

hallow thistle You mean RVC voice model and not TTS? Alright, there are ways to train a voice m...

ok i know

#

how about tts

hallow thistle Jul 5, 2025, 4:06 PM

#

Don't.

hallow thistle Jul 5, 2025, 4:06 PM

#

tribal cosmos does anyone know where the download button is i cant find it 😭

What is your PC GPU? NVIDIA GeForce or AMD Radeon RX?

tribal cosmos Jul 5, 2025, 4:07 PM

#

hallow thistle What is your PC GPU? NVIDIA GeForce or AMD Radeon RX?

nvidia 3080

hallow thistle Jul 5, 2025, 4:07 PM

#

tribal cosmos nvidia 3080

https://cdn.discordapp.com/attachments/1159290139609137264/1371371778181431328/image.png

tribal cosmos Jul 5, 2025, 4:07 PM

#

hallow thistle https://cdn.discordapp.com/attachments/1159290139609137264/1371371778181431328/i...

thank you

toxic zenith Jul 5, 2025, 4:08 PM

#

ok

#

may be i try to tts in another app

tribal cosmos Jul 5, 2025, 4:09 PM

#

hallow thistle https://cdn.discordapp.com/attachments/1159290139609137264/1371371778181431328/i...

should i uninstall my old w okada??

toxic zenith Jul 5, 2025, 4:09 PM

#

then use rvc to emulate decadriver

hallow thistle Jul 5, 2025, 4:09 PM

#

tribal cosmos should i uninstall my old w okada??

Yes.

toxic zenith Jul 5, 2025, 4:09 PM

#

I think none of u all know about kr decade

hallow thistle Jul 5, 2025, 4:10 PM

#

toxic zenith I think none of u all know about kr decade

Yes, you guessed it right. I have no idea what Kr Decade even is.

toxic zenith Jul 5, 2025, 4:11 PM

#

hallow thistle Yes, you guessed it right. I have no idea what Kr Decade even is.

I tried to recreate custom decade driver sound for my decade driver

hallow thistle Jul 5, 2025, 4:12 PM

#

While Applio has TTS feature built-in, it's edge-tts, the RVC itself isn't TTS.

toxic zenith Jul 5, 2025, 4:12 PM

#

by training models using sounds from csm toys

#

and tv series using UVR5

hallow thistle Jul 5, 2025, 4:13 PM

#

toxic zenith I tried to recreate custom decade driver sound for my decade driver

Sound driver? I only know Intel/Realtek HD Audio and Creative Sound Blaster as sound card drivers.

toxic zenith Jul 5, 2025, 4:13 PM

#

hallow thistle Sound driver? I only know Intel/Realtek HD Audio and Creative Sound Blaster as s...

no

#

"toy" sound ok

hallow thistle Jul 5, 2025, 4:13 PM

#

cat_seriously

toxic zenith Jul 5, 2025, 4:14 PM

#

i don't mean audio driver fr

#

i mean changing sound in my decade driver bootleg toy model

tribal cosmos Jul 5, 2025, 4:26 PM

#

hallow thistle Yes.

do i have to run any of these bat files?

#

or do i just straight do the exe

viral mason Jul 5, 2025, 4:27 PM

#

The exe

tribal cosmos Jul 5, 2025, 4:27 PM

#

oh ok thx

#

also

#

how do i uninstall my old one

#

i had w okada

#

like 2 years ago

viral mason Jul 5, 2025, 4:27 PM

#

Just delete all files related to it

tribal cosmos Jul 5, 2025, 4:27 PM

#

thats it??

viral mason Jul 5, 2025, 4:27 PM

#

Should just be in that folder unless they got replaced by the new stuff

viral mason Jul 5, 2025, 4:27 PM

#

tribal cosmos thats it??

Yup

tribal cosmos Jul 5, 2025, 4:29 PM

#

viral mason Should just be in that folder unless they got replaced by the new stuff

when i run the new exe is it js gonna replace the old one?

viral mason Jul 5, 2025, 4:29 PM

#

I don't think so

tribal cosmos Jul 5, 2025, 4:30 PM

#

oh wtf

viral mason Jul 5, 2025, 4:30 PM

#

Pretty sure the files would've already been replaced by default unless you chose to skip or smth or didn't get that option after extraction

tribal cosmos Jul 5, 2025, 4:31 PM

#

viral mason Pretty sure the files would've already been replaced by default unless you chose...

oh i mean like the acutal w okada

viral mason Jul 5, 2025, 4:32 PM

#

Wdym

#

I'm slow

#

:3

tribal cosmos Jul 5, 2025, 4:32 PM

#

nahh its okay lmaooo

#

yo i have another quesiton

viral mason Jul 5, 2025, 4:32 PM

#

Yah?

tribal cosmos Jul 5, 2025, 4:32 PM

#

i havent doen any ai voice stuff in years

#

but when i was making models

#

i used to use appolio

#

is there a new one ppl are using now or is it still apollio

#

becuase i feel like alot has probbaly changed

viral mason Jul 5, 2025, 4:34 PM

#

There's applio and mainline

#

And local rvc stuff which idk anything about

#

But applio still exists ya

#

Nothing new I know about

long obsidian Jul 5, 2025, 4:36 PM

#

@simple ore i tried doing the comfyui one u suggested but im always getting this error

simple ore Jul 5, 2025, 4:37 PM

#

long obsidian <@155030383648440320> i tried doing the comfyui one u suggested but im always ge...

read the link, download the files, place them into right places

long obsidian Jul 5, 2025, 4:37 PM

#

i did that

tribal cosmos Jul 5, 2025, 4:38 PM

#

viral mason There's applio and mainline

ohhh ok thanks

long obsidian Jul 5, 2025, 4:39 PM

#

simple ore read the link, download the files, place them into right places

tribal cosmos Jul 5, 2025, 4:39 PM

#

long obsidian Jul 5, 2025, 4:39 PM

#

here for an example

tribal cosmos Jul 5, 2025, 4:39 PM

#

is this bad?

viral mason Jul 5, 2025, 4:39 PM

#

Uhh

simple ore Jul 5, 2025, 4:39 PM

#

sweeet mother of god

viral mason Jul 5, 2025, 4:39 PM

#

Try running it again

tribal cosmos Jul 5, 2025, 4:40 PM

#

okk imma try

long obsidian Jul 5, 2025, 4:41 PM

#

simple ore sweeet mother of god

what 😭

simple ore Jul 5, 2025, 4:43 PM

#

#

works fine

long obsidian Jul 5, 2025, 4:44 PM

#

simple ore

simple ore Jul 5, 2025, 4:44 PM

#

with the files placed into right places

long obsidian Jul 5, 2025, 4:44 PM

#

i have same settings same stuff

silver lynx Jul 5, 2025, 4:46 PM

#

simple ore https://rentry.co/forkvoicechangerguide#known-working-settings-for-chunk-and-ext...

oh right for this, im kinda stupid but the highest amd card on it is 7xxx XT

im getting a 9070 XT, so im wondering what applies to that

tribal cosmos Jul 5, 2025, 4:48 PM

#

viral mason Try running it again

okay i think it finished, is it supposed ot open in ur broswer right?

viral mason Jul 5, 2025, 4:48 PM

#

tribal cosmos okay i think it finished, is it supposed ot open in ur broswer right?

Yup!

#

Btw if u wanted we could move the convo to dms

tribal cosmos Jul 5, 2025, 4:48 PM

#

ohh okayyy

viral mason Jul 5, 2025, 4:48 PM

#

Ye

long obsidian Jul 5, 2025, 5:00 PM

#

@simple ore is it possible for me to screenshare for us to fix the omnigen2 local version in vc here?

#

pls i wanna kym at this point 😭

simple ore Jul 5, 2025, 5:14 PM

#

long obsidian <@155030383648440320> is it possible for me to screenshare for us to fix the omn...


cd OmniGen2

py -3.11 -m venv venv

venv/scripts/activate

pip install -r requirements.txt

pip install torch torchvision --upgrade --index-url https://download.pytorch.org/whl/cu128

pip install https://huggingface.co/lldacing/flash-attention-windows-wheel/resolve/main/flash_attn-2.7.4.post1%2Bcu128torch2.7.0cxx11abiFALSE-cp311-cp311-win_amd64.whl

python inference.py --model_path "OmniGen2/OmniGen2" --num_inference_step 50  --height 1024 --width 1024 --text_guidance_scale 4.0 --instruction "The sun rises slightly, the dew on the rose petals in the garden is clear, a crystal ladybug is crawling to the dew, the background is the early morning garden, macro lens." --output_image_path outputs/output_t2i.png --num_images_per_prompt 1```

#

long obsidian Jul 5, 2025, 5:15 PM

#

i will try it now

long obsidian Jul 5, 2025, 5:34 PM

#

simple ore ```git clone https://github.com/VectorSpaceLab/OmniGen2 cd OmniGen2 py -3.11 -...

its still sitting on 0

simple ore Jul 5, 2025, 5:38 PM

#

show vram use

#

add --enable_model_cpu_offload parameter to inference call

long obsidian Jul 5, 2025, 5:43 PM

#

simple ore show vram use

simple ore Jul 5, 2025, 5:43 PM

#

that parameter should help

long obsidian Jul 5, 2025, 5:43 PM

#

simple ore add `--enable_model_cpu_offload` parameter to inference call

let me try this, so i add it at the back of this python inference.py --model_path "OmniGen2/OmniGen2" --num_inference_step 50 --height 1024 --width 1024 --text_guidance_scale 4.0 --instruction "The sun rises slightly, the dew on the rose petals in the garden is clear, a crystal ladybug is crawling to the dew, the background is the early morning garden, macro lens." --output_image_path outputs/output_t2i.png --num_images_per_prompt 1

simple ore Jul 5, 2025, 5:44 PM

#

at the end

long obsidian Jul 5, 2025, 5:44 PM

#

alr

long obsidian Jul 5, 2025, 5:47 PM

#

simple ore at the end

its working now, will it work if i do python app.py --enable_model_cpu_offload

simple ore Jul 5, 2025, 5:50 PM

#

yes

#

#

can try the other too

long obsidian Jul 5, 2025, 5:50 PM

#

can u send it

#

as code

shadow orbit Jul 5, 2025, 5:51 PM

#

help

#

mic is working but i can't hear myself

#

on okada voice changer

#

:((

forest vector Jul 5, 2025, 5:52 PM

#

for some reason, the "2025-07-03 22:02:27,643 ERROR [VoiceChangerManager] CUDA error: an illegal memory access was encountered
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.
"- error is fixed when I run a game of sorts on the same PC.

forest vector Jul 5, 2025, 5:53 PM

#

shadow orbit mic is working but i can't hear myself

you mean, you cannot hear yourself, as in your real voice or your ai-converted voice ?

simple ore Jul 5, 2025, 5:54 PM

#

long obsidian can u send it

python app.py --enable_model_cpu_offload

#

and/or --enable_sequential_cpu_offload

shadow orbit Jul 5, 2025, 5:56 PM

#

forest vector you mean, you cannot hear yourself, as in your real voice or your ai-converted v...

both

#

no sound at all

#

when i hit passthru i hear myself tho

forest vector Jul 5, 2025, 5:57 PM

#

probably the voice isnt converting

shadow orbit Jul 5, 2025, 5:58 PM

#

forest vector probably the voice isnt converting

how do i fix that 😭

#

i sent a screenshot of my options in #1192011222023950368

long obsidian Jul 5, 2025, 6:07 PM

#

simple ore `python app.py --enable_model_cpu_offload`

thanks it works perfect now

hushed thicket Jul 5, 2025, 6:12 PM

#

how do i add a voice model

low shard Jul 5, 2025, 6:14 PM

#

hushed thicket how do i add a voice model

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

#

This is a general AI server, we can't know which program you're talking about, so that's why we need more information on the questions I asked you please

hushed thicket Jul 5, 2025, 6:15 PM

#

oh i downloaded the realtime voice changer and a voice that I want and im confused on how to add it

#

is there a tutorial video?

low shard Jul 5, 2025, 6:15 PM

#

hushed thicket oh i downloaded the realtime voice changer and a voice that I want and im confus...

please reply to the questions I asked you, there's thousands of different programs

#

and your pc gpu and operative system is crucial too

low shard Jul 5, 2025, 6:16 PM

#

hushed thicket is there a tutorial video?

also, there's no updated video tutorial for realtime voice changing, they mostly use old programs, did you follow one?

hushed thicket Jul 5, 2025, 6:17 PM

#

yea i followed a old one

low shard Jul 5, 2025, 6:17 PM

#

hushed thicket yea i followed a old one

if you followed a video tutorial, you can just delete everything you got off it, you probably got original wokada like version 1.5.3.8 and vb audio cable

#

AI runs at sonic speed, youtube tuts aren't the best for ai programs, they get outdated easily

hushed thicket Jul 5, 2025, 6:18 PM

#

low shard Elaborate: - your PC GPU - your operative system - what you want to do - what t...

4070, windows, use a voice, https://www.youtube.com/watch?v=We5oYpCR3WQ, I cant upload images

low shard Jul 5, 2025, 6:18 PM

#

it's best you also forget everything they tell you in it, they also tell outdated info like using "crepe"

low shard Jul 5, 2025, 6:18 PM

#

hushed thicket 4070, windows, use a voice, https://www.youtube.com/watch?v=We5oYpCR3WQ, I cant ...

yeah, that youtube tutorial is outdated asf lol

#

also, we don't endorse anything that duckus does

#

duckus is an horrible person

hushed thicket Jul 5, 2025, 6:19 PM

#

ooh i didnt know that

#

i just wanted a simple tut

low shard Jul 5, 2025, 6:19 PM

#

hushed thicket ooh i didnt know that

duckus describes himself as a "certified catfisher"

#

he makes money off catfishing people for the pure fun of it

#

AI should be used for good and fun, not for catfishing

hushed thicket Jul 5, 2025, 6:20 PM

#

yeah i agree

#

I want to use it to troll my friends and some randoms

low shard Jul 5, 2025, 6:20 PM

#

hushed thicket yeah i agree

-realtime

patent trellisBOT Jul 5, 2025, 6:20 PM

#

low shard -realtime

💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard Jul 5, 2025, 6:21 PM

#

read up the 1st link, wokada deiteris fork

#

this is the only updated tutorial

#

wokada deiteris fork got various improvements in performance and quality

hushed thicket Jul 5, 2025, 6:23 PM

#

ok i installed it

#

and the virtual cable

low shard Jul 5, 2025, 6:24 PM

#

hushed thicket and the virtual cable

nice, let me know for any issues

hushed thicket Jul 5, 2025, 6:24 PM

#

how do i input a voice

low shard Jul 5, 2025, 6:25 PM

#

hushed thicket how do i input a voice

like add a model?

hushed thicket Jul 5, 2025, 6:25 PM

#

yeah

low shard Jul 5, 2025, 6:25 PM

#

hushed thicket yeah

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#adding-models

Deiteris' W Okada Fork

Last update: May 5, 2025

#

also, if you share a screenshot of your wokada, i can help you with settings

shadow flax Jul 5, 2025, 6:25 PM

#

can someone give me a simple video on how to install RVC :,) ?
(NVIDIA, WIN11)

low shard Jul 5, 2025, 6:26 PM

#

shadow flax can someone give me a simple video on how to install RVC :,) ? (NVIDIA, WIN11)

please tell me your pc gpu and what you want to do

shadow flax Jul 5, 2025, 6:27 PM

#

low shard please tell me your pc gpu and what you want to do

5070ti, use realtime voice changer (RVC), the part as to how i can use it in games/calls i know myself i just cant find the right RVC to download

hushed thicket Jul 5, 2025, 6:27 PM

#

low shard https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#adding-mode...

everytime I download on the thing it says to create a covere

low shard Jul 5, 2025, 6:27 PM

#

shadow flax 5070ti, use realtime voice changer (RVC), the part as to how i can use it in gam...

realtime voice changer (RVC),
Yeah that's why I asked what you want to do, RVC doesn't mean that, it means Retrieval-based-Voice-Conversion

#

RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

#

I think what you want is actually wokada deiteris fork, right?

low shard Jul 5, 2025, 6:28 PM

#

hushed thicket everytime I download on the thing it says to create a covere

share the link of the model you're trying to download

hushed thicket Jul 5, 2025, 6:29 PM

#

https://www.weights.com/models/cmc8m0p4p00uon915fz7qukh3

shadow flax Jul 5, 2025, 6:30 PM

#

low shard I think what you want is actually wokada deiteris fork, right?

oh sorry seems i was mistaken then, i just remember having used it on AMD like a year ago from "okada".
whichever you think fits best ill get then, heres a picture of the UI i remembered

low shard Jul 5, 2025, 6:30 PM

#

shadow flax oh sorry seems i was mistaken then, i just remember having used it on AMD like a...

that one is a pretty outdated version of original wokada, wouldn't be suggested anymore

#

-realtime

patent trellisBOT Jul 5, 2025, 6:30 PM

#

low shard -realtime

💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard Jul 5, 2025, 6:30 PM

#

read up the 1st link, wokada deiteris fork

#

it has better quality and performance

#

and it supports the rtx 50 serie

shadow flax Jul 5, 2025, 6:31 PM

#

low shard it has better quality and performance

oh thats cool, never seen it in tutorials like you said. thanks

low shard Jul 5, 2025, 6:31 PM

#

shadow flax oh thats cool, never seen it in tutorials like you said. thanks

because all youtube tutorials are outdated lol

#

AI moves at sonic speed, dont trust yt tuts for everything

hushed thicket Jul 5, 2025, 6:32 PM

#

low shard share the link of the model you're trying to download

https://www.weights.com/models/cmc8m0p4p00uon915fz7qukh3

low shard Jul 5, 2025, 6:33 PM

#

hushed thicket https://www.weights.com/models/cmc8m0p4p00uon915fz7qukh3

make a weights.com account, click the 3 dots, then download

hushed thicket Jul 5, 2025, 6:36 PM

#

low shard make a weights.com account, click the 3 dots, then download

it still says select a model first after I uploaded the pth

golden walrus Jul 5, 2025, 6:36 PM

#

low shard make a weights.com account, click the 3 dots, then download

Sir Nick. Can i ask if there is a way to compress my voice effectively? Cuz when i use Wokada or Vonovox, discord will "bonk" it and make it pretty robotic and unnatural somehow

low shard Jul 5, 2025, 6:36 PM

#

hushed thicket it still says select a model first after I uploaded the pth

show a screenshot of your wokada

hushed thicket Jul 5, 2025, 6:37 PM

#

golden walrus Jul 5, 2025, 6:37 PM

#

I have EQ band and compressor enable

low shard Jul 5, 2025, 6:37 PM

#

hushed thicket

click close
click the model slot
click start

golden walrus Jul 5, 2025, 6:37 PM

#

POPOcat

shadow flax Jul 5, 2025, 6:37 PM

#

do i need to download all 3?

low shard Jul 5, 2025, 6:37 PM

#

also, you didn't set up the settings, show an entire screenshot

shadow flax Jul 5, 2025, 6:37 PM

#

low shard Jul 5, 2025, 6:37 PM

#

golden walrus Sir Nick. Can i ask if there is a way to compress my voice effectively? Cuz when...

like, there's cracking?

golden walrus Jul 5, 2025, 6:38 PM

#

ye, sharp decline in quality

low shard Jul 5, 2025, 6:38 PM

#

shadow flax do i need to download all 3?

yes

analog obsidian Jul 5, 2025, 6:38 PM

#

golden walrus I have EQ band and compressor enable

bitcrush

golden walrus Jul 5, 2025, 6:38 PM

#

emoji_141 i know it has something to do with the bitrate

hushed thicket Jul 5, 2025, 6:38 PM

#

what settings do u recomend?

golden walrus Jul 5, 2025, 6:39 PM

#

analog obsidian bitcrush

cat_blush

#

on it, sire

low shard Jul 5, 2025, 6:41 PM

#

hushed thicket what settings do u recomend?

chunk: 80ms
extra: 2.7
f0: rmvpe
input: microphone
output: line 1
monitor: headphones, optional to hear urself

golden walrus Jul 5, 2025, 6:41 PM

#

wait, is it the same with bad mic effect?

#

cat_pawbite

mystic sky Jul 5, 2025, 6:42 PM

#

What is the most reliable or update version of this AI in real-time?

analog obsidian Jul 5, 2025, 6:42 PM

#

golden walrus wait, is it the same with bad mic effect?

yea, bitcrushing simulates how the audio would sound if it were of low quality

low shard Jul 5, 2025, 6:44 PM

#

mystic sky What is the most reliable or update version of this AI in real-time?

This is a General AI Server, we won't be focused on voices anymore

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

golden walrus Jul 5, 2025, 6:46 PM

#

analog obsidian yea, bitcrushing simulates how the audio would sound if it were of low quality

do voicemeeter have it built-in? or do you have any recommendation for the external app? POPOcat i really hope to get a cleaner voice in low quality

analog obsidian Jul 5, 2025, 6:47 PM

#

golden walrus do voicemeeter have it built-in? or do you have any recommendation for the exter...

idk i dont use filters in realtime

golden walrus Jul 5, 2025, 6:47 PM

#

emoji_150

analog obsidian Jul 5, 2025, 6:47 PM

#

but the docs recommends one app

#

1 sec

shadow orbit Jul 5, 2025, 6:47 PM

#

whaaa

analog obsidian Jul 5, 2025, 6:48 PM

#

golden walrus do voicemeeter have it built-in? or do you have any recommendation for the exter...

https://docs.aihub.gg/rvc-voice-changer/realism/
this

Realism

Last update: July 4, 2025

golden walrus Jul 5, 2025, 6:48 PM

#

ah Kiloheart

analog obsidian Jul 5, 2025, 6:48 PM

#

yea just get kiloheart bitcrusher

golden walrus Jul 5, 2025, 6:49 PM

#

cat_pawbite hmmm, i hope i can listen playback so i can adjust the stat to get cleaner

shadow orbit Jul 5, 2025, 6:49 PM

#

can someone help with models upload..

shadow orbit Jul 5, 2025, 6:49 PM

#

shadow orbit whaaa

that thing comes up when i upload anything from #1175430844685484042

shadow flax Jul 5, 2025, 6:50 PM

#

low shard yes

its not letting me extract the zip 😭

golden walrus Jul 5, 2025, 6:50 PM

#

POPOcat damn, the playback sound so good but discord just bonk me hard

shadow flax Jul 5, 2025, 6:51 PM

#

whats this even meaaan 😭

#

im actually gonna crashout

#

why can i unzip like everything but not this particular zip

#

dw i got it extracting using a different tool

viral mason Jul 5, 2025, 7:05 PM

#

hushed thicket how do i add a voice model

#

that simple

#

no need to read

#

just

#

do that

#

in fact

#

that video should be pinned

#

it's easy

#

videos like these should be in the server for simple people like me

raven barn Jul 5, 2025, 7:10 PM

#

simple ore i require a screenshot of how you done it

#

i already have it installed

#

but it still doesnt work

analog obsidian Jul 5, 2025, 7:10 PM

#

viral mason

how many models u have lmao

viral mason Jul 5, 2025, 7:10 PM

#

analog obsidian how many models u have lmao

uhhh

raven barn Jul 5, 2025, 7:10 PM

#

raven barn

viral mason Jul 5, 2025, 7:10 PM

#

lemme look

shadow flax Jul 5, 2025, 7:12 PM

#

patent trellis

whys nordvpn telling me this aint safe 😔 the WebUI

simple ore Jul 5, 2025, 7:12 PM

#

raven barn

looks good

viral mason Jul 5, 2025, 7:12 PM

#

shadow flax whys nordvpn telling me this aint safe 😔 the WebUI

you replied to a bot lol

simple ore Jul 5, 2025, 7:12 PM

#

altough I dont like your ( ) folder

shadow flax Jul 5, 2025, 7:12 PM

#

viral mason you replied to a bot lol

i knoe

raven barn Jul 5, 2025, 7:12 PM

#

simple ore altough I dont like your ( ) folder

whats wrong with it

viral mason Jul 5, 2025, 7:12 PM

#

analog obsidian how many models u have lmao

currently 115 models

shadow flax Jul 5, 2025, 7:13 PM

#

where can i find the command line?

analog obsidian Jul 5, 2025, 7:13 PM

#

viral mason currently 115 models

misc_baffled

viral mason Jul 5, 2025, 7:14 PM

#

analog obsidian <:misc_baffled:1159397286833557566>

https://cdn.discordapp.com/attachments/936087908631343105/1278441206451273748/what_did_bro_expect.gif

raven barn Jul 5, 2025, 7:14 PM

#

simple ore altough I dont like your ( ) folder

when i try to launch appolio it says this

#

ive never had this issue

shadow flax Jul 5, 2025, 7:14 PM

#

shadow flax where can i find the command line?

#

whys everything not worky

#

idk if i leak my IP by sending that or not lol

viral mason Jul 5, 2025, 7:16 PM

#

let's go to this guy's house and fix it for him personally

raven barn Jul 5, 2025, 7:16 PM

#

okay i got it to work

simple ore Jul 5, 2025, 7:17 PM

#

raven barn when i try to launch appolio it says this

it tell you what the problem is

raven barn Jul 5, 2025, 7:17 PM

#

i fixed it

simple ore Jul 5, 2025, 7:17 PM

#

shadow flax

the other window with a bunch of text

raven barn Jul 5, 2025, 7:19 PM

#

is this even doing anything wtf

shadow flax Jul 5, 2025, 7:19 PM

#

simple ore the other window with a bunch of text

simple ore Jul 5, 2025, 7:20 PM

#

raven barn is this even doing anything wtf

show the preprocess step screenshot

simple ore Jul 5, 2025, 7:20 PM

#

shadow flax

you did not add models or something

shadow flax Jul 5, 2025, 7:20 PM

#

simple ore you did not add models or something

hmm i thought i did

raven barn Jul 5, 2025, 7:20 PM

#

simple ore show the preprocess step screenshot

all my wav and audio files i just put in there

simple ore Jul 5, 2025, 7:21 PM

#

raven barn all my wav and audio files i just put in there

remove "

raven barn Jul 5, 2025, 7:21 PM

#

both of them?

#

or just hte last one

simple ore Jul 5, 2025, 7:21 PM

#

or just use a different path c:\training\data

#

put your files there

shadow flax Jul 5, 2025, 7:21 PM

#

why does it not want to take the model? (does it have to be in the actual WebUI/app directory?)

#

it literally just does not want to take the model

#

its even in the model -> 0 something directory

#

simple ore Jul 5, 2025, 7:29 PM

#

shadow flax its even in the model -> 0 something directory

close the app, find the model folder, and nuke it

viral mason Jul 5, 2025, 7:29 PM

#

raven barn all my wav and audio files i just put in there

why does it say diddy

simple ore Jul 5, 2025, 7:29 PM

#

shadow flax

illegal combination is using WASAPI input and MME output

#

use both WASAPI

shadow flax Jul 5, 2025, 7:30 PM

#

simple ore close the app, find the model folder, and nuke it

delete?

raven barn Jul 5, 2025, 7:30 PM

#

viral mason why does it say diddy

Heh

shadow flax Jul 5, 2025, 7:30 PM

#

simple ore illegal combination is using WASAPI input and MME output

how do i know whats what

simple ore Jul 5, 2025, 7:31 PM

#

input and output devices

#

use both WASAPI type

shadow flax Jul 5, 2025, 7:34 PM

#

simple ore input and output devices

btw what rest of the settings should i use?

#

also how can i delete the "saved" audio

long obsidian Jul 5, 2025, 7:40 PM

#

if i have 32k sample voice can i use text to speach and save the wav file to extract the voice and train it to higher samples?

simple ore Jul 5, 2025, 7:42 PM

#

long obsidian if i have 32k sample voice can i use text to speach and save the wav file to ext...

tts are usually 24k at best

long obsidian Jul 5, 2025, 7:43 PM

#

simple ore tts are usually 24k at best

is there a way i can increase the samples of a voice then

long obsidian Jul 5, 2025, 7:50 PM

#

simple ore tts are usually 24k at best

if i upload a wav file to applio which has a little bit of background sounds from games lets say will it do the extraction good

simple ore Jul 5, 2025, 7:50 PM

#

just use 32k audio

#

no, use a proper denoise / vocal extraction

long obsidian Jul 5, 2025, 8:11 PM

#

whats the best app that can isolate podcast sounds and make it pure voice audio

#

and whats the best app to download youtube videos as audio files

brittle wing Jul 5, 2025, 8:33 PM

#

how can I fuse 2 models in applio ?

narrow sun Jul 5, 2025, 8:34 PM

#

I have so much ms can someone help me

neat grove Jul 5, 2025, 9:04 PM

#

ok so will gtx 1650 sup + ryzen 3 3100 will be good on w-Okada cause i hear a lot of background even if i put steelseries gg mic or when i talk on my native language i don't think it's trained on my language cause like there is some word that ai can't say it and it will be obvious to anyone that it's ai

forest vector Jul 5, 2025, 9:11 PM

#

neat grove ok so will gtx 1650 sup + ryzen 3 3100 will be good on w-Okada cause i hear a lo...

voice changers only fool a very small portion of the active gaming population at best

#

maybe if you do male to male conversion, and u already sound a bit like the model itself

pine star Jul 5, 2025, 9:13 PM

#

where do you guys usually get sources from to train ai for rcv?

neat grove Jul 5, 2025, 9:15 PM

#

forest vector voice changers only fool a very small portion of the active gaming population at...

when you have like a good device you can change small thing and no one will notice, ofc the best will be like the creative sound blaster SB0490 but these getting like hella expensive

neat grove Jul 5, 2025, 9:16 PM

#

forest vector maybe if you do male to male conversion, and u already sound a bit like the mode...

even if i do i will have some weird ai sound like crashing lol

forest vector Jul 5, 2025, 9:16 PM

#

what setting do you do your crossfade at

torpid turtle Jul 5, 2025, 9:17 PM

#

hii all

#

just found a perfect live walpaper but its not well looped

#

you can clearly see that the vid replays each time

#

is there an ai tha can help with this?

shadow flax Jul 5, 2025, 9:20 PM

#

could someone share with me what the optimal settings are for Wokada Deiteris Fork?

viral mason Jul 5, 2025, 9:27 PM

#

kaggle applio keeps doing this

#

it's pissing me off

neat grove Jul 5, 2025, 9:28 PM

#

forest vector what setting do you do your crossfade at

like ?

forest vector Jul 5, 2025, 9:30 PM

#

id say 0,12-0,15s

#

depending on the results u get

umbral frigate Jul 5, 2025, 9:30 PM

#

forest vector voice changers only fool a very small portion of the active gaming population at...

You sure?

forest vector Jul 5, 2025, 9:31 PM

#

pretty sure, or everything I heard till now wasnt the best there is

umbral frigate Jul 5, 2025, 9:32 PM

#

forest vector pretty sure, or everything I heard till now wasnt the best there is

Yeah, i've trolled many times and if I give them a heads up that its a voice changer they will pick up on it but 90% of the time it goes undetected lol

#

Tbf i do think the model sounds realistic

forest vector Jul 5, 2025, 9:33 PM

#

are breathing, and other misc sounds also realistic ?

umbral frigate Jul 5, 2025, 9:33 PM

#

forest vector are breathing, and other misc sounds also realistic ?

Yeah normal speech is very realistic you can kinda hear the breath too ig?

#

But anything other than that is cooked LOL

forest vector Jul 5, 2025, 9:34 PM

#

ahh ... okay thought I missed something

umbral frigate Jul 5, 2025, 9:34 PM

#

So in my experience ive learned to just adapt to it

forest vector Jul 5, 2025, 9:34 PM

#

that helps alot ye

umbral frigate Jul 5, 2025, 9:34 PM

#

Bc ive been using the same one for a while i kinda know how to speak with it

neat grove Jul 5, 2025, 9:34 PM

#

forest vector id say 0,12-0,15s

ill send you photo so i can know whats your meaning, i'm not the best in eng but i can understand atleast very well

forest vector Jul 5, 2025, 9:35 PM

#

umbral frigate Bc ive been using the same one for a while i kinda know how to speak with it

if you dont mind, id like to hear a short sample as im curious

umbral frigate Jul 5, 2025, 9:35 PM

#

forest vector if you dont mind, id like to hear a short sample as im curious

No problem, but im trying to get it to work rn for some reason my whole thing stopped working

forest vector Jul 5, 2025, 9:35 PM

#

stage fright huh

umbral frigate Jul 5, 2025, 9:36 PM

#

forest vector stage fright huh

Nope im fine with sharing it but it wont go through voicemeeter

crude flame Jul 5, 2025, 9:36 PM

#

forest vector voice changers only fool a very small portion of the active gaming population at...

retruthed

forest vector Jul 5, 2025, 9:36 PM

#

umbral frigate Nope im fine with sharing it but it wont go through voicemeeter

I was referring to the voice model itself lmao

umbral frigate Jul 5, 2025, 9:36 PM

#

forest vector I was referring to the voice model itself lmao

Oh wdym

forest vector Jul 5, 2025, 9:37 PM

#

crude flame retruthed

ehh ... still dont think that it is true, it sounds obvious to me up until now

crude flame Jul 5, 2025, 9:37 PM

#

forest vector ehh ... still dont think that it is true, it sounds obvious to me up until now

rvc is very easy to spot

analog obsidian Jul 5, 2025, 9:39 PM

#

obviously people that dont know about ai will not immediately fall for this, but give them a couple of minutes talking with u and they're gonna spot its rvc quickly

pastel oak Jul 5, 2025, 9:40 PM

#

neat grove ok so will gtx 1650 sup + ryzen 3 3100 will be good on w-Okada cause i hear a lo...

gtx 1650 is ok, not great but ok, depends on your use case
dont bother trying to fool anyone in any language other than english its too obvious
"i hear a lot of background" if you mean noise then its microphone related, your mic is picking up every sound you make or that can be heard in the background

brittle wing Jul 5, 2025, 9:40 PM

#

does anyone know if this is a good guide for rx that I should follow for my models https://rentry.co/RVC-dataset-RX11#spectral-denoising-the-audio

analog obsidian Jul 5, 2025, 9:40 PM

#

that was literally how i learned about rvc, some random dude was talking with me and i started to notice his voice was weird

pastel oak Jul 5, 2025, 9:40 PM

#

i can giggle and moan with ai u guys got nothing on me

umbral frigate Jul 5, 2025, 9:41 PM

#

Idk why but my microphone works fine but it wont convert my voice everythings silent

torpid turtle Jul 5, 2025, 9:41 PM

#

chat

#

help please

umbral frigate Jul 5, 2025, 9:41 PM

#

pastel oak i can giggle and moan with ai u guys got nothing on me

Like wdym giggle

torpid turtle Jul 5, 2025, 9:41 PM

#

just found a perfect live walpaper but its not well looped
you can clearly see that the vid replays each time
is there an ai tha can help with this?

umbral frigate Jul 5, 2025, 9:41 PM

#

like hahaha?

neat grove Jul 5, 2025, 9:42 PM

#

pastel oak gtx 1650 is ok, not great but ok, depends on your use case dont bother trying to...

yeah, make sense for the language i think

crude flame Jul 5, 2025, 9:42 PM

#

analog obsidian obviously people that dont know about ai will not immediately fall for this, but...

vewy easy

forest vector Jul 5, 2025, 9:42 PM

#

pastel oak i can giggle and moan with ai u guys got nothing on me

if its rvc id reckon it sounds really obvious

low shard Jul 5, 2025, 9:42 PM

#

narrow sun I have so much ms can someone help me

This is a General AI Server, we won't be focused on voices anymore

Elaborate:

your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program

umbral frigate Jul 5, 2025, 9:42 PM

#

crude flame vewy easy

an indicator is if the mic is too good ngl

analog obsidian Jul 5, 2025, 9:42 PM

#

crude flame vewy easy

both sounds very ai to me

umbral frigate Jul 5, 2025, 9:42 PM

#

ive been told if you make the mic shittier it sounds much more believable

low shard Jul 5, 2025, 9:42 PM

#

long obsidian and whats the best app to download youtube videos as audio files

yt-dlp

analog obsidian Jul 5, 2025, 9:42 PM

#

younger me would also notice it's ai

crude flame Jul 5, 2025, 9:43 PM

#

umbral frigate an indicator is if the mic is too good ngl

yea

but you can just bitcrush and that solves the problem

crude flame Jul 5, 2025, 9:43 PM

#

analog obsidian both sounds very ai to me

well one is real

umbral frigate Jul 5, 2025, 9:43 PM

#

crude flame yea but you can just bitcrush and that solves the problem

or that yea

low shard Jul 5, 2025, 9:43 PM

#

long obsidian whats the best app that can isolate podcast sounds and make it pure voice audio

https://docs.aihub.gg/rvc/resources/dataset-isolation/

Dataset & Isolation

Last update: May 5, 2025

umbral frigate Jul 5, 2025, 9:43 PM

#

crude flame well one is real

Its just placebo dude

analog obsidian Jul 5, 2025, 9:43 PM

#

crude flame well one is real

i mean the first one yeah

umbral frigate Jul 5, 2025, 9:43 PM

#

Shes speaking like a voice actor

analog obsidian Jul 5, 2025, 9:43 PM

#

i only heard the second one and i assumed the first one was also rvc but unedited lmao

crude flame Jul 5, 2025, 9:43 PM

#

analog obsidian i mean the first one yeah

the one with the dash at the end?

brittle wing Jul 5, 2025, 9:43 PM

#

is this the best guide to rx I should follow https://rentry.co/RVC-dataset-RX11#spectral-denoising-the-audio

forest vector Jul 5, 2025, 9:44 PM

#

analog obsidian i mean the first one yeah

the breath gave it away on the first one

neat grove Jul 5, 2025, 9:44 PM

#

crude flame vewy easy

for better result for no one to notice just make the base low

analog obsidian Jul 5, 2025, 9:44 PM

#

crude flame the one with the dash at the end?

never mind, both are rvc

crude flame Jul 5, 2025, 9:44 PM

#

analog obsidian never mind, both are rvc

no

umbral frigate Jul 5, 2025, 9:44 PM

#

LOL

crude flame Jul 5, 2025, 9:44 PM

#

one of them is real

#

and dont skip to the laugh

#

that will make it to easy

forest vector Jul 5, 2025, 9:45 PM

#

if the 2nd one was rvc id say we actually should gib up trying to recoqnize

pastel oak Jul 5, 2025, 9:45 PM

#

brittle wing is this the best guide to rx I should follow https://rentry.co/RVC-dataset-RX11#...

Most likely yes

analog obsidian Jul 5, 2025, 9:45 PM

#

crude flame well one is real

okay the second one is real

#

sorry i cheated

#

XD

brittle wing Jul 5, 2025, 9:45 PM

#

pastel oak Most likely yes

ok ty

crude flame Jul 5, 2025, 9:45 PM

#

analog obsidian sorry i cheated

😡

#

smhhh

analog obsidian Jul 5, 2025, 9:45 PM

#

forest vector if the 2nd one was rvc id say we actually should gib up trying to recoqnize

just look the spectograms lmaooo

crude flame Jul 5, 2025, 9:45 PM

#

forest vector if the 2nd one was rvc id say we actually should gib up trying to recoqnize

wrong

#

second one is real

pastel oak Jul 5, 2025, 9:46 PM

#

crude flame vewy easy

dunno if you pay attention to it like i do but can tell the 2nd is the real one based on the pronounciation of "I have" in the beginning joe_weird

#

and if the first one is real then i question my existence

analog obsidian Jul 5, 2025, 9:47 PM

#

people have to learn rvc can't be realistic no matter what u do

forest vector Jul 5, 2025, 9:47 PM

#

analog obsidian just look the spectograms lmaooo

if one needs to take one of these out, then id say we can fool 95% of the population

analog obsidian Jul 5, 2025, 9:47 PM

#

forest vector if one needs to take one of these out, then id say we can fool 95% of the popula...

it didnt fool me, the thing is i didnt bother in hearing them closely

#

razer always share that audio

crude flame Jul 5, 2025, 9:47 PM

#

forest vector if one needs to take one of these out, then id say we can fool 95% of the popula...

skip to 36 sec and listen

forest vector Jul 5, 2025, 9:47 PM

#

analog obsidian it didnt fool me, the thing is i didnt bother in hearing them closely

as with most conversations

analog obsidian Jul 5, 2025, 9:48 PM

#

rvc is 2023 tech

#

its not a voice cloning ai

#

what the ai is trying to do is literally reproducing mel specs and pitches

#

is not trying to clone expressions or shit because its not meant for that

#

all results are flat asf

crude flame Jul 5, 2025, 9:49 PM

#

vits2 rvc when?

analog obsidian Jul 5, 2025, 9:49 PM

#

good rvc models can fool the more casual side of the internet

#

but they will find out it's ai, rvc will glitch in any moment

forest vector Jul 5, 2025, 9:50 PM

#

analog obsidian people have to learn rvc can't be realistic no matter what u do

If I were you, I wouldnt bet on that

analog obsidian Jul 5, 2025, 9:50 PM

#

i have been training models since 2023

low shard Jul 5, 2025, 9:50 PM

#

crude flame vits2 rvc when?

when they make a good discord competitor

analog obsidian Jul 5, 2025, 9:51 PM

#

i know whats inside rvc, i know how it does the conversion

#

and i know it cant do everything

forest vector Jul 5, 2025, 9:51 PM

#

maybe not that specific way u have in mind it cant, I agree.

crude flame Jul 5, 2025, 9:51 PM

#

pretty much anything non verbal rvc cant do

analog obsidian Jul 5, 2025, 9:52 PM

#

the embedder is trash

forest vector Jul 5, 2025, 9:52 PM

#

but somewhere down the line I believe it will become hard to recoqnize

crude flame Jul 5, 2025, 9:52 PM

#

forest vector but somewhere down the line I believe it will become hard to recoqnize

eventually yes voice cloning will be that good but now its not

forest vector Jul 5, 2025, 9:52 PM

#

now its ... meh

analog obsidian Jul 5, 2025, 9:53 PM

#

forest vector but somewhere down the line I believe it will become hard to recoqnize

i agree with this, flat inferences are usually the most realistic sounding because those don't have expressions, so rvc doesnt struggle

umbral frigate Jul 5, 2025, 9:53 PM

#

crude flame eventually yes voice cloning will be that good but now its not

How would they even go abt that

forest vector Jul 5, 2025, 9:53 PM

#

but really impressive compared to what we had back in the day

crude flame Jul 5, 2025, 9:53 PM

#

no one cares about ai audio so nothing is happening to it 😔

forest vector Jul 5, 2025, 9:53 PM

#

but when I hear real voice actors im like "what am I doing 💀"

analog obsidian Jul 5, 2025, 9:53 PM

#

we do have real voice cloning ai

narrow sun Jul 5, 2025, 9:53 PM

#

low shard This is a **General AI Server, we won't be focused on voices anymore** Elaborat...

Yes i have bad pc.. i dont have gpu

analog obsidian Jul 5, 2025, 9:54 PM

#

analog obsidian we do have real voice cloning ai

but they're tts

crude flame Jul 5, 2025, 9:54 PM

#

umbral frigate How would they even go abt that

better embedder, vits2, better pretrains, better GAN/vocoder

umbral frigate Jul 5, 2025, 9:54 PM

#

analog obsidian we do have real voice cloning ai

elevenlabs?

analog obsidian Jul 5, 2025, 9:54 PM

#

umbral frigate elevenlabs?

eleven, chatterbox, yeah they're real voice cloning ai because the ai is actually learning and reproducing expressions

#

rvc doesn't learn expressions

umbral frigate Jul 5, 2025, 9:54 PM

#

analog obsidian eleven, chatterbox, yeah they're real voice cloning ai because the ai is actuall...

So you mean if elevenlabs became like sts?

#

Would be pretty nuts

analog obsidian Jul 5, 2025, 9:54 PM

#

when you give rvc an audio, it'll extract the mel spec and the pitch data alongisde the features of it

#

then it'll try to reproduce them afterwards

analog obsidian Jul 5, 2025, 9:55 PM

#

umbral frigate So you mean if elevenlabs became like sts?

no idea

umbral frigate Jul 5, 2025, 9:55 PM

#

Is there even an estimate for how far in the future until we get updates on rvc

analog obsidian Jul 5, 2025, 9:55 PM

#

rvc is SOTA

umbral frigate Jul 5, 2025, 9:55 PM

#

SOTA?

analog obsidian Jul 5, 2025, 9:56 PM

#

like the best sts

umbral frigate Jul 5, 2025, 9:56 PM

#

like no updates

analog obsidian Jul 5, 2025, 9:56 PM

#

the reason why it doesn't learn emotions is because is using a pretty old architecture named vits

crude flame Jul 5, 2025, 9:56 PM

#

umbral frigate like no updates

the og dev team left rvc to do tts

umbral frigate Jul 5, 2025, 9:56 PM

#

Why tf

analog obsidian Jul 5, 2025, 9:56 PM

#

because tts are superior

#

arch wise

#

they can learn emotions and non verbal sounds better

umbral frigate Jul 5, 2025, 9:57 PM

#

yeah but that means sts is neglected

#

if everyone on rvc went to tts

crude flame Jul 5, 2025, 9:57 PM

#

no one cares enough about sts to update it

umbral frigate Jul 5, 2025, 9:57 PM

#

i think sts is cool

analog obsidian Jul 5, 2025, 9:57 PM

#

updating sts it's a really hard task

analog obsidian Jul 5, 2025, 9:57 PM

#

umbral frigate i think sts is cool

me too

#

rvc interally it's a hack of a tts architecture actually

#

rvc-boss took vits, and did a couple of changes in order to "convert" it to sts

#

if we remove those changes, it'll be regular tts vits

umbral frigate Jul 5, 2025, 9:58 PM

#

So if there were to be strides in sts would they have to build it from the ground up?

#

Or can rvc as it is rn be improved

#

and its just shitty architecture

analog obsidian Jul 5, 2025, 9:59 PM

#

yea no the arch is too shit to be updated

umbral frigate Jul 5, 2025, 9:59 PM

#

yeah damn

analog obsidian Jul 5, 2025, 9:59 PM

#

some devs of here have tried