#✨│ai-help | AI HUB | Page 328

dim needle Apr 10, 2026, 2:44 AM

#

how do i download the voice changer

hallow thistle Apr 10, 2026, 2:46 AM

#

dim needle how do i download the voice changer

What is your PC GPU? And what do you use the voice changer for?

dim needle Apr 10, 2026, 2:47 AM

#

5070ti

#

nvidia

#

nvidia gpu
im trying to use the voice changer
theres no issue i havent started

hallow thistle Apr 10, 2026, 2:56 AM

#

dim needle nvidia gpu im trying to use the voice changer theres no issue i havent started

Roleplay, girl voice or something? Sakithink

dim needle Apr 10, 2026, 2:57 AM

#

hallow thistle Roleplay, girl voice or something? <a:Sakithink:1142460740435988500>

for video/ fun not really girl voice

hallow thistle Apr 10, 2026, 2:58 AM

#

dim needle for video/ fun not really girl voice

There are Vonovox and Tg Develop's W-Okada fork, these are known voice changers that can work with GeForce RTX 50 series.

#

-realtime

patent trellisBOT Apr 10, 2026, 2:59 AM

#

hallow thistle -realtime

🔊 Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options

Local Guide

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project.

• Applio Realtime

A Realtime Voice Changer with similar performance to Vonovox & Wokada Tg-Develop Fork, with extra features.

Local Guide

• Wokada Deiteris Fork

Deiteris' fork (modified version) of wokada that doesn't get updates anymore.

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

dim needle Apr 10, 2026, 3:00 AM

#

hallow thistle There are Vonovox and Tg Develop's W-Okada fork, these are known voice changers ...

which ones the best?

hallow thistle Apr 10, 2026, 3:00 AM

#

dim needle which ones the best?

Vonovox gives better audio quality, while Tg Develop's W-Okada is easier to use. There are trade-offs.

viral mason Apr 10, 2026, 3:00 AM

#

dim needle which ones the best?

for nvidia I suggest vonovox

dim needle Apr 10, 2026, 3:01 AM

#

in the vids i was watching it said they were free are they?

viral mason Apr 10, 2026, 3:02 AM

#

dim needle in the vids i was watching it said they were free are they?

anything off yt for voice changers is outdated, but yes they're free

dim needle Apr 10, 2026, 3:04 AM

#

yeaa ik one of them showed this dc so im asking in here cis there all 2 years old

#

how can i go about downloading vonovox?

viral mason Apr 10, 2026, 3:06 AM

#

dim needle how can i go about downloading vonovox?

1 moment I'll get the download

hallow thistle Apr 10, 2026, 3:09 AM

#

dim needle how can i go about downloading vonovox?

Download Vonovox. https://huggingface.co/dr87/vonovox/resolve/main/Vonovox_beta_17_11.zip

viral mason Apr 10, 2026, 3:11 AM

#

you need this too
https://software.muzychenko.net/freeware/vac470lite.zip

dim needle Apr 10, 2026, 3:14 AM

#

hallow thistle Download Vonovox. https://huggingface.co/dr87/vonovox/resolve/main/Vonovox_beta_...

you said vonovox wasnt really simple so is there any videos

hallow thistle Apr 10, 2026, 3:15 AM

#

dim needle you said vonovox wasnt really simple so is there any videos

https://docs.aihub.gg/realtime-voice-changer/local/vonovox/

Vonovox

Last update: March 30, 2026

dim needle Apr 10, 2026, 3:19 AM

#

"Many Effects are Premium (paid), such as Low Quality Mic" is it mostly like this? i just want to use models from the models channel not really any effects will i be fine

viral mason Apr 10, 2026, 3:25 AM

#

dim needle "Many Effects are Premium (paid), such as Low Quality Mic" is it mostly like thi...

yea

hallow thistle Apr 10, 2026, 3:25 AM

#

dim needle "Many Effects are Premium (paid), such as Low Quality Mic" is it mostly like thi...

Most core features in Vonovox are free, like RVC voice model. "Paid" effects are optional, not really needed. Ado

viral mason Apr 10, 2026, 3:27 AM

#

those r optional

glass smelt Apr 10, 2026, 3:46 AM

#

voice changer

dim needle Apr 10, 2026, 3:53 AM

#

i downloaded it and its open but how do i use it just to hear myself

dim needle Apr 10, 2026, 4:29 AM

#

how can i delete vonovox and voice cable so i can redownload it

hallow thistle Apr 10, 2026, 4:32 AM

#

dim needle i downloaded it and its open but how do i use it just to hear myself

Patient. Unlike W-Okada though, Vonovox only has one output device. You either set output device to "your speaker" on Vonovox or hear yourself on Discord. https://cdn.discordapp.com/attachments/1159290139609137264/1446358776587489351/image.png?ex=69d92654&is=69d7d4d4&hm=d4e3afdbe81daa1f372c0a8f4f8294c7f48fa662c501236c19e47a9253260a25&

dim needle Apr 10, 2026, 4:33 AM

#

hallow thistle Patient. Unlike W-Okada though, Vonovox only has one output device. You either s...

i set it to my speaker and line 1 and still couldnt hear myself ond discord

hallow thistle Apr 10, 2026, 4:34 AM

#

I don't know, I can't identify an issue from your words alone. Send your screenshot to here.

dim needle Apr 10, 2026, 4:35 AM

#

hallow thistle I don't know, I can't identify an issue from your words alone. Send your screens...

hallow thistle Apr 10, 2026, 4:36 AM

#

dim needle

Did you know? "Exclusive mode" is an audio mode in WASAPI/ASIO that makes a sole program (like Vonovox) as the only program to output sound while mutes other programs at the time if they all on the same audio system. It's better to set this mode off.

dim needle Apr 10, 2026, 4:37 AM

#

when i go on discord and set my speaker to line 1 i see it moving but cant hear anything and i tuened exclusive off

hallow thistle Apr 10, 2026, 4:38 AM

#

You should do this.

#

On Vonovox, press "Start" button to start converting.

dim needle Apr 10, 2026, 4:39 AM

#

dim needle Apr 10, 2026, 4:40 AM

#

hallow thistle You should do this.

shows thiss

hallow thistle Apr 10, 2026, 4:41 AM

#

Why Vonovox works for others though? Did you follow the guide I sent to you at least?

dim needle Apr 10, 2026, 4:41 AM

#

hallow thistle Why Vonovox works for others though? Did you follow the guide I sent to you at l...

yes

hallow thistle Apr 10, 2026, 4:45 AM

#

Send your full screenshot of Vonovox.

dim needle Apr 10, 2026, 4:47 AM

#

hallow thistle Send your full screenshot of Vonovox.

#

i fixed it

hallow thistle Apr 10, 2026, 4:48 AM

#

You set input device on Vonovox wrong.

dim needle Apr 10, 2026, 4:48 AM

#

thats what you said to put it to

#

it only works when i set it that way and then do the opposiute onb discord

hallow thistle Apr 10, 2026, 4:50 AM

#

On Vonovox: input is microphone, output is Line 1.
On Discord: input is Line 1, output is speaker.

#

sad_cat

dim needle Apr 10, 2026, 4:50 AM

#

hallow thistle On Vonovox: input is microphone, output is Line 1. On Discord: input is Line 1, ...

it only works for me when i do opposite of thisd

hallow thistle Apr 10, 2026, 4:50 AM

#

Elaborate?

dim needle Apr 10, 2026, 4:51 AM

#

when i do line 1 as input on vono and then line 1 as speaker in discord it works kinda

hallow thistle Apr 10, 2026, 4:51 AM

#

That's not how it works.

dim needle Apr 10, 2026, 4:53 AM

#

its the only way i can hear myself with the voice changer for me

#

the pitch is off thouggh is there any way to fix it sounds high pitch i tryed 2 dfifferent ones

hallow thistle Apr 10, 2026, 5:07 AM

#

dim needle its the only way i can hear myself with the voice changer for me

To hear yourself on Vonovox, you set output device to speakers, or go to Windows' Sounds settings and do this.

dim needle Apr 10, 2026, 5:12 AM

#

how can i delete everything so i can restart

hallow thistle Apr 10, 2026, 5:16 AM

#

dim needle how can i delete everything so i can restart

dim needle Apr 10, 2026, 5:17 AM

#

what about voice cable

hallow thistle Apr 10, 2026, 5:17 AM

#

If you made the same mistake for another time, you should question yourself. I was giving the most agreed approaches, you're literally doing opposite.

dim needle Apr 10, 2026, 5:21 AM

#

hallow thistle If you made the same mistake for another time, you should question yourself. I w...

i did exactly what you said it didnt work agan the only thing that worked was doing opposite

hallow thistle Apr 10, 2026, 5:25 AM

#

#

If you set "Line 1" as output on Vonovox while set "Line 1" as input on Discord, this is correct. But when you set "Line 1" as speaker on Discord while set "Line 1" as input on Vonovox, this is incorrect because you're gonna send all those Discord sounds (including ping sounds) to Vonovox through Line 1, not Vonovox to Discord as intended. You're just confused, bud.

#

If you don't believe me, you can ask fellow members who used voice changer here. misc_shrug

dim needle Apr 10, 2026, 5:36 AM

#

hallow thistle If you don't believe me, you can ask fellow members who used voice changer here....

i do just dosent work for me

hexed ruin Apr 10, 2026, 5:52 AM

#

One message removed from a suspended account.

hallow thistle Apr 10, 2026, 6:06 AM

#

hexed ruin One message removed from a suspended account.

This AMD Athlon CPU (released in 2018) isn't really that old, though positioned below AMD Ryzen 3. AMD Radeon Vega 3 is an integrated GPU, so probably skip that. Do you mean like you want to run the voice changer as CPU-only? Because of course it gonna be slower.

hexed ruin Apr 10, 2026, 6:36 AM

#

hallow thistle This AMD Athlon CPU (released in 2018) isn't really that old, though positioned ...

One message removed from a suspended account.

low shard Apr 10, 2026, 7:35 AM

#

which rvc related program?
Elaborate:

your pc os
what are you trying to do: AI Covers, TTS, E Girl Trolling / Catfish or Roleplay

brittle wing Apr 10, 2026, 9:29 AM

#

can someone help me'

low shard Apr 10, 2026, 9:34 AM

#

brittle wing can someone help me'

This is a General AI Discord Server, elaborate:

your pc gpu
your pc os
what are you trying to do: LLMs, AI Covers, TTS, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

severe fiber Apr 10, 2026, 10:12 AM

#

ive downloaded evberything and tried to open MMVCServer and the cmd prompt comes up to download stuff but ive done that 3 times now and the actual rvc prompt hasnt came up yet

#

4060 gpu

#

i5 8400

#

windows

tawny bane Apr 10, 2026, 10:15 AM

#

hey are EaseUS Voicewave and Voice ai good?

#

I want to have differnet voices that sound well and not 2022 choppy. both male and female

hallow thistle Apr 10, 2026, 10:19 AM

#

tawny bane I want to have differnet voices that sound well and not 2022 choppy. both male a...

Why male and female voice models? BocchiPlushStare

tawny bane Apr 10, 2026, 10:19 AM

#

FivemRP

severe fiber Apr 10, 2026, 10:19 AM

#

ive downloaded evberything and tried to open MMVCServer and the cmd prompt comes up to download stuff but ive done that 3 times now and the actual rvc prompt hasnt came up yet
4060 gpu
i5 8400
windows

tawny bane Apr 10, 2026, 10:19 AM

#

we don't really do anything but humans

hallow thistle Apr 10, 2026, 10:21 AM

#

severe fiber ive downloaded evberything and tried to open MMVCServer and the cmd prompt comes...

Check out Vonovox. This voice changer gives better audio quality that any W-Okada version.

tawny bane Apr 10, 2026, 10:21 AM

#

@hallow thistle

severe fiber Apr 10, 2026, 10:21 AM

#

hallow thistle Check out Vonovox. This voice changer gives better audio quality that any W-Okad...

is there a guide for this?

hallow thistle Apr 10, 2026, 10:21 AM

#

<@&1159293140440723499> Hacked account in help channel.

severe fiber Apr 10, 2026, 10:21 AM

#

so should i upgrade from wokada to vonovox?

hallow thistle Apr 10, 2026, 10:21 AM

#

severe fiber is there a guide for this?

https://docs.aihub.gg/realtime-voice-changer/local/vonovox

Vonovox

Last update: March 30, 2026

severe fiber Apr 10, 2026, 10:21 AM

#

is it the same delay orrr

tawny bane Apr 10, 2026, 10:22 AM

#

tawny bane hey are EaseUS Voicewave and Voice ai good?

so are they good? are there any better alternatives? using windows 11 and gpu doesn't do well

hallow thistle Apr 10, 2026, 10:22 AM

#

severe fiber is it the same delay orrr

Am I supposed to answer these trivial questions or something?

hallow thistle Apr 10, 2026, 10:23 AM

#

tawny bane so are they good? are there any better alternatives? using windows 11 and gpu do...

Vonovox and Tg Develop's W-Okada are better. I'm not sure about EaseUS one, but Voice.ai is a scam one.

tawny bane Apr 10, 2026, 10:23 AM

#

understood ty

severe fiber Apr 10, 2026, 10:24 AM

#

hallow thistle Am I supposed to answer these trivial questions or something?

huh

#

like whats the downside to changing

hallow thistle Apr 10, 2026, 10:24 AM

#

Vonovox looks like this. https://cdn.discordapp.com/attachments/1159290139609137264/1446358776587489351/image.png?ex=69d9cf14&is=69d87d94&hm=6014e2861815b5c9e77cb975eb5cb5abf9c414f6bb8d860851f04fc7ec22ce0e&

low shard Apr 10, 2026, 12:22 PM

#

severe fiber ive downloaded evberything and tried to open MMVCServer and the cmd prompt comes...

what tutorial link are you using? Are you trying to do TTS, AI covers, E Girl Trolling / Catfishing or Roleplay?

low shard Apr 10, 2026, 12:22 PM

#

tawny bane hey are EaseUS Voicewave and Voice ai good?

we don't suggest those, it's better you elaborate if you want to hear the alternatives used here

indigo python Apr 10, 2026, 12:22 PM

#

Give a deep voice model

low shard Apr 10, 2026, 12:23 PM

#

indigo python Give a deep voice model

there are tons of #1175430844685484042 , which one are you looking for? like ben10 or e boy deep RVC Voice model to troll?

sage flume Apr 10, 2026, 12:59 PM

#

is it open-sourced model?

hallow thistle Apr 10, 2026, 1:13 PM

#

sage flume is it open-sourced model?

Beneath MVSEP website, many separation models and softwares (like UVR5) are open source.

untold marten Apr 10, 2026, 2:24 PM

#

Hola,

GPU: rtx4070 ti super 16GB vram
OS: Fedora KDE Plasma 43
What I am trying to do: I installed wokada TG-develop fork and works with the model & want to link/send the output of the fork to another program like discord or anything else.
The Issue: Can select my mic as input but when it comes to select the output device, I see no virtual cable showing despite having portaudio installed (did I miss anything from the docs?).
**the tutorial link: ** The link from the docs on the TG-develop fork (Realtime Voice Changer > Local > TG Develop's)

I used before deiteris fork on windows and works nice and had vac and was all fine but first time trying to use TG fork and on linux with portaudio. From what I heard, portaudio doesn't create a virtual cable? And that u may need to use pipewire? If anyone knows better how to set up this, I would appreciate a lot ^^

tender hedge Apr 10, 2026, 2:25 PM

#

can you give me the link for download the realtime voice changer

untold marten Apr 10, 2026, 2:28 PM

#

well it's in the docs, literally. Also Sapphire just gave the link for docs above

tender hedge Apr 10, 2026, 2:38 PM

#

uhm can you teach me how to download it

nocturne mural Apr 10, 2026, 3:00 PM

#

tender hedge can you give me the link for download the realtime voice changer

-rt

patent trellisBOT Apr 10, 2026, 3:00 PM

#

nocturne mural -rt

🔊 Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options

Local Guide

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project.

• Applio Realtime

A Realtime Voice Changer with similar performance to Vonovox & Wokada Tg-Develop Fork, with extra features.

Local Guide

• Wokada Deiteris Fork

Deiteris' fork (modified version) of wokada that doesn't get updates anymore.

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

nocturne mural Apr 10, 2026, 3:00 PM

#

cat_doom

untold marten Apr 10, 2026, 3:11 PM

#

GPU: rtx4070 ti super 16GB vram
OS: Fedora KDE Plasma 43
What I am trying to do: I installed wokada TG-develop fork and works with the model & want to link/send the output of the fork to another program like discord or anything else.
The Issue: when opening the web interface, audio processing is locked onto server (when the first time it was working fine but after trying to create a virtual cable with pipewire, it went haywire) and I cannot start the server and run the voice changer at all. Also I set the Sample Rate at 48000hz but it gives errors and changes to 44100 while saying the input/output/monitor supports only 48000hz... I don't get what is wrong here... And I tried 2 browsers: firefox & opera gx + tried the troubleshooting from TG fork about this issue with audio processing locked onto server. At least if the server was working...
**the tutorial link: ** The link from the docs on the TG-develop fork (Realtime Voice Changer > Local > TG Develop's)

sharp jungle Apr 10, 2026, 3:29 PM

#

sage flume is it open-sourced model?

Yes many of them are but in mvsep u can just use those models directly and free...use UVR5 if u want locally

toxic perch Apr 10, 2026, 3:42 PM

#

wall of shame asap

#

<@&1159293204038955078>

low shard Apr 10, 2026, 3:56 PM

#

tender hedge can you give me the link for download the realtime voice changer

This is a General AI Discord Server, elaborate:

your pc gpu
your pc os
what are you trying to do: LLMs, AI Covers, TTS, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

viral mason Apr 10, 2026, 4:37 PM

#

tawny bane hey are EaseUS Voicewave and Voice ai good?

Idk what the first one is but voice.ai is bad and it's paid, if you have AMD gpu use Wokada Tg fork, if you have Nvidia use Vonovox

viral mason Apr 10, 2026, 4:38 PM

#

severe fiber is it the same delay orrr

It depends on what setting you have the chunk size at like always

viral mason Apr 10, 2026, 4:39 PM

#

untold marten **GPU:** rtx4070 ti super 16GB vram **OS:** Fedora KDE Plasma 43 **What I am try...

What is this fedora KDE thing? Do you not use Windows?

untold marten Apr 10, 2026, 4:40 PM

#

viral mason What is this fedora KDE thing? Do you not use Windows?

it's a linux distro. I did use before windows and well it was working fine

viral mason Apr 10, 2026, 4:41 PM

#

Anyways since you have a 4070 use Vonovox, idk how well it works on Linux but it's worth a try

https://huggingface.co/dr87/vonovox/resolve/main/Vonovox_beta_17_11.zip

https://software.muzychenko.net/freeware/vac470lite.zip

#

First download is for the voice changer second one is a virtual audio cable that connects it to both discord and any game you play using it

untold marten Apr 10, 2026, 4:42 PM

#

I've seen vonovox and wish I could have tried but from what I understood in requirements or "pros/cons", it said that it works only on nvidia gpu with windows only or so I understood

viral mason Apr 10, 2026, 4:42 PM

#

untold marten it's a linux distro. I did use before windows and well it was working fine

Ngl I thought it was from Mars lol

viral mason Apr 10, 2026, 4:43 PM

#

untold marten I've seen vonovox and wish I could have tried but from what I understood in requ...

You could try but I'm unsure if it specific only works for windows, I just know it's Nvidia only

#

In case tho you'll need to switch to Wokada tg fork, idk which one is the Nvidia Linux version tho

arctic musk Apr 10, 2026, 4:43 PM

#

I would like to know if there's a good voice changer, also if there's a good woman voice as I run a VTTRPG and would like to stop hurting my throat and instead using a voice changer. I already have some male voices to use that I like, but I don't feel VCC from Okada branch is working good for me. I used it for a while but was not able to configure a good output for it. I have a NVIDIA RTX 4060 with 6gb, 16 ram and an Intel core i9 14900HX. I'm running in Win 11.

untold marten Apr 10, 2026, 4:44 PM

#

viral mason You could try but I'm unsure if it specific only works for windows, I just know ...

will try

viral mason Apr 10, 2026, 4:44 PM

#

arctic musk I would like to know if there's a good voice changer, also if there's a good wom...

Try Vonovox!

https://huggingface.co/dr87/vonovox/resolve/main/Vonovox_beta_17_11.zip

https://software.muzychenko.net/freeware/vac470lite.zip

viral mason Apr 10, 2026, 4:44 PM

#

untold marten will try

Lemme know if there's any issues ^^

hardy yew Apr 10, 2026, 4:45 PM

#

untold marten I've seen vonovox and wish I could have tried but from what I understood in requ...

yeah, vonovox is windows-only

untold marten Apr 10, 2026, 4:46 PM

#

misc_sob

hardy yew Apr 10, 2026, 4:46 PM

#

the way interface jumps from 48000 to 44100 is normal in Windows release of tg too, IDK why it happens but it has never caused any issue for me so I didn't really care

arctic musk Apr 10, 2026, 4:47 PM

#

viral mason Try Vonovox! https://huggingface.co/dr87/vonovox/resolve/main/Vonovox_beta_17_1...

Second link is a lite version?

untold marten Apr 10, 2026, 4:47 PM

#

I have no idea why...

hardy yew Apr 10, 2026, 4:47 PM

#

sadly can't help with the virtual cable issue, I haven't ever run w-okada on Linux so I lack experience here

untold marten Apr 10, 2026, 4:47 PM

#

also wanted to ask, how I would install that VAC470lite on linux as it's, from what I can see, windows installer :))

#

I mean didn't try yet with wine so not sure how it works

untold marten Apr 10, 2026, 4:48 PM

#

hardy yew sadly can't help with the virtual cable issue, I haven't ever run w-okada on Lin...

yeah it's ok.

hardy yew Apr 10, 2026, 4:48 PM

#

Doubt it would work via Wine tbh

#

I would just search for an Linux alternative

arctic musk Apr 10, 2026, 4:48 PM

#

untold marten I mean didn't try yet with wine so not sure how it works

I was going to suggest you wine, but not pretty sure if it would work as it's a "virutal machine"

untold marten Apr 10, 2026, 4:48 PM

#

I tried to make a virtual cable with pipewire but seemed to break client audio processing

viral mason Apr 10, 2026, 4:48 PM

#

arctic musk Second link is a lite version?

It's a virtual audio cable that connects the voice changer to games and discord ect

arctic musk Apr 10, 2026, 4:49 PM

#

viral mason It's a virtual audio cable that connects the voice changer to games and discord ...

OH its virtual audio cable, I have it already installed. Thanks!

viral mason Apr 10, 2026, 4:49 PM

#

Do you have VB cable or the one I sent? The one I sent you is recommended over VB cable

#

VB causes odd issue on windows sometimes

#

This one doesn't

arctic musk Apr 10, 2026, 4:50 PM

#

It's the same, Just checked the one I have already installed

viral mason Apr 10, 2026, 4:50 PM

#

Ah

arctic musk Apr 10, 2026, 4:50 PM

#

Yea, VB caused a lot of issues for me

#

I will give vonovox a try! Also, where can I get pre trained models? I'd preffer them in spanish but I think I can work around with english ones XD

viral mason Apr 10, 2026, 4:54 PM

#

arctic musk I will give vonovox a try! Also, where can I get pre trained models? I'd preffer...

Right here good sir
https://discord.com/channels/1159260121998827560/1175430844685484042

#

And also herehttps://voice-models.com

Voice Models

Voice Models: Over 27,900+ Unique AI RVC Models

arctic musk Apr 10, 2026, 4:55 PM

#

Thanks for the help!

viral mason Apr 10, 2026, 4:55 PM

#

You're welcome!

#

If you need hell or have questions just ask me or a helper ^^

arctic musk Apr 10, 2026, 4:56 PM

#

hell? cat_doom XD

viral mason Apr 10, 2026, 4:57 PM

#

My bad

#

Typo plus still waking up

#

If you need help

#

Lol

arctic musk Apr 10, 2026, 5:08 PM

#

God, this sounds a lot better tan VCC anime_pray

viral mason Apr 10, 2026, 5:12 PM

#

arctic musk God, this sounds a lot better tan VCC <:anime_pray:1159685390156967936>

Never heard of Tan before but I'd think it would lol

arctic musk Apr 10, 2026, 5:13 PM

#

viral mason Never heard of Tan before but I'd think it would lol

Voice Changer from okada branch, I was using that one but, this is better XD

viral mason Apr 10, 2026, 5:21 PM

#

<@&1159293140440723499> weird account

viral mason Apr 10, 2026, 5:21 PM

#

arctic musk Voice Changer from okada branch, I was using that one but, this is better XD

I only know of the original wokada, Wokada deiteris fork, Wokada tg fork, and Vonovox

#

As well as Applio real-time

#

That's somewhat newer like Vonovox

arctic musk Apr 10, 2026, 5:26 PM

#

I see, might take a look at that one too

marsh galleon Apr 10, 2026, 7:31 PM

#

hi i got a nvida gpu 5090 does anyone have the fork okada i used this one before but lost it

viral mason Apr 10, 2026, 7:34 PM

#

marsh galleon hi i got a nvida gpu 5090 does anyone have the fork okada i used this one be...

you should use Vonovox

#

what are you planning on using it for btw just curious

marsh galleon Apr 10, 2026, 7:36 PM

#

hanging with freinds i like using solo leveling voices and sh

#

they sound so relistic i used vonovox but it jst not like tgfork

viral mason Apr 10, 2026, 7:40 PM

#

marsh galleon hanging with freinds i like using solo leveling voices and sh

cool!

#

I'll get u the downloads rq

#

I don't understand how people lose stuff like this, do you randomly delete it or what

marsh galleon Apr 10, 2026, 7:41 PM

#

nah i needed to reset my pc aand sh i had to many files

#

sorry for taking up ur time to get the files im really thankful tho!

viral mason Apr 10, 2026, 7:42 PM

#

https://huggingface.co/dr87/vonovox/resolve/c8034f5f6d50648a8109bb4f847182362e2b779b/Vonovox_beta_17_11.zip

https://software.muzychenko.net/freeware/vac470lite.zip

#

here ya go

viral mason Apr 10, 2026, 7:43 PM

#

marsh galleon nah i needed to reset my pc aand sh i had to many files

I could never, everything I have on my pc is too important to losee

marsh galleon Apr 10, 2026, 7:43 PM

#

sirrr

#

i have vonox im asking for tf fork aha sorry for the confusionn

viral mason Apr 10, 2026, 7:44 PM

#

ohh

#

how come?

#

vonovox gives much better quality and is better in general

marsh galleon Apr 10, 2026, 7:45 PM

#

i had that one before it way easier vonox is so confusingg to me

viral mason Apr 10, 2026, 7:45 PM

#

but the beta is easier than tg fork

viral mason Apr 10, 2026, 7:45 PM

#

viral mason https://huggingface.co/dr87/vonovox/resolve/c8034f5f6d50648a8109bb4f847182362e2b...

all you need to do is change block size around and pitch

#

everything is done for you

marsh galleon Apr 10, 2026, 7:46 PM

#

wait can u show me what fork looks like because im lowk dont know if were talking abt the same thing

viral mason Apr 10, 2026, 7:48 PM

#

this is vonovox

#

this is Wokada tg fork

#

Vonovox isn't complicated at all

#

neither are

#

I wouldn't sacrafice quality just for one to be "easier"

#

that's just me tho

marsh galleon Apr 10, 2026, 7:51 PM

#

oh wait now im weirded out so with wokada it would be on the browser and it be like sounding so nice but ig that must be fan made or smth

viral mason Apr 10, 2026, 7:51 PM

#

?

#

what??

#

I'm confused 💔

marsh galleon Apr 10, 2026, 7:51 PM

#

yea same w me

#

so i had one

#

that looked like okada the normal one

#

but it was on browser

#

sm guy gave it to me

viral mason Apr 10, 2026, 7:52 PM

#

yea wokada tg is on browser too

#

vonovox tho no

marsh galleon Apr 10, 2026, 7:52 PM

#

i just use vonovox thank you fuck me i mst be confusing aha

#

is there a website for voice modles

#

??

viral mason Apr 10, 2026, 7:53 PM

#

marsh galleon sm guy gave it to me

most likely it was outdated then if some rando gave it to you

viral mason Apr 10, 2026, 7:53 PM

#

marsh galleon is there a website for voice modles

https://discord.com/channels/1159260121998827560/1175430844685484042

#

there's plenty here but also a site that has them too

marsh galleon Apr 10, 2026, 7:53 PM

#

ohh what siteee

viral mason Apr 10, 2026, 7:53 PM

#

https://voice-models.com

Voice Models

Voice Models: Over 27,900+ Unique AI RVC Models

#

this one!

#

I'd check here first as this place has a lot more quality control over good models

marsh galleon Apr 10, 2026, 7:54 PM

#

thank you alot

#

ur really helpful

#

lowk should be a mod

viral mason Apr 10, 2026, 7:55 PM

#

if I was the egirl models would have their download links removed to stop dirty scammers, like those weirdos in this channel https://discord.com/channels/1159260121998827560/1420775879759630448

#

people joining just because some random old yt video said they have them here makes me feel some kinda way

grand plinth Apr 10, 2026, 8:18 PM

#

w-okada not working on rtx5060, a little help?

low shard Apr 10, 2026, 8:27 PM

#

grand plinth w-okada not working on rtx5060, a little help?

elaborate:

your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link

grand plinth Apr 10, 2026, 8:29 PM

#

low shard elaborate: - your pc os - what are you trying to do: TTS, AI Covers, E Girl Trol...

windows 11
real time voice changer (RVC)
tutorial link?

low shard Apr 10, 2026, 8:31 PM

#

grand plinth * windows 11 * real time voice changer (RVC) * tutorial link?

RVC doesn't mean realtime voice changer, it means Retrieval-based-Voice-Conversion,

this is a General AI Discord Server and many people confuse it, that's why I'm asking what are you trying to do since there are different tools: AI Covers, E Girl Trolling / Catfishing or Roleplay?

Also, did you use any tutorial or download link for whatever are you using right now? What brought you there?

grand plinth Apr 10, 2026, 8:49 PM

#

low shard RVC doesn't mean realtime voice changer, it means Retrieval-based-Voice-Conversi...

i know what RVC is and what it stands for, i've used w-okada before so it isn't my first experience with it. I recently upgraded to a new PC with RTX 5060 and by the looks of it, the pytorch hasn't been updated. i'm here because of the second to last option

viral mason Apr 10, 2026, 8:56 PM

#

grand plinth * i know what RVC is and what it stands for, i've used w-okada before so it isn'...

Use Vonovox it's the current best

viral mason Apr 10, 2026, 8:56 PM

#

viral mason https://huggingface.co/dr87/vonovox/resolve/c8034f5f6d50648a8109bb4f847182362e2b...

The downloads you need are here

grand plinth Apr 10, 2026, 8:57 PM

#

viral mason Use Vonovox it's the current best

thanks 💛

viral mason Apr 10, 2026, 8:57 PM

#

You're welcome!

wicked bane Apr 10, 2026, 9:33 PM

#

Anyone knows where can I find a feminine but not too feminine voice (femboy)

past rune Apr 10, 2026, 10:16 PM

#

viral mason The downloads you need are here

hi do you know how i can hear myself while using vonovox?

viral mason Apr 10, 2026, 10:42 PM

#

past rune hi do you know how i can hear myself while using vonovox?

I do yes! one second

#

viral mason Apr 10, 2026, 10:43 PM

#

wicked bane Anyone knows where can I find a feminine but not too feminine voice (femboy)

https://tenor.com/view/tole-cat-cute-gif-12080171459357821404

Tenor

#

why?

#

what's your pc gpu? (Nvidia or AMD) and what do u plan on using it for? just curious ^^

#

better not be with egirl models cat_seriously

#

https://tenor.com/view/cat-eyes-eyeballs-cat-eye-emoji-eye-emoji-cat-looking-gif-9406810929029867845

Tenor

#

u promise you're gonna use normal stuff like Goku or Darth Vader ect

#

ok

#

peak

#

https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-dml.zip

https://software.muzychenko.net/freeware/vac470lite.zip

#

here's the two downloads you need ^^

#

first is the voice changer second is a virtual audio cable to use it in games ect

#

nah it's really easy

#

just extract both zip files, for vac lite (the virtual audio cable) just run the file called setup64 and then install driver

#

and for the voice changer run mmvcserversio

#

yea no weird setup like the old one

#

and it runs on browser

#

for the first time it could take a bit but it shouldn't take long

#

are you able to send a screenshot?

#

you have your settings wrong

#

input should be mic output should be line 1

#

you're using a different virtual cable but yea should be fine

slim mantle Apr 11, 2026, 12:05 AM

#

Argument #4: Padding size should be less than the corresponding input dimension, but got: padding (0, 26) at dimension 2 of input [1, 128, 6] tf is this 😭

brave quartz Apr 11, 2026, 1:34 AM

#

I see many but what is the best like free and do tts with rvc model same appolio ?

wild forge Apr 11, 2026, 1:40 AM

#

i am setting up a full off grid property(for when shit hits the fan) and need help with the ai aspect to control (cameras,hydroponics,gates,water) i have been diving into it with ai and they recommend i start with MS-01 but i also want to run 120b models and just would like someone to talk to who knows a little more than me...

shadow cave Apr 11, 2026, 2:49 AM

#

can i get help? everytime i run tthe start this pops up

fleet marsh Apr 11, 2026, 3:01 AM

#

-colab

patent trellisBOT Apr 11, 2026, 3:01 AM

#

fleet marsh -colab

📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**

Google Colab

• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

fleet marsh Apr 11, 2026, 3:52 AM

#

patent trellis

i got this on rvc mainline, what do i do?

#

its the third time this happens

viral mason Apr 11, 2026, 4:01 AM

#

fleet marsh i got this on rvc mainline, what do i do?

that's a bot you're replying to lol

#

you should probably be using Applio btw on Kaggle, google colab kinda stinks

hallow thistle Apr 11, 2026, 4:01 AM

#

fleet marsh i got this on rvc mainline, what do i do?

Why are you using the mainlnie RVC?

fleet marsh Apr 11, 2026, 4:02 AM

#

viral mason you should probably be using Applio btw on Kaggle, google colab kinda stinks

kaggle?

#

rn i was trying on collab, the link that should send me to the ui didnt work

fleet marsh Apr 11, 2026, 4:03 AM

#

hallow thistle Why are you using the mainlnie RVC?

i wanted to do some covers

fleet marsh Apr 11, 2026, 4:06 AM

#

fleet marsh rn i was trying on collab, the link that should send me to the ui didnt work

is there any reason for it?

#

narrow coyote Apr 11, 2026, 4:42 AM

#

Anyone able to help me with an ai voice model?

#

Curious what ai voice trainer thingy it is

clear depot Apr 11, 2026, 4:49 AM

#

hhello !! I've been using this and it stoppped working so I was wondering if there was a new verison <3 vcclient_win_cuda_2.1.4-alpha

viral mason Apr 11, 2026, 5:04 AM

#

clear depot hhello !! I've been using this and it stoppped working so I was wondering if the...

Hi! this is a veryyy old version

frigid spindle Apr 11, 2026, 5:04 AM

#

jai

viral mason Apr 11, 2026, 5:04 AM

#

what is your pc gpu (Nvidia or AMD) and waht do u plan on using it for

frigid spindle Apr 11, 2026, 5:04 AM

#

hai

viral mason Apr 11, 2026, 5:04 AM

#

frigid spindle hai

https://klipy.com/gifs/yippee-creature-autism-creature-2

Klipy

Yippee Autism Creature Stare

▶ Play video

clear depot Apr 11, 2026, 5:05 AM

#

viral mason what is your pc gpu (Nvidia or AMD) and waht do u plan on using it for

5070

viral mason Apr 11, 2026, 5:06 AM

#

you should use Vonovox, I have to go very soon so I'll get you the downloads

#

https://huggingface.co/dr87/vonovox/resolve/c8034f5f6d50648a8109bb4f847182362e2b779b/Vonovox_beta_17_11.zip

https://software.muzychenko.net/freeware/vac470lite.zip

#

here ya go

#

first link is for the voice changer second one is a virtual audio cable (it's recommended to use it over vb cable)

clear depot Apr 11, 2026, 5:07 AM

#

yessss I have vb cablee !!

viral mason Apr 11, 2026, 5:08 AM

#

the second link tho is recommended to use instead of vb cable, it does the same thing but sometimes vb cable is buggy for no reason

torn edge Apr 11, 2026, 6:31 AM

#

so im switching out of deiteris fork to a different program

#

which ones the better option performance wise

#

tg-develop fork or vonovox

viral mason Apr 11, 2026, 6:35 AM

#

torn edge tg-develop fork or vonovox

If you have Nvidia use Vonovox, if you have AMD use tg fork

#

Since Vono is Nvidia only

hardy yew Apr 11, 2026, 9:00 AM

#

damn

#

I thought this was a funny joke, repeating after each other

#

but instead it turns out they're all the same bots cat_doom

#

@low shard triple kill here

#

https://tenor.com/view/wawa-cat-wawa-fast-discord-oh-the-misery-cat-gif-25805989

Tenor

low shard Apr 11, 2026, 10:10 AM

#

viral mason The downloads you need are here

why not suggest the docs btw?

low shard Apr 11, 2026, 10:10 AM

#

wicked bane Anyone knows where can I find a feminine but not too feminine voice (femboy)

are you trying to do e girl / e boy trolling / catfishing?

#

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 11, 2026, 10:12 AM

#

brave quartz I see many but what is the best like free and do tts with rvc model same appolio...

RVC is STS Only, no TTS can natively use RVC models

low shard Apr 11, 2026, 10:13 AM

#

shadow cave can i get help? everytime i run tthe start this pops up

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 11, 2026, 10:13 AM

#

clear depot hhello !! I've been using this and it stoppped working so I was wondering if the...

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 11, 2026, 10:14 AM

#

fleet marsh i got this on rvc mainline, what do i do?

there is no RVC Mainline Cloud port suggested, all are abandoned, I will remove them

low shard Apr 11, 2026, 10:19 AM

#

hardy yew <@911742715019001897> triple kill here

they got already killed

brave quartz Apr 11, 2026, 11:31 AM

#

low shard RVC is STS Only, no TTS can natively use RVC models

So tts can't do sam's appolio when you have rvc model and tts that can convert the voice to the rvc model you have ?

fallow heron Apr 11, 2026, 11:39 AM

#

is there any free website alternative than weights gg that allows the use of custom rvc models ?

ember tapir Apr 11, 2026, 1:18 PM

#

hi, question: what is the group's stance on the limits of AI assistance when it comes to writing in research papers? Is it the provenance of the ideas or the style of writing and prose of the human ideas that are being written?

Basically, where do you see the limits of what AI assistance should not cross?

low shard Apr 11, 2026, 1:39 PM

#

brave quartz So tts can't do sam's appolio when you have rvc model and tts that can convert t...

RVC is Speech To Speech

Applio is an RVC Fork (modified version)

Applio to do "TTS", firstly uses Edge TTS to make the input audio (the tts model you see), then uses the RVC model over it

low shard Apr 11, 2026, 1:39 PM

#

fallow heron is there any free website alternative than weights gg that allows the use of cus...

there's no unlimited free site, it would just go bankrupt

it's better you tell your pc gpu and os

brave quartz Apr 11, 2026, 2:07 PM

#

low shard RVC is Speech To Speech Applio is an RVC Fork (modified version) Applio to do ...

Yeah my problem is I cant use my model with applio speech model bc of Microsoft that's why I want program same appolio

carmine palm Apr 11, 2026, 2:09 PM

#

Хелп ми

brittle wing Apr 11, 2026, 2:29 PM

#

-colab

patent trellisBOT Apr 11, 2026, 2:29 PM

#

brittle wing -colab

📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Tg-Develop Fork**

by Tg-Develop
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

left gale Apr 11, 2026, 2:39 PM

#

Hi everyone! Does anyone know how to make endless streams?

finite wind Apr 11, 2026, 2:43 PM

#

hey what do I do with the D and G pretrain thingys from my training?

#

I thought an index and my model pth would be the only result if I'm being honest

low shard Apr 11, 2026, 2:58 PM

#

brave quartz Yeah my problem is I cant use my model with applio speech model bc of Microsoft ...

you can, you just need to use both an edge tts and rvc model

#

there isn't any tts that can use rvc models other than the way i explained

low shard Apr 11, 2026, 2:59 PM

#

carmine palm Хелп ми

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 11, 2026, 2:59 PM

#

finite wind hey what do I do with the D and G pretrain thingys from my training?

those are needed only for pretrains or to continue training, you don't need to include them when posting a normal rvc model

finite wind Apr 11, 2026, 3:00 PM

#

low shard those are needed only for pretrains or to continue training, you don't need to i...

gotcha, I gotta keep them when I want to resume training oki

viral mason Apr 11, 2026, 4:14 PM

#

low shard why not suggest the docs btw?

It's easier to get the people what they need rather than giving them stuff that they'll probably not read

low shard Apr 11, 2026, 4:27 PM

#

viral mason It's easier to get the people what they need rather than giving them stuff that ...

i mean it's like giving them a car without instructing them how to guide

#

the guides are made to be read to understand how a program works, else why even spend so much time making them

viral mason Apr 11, 2026, 4:28 PM

#

Both programs are very easy to use and setup, I should probably tell them though each time how to run each

low shard Apr 11, 2026, 4:29 PM

#

viral mason Both programs are very easy to use and setup, I should probably tell them though...

i mean if you want to manually step by step give them everything everytime, but that's not going to help them when they need an update or want to know what setting does what

viral mason Apr 11, 2026, 4:30 PM

#

Fair but in Vonovox specifically has only 2 settings that ever needs to be touched, block size and pitch

#

I guess for Wokada tg fork it's a little bit more, just chunk size and extra time

storm holly Apr 11, 2026, 4:48 PM

#

OH SHIT NEW APPLIO COLAB

craggy brook Apr 11, 2026, 5:09 PM

#

How can we create the sound we want here?

viral mason Apr 11, 2026, 6:02 PM

#

storm holly OH SHIT NEW APPLIO COLAB

just use Kagglee, it's better than colab as it gives 30 hours a week for free

#

colab gives like 4 at max

#

why would you do that, that's weird

#

<@&1159293140440723499>

brave quartz Apr 11, 2026, 6:08 PM

#

low shard you can, you just need to use both an edge tts and rvc model

Can you tell me the way again and how because I'm new to these stuff

fleet marsh Apr 11, 2026, 6:17 PM

#

low shard there is no RVC Mainline Cloud port suggested, all are abandoned, I will remove ...

what about applio, it doesnt work anymore either?

viral mason Apr 11, 2026, 6:18 PM

#

applio works fine

#

I use it everyday

fleet marsh Apr 11, 2026, 6:18 PM

#

viral mason applio works fine

it didnt work yesterday

viral mason Apr 11, 2026, 6:20 PM

#

this is how I use it, I use it on Kaggle

fleet marsh Apr 11, 2026, 6:25 PM

#

viral mason this is how I use it, I use it on Kaggle

whats with the dataset?

#

is it for training?

viral mason Apr 11, 2026, 6:25 PM

#

yea that's for training you don't have to add any datasets

fleet marsh Apr 11, 2026, 6:26 PM

#

oka

#

niceee

viral mason Apr 11, 2026, 6:26 PM

#

to import a model tho should be the same

#

just go to download section then paste the link from huggingface

fleet marsh Apr 11, 2026, 6:27 PM

#

viral mason just go to download section then paste the link from huggingface

like on google collab?

viral mason Apr 11, 2026, 6:27 PM

#

idk how to use Applio on colab

#

but applio's interface is the same on all softwares

#

local, kaggle, colab

#

should be the same

fleet marsh Apr 11, 2026, 6:28 PM

#

o

#

ok

#

btw, are there any new models to train my datasets with? or im ok with Ov2?

#

that was the newest one last time i made one

viral mason Apr 11, 2026, 6:30 PM

#

please don't use OV2

#

it's like

#

bad

fleet marsh Apr 11, 2026, 6:30 PM

#

why?

#

it was really good back then

viral mason Apr 11, 2026, 6:30 PM

#

titan, ov2, Ren3, any of those are super old and bad because they cause harmonic distortions that we didn't know about back when we first used them

fleet marsh Apr 11, 2026, 6:31 PM

#

viral mason titan, ov2, Ren3, any of those are super old and bad because they cause harmonic...

pffffffff

#

dont tell me i have to do some models again? D:

#

like

viral mason Apr 11, 2026, 6:31 PM

#

yea 😭

fleet marsh Apr 11, 2026, 6:31 PM

#

10 of my models use ov2

fleet marsh Apr 11, 2026, 6:31 PM

#

viral mason yea 😭

NOOOO

#

also what harmonic distortions, do u have an example?

viral mason Apr 11, 2026, 6:32 PM

#

use this pretrain it's brand new and honestly for me from testing it's great https://discord.com/channels/1159260121998827560/1492203850747216083

inland pagoda Apr 11, 2026, 6:32 PM

#

Hi! What coding LLM is best for 12 GB VRAM atm?

fleet marsh Apr 11, 2026, 6:33 PM

#

viral mason use this pretrain it's brand new and honestly for me from testing it's great htt...

does kaggle have a limit of how much gpu can i use?

carmine siren Apr 11, 2026, 6:33 PM

#

I am looking for something like real - ESRGEN llm model, is there any alternative for upscaling image

carmine siren Apr 11, 2026, 6:34 PM

#

fleet marsh does kaggle have a limit of how much gpu can i use?

30 hours per week of combined GPU time

fleet marsh Apr 11, 2026, 6:34 PM

#

carmine siren 30 hours per week of combined GPU time

how much did google collab had?

carmine siren Apr 11, 2026, 6:38 PM

#

fleet marsh how much did google collab had?

Frequently limited to T4 GPUs with ~12-hour max sessions (but often interrupted sooner) and 90 minutes of idle timeout. You may be restricted for days if you abuse resources.

fleet marsh Apr 11, 2026, 6:53 PM

#

what number of gpu should i be puting here?

white pasture Apr 11, 2026, 7:02 PM

#

Why won’t it let me generate, bruh

#

It won’t lemme send pic wth

#

I’m trying to generate an image yet it says it’s not permitted no matter what I delete

white pasture Apr 11, 2026, 7:26 PM

#

I’m trying to make an image of Maxie and Mega who are two Pokemon characters

#

Gives me this “Content that violates our community guidelines was detected in your generation. Your gems have been refunded. Please try again with different parameters.”

#

Even though there’s no nsfw

#

I just put “Maxie from pokemon ORAS with a younger guy with black hair, red eyes, fluffy black collar, black and red shirt uniform, red cape”
Ain’t nothing wrong with that

low shard Apr 11, 2026, 7:29 PM

#

brave quartz Can you tell me the way again and how because I'm new to these stuff

If you explicitly want to use RVC models, use Applio

If you actually want better TTS, try other TTS programs

RVC isn't the best for TTS

low shard Apr 11, 2026, 7:30 PM

#

storm holly OH SHIT NEW APPLIO COLAB

huh

storm holly Apr 11, 2026, 7:30 PM

#

I wasn't here for a couple months

low shard Apr 11, 2026, 7:30 PM

#

fleet marsh what about applio, it doesnt work anymore either?

Applio Colab works fine

carmine siren Apr 11, 2026, 7:56 PM

#

The VibeVoice TTS model, which is developed by Microsoft, is one of the best.

abstract comet Apr 11, 2026, 8:28 PM

#

storm holly OH SHIT NEW APPLIO COLAB

where

storm holly Apr 11, 2026, 8:29 PM

#

abstract comet where

https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio.ipynb

Google Colab

abstract comet Apr 11, 2026, 8:29 PM

#

storm holly https://colab.research.google.com/github/iahispano/applio/blob/master/assets/App...

what's been updated?

viral mason Apr 11, 2026, 8:29 PM

#

just use kaggle for applio 💔

wanton parrot Apr 11, 2026, 8:49 PM

#

I use a dual nvidia gpu setup, is it possible to make vovonox use a specific gpu?

low shard Apr 11, 2026, 8:50 PM

#

wanton parrot I use a dual nvidia gpu setup, is it possible to make vovonox use a specific gpu...

https://docs.aihub.gg/realtime-voice-changer/local/vonovox/#opening-on-multi-gpu-systems

Vonovox

Last update: March 30, 2026

wanton parrot Apr 11, 2026, 8:52 PM

#

Thanks!

viral mason Apr 11, 2026, 10:08 PM

#

what are you talking about? you should specify

#

I cannot call at the moment sorry

#

whatever you have is outdated then

#

what is your pc gpu (Nvidia or AMD) and what are you using the voice changer for?

#

super outdated yea

#

right here. first link is for the voice changer second one is a virtual audio cable which will connect the voice changer to games and discord
https://huggingface.co/dr87/vonovox/resolve/c8034f5f6d50648a8109bb4f847182362e2b779b/Vonovox_beta_17_11.zip

https://software.muzychenko.net/freeware/vac470lite.zip

viral mason Apr 11, 2026, 11:53 PM

#

No, just run setup64 for the virtual audio cable then install driver
And for Vonovox just run setup

#

Any yt tutorials are outdated for voice changers

#

Sure

viral mason Apr 11, 2026, 11:55 PM

#

viral mason No, just run setup64 for the virtual audio cable then install driver And for Von...

It's not that difficult, just follow what I said here

#

Extract both after download and run the files I said there

#

Pitch at 0 works for most models if you're a guy, but if you're using a female voice pitch it up some until it sounds right

#

I personally have my block size at 0.50 but it works well at 0.30 which is default

#

It's alright

#

Excuse me?

#

I don't do that

#

<@&1159293140440723499> weirdo

#

I won't be helping you further

narrow coyote Apr 12, 2026, 12:57 AM

#

what ai is ran on terminal?

feral saffron Apr 12, 2026, 1:03 AM

#

i cant run the start_http.bat file for the voice changer, any tips

viral mason Apr 12, 2026, 1:09 AM

#

feral saffron i cant run the start_http.bat file for the voice changer, any tips

did you get it from a yt tutorial?

small violet Apr 12, 2026, 1:24 AM

#

anyone know why after follow the audio cable and mic steps for input and output it wont work? Like on roblox i cant hear anything through my mic?

misty marlin Apr 12, 2026, 1:57 AM

#

why the hell did the creator change it like this now i cant choose an index

#

nvm its text to speech

misty marlin Apr 12, 2026, 2:28 AM

#

the ai vc client doesnt work at all it doesnt make any sounds i checked all configs

#

the old versions worked fine but i dont wanna use old version i want new ones

#

im on win_ cuda 2.1.4 alpha

#

i have amd cpu

#

and nvidia graphic card

#

i am trying to speak

#

well

#

yea

viral mason Apr 12, 2026, 3:29 AM

#

misty marlin and nvidia graphic card

use Vonovox, what were you using the old voice changer for btw, just curious

swift thunder Apr 12, 2026, 4:02 AM

#

Does anyone here use Kaggle who can help me? I trained a model, everything was going perfectly, 275/300, then it started throwing errors and stopped training, and everything started throwing errors.

#

this?

ember nymph Apr 12, 2026, 4:25 AM

#

yo

tired aspen Apr 12, 2026, 4:59 AM

#

so im tryna set up the vcclient from w-okada after a while of not using it [i had deleted it] and im on a new version trying to set it up with voicemod, i genuenly cant figure it out. i already have the cable stuff and whatnot

abstract comet Apr 12, 2026, 5:08 AM

#

SOMEONE PUT FLOWMATCHING INTO RVC

#

PLEASE

viral mason Apr 12, 2026, 5:18 AM

#

tired aspen so im tryna set up the vcclient from w-okada after a while of not using it [i ha...

that's really old, if u have an Nvidia gpu u should swap to Vonovox but if you have AMD u should use Wokada tg fork

#

what's ur pc gpu?

#

I'll get u the download but I have to leave soon

tired aspen Apr 12, 2026, 5:18 AM

#

nvidia

#

thinks its a 3060

viral mason Apr 12, 2026, 5:19 AM

#

@swift thunder idk how to fix your error but look at this short tutorial I made in case you did something wrong

viral mason Apr 12, 2026, 5:20 AM

#

tired aspen nvidia

alright! first link is for the voice changer second one is a virtual audio cable which will connect the voice changer to games and discord
https://huggingface.co/dr87/vonovox/resolve/c8034f5f6d50648a8109bb4f847182362e2b779b/Vonovox_beta_17_11.zip

https://software.muzychenko.net/freeware/vac470lite.zip

tired aspen Apr 12, 2026, 5:20 AM

#

pretty sure i already have the cable stuff

#

unless it got a few updates since ive downloaded it

viral mason Apr 12, 2026, 5:22 AM

#

are you using Vac lite or VB cable, they're two different softwares but do the same thing just VB cable causes issues sometimes

tired aspen Apr 12, 2026, 5:22 AM

#

i ohnestly dont know, it just says cable, and its benn a few years since ive downlaoded it

#

nevermind its vb

viral mason Apr 12, 2026, 5:23 AM

#

I'd recommend the one I sent then just in case

tired aspen Apr 12, 2026, 5:23 AM

#

yeahhh

#

k both folders are done downloading

#

do i unzip both and install the new cable?

viral mason Apr 12, 2026, 5:25 AM

#

Yep

#

For vac lite just run setup64 (not as admin)

#

And most likely you won't need to restart your pc either

tired aspen Apr 12, 2026, 5:25 AM

#

oo

#

in that case should i try to find a way to delete vb cable?

viral mason Apr 12, 2026, 5:27 AM

#

If you like you can uninstall it the same way you installed it

#

But you don't have to

tired aspen Apr 12, 2026, 5:27 AM

#

ah

#

ok i have what i need i think, how od i set it up with voicemod now?

viral mason Apr 12, 2026, 5:51 AM

#

tired aspen ok i have what i need i think, how od i set it up with voicemod now?

Same setup just using the other cable

#

What are you using?

#

There are no default voice models that come with it, whatever you're using is outdated

#

What's your PC gpu? (Nvidia or AMD)

#

And what what do you want to do with the voice changer, just curious

viral mason Apr 12, 2026, 6:16 AM

#

Wdym by this?

#

joe_weird

edgy quiver Apr 12, 2026, 7:26 AM

#

I need help T-T, i am looking for some good male voice models for realtime, do u guys know any good ones

finite wind Apr 12, 2026, 8:09 AM

#

man the hard truth about clipping audios on your own from 1 to 10seconds wasn't giving me good results at all

#

same goes for letting applio cut audios on its own 😔

#

can def say og 48k pretrain is noticeably giving bad results than legacy 1.5 48k pretrain

#

is there any documents on what needs to be avoided or kept as I isolates a dataset?

#

like, for example, without whether it's true or not

#

if audio utilizes stereo heavily (sound from left or right), you should turn it into mono (just an example not confirmed to be true)

#

if you can cut audios on your own from 1 to 5 seconds(RVC limitation), it's better to cut on your own to make better quality dataset (just an example not confirmed to be true)

#

why there are so few to no documents on how you SHOULD process a dataset?

hardy yew Apr 12, 2026, 8:22 AM

#

finite wind if audio utilizes stereo heavily (sound from left or right), you should turn it ...

RVC works with mono audio only

finite wind Apr 12, 2026, 8:22 AM

#

WHY IT WASN'T ON THE AIHUB DOC

#

AHHHHHHHHH

#

MY 8 HOURS

hardy yew Apr 12, 2026, 8:24 AM

#

Your stereo data was simply downmixed to mono at preprocessing step xd

finite wind Apr 12, 2026, 8:25 AM

#

yeah....

hardy yew Apr 12, 2026, 8:25 AM

#

I thought there was a warning about this but maybe not

finite wind Apr 12, 2026, 8:25 AM

#

I was wondering why I got irregular volume sometimes with my model

#

apparently model also studied the lowest volume part of the audio when the music was coming from only left or right

hardy yew Apr 12, 2026, 8:26 AM

#

It just takes an average of both channels

#

What about normalization?

finite wind Apr 12, 2026, 8:27 AM

#

I let applio do the normalization but

#

wasn't able to tell huge difference on my own when I heard the dataset after the auto normalization from applio

hardy yew Apr 12, 2026, 8:28 AM

#

There's also some debate on pre vs post normalization

#

If your data has noisy silence removed then post should be quite good

finite wind Apr 12, 2026, 8:29 AM

#

afaik, pre is normalization before cutting and post is after cutting

hardy yew Apr 12, 2026, 8:29 AM

#

Yeah

finite wind Apr 12, 2026, 8:29 AM

#

I just see no reason to go for pre

#

unless your dataset is suffering from low quailty audio issues

hardy yew Apr 12, 2026, 8:30 AM

#

If there was lots of dirty silence in your dataset, post would blow it up

#

But yeah, other than that it's seemingly better

finite wind Apr 12, 2026, 8:32 AM

#

idk how you deal with the brief silences between lyrics or speech though

#

like 0.1 to 0.4 seconds dirty silences

#

someone who knows def should put it on the docs

modern hornet Apr 12, 2026, 8:32 AM

#

why did the mmvc file stop opening for the voice thing it was opening yesterday now gotta reinstall

hardy yew Apr 12, 2026, 8:33 AM

#

a) ignore them and deal with it
b) manual cutting
c) smartcutter (my go-to)

#

Though I mostly train with video game voiceover. It's clean out of the box.

finite wind Apr 12, 2026, 8:34 AM

#

see I do either
a) manually cutting them to close the silence gap
b) completely silence that dirty silence part without closing the gap

patent plover Apr 12, 2026, 8:34 AM

#

hello

finite wind Apr 12, 2026, 8:34 AM

#

but I can't tell which is better or should be avoided

patent plover Apr 12, 2026, 8:34 AM

#

i have problems with mmvcservice

#

no works :,v

#

and i have all

#

the VB- virtual cable input, input 16inch and output

hardy yew Apr 12, 2026, 8:35 AM

#

finite wind see I do either a) manually cutting them to close the silence gap b) completely ...

Ideally replace dirty silence with pure zeros and leave just a bit of silence between words/sentences (e.g. 0.1s)

patent plover Apr 12, 2026, 8:36 AM

#

1 day to the other stops working

hardy yew Apr 12, 2026, 8:36 AM

#

How you're gonna do that is a separate thung

patent plover Apr 12, 2026, 8:36 AM

#

i reinstall but dont works :,vv

finite wind Apr 12, 2026, 8:36 AM

#

hardy yew Ideally replace dirty silence with pure zeros and leave just a bit of silence be...

thank you so much for the clear answer

latent kraken Apr 12, 2026, 8:36 AM

#

anyone wanna help me make a AI voice website

I can't pay anyone but I kinda wanna see how hellish this could be

I don't know how tf to do anything ;-;

finite wind Apr 12, 2026, 8:36 AM

#

really gotta put it on AIhub documents though

hardy yew Apr 12, 2026, 8:37 AM

#

I think the main problem is there's lots of uncertainty around dataset preparation

#

Lots of aspects for which people have different approach

finite wind Apr 12, 2026, 8:38 AM

#

at least the ones that are generally good to do should be listed up on the doc

hardy yew Apr 12, 2026, 8:38 AM

#

So it's hard to tell "this is the way. This is the only and right way"

hardy yew Apr 12, 2026, 8:38 AM

#

finite wind at least the ones that are generally good to do should be listed up on the doc

Perhaps, yeah

finite wind Apr 12, 2026, 8:38 AM

#

instead of having nothing should be better

#

the info of something generally good + the reason why it's generally good = an easy step for anyone, can logically think from there to guess and try some better ways to do things

#

rather than shooting themselves in the foot

patent plover Apr 12, 2026, 8:40 AM

#

cat_doom

hardy yew Apr 12, 2026, 8:41 AM

#

Definitely, agree

patent plover Apr 12, 2026, 8:41 AM

#

:c

hardy yew Apr 12, 2026, 8:42 AM

#

patent plover i reinstall but dont works :,vv

It just doesn't open or what happens?

#

This is weird

#

Especially that another person above just had the same issue

finite wind Apr 12, 2026, 8:43 AM

#

just one favor to ask, can you share a screenshot of any of your processed audio file because I want to see how you processed it?

#

preferably with spectrum

#

like this is a random pic from online but I usually edit out these clicking or tearing parts of the spectrum to process but NOTHING else because I have no info for anything else

#

just the pure silencing out part like you and I discussed a little bit earlier

hardy yew Apr 12, 2026, 8:46 AM

#

this is one of my datasets (from a game, too)

#

other than concatenating all of it and silence truncation i didn't do much here

hardy yew Apr 12, 2026, 8:47 AM

#

finite wind like this is a random pic from online but I usually edit out these clicking or t...

i certainly don't do any precise adjustments like that (although they can sure be beneficial, depending on the dataset and things you adjust)

finite wind Apr 12, 2026, 8:48 AM

#

yeahhhh

#

yours looks vastly different from mine I think I got a better idea now

#

thanks again

hardy yew Apr 12, 2026, 8:49 AM

#

they don't always look that similar, I guess various timbre might turn out quite different

#

though obviously clean human voice will have lots of shared properties in the spectrograms

finite wind Apr 12, 2026, 8:49 AM

#

this is from the very first dataset I did, I think that higher frequency parts needs to be cleared out? or is it just only in your screenshot case idk

#

some dirty silence parts can be seen in here too

hardy yew Apr 12, 2026, 8:50 AM

#

it might also be my spectrogram settings TBH, not exposing too much dirt in the highs

finite wind Apr 12, 2026, 8:50 AM

#

only major difference is just I added silences in between sentences

hardy yew Apr 12, 2026, 8:50 AM

#

those settings are what I mainly use when looking at the harmonics

finite wind Apr 12, 2026, 8:51 AM

#

ahhh

hardy yew Apr 12, 2026, 8:51 AM

#

#

this is how the same data looks with default amplitude range

finite wind Apr 12, 2026, 8:52 AM

#

maybe mine was showing 48k spectrum I think

#

I figure because yours is showing til only 15k but it's just my guess

hardy yew Apr 12, 2026, 8:53 AM

#

my data is 32k so it can only peak at 16kHz, hence the range

#

in your case it can go up to 24k

finite wind Apr 12, 2026, 8:55 AM

#

yeppp

#

oh and just one more thing before I go

#

I let applio cut my one long audio file for a test

#

and it chopped some of the last parts from the first sentence and then put it into the second sentence's first part

#

is that a problem or not at all?

#

or abruptly cutting it mid sentence?

hardy yew Apr 12, 2026, 9:06 AM

#

finite wind is that a problem or not at all?

good question

#

TBH not sure how it affects the final model

finite wind Apr 12, 2026, 9:07 AM

#

damn

hardy yew Apr 12, 2026, 9:07 AM

#

Personally I don't mind it and just do use the autoslicing on my concatenated data

#

but

#

slicing phonemed in half

#

definitely can have some negative impact compared to when the samples would simply go from silence, to audio, to silence again

finite wind Apr 12, 2026, 9:07 AM

#

ohhhh

hardy yew Apr 12, 2026, 9:08 AM

#

Haven't ever done any research on this but that's what I would expect

finite wind Apr 12, 2026, 9:08 AM

#

at least it's much better than having no answer

hardy yew Apr 12, 2026, 9:08 AM

#

Whether it has a massive impact or little-to-no impact at all? No idea, maybe someone else knows

finite wind Apr 12, 2026, 9:08 AM

#

I will go for manual cutting from 1 to 5 on my own and see if it helps any better

#

apparently 5 is the max for applio

#

to process while training

hardy yew Apr 12, 2026, 9:09 AM

#

There is one more problem with it though

#

(and i guess the main reason for equally-lengthed 3s clips)

finite wind Apr 12, 2026, 9:09 AM

#

hmm?

hardy yew Apr 12, 2026, 9:10 AM

#

The training pipeline utilizes 3s segments and cuts off the rest. So if you provide it a sample of e.g. 4s, it will still only use the 3s and ignore the 1s.
If you provide a sample of 8s, it will process 2x 3s samples and discard the remaining 2s

#

(or at least that's how I understand it, recently saw a discussion on this)

finite wind Apr 12, 2026, 9:11 AM

#

ohhhh

hardy yew Apr 12, 2026, 9:11 AM

#

So eventually the outcome is often similar - cutting words in half and discarding some info

finite wind Apr 12, 2026, 9:11 AM

#

I swear I saw it somewhere in this discord that applio can process up to 5 sec hmm

#

gotta go for 3s to be safe I guess

hardy yew Apr 12, 2026, 9:11 AM

#

This needs further verification I suppose 🤔

#

Don't want to state i'm 100% sure of something when i'm not

finite wind Apr 12, 2026, 9:13 AM

#

gotcha thanks a lot

astral tangle Apr 12, 2026, 9:57 AM

#

hello everyone im new to ai stuff, i wanted to change a voice to another to make some ai song covers, i've tried using RVC but i have an AMD GPU (RX 6700XT) and cant get it to work, could someone help me getting it to work, or maybe guide me towards another ai i could use to change one voice to another? any help would be appreciated.

My specs are:
Rx 6700XT gpu
Windows 10

lone smelt Apr 12, 2026, 10:22 AM

#

hi, anyone know why google colab keeps crashing or disconnecting when i’m generating roblox assets? not sure if it’s a GPU limit thing or what.

shrewd nest Apr 12, 2026, 11:51 AM

#

Hi, anyone know how to create realistic TTS with human nature voices? Like breathing laughing?

thick patrol Apr 12, 2026, 1:32 PM

#

Hello! evening, morning to everyone! im just curious about why my Odaka starts to slow down and genuiley start being unresponsive, is it a internet thing?

it only started to act like this after a few seconds tops

low shard Apr 12, 2026, 2:25 PM

#

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 12, 2026, 2:26 PM

#

feral saffron i cant run the start_http.bat file for the voice changer, any tips

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 12, 2026, 2:26 PM

#

small violet anyone know why after follow the audio cable and mic steps for input and output ...

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 12, 2026, 2:27 PM

#

misty marlin im on win_ cuda 2.1.4 alpha

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 12, 2026, 2:27 PM

#

ember nymph yo

do you need help?

low shard Apr 12, 2026, 2:27 PM

#

tired aspen so im tryna set up the vcclient from w-okada after a while of not using it [i ha...

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 12, 2026, 2:27 PM

#

abstract comet SOMEONE PUT FLOWMATCHING INTO RVC

what

abstract comet Apr 12, 2026, 2:28 PM

#

low shard what

I need it 🙁

low shard Apr 12, 2026, 2:28 PM

#

edgy quiver I need help T-T, i am looking for some good male voice models for realtime, do u...

what mdoels? are you trying to be like ben10 or e girl / e boy / trolling catfishing?

low shard Apr 12, 2026, 2:28 PM

#

patent plover no works :,v

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 12, 2026, 2:29 PM

#

abstract comet I need it 🙁

need what? an rvc voice model? check #1175430844685484042 or #1159289738314919936 , or make it yourself

low shard Apr 12, 2026, 2:29 PM

#

astral tangle hello everyone im new to ai stuff, i wanted to change a voice to another to make...

https://docs.aihub.gg/rvc/local/applio/

Applio

Last update: April 4, 2026

low shard Apr 12, 2026, 2:29 PM

#

lone smelt hi, anyone know why google colab keeps crashing or disconnecting when i’m genera...

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

low shard Apr 12, 2026, 2:30 PM

#

thick patrol Hello! evening, morning to everyone! im just curious about why my Odaka starts t...

This is a General AI Discortd Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

#

I should make the Sapphire Message about the Guidelines and Elaborating more visible, not sure why it's so ignored

edgy quiver Apr 12, 2026, 2:32 PM

#

Yeah, trying to get a good eboy one, im kinda tired of getting catcalled and all that. I currently have wokada set up (my gpu is a 3060)

ember nymph Apr 12, 2026, 3:18 PM

#

low shard do you need help?

Oh yeah

low shard Apr 12, 2026, 3:28 PM

#

edgy quiver Yeah, trying to get a good eboy one, im kinda tired of getting catcalled and all...

are you like trying to troll / catfish people or trying to like get privacy in games because you get harassed as a girl?

wanton parrot Apr 12, 2026, 3:56 PM

#

Hello, im trying to run tg-develop w okada fork on ubuntu server but when i run the server it does not accept arguments

kat@kat-server:~/Voice-Changer/MMVCServerSIO$ ./MMVCServerSIO --launch-browser false --https true
usage: MMVCServerSIO [-h] [--log-level {debug,info,warning,error,critical}] [--launch-browser]
MMVCServerSIO: error: unrecognized arguments: false --https true

finite wind Apr 12, 2026, 4:16 PM

#

hardy yew this is one of my datasets (from a game, too)

did you run this through some sort of noise removing process? if you did, I'd like to know how you did it and why

#

because I certainly didn't make my dataset to have voids all around the spectrum

#

not in spacing between audios but spaces inside of them like swiss cheese

hardy yew Apr 12, 2026, 4:17 PM

#

well, as i said, it was already clean as it's from a game so it's kind of an "easy" dataset

#

so lots of samples concatenated with 100ms breaks in between

#

smartcutter on top of that to possibly clean up silences from within the samples

low shard Apr 12, 2026, 4:18 PM

#

wanton parrot Hello, im trying to run tg-develop w okada fork on ubuntu server but when i run ...

This is a General AI Discord Server and there are many voice changers, elaborate:

your pc gpu
your pc os
what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
the tutorial link used

finite wind Apr 12, 2026, 4:18 PM

#

#

so these are from the original samples or smartcutter did that for you?

analog obsidian Apr 12, 2026, 4:19 PM

#

finite wind

rvc fills the voids when the model upscales the audio, it doesnt matter

finite wind Apr 12, 2026, 4:20 PM

#

analog obsidian rvc fills the voids when the model upscales the audio, it doesnt matter

I know it dose because I already read you said it couple of times in the discord

#

BUT

#

do I want to make those voids on purpose is the question

analog obsidian Apr 12, 2026, 4:20 PM

#

no

finite wind Apr 12, 2026, 4:21 PM

#

hmm okay

#

so voids on only spacing for now

#

I thought it could make clearer voice output idk

analog obsidian Apr 12, 2026, 4:21 PM

#

there will be always quality loss due to how rvc works, it will never be 1:1 with the dataset

finite wind Apr 12, 2026, 4:22 PM

#

yep and we're just trying to mitigate the losses as much as we can

#

and this is the part of the thing too but it's a shame I guess

hardy yew Apr 12, 2026, 4:22 PM

#

finite wind so these are from the original samples or smartcutter did that for you?

good question, i would have to compare this part from samples to before smartcutter

finite wind Apr 12, 2026, 4:23 PM

#

yeah somehow I thought those voids inside of the audio spectrum not the ones from spacing could enhance outcome quality

hardy yew Apr 12, 2026, 4:23 PM

#

#

this is smartcutter

#

and this is original

#

(slightly different scale too because the second one is before downsampling to 32k)

finite wind Apr 12, 2026, 4:24 PM

#

ah so original already had those voids in

hardy yew Apr 12, 2026, 4:24 PM

#

yeah, looks like it

finite wind Apr 12, 2026, 4:24 PM

#

oki

#

lyery also confirmed it does not enhance outcome so

hardy yew Apr 12, 2026, 4:24 PM

#

I admire the attention to details, I don't look that precisely usually xD

finite wind Apr 12, 2026, 4:24 PM

#

gotta do what I was doing just now

#

hey I just wanna make my dataset processing worth a while

analog obsidian Apr 12, 2026, 4:27 PM

#

the only trick to enhance the quality of a rvc model is to get a better dataset, recorded with a decent mic, low noise, and no editing at all

hardy yew Apr 12, 2026, 4:28 PM

#

what i usually do is slap the data i got into training and filter it later, if the model develops some flaws

finite wind Apr 12, 2026, 4:28 PM

#

there are more questions to ask after I realized wthat is a better dataset in general

analog obsidian Apr 12, 2026, 4:28 PM

#

raw wav audio files

finite wind Apr 12, 2026, 4:28 PM

#

like hmm

analog obsidian Apr 12, 2026, 4:29 PM

#

like a recording of yourself, without any editing to that audio clip

#

no mp3 compression

#

no "voids" in the spectrum

finite wind Apr 12, 2026, 4:29 PM

#

should I avoid putting chest voiced audio when the majority of the audio is modal voiced? kind of stuff

finite wind Apr 12, 2026, 4:30 PM

#

analog obsidian no "voids" in the spectrum

this just saved me an hour or two tbh

analog obsidian Apr 12, 2026, 4:30 PM

#

yep you want consistency, in both timbre and audio quality

finite wind Apr 12, 2026, 4:30 PM

#

yeeahhh

#

I was confused because I read you need diverse audios but at the same time you can't put drastically or moderately different styled audio (in terms of speaking / singing)

analog obsidian Apr 12, 2026, 4:31 PM

#

yea with diverse they mean pitch variety, not monotone audio

finite wind Apr 12, 2026, 4:31 PM

#

gotcha

analog obsidian Apr 12, 2026, 4:31 PM

#

you have to teach the ai the whole voice range of your speaker

finite wind Apr 12, 2026, 4:32 PM

#

I just gotta undo my voids on spectrums I did for the past 30 minutes

#

and I put too much trust on index since I tested out british accent model as a realtime model

hardy yew Apr 12, 2026, 4:33 PM

#

analog obsidian no "voids" in the spectrum

why not actually? I mean if it was to be added manually then sure, perhaps not worth the hassle. But assuming it's inserted automatically, isn't it better to discard the noisy silence?

analog obsidian Apr 12, 2026, 4:33 PM

#

analog obsidian yea with diverse they mean pitch variety, not monotone audio

ah and also word variety, you dont want the speaker to repeat the same words often

hardy yew Apr 12, 2026, 4:33 PM

#

(even if the noise is low anyway)

finite wind Apr 12, 2026, 4:33 PM

#

and it only sounded somewhat decent when I mimicked the british accent decently

#

so a lil disappointment there but we carry on

analog obsidian Apr 12, 2026, 4:33 PM

#

hardy yew why not actually? I mean if it was to be added manually then sure, perhaps not w...

with voids you're talking about the empty space in the spectogram? like the ones caused by compression?

#

or you're talking about the silences between samples? lol

finite wind Apr 12, 2026, 4:34 PM

#

not the gaps but like holes as if it's swiss cheese

#

hardy yew Apr 12, 2026, 4:34 PM

#

analog obsidian with voids you're talking about the empty space in the spectogram? like the ones...

oh i thought you meant the silence

#

mb

analog obsidian Apr 12, 2026, 4:34 PM

#

ah ok compression

#

yes avoid them

#

rvc dont like that

finite wind Apr 12, 2026, 4:34 PM

#

ffghhhhh

#

I gotta undo

analog obsidian Apr 12, 2026, 4:34 PM

#

i mean, it fills them up with random shit but it's not ideal

#

better to have real data there

finite wind Apr 12, 2026, 4:36 PM

#

we really gotta update AIHUB documents

hardy yew Apr 12, 2026, 4:37 PM

#

analog obsidian yes avoid them

hmm in theory i could fill those with RX's spectral reconstruction. Wonder if it's better or worse than leaving it untouched.
Interesting thing to check I guess, not sure how that reconstruction thing performs

finite wind Apr 12, 2026, 4:37 PM

#

probably better off leaving it untouched unless your dataset is low quality in that terms

analog obsidian Apr 12, 2026, 4:38 PM

#

hardy yew hmm in theory i could fill those with RX's spectral reconstruction. Wonder if it...

i'd rather train the original data, tho i have tried upscaled audio before as dataset and it came out fine

hardy yew Apr 12, 2026, 4:38 PM

#

Yeah, just wondering, might try it

#

I mean, this model turned out nice the way it is

analog obsidian Apr 12, 2026, 4:38 PM

#

i know pretrains dont like upscaled data tho

#

but finetuning is different... soo

#

misc_shrug

analog obsidian Apr 12, 2026, 4:40 PM

#

hardy yew I mean, this model turned out nice the way it is

yea tbh i think it's fine as long is not for pretrains

finite wind Apr 12, 2026, 4:43 PM

#

what's your take on non-verbal audio for a dataset though

#

like sighing, laughing(moderate not high pitched), humming sounds like hmm or etc

hardy yew Apr 12, 2026, 4:44 PM

#

out of those three laughing is the worst i think

#

wouldn't expect occassional sighing/humming to break the model

finite wind Apr 12, 2026, 4:45 PM

#

oh now it makes sense

#

my british accent model had A LOT of laughing or giggling

#

f the singer ig

hardy yew Apr 12, 2026, 4:46 PM

#

adding lots of noises like that will probably cause the model to insert them into normal speech

#

which is rather undesired xD

finite wind Apr 12, 2026, 4:46 PM

#

yeeeep

hardy yew Apr 12, 2026, 4:47 PM

#

one of my lazy trainings was Ellie from TLOU which had lots of shouting/screaming/growl-ish angry voicelines, beside normal speech

#

and it got quite audibly rendered into speech in the model

finite wind Apr 12, 2026, 4:47 PM

#

I can see that

#

why it could happen ye

hardy yew Apr 12, 2026, 4:48 PM

#

especially "heavier" speech with stronger emphasis turned out raspy like the screams

#

"soft" speech was more-or-less unaffected

#

but yeah, it was an experiment to see how it affects the model and turned out as expected, it's rather bad

finite wind Apr 12, 2026, 4:48 PM

#

now I think the most difficult thing to do in the processing dataset step is which theme of the audio you want to mainly use as a dataset

#

you can't just put everything and hope for the training to turn out good at everything right?

hardy yew Apr 12, 2026, 4:49 PM

#

finite wind now I think the most difficult thing to do in the processing dataset step is whi...

well, ideally if the data is consistent, then this is a non-issue

finite wind Apr 12, 2026, 4:49 PM

#

so I gotta choose what kind of audio I mainly want to train

#

like, for someone who doesn't know what a consistent data is

#

I, myself would put shouting, crying, singing, grumping, sarcastical speech, screeching, mocking etc in the same dataset

#

and it wouldn't be able to make generalized voice like I expect it to

#

for example like the one you've just said from TLOU

hardy yew Apr 12, 2026, 4:51 PM

#

finite wind I, myself would put shouting, crying, singing, grumping, sarcastical speech, scr...

ideally that would be great, but for a better and more flexible architecutre than RVC

finite wind Apr 12, 2026, 4:51 PM

#

that's the thing

#

we can't do that YET

hardy yew Apr 12, 2026, 4:51 PM

#

yeah, hopefully some day

finite wind Apr 12, 2026, 4:52 PM

#

RVC limitation

#

is uh

hardy yew Apr 12, 2026, 4:52 PM

#

although a part of me doesn't want it

#

due to how common catfishing and other stinky use cases are

finite wind Apr 12, 2026, 4:52 PM

#

if I have a dataset of crying, angry, annoyed then I have to choose one

#

oh it's just an example

#

even singing method can be vary from a same person

#

and we have to choose only one of them to make a model for now

#

at least realtime or retrieval tech isn't going anywhere to be developed further at the moment

#

TTS industry is going to be advancing just fine and at least it's not for catfishing

viral mason Apr 12, 2026, 4:56 PM

#

finite wind I, myself would put shouting, crying, singing, grumping, sarcastical speech, scr...

So far I know humming, grunts, yelling (kinda), singing, coughing ect works

#

Especially when used with a good pretrain like Legacy core 1.5 or 1.6

finite wind Apr 12, 2026, 4:57 PM

#

viral mason So far I know humming, grunts, yelling (kinda), singing, coughing ect works

I think, in practice, audios that are drastically different produce unstable outcome because it's not just pitch is different

viral mason Apr 12, 2026, 4:57 PM

#

I usually keep singing out of a dataset if it's mostly a talking model

finite wind Apr 12, 2026, 4:58 PM

#

that I gotta agree

#

so any model that sounded less robotic with less artifcats have these monotone-like feelings in my experience

hardy yew Apr 12, 2026, 4:58 PM

#

finite wind I think, in practice, audios that are drastically different produce unstable out...

i think those things also will depend on the ratio between the types of data

finite wind Apr 12, 2026, 4:59 PM

#

maybe

#

my mind says 9:1 can disrupt a lot than 5:5 ratio

viral mason Apr 12, 2026, 4:59 PM

#

What does that mean

#

I just woke up and in general I'm kinda slow

hardy yew Apr 12, 2026, 5:00 PM

#

screaming only = bad
screaming just a bit = not as bad

viral mason Apr 12, 2026, 5:00 PM

#

Good example is my model of Doey

finite wind Apr 12, 2026, 5:01 PM

#

for simple explanation would come from talking models

#

let's say you want a model to sound like a specific game character in general

#

but you have a dataset that consists of just talking, shouting, being grumpy, or idk mocking

#

ratio wise idk what ratio that can f up the model

cedar rock Apr 12, 2026, 5:03 PM

#

finite wind for simple explanation would come from talking models

i am new on this and i want to know how to have a solid base to start learning Ai....is coding must?

#

misc_sob

viral mason Apr 12, 2026, 5:04 PM

#

finite wind ratio wise idk what ratio that can f up the model

So can excessive coughing really mess up a dataset?

#

It's mostly for General Grievous from star wars

finite wind Apr 12, 2026, 5:04 PM

#

idk we will have to find out ourselves

#

both ratio and the fact that coughing is in the dataset in the first place is a problem or not

viral mason Apr 12, 2026, 5:05 PM

#

Hmm

finite wind Apr 12, 2026, 5:05 PM

#

ideally it reproduces the same cough every time you cough

#

but I can imagine the model to blend that coughing into normal speech and f up the whole speech you're trying to say

#

idk if it's prevented from the training phase I don't really know

hardy yew Apr 12, 2026, 5:07 PM

#

that's pretty much my Ellie case i think and for that the answer is right there

#

but i would expect that with not-so-much shouting it would be way better

finite wind Apr 12, 2026, 5:07 PM

#

that is, in this case a lot of coughing is in the dataset of that general grievous

hardy yew Apr 12, 2026, 5:07 PM

#

so maybe same with coughing

finite wind Apr 12, 2026, 5:08 PM

#

I think moderate tone differences express happiness, sadness, and annoyance can be done and I've seen a few

#

but above that, I don't think we can do that

hardy yew Apr 12, 2026, 5:08 PM

#

that for sure, nothing wrong with expressions

finite wind Apr 12, 2026, 5:09 PM

#

but people expect more than just little expressions to fall into the category of "oh I should put this in the dataset"

viral mason Apr 12, 2026, 5:09 PM

#

His voice is usually this really gravely somewhat robotic voice but for the most part it's pretty human sounding

#

Not sure how the sound of his voice could affect the training

finite wind Apr 12, 2026, 5:09 PM

#

so did I for one or two first models I trained

hardy yew Apr 12, 2026, 5:09 PM

#

viral mason Not sure how the sound of his voice could affect the training

one way to find out 8)

finite wind Apr 12, 2026, 5:09 PM

#

like glados and etc characters

#

I think his voice alone isn't a problem at all

#

it is that god damn non-verbal things are always the problem

hardy yew Apr 12, 2026, 5:10 PM

#

it's quite harsh at times

finite wind Apr 12, 2026, 5:10 PM

#

not to mention the "too much expressive" speech if you're training a character model

hardy yew Apr 12, 2026, 5:10 PM

#

i'd worry a bit that RVC could exaggerate this after training

#

but usually it does well with all kinds of funky voices

finite wind Apr 12, 2026, 5:11 PM

#

yeep

hardy yew Apr 12, 2026, 5:11 PM

#

whether clean human speech or something very artificial

#

e.g. both robotic-ish voices i tried training had some flaws resulting from RVC learning some parts of it too well and exaggerating them

#

one was resulting from low frequency content in the voice so that was more or less architectural limitation

#

the other was a 'cyborg' voice which is 90% human with slight electronic buzzing

#

and it affected the model a bit too much too

#

doesn't sound great even though it's actually not so far from thje original

#

viral mason Apr 12, 2026, 5:14 PM

#

For this model he has some efforts and pure screaming in it but most of it is talking, the thing is his voice goes from talking in a deeper more serious voice to a more silly higher pitched voice
https://discord.com/channels/1159260121998827560/1349526562197995570

hardy yew Apr 12, 2026, 5:14 PM

#

example

viral mason Apr 12, 2026, 5:15 PM

#

I love making silly models

finite wind Apr 12, 2026, 5:15 PM

#

I heard it and it sounds like high pitch sounds are blended with his serious tone of voice

analog obsidian Apr 12, 2026, 5:15 PM

#

me too

viral mason Apr 12, 2026, 5:15 PM

#

finite wind I heard it and it sounds like high pitch sounds are blended with his serious ton...

That model is a bit older tho so I no longer use it

#

Still struggling to make a new one that I actually like

finite wind Apr 12, 2026, 5:16 PM

#

I just wanted to point that out because it might have been the very thing we were talking about the dataset ratio or the

#

too much diverse expressions generate lower quality model

viral mason Apr 12, 2026, 5:17 PM

#

Makes sense

finite wind Apr 12, 2026, 5:17 PM

#

Doey have serious and silly voices which are drastically different so I figure

#

That might be the reason why e-girl models were thriving back then

analog obsidian Apr 12, 2026, 5:18 PM

#

rvc dont learn expressions, it learns to predict mel and features

#

misc_shrug

finite wind Apr 12, 2026, 5:18 PM

#

that sounds about right

viral mason Apr 12, 2026, 5:19 PM

#

finite wind That might be the reason why e-girl models were thriving back then

I'm feeling car sick

hardy yew Apr 12, 2026, 5:19 PM

#

https://tenor.com/view/cat-ai-pufferfish-cat-puffer-fish-ai-pufferfish-cat-ai-ai-gif-3219711934296360561

Tenor

viral mason Apr 12, 2026, 5:20 PM

#

I find it strange tho how rvc can make a model like this somewhat where it can change pitch to match the random voices this character changes to

#

But struggles with a character that just has a lot of range in their voice like going from serious dark voice to higher more bubbly voice

finite wind Apr 12, 2026, 5:22 PM

#

The limitation is real and I'm coping

viral mason Apr 12, 2026, 5:22 PM

#

I hope some rich dude will show up and improve this stuff

hardy yew Apr 12, 2026, 5:22 PM

#

viral mason I find it strange tho how rvc can make a model like this somewhat where it can c...

Oh have you made a model like this?

viral mason Apr 12, 2026, 5:22 PM

#

hardy yew Oh have you made a model like this?

My friend has yes

#

I've tried many times but never was left satisfied

finite wind Apr 12, 2026, 5:23 PM

#

one day I can say unhinged shit as glados that sounds so natural to the point others might think it could've been from the actual game

viral mason Apr 12, 2026, 5:23 PM

#

finite wind one day I can say unhinged shit as glados that sounds so natural to the point ot...

I mean my model sounds pretty damn good as long as you add autotune to it when using live

finite wind Apr 12, 2026, 5:24 PM

#

I used one from here for a gartic phone session and it was fun

hardy yew Apr 12, 2026, 5:24 PM

#

viral mason I've tried many times but never was left satisfied

i think maybe it's achievable by separating the various voices by some criterion

#

splitting by features is probably not a way

#

but pitch maybe?

#

like, having some vastly different samples of a low voice and high-pitched completely different voice

finite wind Apr 12, 2026, 5:25 PM

#

could try with a deadpool model I was gonna try training

hardy yew Apr 12, 2026, 5:25 PM

#

and perhaps training with some low batch size to make it not generalize so well on purpose

finite wind Apr 12, 2026, 5:25 PM

#

since he got that normal unc voice and his silly voice lines

edgy quiver Apr 12, 2026, 5:25 PM

#

low shard are you like trying to troll / catfish people or trying to like get privacy in g...

Not for trolling, but just so that i can talk without people being weird

viral mason Apr 12, 2026, 5:26 PM

#

finite wind since he got that normal unc voice and his silly voice lines

My favorite Deadpool as of recently is the one from marvel rivals

hardy yew Apr 12, 2026, 5:26 PM

#

edgy quiver Not for trolling, but just so that i can talk without people being weird

that is a respectable use case

finite wind Apr 12, 2026, 5:27 PM

#

viral mason My favorite Deadpool as of recently is the one from marvel rivals

Exactly the one that I was gonna do

#

Say does keggle applio still works?

#

or I should just try that google colab

viral mason Apr 12, 2026, 5:31 PM

#

edgy quiver Not for trolling, but just so that i can talk without people being weird

As long as you're not using any of those e girl models you're still a person

edgy quiver Apr 12, 2026, 5:31 PM

#

Idk how i would do that as a woman but yeah

hardy yew Apr 12, 2026, 5:31 PM

#

xDD

viral mason Apr 12, 2026, 5:32 PM

#

finite wind Say does keggle applio still works?

Yup! Still works and it's the only version I know how to use anymore

finite wind Apr 12, 2026, 5:32 PM

#

imagine using female model as a female

#

see where that takes you

finite wind Apr 12, 2026, 5:32 PM

#

viral mason Yup! Still works and it's the only version I know how to use anymore

doesn't that charges you for a server or anything?

#

since the age of AI I cannot imagine any provider without paying a server

viral mason Apr 12, 2026, 5:32 PM

#

Nope, 30 hours free

#

A week

finite wind Apr 12, 2026, 5:32 PM

#

WHAT

#

what spec

#

I NEED TO KNOW

hardy yew Apr 12, 2026, 5:33 PM

#

#

one of those two

edgy quiver Apr 12, 2026, 5:33 PM

#

finite wind see where that takes you

"Mommy asmr" ahh voice

finite wind Apr 12, 2026, 5:33 PM

#

edgy quiver "Mommy asmr" ahh voice

wrong but good attempt

edgy quiver Apr 12, 2026, 5:33 PM

#

Wdym

finite wind Apr 12, 2026, 5:33 PM

#

hardy yew

not even that bad

hardy yew Apr 12, 2026, 5:34 PM

#

it's free 30h weekly

finite wind Apr 12, 2026, 5:34 PM

#

where are they pulling their cash from wth

hardy yew Apr 12, 2026, 5:34 PM

#

i'd say it's amazing xD

finite wind Apr 12, 2026, 5:34 PM

#

it's google level of flex in terms of a provider that is

viral mason Apr 12, 2026, 5:34 PM

#

hardy yew

-# I delete my acc every time my time is too low and just use one of my other emails and it resets the time each time forever giving me infinite training time

hardy yew Apr 12, 2026, 5:34 PM

#

https://tenor.com/view/msp-mass-state-police-helicopter-msp1000-gif-9232721672305839389

Tenor

#

they're on the way

viral mason Apr 12, 2026, 5:35 PM

#

Totally don't do what I said

finite wind Apr 12, 2026, 5:35 PM

#

so they hadn't even blocked account refreshing yet

#

keggle is so unreal

viral mason Apr 12, 2026, 5:35 PM

#

Kaggle is W

#

Love it

#

Colab is Doo Doo because the time limit for anything is like 4 hours max

finite wind Apr 12, 2026, 5:35 PM

#

though how long does it usually takes for a single epoch to train?

#

my local training time for an epoch would be 24 seconds

hardy yew Apr 12, 2026, 5:36 PM

#

finite wind though how long does it usually takes for a single epoch to train?

very broad question, considering one epoch of 1min dataset is a bit different than one epoch with a 5h dataset xD

finite wind Apr 12, 2026, 5:36 PM

#

ok hm

#

let's say 1 epoch = 50 steps

#

how long did it take you on keggle

#

so it's around like

#

15 to 18 minutes of data

viral mason Apr 12, 2026, 5:36 PM

#

finite wind though how long does it usually takes for a single epoch to train?

Not sure but I'd say it trains quickly

#

Really depends on dataset length

hardy yew Apr 12, 2026, 5:37 PM

#

anran_klm_dc_32k_4b | epoch=139 | step=7367 | Current time: 23:50:47 | Time per epoch: 0:00:25

#

this is from epochs with 53 steps

#

25s

finite wind Apr 12, 2026, 5:37 PM

#

so that's similar to my spec with acceleration option on

#

damn

#

really similar I tell you

#

3060ti with i5-14gen 32gb ram

viral mason Apr 12, 2026, 5:40 PM

#

Shhhh

hardy yew Apr 12, 2026, 5:41 PM

#

good bot

viral mason Apr 12, 2026, 5:42 PM

#

finite wind 3060ti with i5-14gen 32gb ram

I have a 5070ti but I refuse to train locally bc it's confusing, just takes a lot of room on my pc, and can cause my vr to do janky stuff like freeze ect

finite wind Apr 12, 2026, 5:42 PM

#

okay do I install applio on keggle or

finite wind Apr 12, 2026, 5:42 PM

#

viral mason I have a 5070ti but I refuse to train locally bc it's confusing, just takes a lo...

yeah because it's hoarding your pc resources and VR is a heavy game

viral mason Apr 12, 2026, 5:42 PM

#

Kaggle is my favorite

#

Best option for non local

hardy yew Apr 12, 2026, 5:43 PM

#

i prefer to not train locally just because it's a waste of money when i can do the same in the cloud for free xd
and also no need to keep the PC running for additional hours and blocking me from doing something

viral mason Apr 12, 2026, 5:43 PM

#

Real

hardy yew Apr 12, 2026, 5:44 PM

#

the last time i trained a lot on my PC, my energy bill reflected it cat_doom

viral mason Apr 12, 2026, 5:44 PM

#

Yikes

hardy yew Apr 12, 2026, 5:44 PM

#

the only issues emerge in case of huge datasets that exceed the disk space available on kaggle for free xd

viral mason Apr 12, 2026, 5:45 PM

#

Good thing I've never gone over an hour of audio

#

Well there was that one time with Kratos

finite wind Apr 12, 2026, 5:45 PM

#

hey I know we're simping keggle here but can I get a link or an explanation on how to set up applio on keggle

viral mason Apr 12, 2026, 5:46 PM

#

I have a whole video

hardy yew Apr 12, 2026, 5:46 PM

#

viral mason Good thing I've never gone over an hour of audio

one of my first nice models was the Witcher, dude has almost 11h of voiceover in the game cat_doom

finite wind Apr 12, 2026, 5:46 PM

#

on the contrary

viral mason Apr 12, 2026, 5:46 PM

#

This is how to do it

finite wind Apr 12, 2026, 5:46 PM

#

who is interested in limbus company character models

#

Imma do it

viral mason Apr 12, 2026, 5:47 PM

#

I've heard of the game but have no idea what it is

finite wind Apr 12, 2026, 5:47 PM

#

if you'd watch a video about it it would be more confusing ngl

viral mason Apr 12, 2026, 5:48 PM

#

😭

hardy yew Apr 12, 2026, 5:48 PM

#

viral mason This is how to do it

I like scripting, I just have a script with a couple variables that takes all the necessary configuration and it does all the magic with one click cat_sunglasses

#

finite wind Apr 12, 2026, 5:48 PM

#

viral mason This is how to do it

thanks a lot

hardy yew Apr 12, 2026, 5:48 PM

#

entire training procedure in one short string

viral mason Apr 12, 2026, 5:48 PM

#

hardy yew

?

#

Explain what you mean

finite wind Apr 12, 2026, 5:48 PM

#

set to 32k 250 epoch batch 4 and 1.6 legacy core pretrain bat file?

#

wowwie

hardy yew Apr 12, 2026, 5:49 PM

#

yeah, runs preprocessing, feature extraction, index generation and then runs the training

#

afterwards compresses all data into zip

viral mason Apr 12, 2026, 5:49 PM

#

No need for such specific epoch

hardy yew Apr 12, 2026, 5:49 PM

#

and then i just download it

finite wind Apr 12, 2026, 5:49 PM

#

I mean you can resume training from where you left off too right?

hardy yew Apr 12, 2026, 5:49 PM

#

viral mason No need for such specific epoch

i mean, in this case, 250 is when it ends

finite wind Apr 12, 2026, 5:50 PM

#

I guess 200, 300, 350 all good unless it's overtrained

hardy yew Apr 12, 2026, 5:50 PM

#

finite wind I mean you can resume training from where you left off too right?

in this case the 250 is almost surely much more than i need

finite wind Apr 12, 2026, 5:50 PM

#

you can resume from where you left off

hardy yew Apr 12, 2026, 5:50 PM

#

i can later reupload the data and restore it before continuing training

#

if needed

#

but i usually just pick a large number of epochs to "ensure" i won't need to resume training later

#

though sometimes i still do it later

finite wind Apr 12, 2026, 5:51 PM

#

sounds about right

hardy yew Apr 12, 2026, 5:52 PM

#

in my case i don't have data persistence so it's not like all the trained data stays there between runs

#

that's why i need to reupload the necessary files if i want to resume

#

not much work anyway

finite wind Apr 12, 2026, 5:52 PM

#

yep

hardy yew Apr 12, 2026, 5:53 PM

#

i prefer to avoid the GUI whenever i can

#

especially when the work is repetitive

finite wind Apr 12, 2026, 5:54 PM

#

isn't GUI's whole point is to make it less of a chore to navigate

hardy yew Apr 12, 2026, 5:54 PM

#

each to their own, I guess

#

for me making a script and then just running one command is more convenient than opening a GUI and clicking through al lthe things

#

but GUIs are definitely convenient for lots of people

#

I'm spoiled by the linux world

finite wind Apr 12, 2026, 5:55 PM

#

ahh

#

I didn't get it at first because locally applio saves the latest settings you've used

viral mason Apr 12, 2026, 5:56 PM

#

finite wind I mean you can resume training from where you left off too right?

Ye

tame oracle Apr 12, 2026, 6:02 PM

#

pat

finite wind Apr 12, 2026, 6:03 PM

#

@viral mason oh yeah, when you put silences in both start and the end of a 3s clip

#

do you include those silences in the 3s in total or exlude them from the total of 3s

#

https://tenor.com/view/shh-cats-gif-10955612

Tenor

viral mason Apr 12, 2026, 6:06 PM

#

finite wind <@1023278814752677918> oh yeah, when you put silences in both start and the end ...

I'm not sure, I use one entire dataset and let Applio do the silent clips and all that, I truncate silence of the entire audio in audacity before putting it in

finite wind Apr 12, 2026, 6:06 PM

#

hmmm oki

#

I was gonna manually slice clips into 3s

#

and I was wondering if silences in a clip should be counted as total seconds

viral mason Apr 12, 2026, 6:07 PM

#

Yea no need to manually do all that nonsense

analog obsidian Apr 12, 2026, 6:07 PM

#

the model learns silence thanks to the mute files, don't manually add silence to the samples

viral mason Apr 12, 2026, 6:07 PM

#

Applio automatically slices your audio pretty sure

#

That's why I put one entire audio file

finite wind Apr 12, 2026, 6:08 PM

#

yeah but I tested it and it cuts the sentence mid way into two clips that it sounds just weird

viral mason Apr 12, 2026, 6:08 PM

#

hmm

finite wind Apr 12, 2026, 6:08 PM

#

I wanted to manually slice into 3s myself

analog obsidian Apr 12, 2026, 6:08 PM

#

and during training it cuts that audio even more

#

by 0.36 secs

#

misc_shrug

finite wind Apr 12, 2026, 6:09 PM

#

got it

analog obsidian Apr 12, 2026, 6:09 PM

#

model takes that 3s audio, then learns using segments of 0,36 secs, it doesnt learn the 3s at once

finite wind Apr 12, 2026, 6:09 PM

#

no manual silence pauses in the dataset and no manual cutting

analog obsidian Apr 12, 2026, 6:09 PM

#

truncate the silence in audacity then use simple slicing

viral mason Apr 12, 2026, 6:10 PM

#

Simple slicing?

analog obsidian Apr 12, 2026, 6:10 PM

#

viral mason Apr 12, 2026, 6:10 PM

#

Ah

#

I've always used automatic

finite wind Apr 12, 2026, 6:10 PM

#

truncate the silences and how long would that suppose to be after that?

#

30ms? 50ms? 100ms?

analog obsidian Apr 12, 2026, 6:10 PM

#

300 ms

finite wind Apr 12, 2026, 6:11 PM

#

300ms? sounds a lot generous than I thought

analog obsidian Apr 12, 2026, 6:11 PM

#

analog obsidian Apr 12, 2026, 6:11 PM

#

finite wind 300ms? sounds a lot generous than I thought

yea coz the model still needs to learn natural silence

#

i train pretrains, thats what i use and works

#

misc_shrug

viral mason Apr 12, 2026, 6:12 PM

#

analog obsidian

You use 0.3 seconds? I have mine at 0.2

analog obsidian Apr 12, 2026, 6:12 PM

#

viral mason You use 0.3 seconds? I have mine at 0.2

0.2 is fine too

#

0.1, 0.2, 0.3

viral mason Apr 12, 2026, 6:12 PM

#

Does it really effect it at all?

#

Training

analog obsidian Apr 12, 2026, 6:13 PM

#

yes

viral mason Apr 12, 2026, 6:13 PM

#

I'll try 0.3 then

viral mason Apr 12, 2026, 6:15 PM

#

analog obsidian

Do I need to enable truncate tracks independently if I'm using one audio file or nah

analog obsidian Apr 12, 2026, 6:15 PM

#

viral mason Do I need to enable truncate tracks independently if I'm using one audio file or...

idk, i leave that on always because im paranoid dog_laugh

viral mason Apr 12, 2026, 6:15 PM

#

Ah

#

🤫

finite wind Apr 12, 2026, 6:19 PM

#

when audio is 48k but when you look into the spectrum it's only 19k on chart so it's 38k joe_weird

#

gotta love mp3 bro

viral mason Apr 12, 2026, 6:19 PM

#

Use Wav

analog obsidian Apr 12, 2026, 6:19 PM

#

if the file is already mp3 he cant do much

finite wind Apr 12, 2026, 6:19 PM

#

yeah

viral mason Apr 12, 2026, 6:19 PM

#

Sad

analog obsidian Apr 12, 2026, 6:19 PM

#

converting it to wav is going to preserve the mp3 compression

finite wind Apr 12, 2026, 6:20 PM

#

it's a lost cause

viral mason Apr 12, 2026, 6:20 PM

#

Do you use YouTube dlp?

#

It's peak

finite wind Apr 12, 2026, 6:20 PM

#

I do but not often since they do that too

viral mason Apr 12, 2026, 6:20 PM

#

Do what?

finite wind Apr 12, 2026, 6:20 PM

#

youtube compression and all that

viral mason Apr 12, 2026, 6:20 PM

#

Ah

finite wind Apr 12, 2026, 6:20 PM

#

I often get lower quality ones

low shard Apr 12, 2026, 6:21 PM

#

edgy quiver Not for trolling, but just so that i can talk without people being weird

check either https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ or https://docs.aihub.gg/realtime-voice-changer/local/vonovox/

don't use youtube video tutorials

Tg Develop's W Okada Fork

Last update: April 1, 2026

Vonovox

Last update: March 30, 2026

finite wind Apr 12, 2026, 6:21 PM

#

when people neatly edit an entire video of a character voice lines but download it with dlp

#

it's garbage

viral mason Apr 12, 2026, 6:21 PM

#

low shard check either https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-ok...

Do they have Nvidia?

finite wind Apr 12, 2026, 6:21 PM

#

I wish they could upload it somewhere else that can be lossless

low shard Apr 12, 2026, 6:22 PM

#

viral mason Do they have Nvidia?

viral mason Apr 12, 2026, 6:22 PM

#

Ah

#

Ok, was making sure since you sent Vonovox as an option