#✨│ai-help

1 messages · Page 285 of 1

stiff idol
#

I found it a bit baffling that CN instruments were forgotten

analog obsidian
#

u need to use those in the original audio with the instrumentals + vocals tho

stiff idol
#

You mean to load the original audio? No problem. lol I'm trying to learn audio editing

#

just so I cna make a good dataset because I will never have audio without SE or music and other stuff

analog obsidian
#

yea, if your original audio has harmonies you use becruily's, if not, you use fv4

stiff idol
#

I think I'll have to use an excel sheet like I had for LoRa models

#

too many models are coming my way

#

but it doesn't take long and it's quick even on my GTX 1650

#

I wanted to try Reaper instead of Audacity but it looks like a Spanish village to me (or idk how people say it 🤣 )

#

now only to make it a bit sharper and that'll be it, I don't think I can make the quality better 🧐

fossil sage
#

yes

#

i noticed anything above 90 epochs it has more noise

#

xtts

#

but its to hard to install

#

i was told xtts was the best at reading out loud

#

thats what im looking for

teal ferry
#

thats what my whispering model is but its outdated model

#

and has dumb quirks

#

i wouldnt use it

fossil sage
#

sound good to me

teal ferry
#

yeah but the effort to get to that was significant

fossil sage
#

😭

#

now how do i install xtts

teal ferry
#

if you installed the right version of cuda it should install for you

fossil sage
#

should i just use the google collab at this point

teal ferry
#

no colab will most likely kick you before the thing finishes

#

just try to install it again, pussy

waxen wigeon
#

Anyone know the AI that lets you combine rvc voice models to merge voices and where I can download it?

fossil sage
fossil sage
#

@royal kettle is it posssible to change the embedder for wokada

#

@teal ferry E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\torch\cuda_init_.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
import pynvml # type: ignore[import]
[AllTalk Startup] AllTalk startup Mode : Standalone mode
[AllTalk Startup] WAV file deletion : Disabled
[AllTalk Startup] DeepSpeed version : Not Detected
[AllTalk Startup] Model is available : Checking
[AllTalk Startup] Model is available : Checked
E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\torch\cuda_init_.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
import pynvml # type: ignore[import]
[AllTalk Startup] Current Python Version : 3.11.13
[AllTalk Startup] Current PyTorch Version: 2.8.0+cu128
[AllTalk Startup] Current CUDA Version : 12.8
[AllTalk Startup] Current TTS Version : 0.22.0
[AllTalk Startup] Current TTS Version is : Up to date
[AllTalk Startup] AllTalk Github updated : 19th March 2025 at 20:50
[AllTalk Startup] TTS Subprocess : Starting up
[AllTalk Startup]
[AllTalk Startup] AllTalk Settings & Documentation: http://127.0.0.1:7851
[AllTalk Startup]
E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\torch\cuda_init_.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
import pynvml # type: ignore[

teal ferry
#

You need to grab the location of python.exe again. In the env folder. Paste that location into a terminal then -m pip uninstall pynvml then paste it again -m pip install nvidia-ml-py

fossil sage
#

didn't work

#

@teal ferry IT FINALLY LAUNCHED

#

FINALLLLLLLLLLLLLY

viral mason
#

If u have Nvidia use Vonovox

fossil sage
#

o man who was the guy who just left the vc

teal ferry
fossil sage
stiff idol
fringe snow
#

Is Kaggle broken? The urls are all the same so Applio takes me to Tensorboard a few times before it takes me to Applio, then I try to use it and then I get an error that reads "Connection Errored Out"

fossil sage
#

this is horrible

#

and it told me to use 22050Hz, Mono, 16 bit to

#

the documentation said i can use a model

#

and imported it

#

but idk how to use it

teal ferry
#

id use 48khz for the reference

#

and stereo

fossil sage
#

ok imma try that

teal ferry
#

its 24khz to train

fossil sage
teal ferry
#

youd want to test on out of distribution data

fossil sage
#

whats that

teal ferry
#

or write some words that arent an exact match to the reference audio

teal ferry
#

sounds like it could be the same as reference audio

fossil sage
#

mman this is horrible

teal ferry
#

then make a better one

fossil sage
#

the reference audio is long enough and its high quality

#

i dont understand what else i could do

teal ferry
#

make a better model

fossil sage
#

i looked in the docu

teal ferry
#

do you have a model to import

fossil sage
#

yes i alredy imported it

teal ferry
#

are you implying that you intend to use an rvc model where an xtts model goes?

teal ferry
#

you cant

fossil sage
#

OH

#

THAT MUST BE WHY

teal ferry
#

models are not cross swappable

fossil sage
teal ferry
#

no

fossil sage
#

how do i tune it

teal ferry
#

follow the instructions

fossil sage
fossil sage
#

22050Hz

teal ferry
#

several reasons. mostly has to do with the size/shape of the tensor

fossil sage
#

imma train this one

fringe snow
#

bruh now it's just taking me to my files

stiff idol
teal ferry
stiff idol
#

🧐

teal ferry
#

if you can in some cases you cant

#

but xttsv2 has no problem with it

#

it depends on model type, i believe its the diffusion archetecture because of the latent space

stiff idol
#

I've just found a good model for vocals that removes a chinese instrument from vocals. finally (Mel-Band-Roformer-Karaoke-Gabox)

teal ferry
#

so youd be mapping the empty space with 48khz or a higher resolution

stiff idol
#

it can't deal with multiple vocals but that's fine, ... it's an experimental model, after all

teal ferry
#

if its just one thing of audio davinci resolve is kind of a hidden gem

#

its ai audio stuff is really good

stiff idol
#

Which AI stuff? I may not have caught on that.

#

I have yet to explore other types. For now only RVC and Stable Diffusion.

teal ferry
#

audio seperation from music and noise

stiff idol
#

I have a question, which model is the best for removing reverb and echo form audio?

teal ferry
#

i dont work with music often. when i denoise i just use mel band roformer

#

the original one by kim iirc is her name

stiff idol
#

Kim has many models

teal ferry
#

here i have it in my repo

stiff idol
#

thank you

fossil sage
#

Could not locate cudnn_ops_infer64_8.dll. Please make sure it is in your library path! its not in my libary path what do i do

pure obsidian
#

yo can anyone help

#

how to set up

#

some voice changer

tight halo
#

HI GUYS

#

IM AI

stiff idol
#

ok, so my test was: Mel Band Roformer Karaoke V2 (experimental) by Gabox is better for the audio where I tried separating vocals and isntruments

languid kernel
#

Ok

young halo
#

i just opened the damn kaggle

low shard
#

Cloud shouldn't be your first option

low shard
fringe snow
#

Damn rip

graceful igloo
#

Is there an alternative to weights.com? I want to train a voice model, but now weights.com is charging people to train voice models and I like to use ai for free.

young halo
#

is there any way to fix this? my screen turned off and then it was like this, it still training but not showing up on applio/kaggle

knotty moth
# stiff idol thank you

he was just showing his application repo that only pre-includes the single og Kim model. you should try this colab that lists many more models (including newer roformer models being better than the og Kim) or clone the MSST repo for local use: #📰│dev-updates message

knotty moth
young halo
#

How can i resume training on kaggle?

split warren
#

i dont understand why i can hear myself and is like 3 sec delay?
can someone help?

fringe snow
#

Besides for paying for more Ngrok benefits, there is a workaround you can do in #📰│dev-updates

young halo
#

i want to stop the training but when i press that button nothing happens and the training goes on

meager lichen
#

how do i uninstall okada voice changer?

#

i installed the original one but i want deitaris version

young halo
#

help, how do i stop it? the button wont work

sonic agate
#

who pinged

nocturne mural
knotty moth
teal ferry
#

correct

deft finch
#

I am trying to install Automatic1111 with an AMD graphics card. I followed all of the install instructions, but whenever I run the webui-user.bat file, I keep getting this error and I don't know how to resolve it. "RuntimeError: Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check"

viral mason
#

never heard of it

deft finch
# viral mason never heard of it

Wdym, it's one of the most popular Stable diffusion uis. Or so I have heard idk. This ai stuff confusing fr. I don't blame you.

simple ore
#

use SD.Next

hard forge
#

Hi everyone,

I wanted to bring up something that’s been on my mind regarding Alignerr. I’ve been finding it difficult to actually secure projects on the platform, and I was wondering if others here are facing the same challenge.

If you’ve managed to get projects through Alignerr, could you share what worked for you? like how do you usually approach projects there?

I think it would be helpful if we could share our experiences and strategies so we can all figure out the best way to make Alignerr more fruitful for us.

Looking forward to hearing your thoughts!

simple ore
elfin cypress
#

Yo I need help

knotty moth
patent trellisBOT
# knotty moth !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
chilly furnace
#

you can find someone linking resources every minute or so

tall merlin
#

index file not found

tame yacht
#

Hi
So i wanna develop a chatbot type ai which will be modeled after me and will communicate with my social media friends on my behalf and they won't notice whether it is me or ai

#

So
For that
I will need to learn python

#

But there are really many varieties in python
So i ask for you guys to help me

#

I would need to know which are the topics i need to learn in python to be able to start the project

tame yacht
#

Nah let it some times

brittle wing
#

i need help applio installation

stray pier
#

hello, is it possible to request the creation of a voice model that I need?

stiff idol
#

haven't heard of people using Automatic111, I tried it at the beginning too, but too many problems with it and it'ss not versatile

brittle wing
#

i need help

viscid topaz
patent trellisBOT
# viscid topaz !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
oak adder
#

hei i wondeer is any open source ai like warp or cursor exists thank you .

low shard
low shard
# stray pier hello, is it possible to request the creation of a voice model that I need?

You can search rvc ai voice models at:

if there isnt one, you can:

earnest muskBOT
low shard
#

!howtoask

patent trellisBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
subtle cedar
#

We can train voice models locally on Chromebook or ChromeOS?

low shard
lethal shale
#

heyy

#

i didnt use okada for 4 months and now I am back and it doesnt work

#

can someone pls help

low shard
#

!howtoask

patent trellisBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
lethal shale
#

my mic works but it doesnt transfer my voice to line1 i guess

#

just when I speak I dont get a feedback

low shard
river dune
#

How can i use Stable Diffusion on AMD GPU?

lethal shale
#

i dont know why it doesnt work either it just doesnt work when I speak the app does nothing

low shard
low shard
lethal shale
river dune
lethal shale
#

sorry for hdr

low shard
# lethal shale

this is very low quality, but you seem to be on windows 11 with the rtx 4060 ti 16gb desktop on wokada deiteris fork b2332?

lethal shale
#

oh it works now

#

echo was cutting all my voice

low shard
lethal shale
#

omg lol

low shard
#

wokada deiteris fork latest update was b2332, the 7th december 2024

lethal shale
#

which okada should i use? i think that one is outdated right?

hallow thistle
# subtle cedar We can train voice models locally on Chromebook or ChromeOS?

"Chrome OS" refers to an operating system that made for Chromebook or any PC that supports it, while Chromebook refers to a type of a laptop that runs Chrome OS especially, which typically manufactured and sold by other PC brands. Unlike traditional laptops/desktops, there's no known Chromebook that is made for training AI locally, but for "online" training it sure can.

low shard
lethal shale
#

does it work better with 11 alreadyt

low shard
# lethal shale which okada should i use? i think that one is outdated right?

wokada is just one RVC AI Realtime Voice Changer program, which has 3 main current versions:

  • original: not suggested anymore as it doesn't have proper rvc performance updates
  • forks:
    • deiteris: a modified version which added some performance things, last update was in december 2024
    • tg-develop: a modified version off deiteris, added some quality of life updates but isn't considerated a maintained fork, last update was in august 2025

for windows nvidia, vonovox would be more suggested, which is another RVC AI Realtime Voice Changer for calls

#

-realtime

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

low shard
#

here are the guides for each

#

I would ofcourse suggest Vonovox, it's your choice, you can read the pros and cons of each in each guides

lethal shale
#

Okay so I use it realtime in vr should I use vonovox and do I have to delete mine to download it

#

and is mine original?

low shard
low shard
#

vonovox gets more frequent updates and migh have better performance for your setup

low shard
#

you can read their specific pros&cons, that would be better if you want to understand more the difference, it's all ur choice tho

lethal shale
#

thanks bro do I have to delete my VC I have to use Vonovox

#

or can I keep both to compare

lethal shale
#

do they clash with each other I mean

#

like different versions of same app so one deletes when other gets in

viral mason
#

You can't use both at the same time, delete the current one you have and then download vonovox

lethal shale
#

Last question, Vonovox is better with performance and convert quality too right?

low shard
#

they are 2 different programs that do a similar goal with the same type of models

lethal shale
low shard
#

they aren't "updates" of the same program

viral mason
#

Vonovox is cool and has an auto update feature

lethal shale
#

Thanks Nick for helping me for 2 years lol

#

I always like people named Nick

lethal shale
low shard
#

You can guess I helped a lot of people here lol

lethal shale
#

you also helped me with voice training

#

2 years ago lol

#

If I make a website of voice model market I will send you gift lol

lethal shale
#

I am just sad I deleted my american accent models thinking they were bad

#

it is just that I got German accent

#

so they didnt work lol

low shard
lethal shale
#

I know that hahaha

#

Thanks for reminding

#

Well I didnt know but I could guess I think 😄

low shard
lethal shale
#

should I use index? I never use them

#

I feel like it causes inconsistency

low shard
# lethal shale Well I didnt know but I could guess I think 😄

Well yeah, we actually did allow people to promote here before some months ago (I think almost half a year ago) but there were just too many scams, scummy services and chaos that it wasn't worth it, even bc not many people were actually checking the promo channel

lethal shale
#

with my models I can sing, laugh, cry

#

they all work somehow

low shard
lethal shale
#

so my model is german and when I speak english I shouldnt use it

#

right?

#

Vonovox is realtime vc right?

viral mason
river dune
#

can somebody help me install stable diffusion for amd gpu ?

lethal shale
#

but i got few problems lol

#

where is echo cancellation

#

it is so complicated to me right now

lethal shale
#

nickkk

#

okay I use Steel Series Sonar smart noise reduction and it works well so far

simple ore
lethal shale
#

I got Nvidia yeah

#

but I dont know that one

#

Ill check now

simple ore
lethal shale
#

does it create another microphone like steel series

#

or just enhance the one you use

simple ore
#

you configure it with your real mic input

pure obsidian
#

yo

#

any expert in here that can help?

lethal shale
#

I will give it a shot now

pure obsidian
#

trying to download the vonovox thingy

simple ore
#

and then you pick nvidia broadcast output as an intput in the other app

lethal shale
#

i am launching

low shard
pure obsidian
#

can you please reply to my aihelpforum?

lethal shale
#

but I dont speak that language

low shard
lethal shale
#

there is no echo cancellation and sup 1 sup 2 like okada

#

and I hear myself 3 times lol

#

Steel Series Sonar made it better but now I will try Nvidia

simple ore
#

you may need to fix the audio path

pure obsidian
lethal shale
#

even echo echoes

simple ore
#

means your have some other inputs mixed in

lethal shale
#

I Use Voicemeter

simple ore
#

you dont need to route your actual mic thru voicemeeter

lethal shale
#

My Mic -> Line 1 -> Voicemeter

simple ore
#

stop that

pure obsidian
simple ore
pure obsidian
#

i cant send a photo

lethal shale
pure obsidian
#

can i tag u in the place where i posted the photo

lethal shale
#

can you stop interrupting and ask when this one is done

pure obsidian
#

yeah bro but u recently started i been asking since 2 hours

low shard
pure obsidian
low shard
#

@pure obsidian @lethal shale please don't argue about who gets helped first, there aren't many helpers (not all staffers are helpers, just the ones with the role "helper" are official ones), there are different timezones and sometimes might be multiple people asking at the time time, I hope yall understand we are volunteers :)

low shard
pure obsidian
low shard
#

!give-media-perms 1h @lethal shale

lethal shale
#

I can already send pics ig

low shard
#

you seem to be using 2 VACs at the same time

#

you only need Vac Lite

#

your audio setup should look like:

  • vonovox:

    • input: microphone
    • output: line 1
  • other programs:

    • input: line 1
    • output: headphones
simple ore
#

you really need to figure out the purpose of a1, b1 buses

lethal shale
#

I will try now

#

how will I listen to myslef

#

in vonovox

low shard
lethal shale
#

okay I will now try nvidia smart noise + Line 1 only

#

I will tell difference

#

then my own mic

#

and line 1

simple ore
#

A1, A2, A3 buses are hardware outputs, you send them to physical headphones, speakers, etc

#

B1, B2 are virtual outputs, you used the mix audio, like discord voice chat + game audio to push it to OBS

#

if you're using a noise canceling software like Nvidia broadcast, you do not select a real mic as any of the inputs

#

and you generally do not need to mix it to any bus either

#

you send the real mic to broadcast app, then from broadcast app to voice change, then from voice changer to virtual cable line 1, then you take that as input for discord

#

and output for discord is voicemeeter's virtual input

#

technically with banana you dont need virtual audio cable

#

you can use b1 to route audio from voice changer to discord

lethal shale
#

Nvidia broadcast made my voice like I speak in a cave

simple ore
lethal shale
simple ore
#

it should not add any effects

lethal shale
#

wym

simple ore
#

the broadcast app

simple ore
dusty delta
#

model isnt loading

#

and i cant hear myself

viral mason
simple ore
#

noise canceling

viral mason
#

ah

#

only added fx I usually use is de-esser and extra stuff to add fullness in fl studio

dusty delta
#

I NEED HELPP

simple ore
#

it is not for recorded audio

viral mason
#

pretty reasy actually

iron maple
#

Hi, not been in the space for a while but is RVC working with RTX 50 series cards yet?

low shard
iron maple
#

apoliges yes that

#

like applio

iron maple
#

give an audio sample and use a weight to change it locally

low shard
iron maple
#

last time I tried it, there was issues with pytorch or something on the 50 series

#

is that fixed?

low shard
low shard
iron maple
#

fantastic, thanks very much I appreciate it

dusty delta
#

dude

#

i cant bro

low shard
#

!howtoask

patent trellisBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
dusty delta
#

okay so my full gpu name is NVIDA GeForce GT 1030 im windiows 11 im trying to start it by running start http bat but it keeps opening then closing

low shard
dusty delta
#

multiple?

low shard
# dusty delta multiple?

Yeah, this is a general AI server, it was previously RVC-Only but we expanded to multiple things

low shard
dusty delta
#

yes

#

i think so yes

low shard
dusty delta
#

kk

#

now what do i do

low shard
#

Don't expect it to work in intensive games like gta5 FiveM max graphics

dusty delta
#

sure

#

nah im using it for roblox

low shard
patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

low shard
#

Read up the 1st guide

dusty delta
#

i just have to ask a question

#

i have a intel r hd graphics 530

#

is that okay?

#

or no

low shard
# dusty delta is that okay?

Absolutely not that's 10 times worse, I'm guessing you have both the integrated GPU and the dedicated Nvidia GPU, the integrated GPU would be absolutely worse and 100% surely not recognized

#

AI needs power

dusty delta
#

so what do i do?

low shard
#

And let me know if it does or not

dusty delta
#

man i feel embarassed

#

i have an unc intel card

low shard
dusty delta
#

no im not

#

im not joking

low shard
dusty delta
#

correct

low shard
#

Would u want to at least try locally, or try cloud?

#

Hopefully you have a decent wifi connection

dusty delta
#

i do

#

full bars all the time

low shard
dusty delta
#

not yet

#

im about to

#

rocking a 30-35 mb

#

33 mb and 10.33 for upload

low shard
dusty delta
#

mb\s

low shard
#

MB = MegaByte
Mb = Megabit
Byte = 8 bit

dusty delta
#

this good?

low shard
# dusty delta rocking a 30-35 mb

Eh I mean, should be decent for cloud

Soo, would you rather try Vonovox locally to see if it can work? Or do you want to directly try cloud?

dusty delta
#

ill try locally

low shard
dusty delta
#

just gotta download it correct?

low shard
#

Be sure to not miss steps on the guide

dusty delta
#

vac lite kk

#

i downloaded vac right?

#

ran 64setup

#

and when i opened it

#

said it had to make changes

#

so i accepted

#

and it didnt work

viral mason
#

wait vonovox works on intel???

low shard
low shard
viral mason
dusty delta
#

SORRY BRO I WAS OUT

#

but uh

#

it didnt work

#

wait its working

#

nick

#

alright what now

#

nick i got it set up

mellow ermine
#

-rvc

patent trellisBOT
mellow ermine
#

-colab

patent trellisBOT
# mellow ermine -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

elfin cypress
#

So my voice changer is going high pitched and will cut out a lot is there a fix?

dusty delta
#

NICK

#

BROO

#

I GOT VONX AND DOWNLOADED ALL THE STUFF

coral kraken
#

Can someone please tell me if I can uplod a pth file model to Kits AI to convert my voice to one model I have in my computer? I got the starter upgrade version in kits AI but I can't fine where I can Upload the pth file like I use to do a year ago.

viral mason
#

do not use kits it's horrible

#

if anything use weights

last rivet
#

no fix it seems

subtle cedar
# low shard chromebooks are made mostly for just surfing the web and being cheap, most dont ...

Yeah but I rarely do train models and do an AI Covers nowadays because many K-pop fans using AI Covers for shipping opposite gender Kpop artists (well I dislike hetero ships because most of people who ship opposite gender celebrities are toxic that don't respect anyone's actual partner) and I can't train models too often anymore because of my Dad for some reasons so it's okay if my future laptop is Chromebook or ChromeOS or don't use a local training or unlimited GPU

knotty moth
subtle cedar
knotty moth
subtle cedar
knotty moth
#

it's like believing that my chicken can fly because I have trained him so hard, till you realize I was joking on it

hallow thistle
robust drum
#

so if i were to download this juice preset thing can it work for pro tools?

low shard
#

!howtoask

patent trellisBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
low shard
#

!howtoask

patent trellisBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
robust drum
#

where do i drag the file i downloaded? like the voice ai

#

do i drag it into models

#

in the RVC

low shard
#

ur only option is using a cloud (remote good pc) service with limited free gpu time

low shard
#

please elaborate as i asked

low shard
#

rvc doesn’t mean realtime voice changer, it measn Retrieval-based-Voice-Conversion

robust drum
#

do i drag the files into the RVC?

low shard
low shard
low shard
low shard
patent trellisBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
robust drum
#

I downloaded the RVC off a yt video with a discord link to this discord am i cooked nick

low shard
knotty moth
low shard
robust drum
#

by cooked did i just download a virus?

low shard
low shard
#

i can’t help you if you don’t elaborate

robust drum
#

Ok so then whats the right RVC?

knotty moth
low shard
#

open source ai isn’t easy to use

hallow thistle
robust drum
#

Ok so i wanna use the an ai preset and I dont know how to start people are saying use a RVC? i dont even know what that is, is there something i gotta download

low shard
# knotty moth not with snapdragon 6xx or weaker

05:09 Settings

How to fix lag/stutters?
Launch the game then if you experience lag, go back to the home screen and open the game again in recent apps. Do it every time you experience lag, stutter or frame drops.

Device: Redmi 10c
Specs:
CPU: Snapdragon 680
GPU: Adreno 610 (1114 MHz)
ROM: 64gb (UFS 2.2)
RAM: 4gb
OS: MIUI 14.0.4 based on Androi...

▶ Play video
hallow thistle
robust drum
#

nah i just wanna know how to make my voice sound like a specific artist

low shard
hallow thistle
low shard
#

there isn’t a right program and version for everyone, it depends on your hardware and needs

robust drum
#

pre recorded, GE FORCE RTX, Windows 11 64

low shard
hallow thistle
#

RVC stands for Retrieval-based Voice Conversion, an AI program where you make AI cover and training voice models; it doesn't always mean realtime voice changer.

quick pine
#

is the voice changer supposed to be delayed

hallow thistle
quick pine
#

cause its delayed by like 20 secs

low shard
#

!howtoask

patent trellisBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
low shard
#

be sure to not use youtube or video tutorials for ai realtime voice changers for roleplay in calls/games

hallow thistle
#

To check your PC GPU name, open Task Manager, go to Performance tab, and spot GPU 0 or GPU 1 in the left panel.

robust drum
#

GPU 1

hallow thistle
hallow thistle
robust drum
#

4050

#

RTX 4050

low shard
robust drum
#

so then i drag the file i want into this

low shard
hallow thistle
robust drum
hallow thistle
#

NVIDIA GeForce RTX 4050 is a mobile GPU for laptops. Applio will work with this GPU for inferencing (doing AI cover) but not very much for training.

robust drum
#

which one

low shard
hallow thistle
robust drum
#

this aint gonna hack me right

hallow thistle
#

Sometimes, it's too much to ask every step, I can focus on known settings and how to use the program.

robust drum
#

sometimes?

hallow thistle
robust drum
#

ok its downloaded

#

now extract all?

low shard
hallow thistle
#

If I tell you on "how to extract the zip", would you still want me to do it? osaka

robust drum
#

do i extract it or no

hallow thistle
#

This step is so simple. You open WinRAR or 7-Zip, you use the zip program to extract the internal files of "Applio" zip into somewhere like D:\Applio. No need to ask me for confirm like this.

low shard
hallow thistle
#

Yeah, I know you want me to tell you every step, even on how to run the program, so do I. koharuwheeze

robust drum
#

Ok im doing it

hallow thistle
robust drum
#

do u want me to extract it uhhh 6 to 7 times ? misc_cry

hallow thistle
#

After you extract the Applio zip, go inside Applio folder, and double click on run-applio.bat batch file.

robust drum
#

is it normal for the file to tell me my full name?

hallow thistle
#

After you double click on the batch file, wait for it until the program opens your browser.

knotty moth
robust drum
#

why is it stuck on this black screen

hallow thistle
robust drum
#

it wont let me lmao

#

its stuck on that

hallow thistle
robust drum
#

Ik i was saying its stuck on that

hallow thistle
#

Launching a Python-related AI program can take some time to load up files and resources, so don't expect it to launch within seconds. cat_vibe

robust drum
#

Ok

robust drum
#

i hit a key

#

and it does nothing

#

is that normal

low shard
hallow thistle
robust drum
#

so now what

hallow thistle
#

Run the batch file again.

robust drum
#

Ok

hallow thistle
#

This is what Applio might look like if launched successfully, but I'm using the older version.

robust drum
#

its saying file in one drive please move to a different file

hallow thistle
#

Don't extract anything on your desktop.

robust drum
#

Ok

#

you know @hallow thistle ur a legend did u know thatg

#

Ok got it working

#

now how do i add this vocal ai

#

???

#

dont die on me

hallow thistle
robust drum
#

i move it to logs?

hallow thistle
#

This is how you upload voice model files into \Applio\logs\ folder, so these voice models will appear on Applio GUI.

robust drum
#

good?

#

i just moved the two bottom files

hallow thistle
#

I won't be repeating the same step for another time.

robust drum
#

i love u

#

thank u

#

u deserve a raise

#

ur a goat

#

i love u

hallow thistle
#

I explain things in the most simplest I can do, and you're here to ask me for little more when it's too much for me. YuukaErm

hallow thistle
#

On Applio, click "Refresh" to load models.

robust drum
#

YESSS

#

THANK YOU SO MUCH

#

HOLY

hallow thistle
#

Now you got those models.

robust drum
#

YESS

#

i love u

hallow thistle
#

This is how you upload an audio file into Applio.

robust drum
#

😆

#

😩

hallow thistle
#

If you act like this to some other helpers, you might not only be getting mocked, by the way.

robust drum
#

wdym

#

im just saying thank u

elder coral
#

how do i make my models sound clean not robotic or artifactual creature than your bald friend's hair

fluid horizon
#

My Applio stopped working today. Can anyone help? I'm using Pinokio as my "launcher"
It can't do its RVC anymore today, so I figured to update it to the latest (which is a simple button Pinokio has)
But its the same problem. So I can't even see the Web UI. Had to open the local url to my browser to see (it is indeed running)
Any clues why? Also before it had no problem reading .m4a files and converting it. What is going on?

simple ore
fluid horizon
#

ah it was an older setup

#

like way back 2024

simple ore
#

the only things I've heard from pinokio is that it breaks shit for no reason

fluid horizon
#

that is true 🤣

#

i will migrate stuff out, i have my voice models in here

#

also im troubleshooting too! i think i found the new folder where the models go in

#

so yeah, pinokio broke it 🤣

#

may i have the link to that compiled version? 😄

#

i might just nuke pinokio entirely

low shard
fluid horizon
#

thanks!

low shard
simple ore
#

i mean, even Dione had f'd up the install

#

and it is from the same guys lol

fluid horizon
#

i did initially run applio in a google colab hahaha back when my rig was shittier

#

then i learned about pinokio lol they were gassing it up on youtube (last year)

#

right now, i cant even see it in their Web UI view. i think it started when i saw the themes ahha i wanted to try it

#

so yeah, im nuking this applio copy lol i just need to move out my trained models

#

like wow, pinokio must have broken it soooo much it cant even infer

#

thanks again for the guidance!

fluid horizon
stiff idol
#

Hi. I'm creating a dataset from videos on bilibili and other stuff. I've got a Chinese singer and wanted to ask someone with good hearing if 135Hz high-pass is fine or not. I think it's ok, 150Hz was too much, but there might be people with better hearing.

  1. HPF sample: high pass filter -> truncate silence -> loudness normalization
  2. no HPF: truncate silence -> loudness normalization

I'd like to ask if you hear any difference and whether it's good or bad, thank you.

stiff idol
#

yup, 135Hz was a bit too much, I needed to listen several times to notice when her voice goes lower, this one should be an okayish starting point

dusty delta
#

nick

#

i have vonox

#

but my file explorer wont open

knotty moth
thorn plinth
#

RuntimeError: The expanded size of the tensor (3850) must match the existing size (8192) at non-singleton dimension 0. Target sizes: [3850]. Tensor sizes: [8192]
//how to changer the size of the tensor

pure obsidian
#

guys

#

question

#

how to hear myself

#

from the vonovox

pure obsidian
#

bro my college wifi is embarassing as fck

#

its not opening

#

okay worked

oak edge
#

yooooo

#

what's the new spin-v2 @low shard

low shard
#

are u deadass running ai on college's pcs?

pure obsidian
#

no no i live in a dorm lol

low shard
#

idk how it works there but they might like track what u do

#

oh lol

pure obsidian
#

using my own laptop

pure obsidian
#

if u want i can send photos

#

of what i see

low shard
low shard
pure obsidian
#

yes

#

wait

#

can i open the forum

#

and show u

#

cuz i dont have perm photos to send here

analog obsidian
#

is not a pitch extraction either

#

its an embedder

knotty moth
analog obsidian
pure obsidian
#

like i need to use vpn to use steam and epic games and discord and so on

#

idk how to by pass it

#

so i use a vpn but they make wifi slower by 10 times

low shard
pure obsidian
#

nick bro see the forum i sent u the photos

low shard
pure obsidian
knotty moth
pure obsidian
knotty moth
pure obsidian
#

but when i use vpn

#

that shi drops to 20-30

low shard
pure obsidian
# low shard what vpn do u use btw?

tunnelbear but u get like 2gb a month / hotspot shield good one but free one takes u to america and fucks ur wifi / mullvad vpn its paid but a friend gifted me a sub for a month

low shard
#

Mullvad and ProtonVPN are prob the best privacy wise iirc

pure obsidian
#

and all are paid

low shard
#

it's unlimited, you just can't choose the region

pure obsidian
#

imma give it a try

#

thanks man

meager lichen
#

my voice changer was working well for abou 3 days now it stopped producing any output, any help?

pure obsidian
#

do like a forum

#

post

meager lichen
#

i didd

pure obsidian
#

oh alrr wait for some help mods

pure obsidian
meager lichen
#

ok

pure obsidian
#

bro lowkey nick is goated

#

promote that guy asap

viral mason
#

He is so cool

whole sky
#

My Play AI Hub account still shows the “Link Discord” button even though my Discord is already connected. When I press it, I get the error “Discord already linked” and the console shows multiple 401/500 errors from the /auth/discord and /user/discord/roles endpoints.

I already tried:

Removing Play AI Hub from Discord → Authorized Apps

Clearing cache and cookies

But the issue remains. It looks like my Discord is linked on the backend, but the UI still thinks it isn’t. Could someone manually reset or unlink my Discord connection from oficail site?

My Play AI Hub account Crypto Phan

pure obsidian
viral mason
low shard
hallow thistle
oak edge
#

@shy spruce @simple ore can you guys elaborate on the languages used for spinv2 (i'm thinking about using it to train tamil voice of mine)

meager lichen
viral mason
oak edge
#

@viral mason alsooo if i were to build a voice model from ground up without pretrains how long would it take

latent yacht
#

can i send an image here?

oak edge
#

and how the quality would be like? (because I'm training tamil voice-for which I don't see a pretrain anywhere)

oak edge
oak edge
viral mason
viral mason
#

You'd have to have a lotttttttttt of data tho

analog obsidian
#

24 hours minimum

oak edge
oak edge
oak edge
latent yacht
#

can one guys tell me if i should download these vcclients or just use the 1.5.3? like whats the difference are they better, new or what

simple ore
#

it should work with another language

viral mason
#

Outdated technology

oak edge
analog obsidian
#

if you train from scratch then you gotta find a way to fight rvc degradation

analog obsidian
#

coz rvc stock just blows up

latent yacht
knotty moth
oak edge
latent yacht
analog obsidian
viral mason
#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

analog obsidian
#

forgetting stuff it learned prior and overfitting on low frequencies

analog obsidian
#

so it will never clean the high ends

oak edge
viral mason
analog obsidian
oak edge
latent yacht
analog obsidian
#

50e is not enough for a pretrain either

#

im saying that it will improve up to that point then start to degrade

oak edge
oak edge
#

then how did these people manage to create pretrains?

#

like how are these spinv2 contentvec models were made?

viral mason
#

I'm kinda slow

latent yacht
#

Nvidia brother

analog obsidian
#

they finetune og pretrain

oak edge
#

then how were the og pretrains were made?

analog obsidian
#

retunes also degrade anyway

#

i noticed degradation in noobies model

latent yacht
analog obsidian
oak edge
#

@analog obsidian can you tell me what degree are you doing/did?

analog obsidian
#

i trained almost 1m steps of a failed pretrain

#

then did tons of tests

#

so i kinda know stock rvc cant train a pretrain

latent yacht
#

i checked vonovox guide , and it says that its features are paid

oak edge
#

you really really are a pro in this @analog obsidian

viral mason
oak edge
#

i wonder how many years of studies it's gonna take me to learn properly U_U

analog obsidian
#

why so interested in a new pretrain? the og one is great

oak edge
latent yacht
oak edge
#

which results as some specific kind of pronounciations never getting right

analog obsidian
#

spin v2 is also a english embedder

oak edge
analog obsidian
#

im sure spin and cvec were trained using the same dataset

oak edge
viral mason
analog obsidian
oak edge
viral mason
analog obsidian
analog obsidian
#

idk yt_nails

analog obsidian
oak edge
#

i think it will be available in it?

analog obsidian
oak edge
#

then how can I train with spin v2 Y_Y

analog obsidian
#

download noobies retune i guess

oak edge
#

also I train in google colab for free btw Y_Y

#

kaggle was uhh... tough Y_Y

analog obsidian
oak edge
#

ohh the sizes are relative low

#

i'd go with 40Khz i assume

hallow thistle
oak edge
#

but can you confirm where to put this in colab ? @analog obsidian

latent yacht
#

@viral mason which one of these

viral mason
#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

viral mason
analog obsidian
latent yacht
viral mason
oak edge
oak edge
#

wait

viral mason
oak edge
#

is there any ways to directly import them to colab rather than downloading and re uploading? as in uploading from a link ? @analog obsidian

analog obsidian
oak edge
#

wait can chat gpt 5 work in writing this code?

analog obsidian
#

im sure a llm can help you with this

#

lol

oak edge
#

i can create a cell with chat gpt

viral mason
#

Ew collab

oak edge
viral mason
#

Worst option for training

oak edge
oak edge
viral mason
#

I forget it has 1 l

latent yacht
latent yacht
#

it was right there i just needed to scroll a lil

#

lmao

oak edge
viral mason
oak edge
analog obsidian
oak edge
#

uhm idk how to use my 4060 Y_Y tbh i'm so used to training in colab

#

now idk how to do it locally

analog obsidian
viral mason
oak edge
#

and no way to save the epochs to drive

#

or any safe splace

viral mason
#

Just turn on the save options variables and files cat_seriously

hallow thistle
#

The download links from GitHub can be confusing, especially those zips that are named identically, looking duplicated but are actually splitted.

oak edge
viral mason
#

Uhmmm you got unlucky cat_seriously

oak edge
#

it just disappeared

oak edge
#

it saves my data properly in my drive

viral mason
#

Sometimes u just get unlucky but it's rare

oak edge
#

and i share my drive folder between 3 accounts

#

so 12 hours training

oak edge
viral mason
#

I will never ever ever use Google colab again for training because I don't wanna go back to switching accounts transferring, I'd rather have errors that make it impossible to train a model over the torture of 2 hour training sessions

#

It sucks ass

hallow thistle
#

You can try train a voice model on NVIDIA GeForce RTX 4060, but the VRAM won't be that much of it, so the best bet is to go for online services instead for much faster speed.

viral mason
oak edge
#

talk about patience

analog obsidian
#

eh it's enough for batch size 8 and fp16

viral mason
analog obsidian
#

i can do batch 8 and fp32 too

oak edge
oak edge
viral mason
#

One clean 15 hour training session over literally putting yourself through horrors beyond the mind is better for me

oak edge
oak edge
analog obsidian
#

why train for 15 hours lol a model can take 2 hours or 3

oak edge
oak edge
analog obsidian
#

my safest recommendation will be always 40-100 epochs lols

viral mason
oak edge
#

i think the electronic noises slowly subsided after 200

hallow thistle
#

"100 compute units" or Colab Pro monthly option on Google Colab both listed for $9.99, kind of exact to the price of Weights basic plan which is also $9.99.

analog obsidian
#

v2 is naturally more metallic, i asked to bring back v1 but as always i was not taken seriously lol

oak edge
#

yt_nails i mean chat gpt code worked to download stuff


URLS = [
    "https://huggingface.co/Aznamir/spin/resolve/main/spin-v2/f0G40k_spin-v2.pth",
    "https://huggingface.co/Aznamir/spin/resolve/main/spin-v2/f0D40k_spin-v2.pth",
]

DEST_DIR = "/content/Applio/rvc/models/pretraineds/custom"

import os, pathlib, requests

# Ensure destination exists
pathlib.Path(DEST_DIR).mkdir(parents=True, exist_ok=True)

def download_to(url: str, dest_path: str, chunk=1024 * 1024):
    with requests.get(url, stream=True, allow_redirects=True) as r:
        r.raise_for_status()
        with open(dest_path, "wb") as f:
            for part in r.iter_content(chunk_size=chunk):
                if part:
                    f.write(part)

saved = []
for url in URLS:
    filename = url.rsplit("/", 1)[-1]
    out_path = os.path.join(DEST_DIR, filename)
    print(f"→ Downloading {filename} ...")
    download_to(url, out_path)
    size = os.path.getsize(out_path)
    print(f"   Saved to: {out_path} ({size:,} bytes)")
    saved.append(out_path)

print("\n✅ Done. Files now in:")
for p in saved:
    print(" -", p)
#

W gpt Y_Y

analog obsidian
oak edge
hallow thistle
#

Colab Pro+ and "500 compute units", while both more expensive than Colab Pro, I think they typically not needed for most people unless someone needs to train something big all day and night or just wanna use A100 for something longer. cat_wtf

oak edge
#

@analog obsidian are there any other custom pretrains like the one you've mentioned?

#

holy hell nvm

oak edge
#

imma go to good old kaggle Y_Y

#

HOLY SHIT

#

mine never got deleted in kaggle

#

i just thought mine got deleted but it never was @viral mason u

#

VROOOO @analog obsidian i need help

oak edge
#

i don't see the spin v2

viral mason
#

I don't believe it's added yet

oak edge
oak edge
#

!!!

#

like spin v2 was there in colab applio

viral mason
#

Keep it to the original environment

oak edge
#

okie how do I go custom

#

how do I add custom embedder for spin v2 @viral mason

viral mason
#

@analog obsidian uhhh ^

#

Help

oak edge
#

cat_huh i mean there must be a way

analog obsidian
#

i did not made the notebook lol

#

most likely is downloading a stable version and not the main branch

viral mason
#

Who did make it?

analog obsidian
#

vidal

viral mason
#

Ah

oak edge
analog obsidian
#

i dont even like applio

oak edge
#

or should I click always use the latest environment?

oak edge
#

what if I press the use latest environment? bcz im damn sure the applio colab had spin v2

#

icant that's why i even asked in the first place

analog obsidian
#

just remove the line that downloads a specific branch

oak edge
analog obsidian
#

can you please at least send me the link of the thing youre using

analog obsidian
#

you're downloading an old version of applio

#

just git clone the main branch

oak edge
analog obsidian
#

but you have to encrypt everything because kaggle bans rvc usage

oak edge
oak edge
analog obsidian
#

show me a screenshot of your first cell

#

most likely says --branch 3.6.0 or whatever

oak edge
#

HOLY SHIT

#

IT'S 3.3.0

analog obsidian
#

change it to 3.5.0

oak edge
analog obsidian
#

yes

oak edge
#

wait what will happen to my training data

#

it didn't change anything ? @analog obsidian

analog obsidian
#

bro just reinstall everything im not the tech support of this server ask gpt idk 😭

oak edge
#

imma try somethin out

shell pewter
#

anyone got fixes for electrical background noises when speaking with a model? im using W-Okada

pure obsidian
#

i think thatll work