#✨│ai-help

1 messages · Page 284 of 1

fossil sage
#

yes it is

teal ferry
#

you see this?

fossil sage
#

yes

teal ferry
#

hold control and click the gradio dark link in the terminal

#

whatever it may say

#

or copy and paste it into your browser

#

if youre using brave browser tell me

fossil sage
#

im using firefox

fossil sage
teal ferry
#

Don't go to 7851

#

Screenshot your terminal

#

Show me

#

If you can hit the API at 7851 then you can go to the output at 7852

fossil sage
teal ferry
#

Ok you have a torch issue

#

We need to uninstall pytorch then reinstall the right version

viral mason
#

if u want to make covers sure, models? don't waste ur time or money on that garbage

teal ferry
#

E:\xtts\alltalk_tts\alltalk_environment\env\python.exe -m pip uninstall torch torch vision torchaudio

fossil sage
#

just finished uninstalling it

teal ferry
#

Ok then go

fossil sage
#

reinstall it got it

teal ferry
#

No wait

fossil sage
#

wait wtf it rquires python 3.9 or higher

teal ferry
#

Wait

fossil sage
#

i have cuda 12.1

teal ferry
#

Ok now it works

#

Don't worry about it

#

Just follow instruction

fossil sage
#

now i just gotta wait

teal ferry
#

The python version is specified by your environment

knotty moth
teal ferry
#

Cuda may be an issue but it may not 50-50 shot

#

Yeah 3.9 is really old. But it doesn't matter we're not using the system python

#

Once torch is done installing close the terminal and start all talk again

#

Should occupy port 7852 this time and therefore you'll see the link in std output

#

Also if you have to go to bed then tell me to shut up. It's fine

fossil sage
#

hahaha

#

not 3.9

#

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0/4 [sympy] WARNING: The script isympy.exe is installed in 'E:\xtts\alltalk_tts\alltalk_environment\env\Scripts' which is not on PATH.
Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location.
━━━━━━━━━━╺━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1/4 [torch] WARNING: The scripts torchfrtrace.exe and torchrun.exe are installed in 'E:\xtts\alltalk_tts\alltalk_environment\env\Scripts' which is not on PATH.
Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location.
Successfully installed sympy-1.13.3 torch-2.8.0+cu128 torchaudio-2.8.0+cu128 torchvision-0.23.0+cu128

#

what wtf

teal ferry
#

tap windows key and type env hit enter

fossil sage
teal ferry
#

i dunno how else to say that

#

use brain

fossil sage
#

ohhhhhhhh

#

do i edit environment vriables

teal ferry
#

yeah then click path then edit

fossil sage
#

alr

teal ferry
#

system variables so the bottom box

#

click new

#

then put
E:\xtts\alltalk_tts\alltalk_environment\env\Scripts

fossil sage
#

and im browing an directory not a file

#

done

teal ferry
#

click ok and exit ALL of the windows we just opened.

#

ok three times

#

close all terminals

#

then start all talk again

fossil sage
#

👍

teal ferry
#

oh no

#

ok so this would require editing the source code of a couple of torch files. specifically the wirghts_only variable

#

there is no way around this because your GPU requires a certain version of torch

#

we could downgrade torch in theory but im guessing because the last error you got with compute score

#

it wont work

fossil sage
#

well looks like i should just stick to chatterbox

teal ferry
#

so you can open. E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\TTS\utils\io.py and

fossil sage
#

and i've tried using chatpgt

teal ferry
#

i think thats it.

fossil sage
#

its an infinite loophole

teal ferry
#

this will work

knotty moth
# fossil sage wow

is yours RTX 50-series? if so it'd need cuda 12.8 and latest version of torch

otherwise GTX 10-series might need the legacy cuda

teal ferry
#

theres no such thing as an unsolvable problem

fossil sage
#

unfortunately it didn't

fossil sage
teal ferry
#

just open this file

#

E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\TTS\utils\io.py

#

or send it to me

fossil sage
fossil sage
fossil sage
#

i am having the cuda 12.1 cuz i think chatterbox needs it ig

#

so far its the best out of index tts and vibevoice

knotty moth
fossil sage
#

chatgpt was messing everything up

teal ferry
#

i may have to add another argument

#

but i think you hit the else statement regardless

#

else within the if statement*

fossil sage
#

ok time to run it again

#

ALLL great

teal ferry
#

if you get the same error then use this one. i guess i couldve added the variable regardless to begin with

#

oh thats a new error

fossil sage
#

yes

viral mason
#

btw @tawny radish how bad are my settings in okada

tawny radish
#

absolutley

#

fucking horrible 😭

viral mason
#

gimme good ones

tawny radish
#

ur extra cant be higher then 2.7

fossil sage
tawny radish
#

put it at 2.7

#

ur chunks, idk about that just mess with it

viral mason
teal ferry
#

send me E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\torch\distributed\elastic\agent\server\api.py

tawny radish
#

mainly i put my index at 0.5

#

or 0.7

teal ferry
#

actually if loop didnt exit you may be up and running

ashen patrol
#

If 2.7 it's good if 3.5 not great

teal ferry
#

no nvm it did

tawny radish
#

2.7 is the max you shoudl do

#

never go higher

#

or youll get problems

tawny radish
#

@fossil sage are u trying to make voicemodels

viral mason
tawny radish
#

idk

#

i dont use forked anymore

#

i do like 63 chunks but thats bc i have a good gpu

teal ferry
#

wrong one sorry i need this E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\deepspeed\elasticity\elastic_agent.py

fossil sage
tawny radish
viral mason
tawny radish
#

so id say your goal would be so easy

tawny radish
#

i have a 4080 super

#

ur settings vary chunk wise on your GPU

viral mason
#

I meant 63 chunk

tawny radish
#

because ur gpu renders chunks

fossil sage
tawny radish
#

my first voicemodel was even good

#

the more you do it the more your datasets can improve - also meaning ur voicemodels will sound better

#

but you dont even need too much expirience to make GOOD voicemodels

#

id tell you now like male voicemodels are harder then female because female have a more high pitch voice so male ones can be difficult, but not impossible

teal ferry
teal ferry
#

just swap that out n run alltalk again

fossil sage
tawny radish
#

i havent made a voicemodel in a solid 2 months ngl

tawny radish
tawny radish
#

NAH LMAOO

teal ferry
#

no claiming its easy is though

fossil sage
#

i lost the model

tawny radish
#

😭

teal ferry
#

its not

#

most people make horrific models that vaguely sound like the target

#

if you call that easy then sure. but its bad

#

so i would call it a fail

tawny radish
#

😭

#

and i overtrained it

#

so

#

youll have mistakes but like if ur consistent in learning it youll make good ones in id say a month or two

teal ferry
#

so maybe its not so easy especially dealing with python environments

fossil sage
#

well this going to be a infinite loop hole

tawny radish
#

pretty sure it does

teal ferry
#

its gonna work now

tawny radish
#

its actually kinda crucial if u want ur voicemodel to sound more natural or realistic

fossil sage
#

well atleast it was able to detect deepseed

teal ferry
#

E:\xtts\alltalk_tts\alltalk_environment\env\python.exe -m pip uninstall put\the\path\to\deepspeed\whl\here

fossil sage
#

done

teal ferry
#

then either start alltalk or reinstall deepspeed. because we installed new torch deepspeed was invalidated. deepspeed installation uses the version of torch to create itself. so we end up with these two solutions. because you dont need deepspeed i would just start alltalk

fossil sage
# teal ferry then either start alltalk or reinstall deepspeed. because we installed new torch...

[AllTalk Startup]
E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\torch\cuda_init_.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
import pynvml # type: ignore[import]
[AllTalk Model] XTTSv2 Local Loading xttsv2_2.0.2 into cuda
ERROR: Traceback (most recent call last):
File "E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\starlette\routing.py", line 734, in lifespan
async with self.lifespan_context(app) as maybe_state:
File "E:\xtts\alltalk_tts\alltalk_environment\env\Lib\contextlib.py", line 210, in aenter
return await anext(self.gen)
^^^^^^^^^^^^^^^^^^^^^
File "E:\xtts\alltalk_tts\tts_server.py", line 127, in startup_shutdown
await setup()
File "E:\xtts\alltalk_tts\tts_server.py", line 172, in setup
model = await xtts_manual_load_model()
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\xtts\alltalk_tts\tts_server.py", line 244, in xtts_manual_load_model
model.load_checkpoint(
File "E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\TTS\tts\models\xtts.py", line 783, in load_checkpoint
self.gpt.init_gpt_for_inference(kv_cache=self.args.kv_cache, use_deepspeed=use_deepspeed)
File "E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\TTS\tts\layers\xtts\gpt.py", line 222, in init_gpt_for_inference
import deepspeed
ModuleNotFoundError: No module named 'deepspeed'

ERROR: Application startup failed. Exiting.
[AllTalk Startup] Warning TTS Subprocess has NOT

teal ferry
#

omg he did

#

ok install it again

fossil sage
teal ferry
#

id assume he had a good reason to exit the loop

#

but i dont know

fossil sage
#

E:\xtts\alltalk_tts\alltalk_environment\env\python.exe -m pip install C:\Users\yonshuk\Downloads/deepspeed-0.14.0+ce78a63-cp311-cp311-win_amd64.whl

teal ferry
#

yes

fossil sage
#

well imma run it again

#

and theres a 1 percent chance its gonna work

teal ferry
#

100%

fossil sage
teal ferry
fossil sage
#

yep and chatgpt is just telling me to go thorugh a infinite rabbit hole

teal ferry
#

ignore chaptgpt

#

humans are the superior intelligence

fossil sage
teal ferry
#

where did you get that version of deepspeed

fossil sage
#

from the github you sent me

teal ferry
#

uninstall it

#

its wrong for wrong version of cuda

fossil sage
#

i have cuda 12.1

#

and thats the one i installed

teal ferry
#

yeah but it has to work with torch to

#

o

fossil sage
#

and i already uninstalled it breh

teal ferry
#

E:\xtts\alltalk_tts\alltalk_environment\env\python.exe -m pip install deepspeed

fossil sage
#

alright now whre do i download it from

teal ferry
#

you dont

#

just input that

#

were using pypi

fossil sage
#

i just did

teal ferry
#

this may trickle down to a python version issue

#

ok when its done start alltalk

#

assuming you got no errors

fossil sage
#

errors usal

#

at this point im just do the steps back and foward back to

#

back

teal ferry
#

E:\xtts\alltalk_tts\alltalk_environment\env\python.exe -m pip uninstall deepspeed

then

E:\xtts\alltalk_tts\alltalk_environment\env\python.exe -m pip install deepspeed --no-cache-dir

teal ferry
#

yep

fossil sage
#

imma just install 12.8

teal ferry
#

i beleive its just these three. make sure they say 12.8 or above

fossil sage
#

basiclly the same thing i was gonna download expect its local

teal ferry
#

yeah either one

fossil sage
#

i should have created a system restore point

teal ferry
#

not necessary were not doing anything to core system files. but i mean its not going to hurt if you did

#

you should just reinstall alltalk after that. otherwise wed have to reinstall torch, deepspeed, and possible edit deepspeed source code again

#

well maybe not edit it because we install frpm latest

fossil sage
#

alright just installed it

teal ferry
#

i mean it should work like that but maybe deleting 12.1 for redundancy

fossil sage
#

[AllTalk Startup] AllTalk Settings & Documentation: http://127.0.0.1:7851
[AllTalk Startup]
E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\torch\cuda_init_.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
import pynvml # type: ignore[import]
[AllTalk Model] XTTSv2 Local Loading xttsv2_2.0.2 into cuda
ERROR: Traceback (most recent call last):
File "E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\starlette\routing.py", line 734, in lifespan
async with self.lifespan_context(app) as maybe_state:
File "E:\xtts\alltalk_tts\alltalk_environment\env\Lib\contextlib.py", line 210, in aenter
return await anext(self.gen)
^^^^^^^^^^^^^^^^^^^^^
File "E:\xtts\alltalk_tts\tts_server.py", line 127, in startup_shutdown
await setup()
File "E:\xtts\alltalk_tts\tts_server.py", line 172, in setup
model = await xtts_manual_load_model()
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\xtts\alltalk_tts\tts_server.py", line 244, in xtts_manual_load_model
model.load_checkpoint(
File "E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\TTS\tts\models\xtts.py", line 783, in load_checkpoint
self.gpt.init_gpt_for_inference(kv_cache=self.args.kv_cache, use_deepspeed=use_deepspeed)
File "E:\xtts\alltalk_tts\alltalk_environment\env\Lib\site-packages\TTS\tts\layers\xtts\gpt.py", line 222, in init_gpt_for_inference
import deepspeed
ModuleNotFoundError: No module named 'deepspeed'

ERROR: Application startup failed. Exiting.

teal ferry
#

yeah install deepspeed again

#

not the whl

fossil sage
teal ferry
#

i know you have to do it again because all of it hinges on cuda

#

if you install them with the wrong version of cuda then it wont work

fossil sage
#

great now chatterbox doesn't work

#

anyways i gotta go to bed

teal ferry
#

youre close, alright

tawny radish
simple ore
heady plover
#

Trying to use the tg develop branch or any branch of w-okada's vc for that matter and all I get is this continuous tapping sound that sounds like the voice. On the Linux vers, does anyone know why this is happening

(3060 12gb)

edgy minnow
#

whats the replacement for okada w?

viral mason
#

could u share a screenshot of what the program looks like

#

btw @tawny radish is it cool if I add u if I haven't already

#

you seem chill

#

o

viral mason
#

did u get it from a yt link

knotty moth
#

!give-media-perms 30m @median quiver

viral mason
tawny radish
#

Put ur extra to 2.7

viral mason
#

first human being ever to get the voice changer from a credible source ^

tawny radish
#

Put ur input as ur mic and ur output as line 1 vb-audiocable

#

If u havent already download VAC

viral mason
#

these settings work pretty good for my gpu but I have nvidia

tawny radish
viral mason
#

the lowered delay is neat but still working on getting it lower without it doing anything funky

knotty moth
#

you can lower delay as long the perf stays green

#

but it may depend on if you're playing a kind of game/another application

viral mason
#

I'm trying to find the line between it still sounding fine with no cutting out or choppiness like that with as little delay as possible

analog obsidian
#

one important thing to keep in mind is that rvc is context based, lower chunks decrease delay yea but also decreases the context of the audio, and if its too low, the model is going to start to have very bad pronunciation

#

rvc cant predict what are you gonna say, so it actually needs that delay in order to properly say the words

#

lower extra decreases the delay and but also decreases context of the audio

onyx badge
#

anyone have this issue where after some time VAC lite just stops working

#

and I ned to reinstall it

stiff idol
#

Hi guys. I'd like to ask how do people fix weird glitch in a voice and what tool to use? I was listening to dataset I'm preparing, but the voice suddenly pitched into a robotic-like voice.

#

I was looking on some YouTube videos and tried some functions in audacity I know if, but wasn't successful.

hallow thistle
#

In REAPER, there's a built-in VST for tuning pitch of an audio track, named ReaTune. This one also has an option to do autotune-like effect, as in this screenshot. It's simple, but not quite the same level as other free and paid plug-ins.

hallow thistle
golden walrus
#

Hmmmm. Guys, can i ask how to improve my model volume? It sounds so small

spare tree
#

guys any tips to lower ai voice changer delay without lowering quality too much?

spare tree
#

guys I cant find my gpu in the gpu dropdown, I can only find "cpu"

simple ore
#

what's your gpu?

spare tree
#

as in voice model or app

spare tree
simple ore
#

what file did you download

#

iGPU is not a real GPU, so at best you can run the VC on CPU

short belfry
#

hey yall ive just got the w okada voice changer, and when im trying to use it its cutting off every 500ms or so then continuing with the sentence. id love a way to figure this out! im using an AMD GPU, using rmvpe_onnx

spare tree
#

are there any gpus i can download?

simple ore
short belfry
spare tree
#

is it a link to go to

simple ore
spare tree
#

uhuh

ruby idol
#

hey can someone help me set up MMVCServerSIO

stiff idol
strange grail
#

why was codename fork removed from docs

thorn narwhal
#

how to stop echo

ruby idol
#

how can i fix the delay?

fervent orbit
#

Is there any n8n automation experts that can help me? I need to do an n8n workflow for real estates

thin anchor
#

where to dowload voice changer

#

i need free one plz 🤑

viscid topaz
#

-realtime

patent trellisBOT
# viscid topaz -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

winter seal
#

ts happened while tryting to use model in applio noui thing how do i fixit

fast yoke
#

can you help me coding?

edgy rapids
#

why is my voice changer is so delay?

tawny radish
bleak smelt
#

am i able to use any of these for rvc training? (on runpod) or does anyone know a workable one on there?

simple ore
viral mason
#

trying out codename's fork since applio is a shitshow, what is this

#

should I leave it

potent bone
viral mason
#

why did what get removed?

potent bone
#

codenane fork

viral mason
#

applio or codename fork

#

idk

potent bone
#

where can i download it from

#

you know where?

viral mason
#

I have the one that works for kaggle I guess

thick latch
#

Is this any useful to finetune the voice?

bleak smelt
#

does this not work anymore?

viral mason
bleak smelt
viral mason
thick latch
viral mason
# bleak smelt

I'm not sure if this is kept up anymore as in it being maintained or updated

#

I'd ask around maybe Nick or Lista may know

bleak smelt
viral mason
#

you could check these but other than that I have no real knowledge of the difference gcollabs out there for cover making

#

-collab

#

uh

#
  • collab
#

-colab

patent trellisBOT
# viral mason -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

viral mason
#

I spelled it wrong twice lol

rain nova
#

yo can anyone help with the voice changer download

winter seal
viral mason
#

Nvidia, AMD, or Intel

rain nova
#

2060rtx

viral mason
#

u can use any of these but I recommend Vonovox as it's the best

#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

viral mason
#

read the guide and download the two things, Vonovox and Vac lite

#

it's the first one

zinc thunder
#

is there a way to setting parameters on Applio when the original voice on a song have trembling (example: hinorobu kageyama) and the AI voice not? every time i try to make a cover, the result is not good, because the "new" voice came like he is trying to make funny noises with his voice

viral mason
zinc thunder
#

can i post a youtube link with the song? if not, the song is "CARELESS WHISPERS (Japanese Version 1984) - Hideki Saijo". and yes, of course i removed the melody, and only used "acapella" XD

royal kettle
viral mason
# viral mason

yo @simple ore sorry for the ping but was wondering if this also did the double batch thing if settings are left like this

viral mason
#

make sure u removed the reverb and echo as well

zinc thunder
#

uploading on mega, for example?

#

yep, i have a version without reberb (mostly of it) that i did through davinci

viral mason
#

what is davinci

zinc thunder
#

oh boy. sorry XD. heavy rain here. sadly im going to leave. but thanks for trying to help! oh, davinci resolve is a tool to make videos, similar to adobre premiere, kden live etc

viral mason
#

ahh ok

stiff idol
#

time for good old CHrome

#

Does Mel work in UVR app?

#

oh yeah, it's for UVR 🤦‍♂️ I only don't know where the models are stored

#

I also didn't see any of the preinstalled models that were supposed to be in UVR, I guess the guide doesn't have the newest info on UVR

stiff idol
#

fixed*

low shard
viral mason
#

as of now

low shard
viral mason
#

so I prefer kaggle

#

I got the tensorboard to open with gradio but applio won't open in lightning just a blank page

low shard
# viral mason that's so confusing and I had issues with it

It seems like you could say the same about codename's fork overall since you're asking Noobies about some experimental settings lol

Anyways, I was just warning you that if you mess up with it, especially with settings you don't know, it could just make things worse, I mean your choice tho

low shard
viral mason
#

just brought me to a blank page, it did some loading thing and then nothing

#

hold on the image has my name in it gotta edit it

low shard
viral mason
#

this is "applio" as it calls it

#

an empty page

#

pretty sure it was the cat guy

#

I forgot his name

carmine siren
#

--kaggle

#

-kaggle

patent trellisBOT
# carmine siren -kaggle
📘 Kaggle Notebooks

Kaggle is a Cloud (Remote Good PC) Service that offers 30 hours of GPU weekly, but needs a phone number verification

• **Applio Notebook**

by IAHispano
Kaggle

• **Hina Mod Original Wokada**

by Hina
Kaggle

• **Wokada Deiteris Fork**

by Hina & Deiteris
Kaggle

• **UVR5 UI**

by Eddy, ArisDev & Nick088
Kaggle

• **UVR5 NO UI**

by Eddy
Kaggle

• **RVC AI Cover Maker UI**

by Shirou & ArisDev
Kaggle

• **Music Source Separation**

by Shirou
Kaggle

thorny musk
#

what is the best training tool for SD images?

viral mason
low shard
analog obsidian
#

theres two gpu there so you know what do to lol

viral mason
viral mason
#

ok ty

solid pumice
#

what settings do i have to apply if my microphone aint it

viral mason
solid pumice
patent trellisBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
solid pumice
low shard
# solid pumice thanks

It's a discord bot command to help users elaborate and understand how to ask for help (and what to not ask for help), please read it up so helpers know how to help you

viral mason
#

got stuff mixed up

#

but this is the one that gives me that issue after running it

#

the blank page

#

nvm it works fine now

#

😭

#

can't the same code for lightning ai be ported over to fit Kaggle

#

@low shard

stiff idol
#

so github has a shortlived links which don't alllow me to download it because it strangely fails

tawny radish
#

are u trying to get beta? or just regular UVR

#

i have it saved somewhere in my pc

stiff idol
#

I'm trying to get the beta with mel Roformer

low shard
viral mason
low shard
#

like I could use Gradio for Applio Kaggle, but it would work for only the Applio UI, not the filebrowser that is kind of needed (even tho not a very big necessity) for kaggle

viral mason
#

I mean all that's needed for applio is just applio and tensorboard

#

so if there's code that allows those two things to load with no issues

low shard
#

you can check that convo yourself if u wanna

viral mason
#

ngrok causes issues yea but just swap it for gradio or zgrok or anything but ngork

#

would that work orrrr

low shard
# viral mason would that work orrrr

yeah I tested it, Gradio does now (it has been recently fixed, as previously it didn't) work on Kaggle, I could use it with Applio UI but I don't think I can pull request a gradio tunnel option without including using a secondary tunnel for the filebrowser as @nocturne mural said

viral mason
low shard
# viral mason so in theory it could work but it'd be complicated?

If I would have to include filebrowser for the gradio tunnel: yes, because i'd have to use another tunnel for filebrowser since filetunnel is not made with gradio

If i would not use filebrowser (atleast not when using Gradio): technically it would be easier. EDIT: well not as easy, because vidal pointed out that the tensorboard would need a secondary tunnel too

#

tbh just learn applio lightning.ai, this issue and pull request seems like it will take some time, I'm also checking for other free tunnel alternatives too

#

cloud isn't as stable as local, it's normal to sometimes switch and modify things

nocturne mural
viral mason
#

well Kaggle is dead then

#

no use left for it

#

only due to Ngrok

#

screw them

#

a reasonable lowering from 4k would at least be like max 1k

low shard
viral mason
#

not whatever tiny number they changed it too

viral mason
#

I'm getting the true free user experience by never paying these services for their greed

nocturne mural
#

uh

viral mason
#

I dunno how to change code I'm stupid

low shard
low shard
#

I could commit that right quick

#

I was actually looking for other possible good free tunnels but that might take a while, doing as said above would fix the issue faster i guess

viral mason
low shard
brittle wing
#

hello

#

im using the uvr v5 hugging face ui

#

which model do i use to get the instrumental to not sound like shit

#

im seperating an instrumental from vocals

nocturne mural
nocturne mural
brittle wing
viral mason
#

use thishttps://colab.research.google.com/github/Eddycrack864/UVR5-NO-UI/blob/main/UVR5_NO_UI.ipynb?authuser=1#scrollTo=gmjUWmz8iecd

#

same thing but on google collab

dense cave
#

anyone familer with the lightweight text to video generaion model ?

viral mason
#

if I open tensorboard at all it fucks everything over as I have no way of returning to the lightning space without it stopping and ruining everything

stiff idol
#

so I downloaded a bad one? WHich is the newer one?

stiff idol
#

seems like the resources aren't up to date or I search wrong

stiff idol
#

Where do you get the info on these things from?

viral mason
stiff idol
#

I guess time to uninstall UVR and install it again

simple ore
ashen patrol
stiff idol
fleet marsh
#

hello, i was just gonna ask, does anyone know if this page is from here?https://www.vocalize.fm/voices

they are using a paid suscription to use ai models, some of those are created by me, i dont want them to use my models and getting paid for it, does anyone know how could i make them take down my models or smth?

Vocalize

AI Music Cover Generator

fleet marsh
viral mason
stiff idol
#

🤣

viral mason
#

Oh lol

#

Make a folder in your Google drive called Vocales, and make another one with whatever name you want, copy the path of the folder u put the audio you want to clean in (not Vocales)

stiff idol
viral mason
#

In your own Google drive

#

Not in the collab

stiff idol
#

ok 👍 thanks

fleet marsh
viral mason
#

Not sure tbh

fleet marsh
viral mason
#

Maybe could sue them?

#

But idk

fleet marsh
viral mason
#

Me either

#

Maybe look it up

viral mason
simple ore
# winter seal idk no maybe

you can only infer with small voice models, D/G are training weights, although G has a small model inside, but there's no config,so the application does not know how to set up the model

analog obsidian
#

imagine if applio had the extract small model from the g file feature, like mainline

fleet marsh
#

They probably do

#

They got my hollow knight models even when those models are not that famous

#

Even the one of the knight

analog obsidian
#

isn't weights also technically stealing our models and profit of them?

edit: grey area, read weights TOS

viral mason
#

I see a bunch of Ov2 models which are definitely shit

#

Dead pretrain

analog obsidian
#

calling them 'our' models is uhh not correct unless we voiced such characters and had the legal rights to create models based off them

viral mason
#

Which is the only function of the voice models on weights besides like tts

analog obsidian
#

what weights is doing is also illegal so

edit: grey area

viral mason
#

🤷‍♀️

analog obsidian
#

or i think it's gray area

viral mason
#

At least it tells you it uploads the models you make

#

When u post them here

fleet marsh
#

Yeah

viral mason
#

So it's worse

analog obsidian
#

both are the same thing tho

viral mason
#

One looks worse tho

#

And isn't popular or known

fleet marsh
#

And does without consent

analog obsidian
#

isnt the weights bot doing the same exact thing

viral mason
#

But it at least tells us about it

#

And we know it does that

fleet marsh
#

What was weights? I don't remember

analog obsidian
#

random corpo that bought this server

viral mason
#

Used to be a good website to do covers and stuff but now it's shit Bec greed

#

Their greed has literally destroyed them

#

It was better in 2023

analog obsidian
#

eh i also dislike weights but you gotta understand gpus cost money

viral mason
#

I get that but they did a lot of bad shit especially not telling people about updates randomly

#

Just running the users over with a sudden change like hitting a deer you couldn't see with a truck

fleet marsh
#

What dont people use applio?

analog obsidian
#

quite smart

fleet marsh
viral mason
analog obsidian
viral mason
#

That I know of

fleet marsh
#

People are dumb, they could do it for free and use Audacity to put it togheter

viral mason
#

Wdym audacity

#

Ohh

analog obsidian
viral mason
#

Nvm

#

That's interesting

analog obsidian
#

if not, its not yours

viral mason
#

So if weights gets in trouble we're to blame misc_trolley

analog obsidian
#

weights is grey area so yea they cant get in trouble

#

unlike the other site which is actually selling the models

viral mason
#

What about the users

analog obsidian
viral mason
#

So like every single model in existence

stiff idol
#

Does anyone have a direct link for the repository with this model? mel_band_roformer_denoise_debleed_gabox.ckpt I can't find it on huggingface. UVR downloader acts like a github but worse, it doesn't download the whole file (using curl).

fossil sage
#

@teal ferry im back

analog obsidian
fleet marsh
analog obsidian
#

they do have copyright

analog obsidian
#

but since there's no dmca claim, they're still there

#

unless scott just fills a dmca

fleet marsh
viral mason
#

Like Miku or Teto?

analog obsidian
#

or having the rights of that thing

#

you don't have the rights? then its not yours

#

even if its just a piano model lets say

fleet marsh
#

From fnf mods

viral mason
fleet marsh
#

Oh i ser

analog obsidian
#

weights was smart

#

but the other site is clearly dumb because they're uploading the models themselves

analog obsidian
#

it wont last long before they get sued lol

analog obsidian
analog obsidian
stiff idol
#

I'm not sure what they do.

viral mason
#

I have stopped De-noising my models unless they have a really bad mic

stiff idol
#

I've just come around a document that explains a bit.

analog obsidian
#

well, it removes noise

stiff idol
#

I mean debleed

analog obsidian
stiff idol
#

there's lots of new terms for me

#

thanks

fleet marsh
analog obsidian
stiff idol
#

it never downloads the whole model, the model has 913MB< what do you guys usually use?

analog obsidian
# fleet marsh Who does?

for example, the hatsune miku model, the legal owner of that is crypton, they have the legal rights to sue that site

#

not the person who made the model

#

actually the person who made the miku model is also breaking crypton TOS

stiff idol
#

I'll use local for now.

analog obsidian
#

1 sec lemme find the links

fleet marsh
analog obsidian
#

you own your voice (legally speaking)

#

xD

stiff idol
#

that's an old one btw

analog obsidian
stiff idol
#

1.8.4 is the newest

analog obsidian
#

no?

stiff idol
#

wait

analog obsidian
#

thats the official dl link

#

if u have something else then is custom

#

i do not trust forks

stiff idol
#

this is so confusing, I got the version 5.6, then removed it, then got the 5.6.1 (supports former models) btu then removed it for the web one

analog obsidian
#

i'd recommend trying what i sent (install them in order)

#

it's from the person who made uvr gui

#

this is gabox's official download link of his mel rofo karaoke

fleet marsh
analog obsidian
#

you just made it for fun

fleet marsh
#

Oh ok

stiff idol
#

I'll take a look tomorrow at it.

#

I wish there was actually a repository with the links.

analog obsidian
analog obsidian
#

its veery hidden in a serveryt_nails

stiff idol
#

Is there a way to know what is official and what not? actually just checking the author... welp

analog obsidian
#

trvlvr is the author of uvr gui

#

hes also known as anjok

analog obsidian
#

ai hub moment

#

xD

#

thats from illaria and eddy i think

stiff idol
#

I'm downloading the gaboke's model from there

#

Gabox's*

analog obsidian
#

yea so i always recommend using the official stuff first

#

forks later

#

the colab i sent is great tho, gets updated quite often

analog obsidian
stiff idol
#

I haven't checked what is the difference between .onnx and .ckpt models

#

so they are like safetensors... btu this many formats for the same thing

#

I heard Okada software converts ckpt to safetensor

analog obsidian
#

.onnx is an alternative for .pth files, in amd gpu systems is faster

#

but in nvidia gpu, onnx actually uses more cpu and potentially slower performance/higher delay

stiff idol
#

actually I'm missing something comprehensible, it's either too shallow or an academic paper on a whole algorithm

analog obsidian
#

.safetensors is an alternative for .pth/ckpt to make them safer and prevent them from executing arbitrary code

#

since you can actually inject code into .pth/ckpt, making them virus

stiff idol
#

can safetensors be trained too? actually never looked into that too much

analog obsidian
#

no idea.. i know in the sd community everyone uses .safetensors because is safer

#

ive heard is also faster than pth/ckpt

#

in rvc no one uses it, maybe because applio and w-okada already have a protection against arbitrary code

stiff idol
#

I read that ckpt can be trained with the config and have an executable code that can infect you, same with .pth, safetensors are safe because they omit the code and have only weights

analog obsidian
#

yeah exactly

#

in applio there's a line named "weights only"... pretty self explanatory what this does

#

deiteris fork also has it, no idea about regular w-okada

stiff idol
#

I can never look at those error messages in cmd, I see "separation failed" but it's still ongoing. My notebook is very sensitive to any process. I know when it is running and when it is not by sound.

#

if you stumble upon any repository that has the official links to the models or their creators I'd be glad if you shared them

analog obsidian
stiff idol
#

yeah

analog obsidian
#

google "audio separation discord"

#

join the server

stiff idol
#

or dereverb / denoise

analog obsidian
#

everything it's there

#

the authors, colab links, etc etc

stiff idol
#

gotta go, good night, see u tomorrow

analog obsidian
fossil sage
#

whats better Local Eddy's UVR5 UI or google collab uvr

viral mason
#

I figured it out nvm

low shard
#

also you can click Open to open it in a new tab

viral mason
#

nah I mean it loads fine but switching back from that particular page screws it up if u wanna go back to see training progress, figured it out tho just clicking the open button, then copy the link to a new tab on my browser

low shard
viral mason
#

I've basically already figured out how to do what I need

low shard
#

let's see to which cloud platform you will switch to when lightning.ai won't be enough for ya in 6 months misc_trolley

viral mason
#

6 months is good enough

low shard
#

are you even gonna use rvc in like 2 years?

#

i mean who knows what will happen in 2 years lol

viral mason
low shard
#

it will always be usable on cloud

#

there are a lot of renting gpu services

viral mason
#

my tensorboard did a loopdy loop

#

actually just noticed at the start too it had a stroke

#

never seen this before

low shard
viral mason
viral mason
#

not just this one all of them look like that lmao

low shard
#

maybe something got fucked up while you had that random crash, might be better to restart yt_nails

viral mason
#

nah it's fine

#

probably all that starting and stopping nonsense from me not understanding how to use it properly

#

btw can lightning run out of space in the notebook like kaggle?

fossil sage
karmic copper
#

is someone else having problems training a model??
I haven't trained a model for months (means I didn't reach the weekly limit) but It still says I need to upgrade my acc(purchase required) to train my model??

simple ore
heady plover
#

it worked fine before nvidia drivers updated and whatnot, and im not sure what else to do beyond using the install script/changing the env version to install newer packages

#

I'll forward this to forum too but I'm stumped myself.

viral mason
#

What's the best Nvidia GPU I could upgrade to, I want to replace my 1660

simple ore
#

well, 5060ti is also an option, comparing to 1660 it is a beast

fossil sage
#

guys im kinda of confused on what he applio built in normalization does

jaunty minnow
#

is there a way to run deiteris okada's as an application rather it opening a google chrome page?

viral mason
#

idk what it really does

fossil sage
#

So far I have a clean 31 minute Audio that I'm ready to train

analog obsidian
# viral mason idk what it really does

as you know rvc doesn't train the whole dataset .wav file at once, instead, it slices the dataset into 3s segments with overlap
post normalization applies a normalization to those generated segments

#

helps the model building frequencies

#

it's a setting that is enabled by default in mainline, but somehow it's off by default in applio

fossil sage
#

I'm confused

analog obsidian
#

it loads small segments of 3s

fossil sage
analog obsidian
#

yes, if you select post norm they get normalized

fossil sage
analog obsidian
#

once the model loads a slice, instead of "reading" it in a single pass, it will start by reading 0.35s segments until it has gone through the entire file

#

thats how the model learns the timbre and characteristics of the dataset voice

fossil sage
#

i used Eddycrack864 uvr locally

analog obsidian
fossil sage
#

for this server

analog obsidian
#

do u have spek?

fossil sage
analog obsidian
#

can u load the uncompressed audio so i can see it

#

just take a screenshot

fossil sage
#

i don't really know whats going on

#

i read the docs aihub on vocal seperation

analog obsidian
#

oh yea that still looks like 24k to me

#

everything above 12k is just noise

#

voice is around 22050

#

22k/24k

fossil sage
#

idk what that is but ok

analog obsidian
#

idk how rvc will react to a 22k dataset tho

#

minimum it needs a 32k one

fossil sage
#

💀

#

the audio is extracted from a show

analog obsidian
#

it should still work, the model in theory will learn the frequency cutoff

fossil sage
#

i exported it 16 bit btw not 32 bit wav

analog obsidian
#

just try

#

i dont think the model cares much about the missing high frequencies

fossil sage
analog obsidian
#

rvc never got an official doc lols... o wait it has one, but in chinese

fossil sage
analog obsidian
#

oh ok

fossil sage
spare tree
#

my rvc is bugging

#

it wont repeat everything I say and it has a 10 second delay

knotty moth
knotty moth
viral mason
#

voice changer? training?

stiff idol
# analog obsidian

Yesterday I tried using RVC's basic MDX models, tey were quite good, they just didn't catch a Chinese instrument. I have a problem with removing it. Noise reduction kinda doesn't work on that. misc_dead

spare tree
viral mason
#

can u send a ss

spare tree
#

@viral mason

#

or do u want me to send the application

viral mason
#

Is it deiteris, tg fork?

#

Did u get it from a yt link?

viral mason
#

Definitely outdated

#

What gpu do ya have

#

Nvidia, AMD, intel?

spare tree
#

AMD Radeon TDM Graphics i think

#

AMD

viral mason
#

AMD, cool

#

U can use either wokada deiteris which is what I use or wokada tg fork which is the same thing but slightly different but I never used it so I recommend deiteris

#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

viral mason
#

Third guide is deiteris fork

spare tree
#

are those voice changers

#

where can i get deiteris

knotty moth
# spare tree

many ppl following such youtube tuts are falling to this trap, and the worse thing you havent checked the system requirements that igpus are not capable at all

viral mason
spare tree
#

alr do i uninstall the old one

knotty moth
viral mason
knotty moth
viral mason
#

Ah

#

Yea he might be kinda screwed then

spare tree
#

oh

#

so i get a new gpu

fringe snow
#

Does anyone know why Kaggle's Applio URL is just taking to the files instead of applio?

#

nvm now it wants to work as soon as i hit send lol

viral mason
#

AMD isn't the best for ai realtime

#

Intel basically doesn't work at all

carmine siren
#

appoilio not working , I am encountering this issue when trying to execute third cell of the notebook in kaggle

knotty moth
low shard
low shard
humble cobalt
#

is there any good alternative ai voice changer thats like w-okada?

meager lichen
#

is rx 6600 enough for ai voice changer

plush reef
#

If you only want to download ready made videos that have been generated by AI, that can be different . most tools are now for creating the videos for you, rather than providing a video library where you can download for free. Some sites, like Pexels, Pixabay, have free stock style videos that look AI-generated, but they are not technically AI-generated on demand. if you want a realistic high quality videos as per you script including enhanced features like AI Natural voices and customs AI avatars , you need to go for the paid ones. There are some tools offers budget friendly plans.

meager lichen
carmine siren
#

-kagle

#

--kaggle

#

--kagle

hallow thistle
patent trellisBOT
# hallow thistle -kaggle
📘 Kaggle Notebooks

Kaggle is a Cloud (Remote Good PC) Service that offers 30 hours of GPU weekly, but needs a phone number verification

• **Applio Notebook**

by IAHispano
Kaggle

• **Hina Mod Original Wokada**

by Hina
Kaggle

• **Wokada Deiteris Fork**

by Hina & Deiteris
Kaggle

• **UVR5 UI**

by Eddy, ArisDev & Nick088
Kaggle

• **UVR5 NO UI**

by Eddy
Kaggle

• **RVC AI Cover Maker UI**

by Shirou & ArisDev
Kaggle

• **Music Source Separation**

by Shirou
Kaggle

hallow thistle
#

You mean this command? shino

meager lichen
#

is 8 vram enough

#

guys pls

hallow thistle
patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
carmine siren
#

i am getting ngrok issues with the applio like its is asking verify with credit card credentials to use it further

meager lichen
#

@hallow thistle uh so same question, rx 6600 8000mb vram is ok right?

#

im ok to download the voice changer

deft condor
#

guys i downloaded okada and opened the file up, downloaded it. what should i open up afterward?

hallow thistle
patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
meager lichen
#

ty @hallow thistle

#

so last question, the deitaris version is better than the original w-okada version right

deft condor
#

im using rtx2050, win11, i downloaded okada and opened MMVCServerSIO, downloaded it. what should i open up afterward? nothing pop up for me. im not following any tutorial

hallow thistle
hallow thistle
hallow thistle
deft condor
cyan pelican
#

guys can i use a v2 rvc model on the normal rvc app or will it sound bad

hallow thistle
hallow thistle
# deft condor deiteris

Is the MMVCServerSIO still in your folder? If the program gone after you run it, an antivirus might have interfered it.

deft condor
#

i dont have my firewall nor my antivirus on. if it helps i did download an wokada voice changer before too

knotty moth
nocturne mural
gleaming rapids
#

hey sorry to disturb, what epoch meaning ? and where can i set per exemple 300 epochs

low shard
# gleaming rapids hey sorry to disturb, what epoch meaning ? and where can i set per exemple 300 ...

epochs are a unit of measuring the training cycles of the AI model

basically the amount of times the model went over its dataset and learned from it

they don't mean how good is the model, it's just an info provided on how they trained the model by the model maker
More ≠ better
Less ≠ better

There's no way to determinate how good the RVC model is until you try it out or listen to the audio samples if there are

night dome
#

Heyy I have a problem with the AI Voice Changer/or the Virtual Cable, whenever I try to use or test it on discord Its not working. When mic testing on discord it says that "Discord is not detecting any input from your mic" or something like that.

patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
hallow thistle
#

Make sure to read help guidelines before start asking. anime_pray

night dome
viral mason
thin anchor
glass meteor
#

When I say something it lagged in the middle of the sentences, can someone help me? i have RX6800

#

in the app its laggy but idk if in game it is

hallow thistle
glass meteor
#

i follow this

#

_v.1.5.3.18

hallow thistle
#

No, no, better not follow any tutorial video about W-Okada from YouTube for this time.

viral mason
#

is rx amd or nvidia

fossil sage
#

hows te graph

#

im training 10,000 epoch

#

when do i stop

viral mason
fossil sage
#

it says to stop when there is no longer any improvement

#

and when it stays flat and makes no progress stop

analog obsidian
#

is that from the ai hub docs?

fossil sage
#

yes

analog obsidian
#

i have told nick to rework the training section because it's so long and wrong

#

lol

#

tensorboard doesn't show you when the model is overtrained

fossil sage
#

WHAT

analog obsidian
#

600e is also crazy

#

i'd recommend maximum 100
but start hearing the epochs from 40e

fossil sage
#

what about the overtraing threshold

#

i have it enabled

analog obsidian
#

so train 100e, hear the model at 40e

#

if its sounds good then its done

#

if not hear 50e, etc etc

fossil sage
#

elborate

analog obsidian
#

rvc finetuning overtrains fast

#

thats just how the arch works

fossil sage
analog obsidian
#

try listening to 45e, and 60e

fossil sage
analog obsidian
#

anything past 105 is most likely overtrained

fossil sage
#

so i should stop the training correct ?

analog obsidian
#

yeah

fossil sage
#

ok

analog obsidian
# fossil sage ok

and in future trainings just ignore the tensorboard lmao, just train 100e max
99% of the time the 'good' epoch will be 40e, but sometimes can be 50e, 60e, etc

#

the tensorboard becomes useful when you train without a pretrain, while using a dataset of 55 hours

fossil sage
#

when the guide going to be updated

analog obsidian
# fossil sage oh ok

i forgot to say also, don't use a batch size below 8, anything below that makes the model learn painfully slow

analog obsidian
#

increase chunk size and leave extra at 2.7

#

decrease game graphics, limit fps to 60

#

1080p

analog obsidian
#

stop the conversion

analog obsidian
#

oh, is your index value at 0?

#

it's not 100% possible to erase a model's native accent tho, they will always have some leftover from the dataset

#

especially if the model is overtrained

#

sorry english is not my first language, but you want the model to use it's native accent or yours?

#

increasing the index blends more of the model's accent into the result

viral mason
#

drink water

analog obsidian
#

index 0 the model is going to use your accent instead

#

but it's not 100% perfect

#

yup! actually thats very good for the model

#

it'll have an easier job while doing the conversion

fossil sage
#

the probelm is that im not able to mimic his accent

analog obsidian
stiff idol
#

Guys, does anyone have a normal guide on removing echo and re-verb? And I can tell you I tried the denoise, de-echo and other such functions form various plugins, but nothing as I thought it'd. So I'm probablyj ust bad at it and want to get better. I extracted vocals through a model. (mel and MDX ...VF actually gets me the same result). So the only thing I want is to remove an instrument the model doesn't know at all (some kind of Chinese instrument) + echo and reverb that I hear with in the voice.

#

and tutorials are more about "how to lower the audio" instead of "removing the echo" (I also tried the Audacity tools denoise, noise gate and the other typical ones)

fossil sage
fossil sage
#

why does it have some artifacts doh

analog obsidian
stiff idol
fossil sage
analog obsidian
analog obsidian
stiff idol
#

sorry

#

read wrong xD

analog obsidian
stiff idol
#

there's lots of models in VR architecture

stiff idol
analog obsidian
#

screenshot

stiff idol
analog obsidian
#

try de echo aggressive

stiff idol
#

ok 👍

fossil sage
analog obsidian
#

in the same advanced settings also enable split audio

fossil sage
analog obsidian
#

after it converts the chunks, it will merge them into one audio file

fossil sage
#

now i need to read the appilo documentation

analog obsidian
#

and i personally noticed my results sound better with split audio on tbh

analog obsidian
#

nop

fossil sage
teal ferry
#

what model is it

fossil sage
analog obsidian
fossil sage
#

which is why you need something like xtts

fossil sage
analog obsidian
#

yeah rvc doesnt learn emotions or accent since its learning spectograms/mel

teal ferry
#

rvc?

stiff idol
teal ferry
#

if its just copying what the reference audio sounds like then yeah it will sound fine. theres no real use case for that though beyond real time live inference. or if youre going to want it to convert your own speech. or using it for like a secondary filter

#

for what you want to do you want to use it with tts

analog obsidian
#

but if your vocals have harmonies you have to use becruily's mel karaoke

#

for me they both work fine

stiff idol