#✨│ai-help

1 messages · Page 225 of 1

low shard
#

Oh nvm

urban fractal
#

lol

low shard
#

Might be a model issue

acoustic lion
#

I am curious, will AI RVC ever improve to the point it can mimic coughing and laughing (odd sounds that can be made from the human mouth in general) Cause if someone screams with a high pitch the model wouldn't comprehend what the user is trying to accomplish

urban fractal
#

Yea they all sound like robots, or russians. I try to set the voice but it doesnt exist or its the preset built in ones

#

for ARIANAGRANDE_ES_BY_SZAJEAN.pth
would I set the voice to something? or leave it empty? If i dont set it then shes a russian man

low shard
#

Ms edge tts is multilingual and nice quality but not emotional at all

urban fractal
#

Ohhhh

#

I getcha. lol, makes a lot of sense now. I'm making a bot to have a convo with and ms sam is trash so was looking to better. The built in voices don't seem to sound like that though if I change to a different en_* voice

knotty moth
#

Apolio is a good alternative to that kind of application

urban fractal
#

Thats a python lib?

low shard
#

It uses Microsoft edge tts API requests

urban fractal
#

Awww

low shard
acoustic lion
low shard
#

It does RVC have improvements, but you can't expect it to sound emotional in applio either

low shard
#

Unfortunately the original devs of RVC kinda left it to work with gpt so vits, which is a TTS AI

urban fractal
#

Are you sure I don't have to set the voice param to match the voice name in the model file? I just don't know what that name would be

low shard
#

Hopefully applio and our engineers will make improvements but who knows

acoustic lion
#

Ah, cause I do intend to do roleplay with AI Voice but not wanting to sound awkward in certain situations

low shard
urban fractal
#

ok

#

but she sure dont sing like in the samples I heard

low shard
#

Microsoft edge tts was mostly meant for just reading text on the browser, Soo yeah

knotty moth
low shard
#

That's very experimental though

#

Iirc hifigan was disabled in applio main branch for now not being stable enough

urban fractal
#

Ohh, yeah huge improvement by setting the voice to en-US-AvaNeural

acoustic lion
knotty moth
low shard
low shard
analog obsidian
knotty moth
analog obsidian
#

seoul finetuned contentvec and was able to make models scream

low shard
#

@urban fractal are you having any other issues ?

low shard
urban fractal
#

Nope

acoustic lion
trim sparrow
#

nick super quick question how should i know how much epoch to use ?

urban fractal
#

I'll share a video and code once I get it together

low shard
#

Use the tensorboard

analog obsidian
#

but rvc does not stand for realtime voice changer

acoustic lion
#

Oh, thought it does 🤔

trim sparrow
#

do you have a tutorial video or doc perchance for that?

analog obsidian
knotty moth
acoustic lion
#

Ah, what term do we use for Real time then?

#

RTVC?

analog obsidian
#

wokada

acoustic lion
#

Ah

acoustic lion
#

Then in terms of Wokada, has there been any noticable improvements lately?

analog obsidian
#

w-okada is merely a gui for realtime inference

trim sparrow
#

ah its just a file for applio

analog obsidian
#

we need to improve actual rvc

low shard
# acoustic lion Ah

Wokada is a program that basically focuses on the realtime rvc inference

There are 2 versions
Original made by Wok
And deiteris fork made by deiteris

The deiteris fork initially brought many advancements, which now are also in the latest original wokada, but deiteris fork is suggested still for having more options and less bugs, along with the removal of Beatrice models which were low quality compared to RVC and were experimental

analog obsidian
#

every change the gui may receive will be a optimization or qol change

low shard
#

There was also a go-realtime.bat rvc GUI for realtime inside of rvc mainline (og rvc), but that's really outdated compared to the ones I mentioned

analog obsidian
#

tier list for perfomance atm:

  1. deiteris
  2. mainline realtime
  3. original w-okada
trim sparrow
low shard
hallow thistle
orchid flint
#

I got a random question as well. Besides the new vocoders is there any team that is actively working on, for example a RVCv3?

analog obsidian
#

don't expect something from him

low shard
#

Rvc boss Is focusing on gpt so vits

orchid flint
#

Ok

analog obsidian
#

they tried rvc v3 but didn't liked the results

orchid flint
#

I see so new vocoders are the only way forward for now?

acoustic lion
sly furnace
#

i installed everything sir , can you tell me the settings

analog obsidian
sly furnace
low shard
# trim sparrow

Weird, could you try to open CMD and do env/python.exe -m pip install --upgrade gradio

lofty lichen
crude flame
orchid flint
#

What would a new embedded or look like? I’m not familiar with those

analog obsidian
hallow thistle
analog obsidian
#

he's trying to make models laugh in realtime

sly furnace
hallow thistle
crude flame
low shard
lofty lichen
#

is KLM 5 RefineGan best for talking female with medium high pitch voice?

sly furnace
#

i am completely new to this

#

no idea

analog obsidian
crude flame
sly furnace
#

any recommeneded settings would help

low shard
sly furnace
#

😭

low shard
#

It seems that Gradio needs to be updated

lofty lichen
low shard
acoustic lion
analog obsidian
#

our current pretrains are already pretty good

low shard
acoustic lion
#

And is there any sorts of logs regarding new updates, when it was ,what it introduces

analog obsidian
orchid flint
knotty moth
trim sparrow
#

if i got 17.1 minutes of datasets would a epoch of around 450 be okay?

hallow thistle
analog obsidian
#

4.9 is a hifigan pretrain

acoustic lion
hallow thistle
acoustic lion
analog obsidian
low shard
patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard
#

1st link

#

Wokada deiteris fork

analog obsidian
#

w-okada can't get any better by itself

#

it's just a gui for inference

#

the only thing that can be improved there is perfomance

low shard
orchid flint
#

Lmao

low shard
#

Wokada working on a NES

#

Let's make Wokada hackrom

acoustic lion
low shard
knotty moth
acoustic lion
#

true true

hallow thistle
#

W-Okada simulator. Damn

low shard
#

Let's make Wokada run on hopes and dreams

acoustic lion
#

NPU?

orchid flint
#

Yes

hallow thistle
versed edge
#

It's not a headset with a mic attached, the mic and headset are separate

lofty lichen
knotty moth
acoustic lion
#

Doesn't sound like a problem to me then, Towers all the way

orchid flint
orchid flint
#

Dam

#

Ok

low shard
low shard
#

GPU + CPU for ai laptops

#

Which is barely usable since no programs support it

orchid flint
#

They can I’ll just grab the link to the file

low shard
#

I mean there are, but pretty few if you compare it to Nvidia GPUs

acoustic lion
acoustic lion
#

But AI laptops sound pretty cool

low shard
#

You can also do it on AMD and Intel but it's a much more pain

acoustic lion
#

Though looking at the site I was provided, last update was December 2024, so I do have the latest build for it

low shard
#

AMD in most cases needs zluda, a cuda emulator

orchid flint
#

Eh, more like it was just set to false in the train.py file

pastel oak
crystal dove
#

Can someone help me transfer music to the lyrics of a singer? My PC is a mess now and it's urgent

acoustic lion
#

Hmm, but does the RVC inside of Wokada come updated everytime you launch the program?

crystal dove
#

pls if anyone can help me dm me

analog obsidian
#

if rvc gets a new important change (like a new embedder) the w-okada dev has to update the gui to use the new embedder

low shard
acoustic lion
#

Ah

orchid flint
acoustic lion
#

Well since it says december 2024, I guess we haven't reached that goal

analog obsidian
orchid flint
#

Oh, ok

gray cobalt
#

i deleted the model files but they're still visible and usable for some reason

#

and trying to delete causes a problem

analog obsidian
low shard
crystal dove
gray cobalt
trim sparrow
#

@low shard when clicking the generate index it does this

gray cobalt
#

hmm

simple ore
analog obsidian
#

even higher

acoustic lion
#

Oof

trim sparrow
obtuse onyx
#

sry i want to ask, do i need to pay to use voice.ai app?

analog obsidian
#

i think xeus is able to run locally but it ask for too much

gray cobalt
low shard
gray cobalt
#

im shit at using pcs bruh

acoustic lion
#

Only way I can think of making it work is by hooking 2 4090s in one PC and leaving one to power the demanding embedder

simple ore
crystal dove
#

can anyone help or...?

orchid flint
acoustic lion
#

80?

#

DAMN

low shard
acoustic lion
#

You'd be needing an entire NVIDIA A100 to run that

knotty moth
acoustic lion
#

Ah no I know that

#

my 4070 ti is doing fine work

#

But we're mainly talking about the demanding embedders

orchid flint
#

Yeah

#

AI go crazy

simple ore
acoustic lion
#

But what about those AI TOPs in the 50 series? Could they be made of use to RVC?

trim sparrow
orchid flint
low shard
acoustic lion
#

Oh god

low shard
crystal dove
#

you know how to do it?

acoustic lion
#

Actually are they called AI TOPS

#

or just TOPS

#

Honestly not sure what they are or what they do

low shard
low shard
#

it has 4gb vram

acoustic lion
#

Yeah doubt 4gb of vram can do anything unfortunately

trim sparrow
#

damn

acoustic lion
#

You can probably get a 1080 ti for cheap

#

It has plenty of vram

#

But well, not the fastest

low shard
#

you can train but at a low batch size which is usually more unstable

#

like 2 or 4

trim sparrow
#

i had it on 4

#

ima try again in a sec at 2

knotty moth
acoustic lion
#

That's true yeah

analog obsidian
#

main branch applio has checkpointing, the speed lost isnt that much

acoustic lion
#

These tensor cores making me tense up...

low shard
analog obsidian
#

im training my models in mainline now

low shard
analog obsidian
knotty moth
low shard
#

welp

#

he's cooked

trim sparrow
#

is there any other steps for tenorboard with applio besides opening the .bat?

analog obsidian
low shard
#

@trim sparrow if u want to do training, you could also try cloud, meaning you will use a remote good pc and won't run on your pc

#

but, it has limited time in free tier

trim sparrow
#

do you know how much time limited?

low shard
#

also cloud notebooks sometimes break easily since the cloud provider update packages and python version for example

trim sparrow
low shard
low shard
#

and then refresh it

trim sparrow
#

ah okay you think kaggle would be a good option?

low shard
#

yeah

#

it's owned by google btw

trim sparrow
#

do you know the website?

knotty moth
low shard
#

cloud is way suggested for phones

sly furnace
trim sparrow
#

with

simple ore
low shard
#

oh right I didn't send you lmao sorry

trim sparrow
low shard
#

Train (make) RVC Models on cloud:

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
  1. Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

trim sparrow
#

it tells me to put ngrok token but i dont see the place where i input it in the code as shown in the doc but it says this

simple ore
trim sparrow
#

im on kaggle

simple ore
#

replace "token" with your actual token

trim sparrow
#

im not seeing that at all is crolled through the cells and tried the find thing to just search up for !pip install pyngrok and it didnt show anything nor did !ngrok config

simple ore
versed edge
trim sparrow
simple ore
#

post a screenshot

trim sparrow
#

is this the right one to copy?

simple ore
#

that's an entirely different colab

trim sparrow
simple ore
#

but you can do it here

trim sparrow
#

ah okay thank you

loud thicket
#

I don't think this fits as an AI related question but I'm wondering what role do I need to be able to send a message in a model thread to ask a question there.

ancient karma
#

Does anyone know how to train voices remotely? My GPU isn't good enough to train a model.

versed edge
pastel oak
simple ore
lofty lichen
orchid flint
#

I have not, I likely won’t until or if it gets supported on wokoda

lofty lichen
orchid flint
#

Im almost certain that’s because you need to use the pretrains in the discord

#

Apollo doesn’t have any official pretrains

#

For refinegan anyways

lofty lichen
#

when i choose

orchid flint
#

I think so

lofty lichen
#

(The parameters of the pretrain model such as the sample rate or architecture do not match the selected model)

simple ore
trim sparrow
#

ERROR: Exception in ASGI application
raceback (most recent call last):
File "/kaggle/tmp/.venv/lib/python3.10/site-packages/uvicorn/protocols/http/h11_impl.py", line 403, in run_asgi
result = await app( # type: ignore[func-returns-value]
File "/kaggle/tmp/.venv/lib/python3.10/site-packages/uvicorn/middleware/proxy_headers.py", line 60, in call
return await self.app(scope, receive, send)
ERROR: Exception in ASGI application
File "/kaggle/tmp/.venv/lib/python3.10/site-packages/gradio_client/utils.py", line 973, in _json_schema_to_python_type
f"str, {_json_schema_to_python_type(schema['additionalProperties'], defs)}"
File "/kaggle/tmp/.venv/lib/python3.10/site-packages/gradio_client/utils.py", line 919, in json_schema_to_python_type
type
= get_type(schema)
File "/kaggle/tmp/.venv/lib/python3.10/site-packages/gradio_client/utils.py", line 880, in get_type
if "const" in schema:
TypeError: argument of type 'bool' is not iterable
An error occurred launching Gradio: When localhost is not accessible, a shareable link must be created. Please set share=True or check your proxy settings to allow access to localhost.

#

i get a error when trying to run through kaggle

low shard
trim sparrow
#

"error: No virtual environment found; run uv venv to create an environment, or pass --system to install into a non-virtual environment
"

low shard
trim sparrow
#

i added it

#

it wouldnt work without the !

#

should i js create a env? "!uv venv .venv"

low shard
#

i'm doing a quick test rn, installing requirements

trim sparrow
#

i just did the venv .venv thing and it worked but not ngrok is saying theres multiple instances from before n i have no clue how to terminate

low shard
trim sparrow
#

TypeError: argument of type 'bool' is not iterable
An error occurred launching Gradio: When localhost is not accessible, a shareable link must be created. Please set share=True or check your proxy settings to allow access to localhost.

#

this is so annoying

#

idk i give up on this

low shard
low shard
trim sparrow
#

im using the google colab one right now it seems to be working is it worth it or should i try notebook again?

low shard
#

and you don't know how many epochs to train, soo

simple ore
trim sparrow
low shard
simple ore
#

@low shard you can just import the notebook directly from github instead of taking 2-month old one

trim sparrow
#

how do i do that?

simple ore
#

yes

#

latest code has !uv pip install gradio==5.23.1

trim sparrow
#

how do i import it from github?

simple ore
#

use import notebook, then

low shard
trim sparrow
#

do you have the github link for it?

simple ore
low shard
#

@crude flame u gotta update the docs then i guess

lofty lichen
#

training with refinegan slower? or just me

simple ore
#

that one is old, but same fix

simple ore
#

new code uses fp32, so a bit slower

low shard
simple ore
#

I assume Vidal owns it, so he needs to update it

trim sparrow
#

how do i put my database inside of Applio?

#

do i put the dataset inside of the "working>Program_ML"

#

@simple ore

simple ore
#

i have not used kaggle

trim sparrow
#

thank you so much and also is there any way to see the console so i can see the progress?

#

nvm got it, thank you again you and nick!

sly furnace
#

which is the best version for ai cover? fork or mainline rvc?

austere harness
#

i need help

#

my voice model doesn't work on discord for some reason

low shard
low shard
low shard
austere harness
low shard
#

!give-media-perms 1h @austere harness

austere harness
#

@low shard here are my settings

#

theres no audio coming out

low shard
#

What browser are u using

austere harness
#

i use google chrome

low shard
#

Check if it works when u use client

simple ore
austere harness
austere harness
low shard
austere harness
#

no audio

low shard
low shard
austere harness
low shard
analog obsidian
analog obsidian
low shard
austere harness
simple ore
sly furnace
sly furnace
analog obsidian
low shard
#

Could u also send the link to the model download link

#

It's prob just a bad model

austere harness
sly furnace
austere harness
#

but theres no audio on discord

analog obsidian
sly furnace
#

mainline better?

low shard
#

Both wouldn't change in quality

sly furnace
#

😭

analog obsidian
low shard
#

Mainline and applio quality are the same

sly furnace
#

so whats the difference

analog obsidian
#

mainline is the original rvc

low shard
sly furnace
#

yea and appolio is the fork

low shard
#

And some features like tts

austere harness
analog obsidian
austere harness
analog obsidian
low shard
sly furnace
#

thank you so much sir

#

this right?

low shard
sly furnace
#

i have 1080ti but its being used at another system

low shard
sly furnace
low shard
#

Because AMD is less supported in most ai programs than Nvidia

sly furnace
low shard
#

You need to use Zluda

#

A CUDA emulator

#

It's just longer to follow but will work

sly furnace
#

i will follow it but do u think i should use my own machine or nvidia will be better?

low shard
#

In speed terms

#

Quality should be the same if you use on ur amd GPU rmvpe

sly furnace
#

you are so good!

simple ore
#

6600 is memory limited

sly furnace
simple ore
#

yeah, 4070 -30% for laptop

sly furnace
#

yeah i am a techy myself haha

#

just getting into ai stuff

austere harness
#

@low shard so is there a fix for the voice model not working or should i just get a new model?

low shard
#

try re-uploading it in wokada

#

and be sure u extracted the zip

austere harness
trim sparrow
#

@low shard which batch size should i use with kaggle?

low shard
trim sparrow
#

its 14 minutes

#

of singing/regular speaking

low shard
#

8 should do the job

trim sparrow
#

and then save should i put that on 20?

#

and for the epoch should i start at 250 and then when it stars dropping just stop the training?

austere harness
low shard
low shard
#

and u check the tensorboard how it goes

low shard
trim sparrow
#

alright bet thank you so muc for all the help

#

should i mess with any of the advanced settings or just not touch those?

low shard
trim sparrow
#

alright brotha ill lyk how it goes thanks again!

charred canopy
#

Hey guys I need to make an AI cover using Yoda and I don't know how to do it in any sense, I have used jammable and a lot of other services and it's not very good, does anyone have any ideas on what I can try?

low shard
#

all that those sites use is in reality the FOSS (Free and Open Source Software) named RVC (Retrieval-based-voice-conversion)

#

you can run it locally on your pc

jolly tangle
low shard
#

why?

sly furnace
#

how much echopos and how many minutes of voice recording i need

#

for best cover

jolly tangle
low shard
trim sparrow
#

is this normal?

analog obsidian
#

its all random

#

thats for epochs

sly furnace
jolly tangle
#

So

sly furnace
#

i want to make ai cover

analog obsidian
sly furnace
#

so need talking + singing clips right?

#

and how long i need it to be

analog obsidian
#

if you want to do a singing cover, a singing model its better

sly furnace
#

i can sing any bullshit ?

sly furnace
#

in singing

analog obsidian
#

dont expect a speech model sing godly lol

#

30 minutes is good for realism

sly furnace
analog obsidian
#

but anything less than that works

sly furnace
#

it wasnt so good

#

so i want to make another one

#

someone made it for me

analog obsidian
#

are you recording your own voice? then sing, use your whole range

sly furnace
analog obsidian
#

just singing

sly furnace
#

bullshit works ?

trim sparrow
#

@low shard mb for all the pings is this bad for the model or normal?

analog obsidian
#

anything yeah

#

rvc clones the expression of the dataset keep in mind that

sly furnace
#

the pre train ?

analog obsidian
#

if u sound tired the model will sound tired

#

like that

sly furnace
#

the energy

analog obsidian
#

yuh

analog obsidian
#

you can sing anything

low shard
#

also it just started

sly furnace
#

cant i make it mix?

low shard
#

overtraining doesn't happen instantly

sly furnace
#

low and highs

#

like i am drunk

analog obsidian
sly furnace
#

and sober

#

at same time

#

💀

analog obsidian
#

lmao anything works

#

as long its singing

trim sparrow
sly furnace
#

and if i want two languages

#

i should make different models for them right?

trim sparrow
analog obsidian
#

instead of fully using the singing samples

trim sparrow
#

welp i guess we will see how it plays out 😭

analog obsidian
#

its all random so i cant even predict that

low shard
analog obsidian
#

havent tried that

sly furnace
#

30 minutes is max?

#

anything over it doesnt matter?

analog obsidian
#

1 hour max

#

after 1 hour, theres no improvement

sly furnace
#

and how many epochs

analog obsidian
#

try 500 epochs

#

and hear all of them until you find the one that sounds more natural

jolly tangle
analog obsidian
#

graphs don't help in choosing the epoch, they're meant to tell you how well the model is doing during training

sly furnace
#

how much time it takes usually?

#

to train 500

analog obsidian
sly furnace
analog obsidian
#

maybe 5 or 6

#

idk

sly furnace
analog obsidian
#

1 day

#

batch size

#

its slow asf

sly furnace
#

batch size means each time it takes backup right?

analog obsidian
#

batch size is how many samples at the time rvc will learn

sly furnace
#

yea i meant that only

#

same thing

analog obsidian
#

too high its bad
Too low also its bad

sly furnace
#

u said 8 i will do 8

#

i will spam ping u when i have issues ty so much buddy

analog obsidian
#

i dont answer pings

sly furnace
#

so i have to send cops to ur place?

#

can i dox u

trim sparrow
sly furnace
#

i dm u ?

low shard
#

it's the loss value

trim sparrow
#

alright ts got me stressing 😭

sly furnace
#

i can see you

analog obsidian
low shard
#

loss goes down = less losing shortly explained

low shard
analog obsidian
low shard
#

both of yall are banned 🔥

sly furnace
#

but i am a cute panda

analog obsidian
#

training models aint hard
just tedious

sly furnace
#

so wanna make it better than previous ones in past

analog obsidian
#

record yourself, after dat denoise your audio, then run this in audacity

sly furnace
#

i will require it when i do in practical

analog obsidian
#

and use the simple slicing in applio, 3s and 0.3s of overlap

sly furnace
#

i will lost it till then

#

so i have to disturb u again

#

i will ping u again tomorrow

#

should go to bed

#

Good morning and good night

#

btw jokes appart u guys really helped a lot , it didnt helped anything but still u helped haha i appreciate it alot thanks a lot for your time buddies

#

❤️

analog obsidian
#

good night cat_pat

crude flame
trim sparrow
#

@low shard still normal or no

charred canopy
#

Problem is I need to do it quickly, I don't want to have to start training anything from scratch

trim sparrow
low shard
# charred canopy I have a nividia rtx 3050

Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible

You can:

  • Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
    • Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
    • Mainline: The original RVC
  • Cloud (remote good pc, easier and faster than ur PC but it's limited):

Easiest possible (automatically separates vocals & instrumentals) : weights.com & rvc-ai-cover-maker-ui
easiest cloud: Ilaria rvc zero
easiest local: Applio

#

if u just want a quick thing as easy as possible then use weights.com

charred canopy
#

Thank you so much

low shard
trim sparrow
#

should i stop it at 16:50 i started it at 16:00 or around that

rancid sparrow
#

can sm1 send me okada voice changer thru google drive its so complicated to download

toxic nacelle
#

hello i need help with the ai voicechanger

toxic nacelle
#

huh

low shard
#

what's the issue? and what guide link did u use?

toxic nacelle
toxic nacelle
low shard
#

!give-media-perms 1h @toxic nacelle

trim sparrow
#

should i stop now?

low shard
toxic nacelle
#

1 sec

trim sparrow
#

alright

toxic nacelle
toxic nacelle
low shard
#

u used a youtube tutorial?

#

because those settings are completely wrong and ur using the original wokada

toxic nacelle
#

yeah, noone want to help me around

low shard
#

uninstall also vb audio cable, it gives random issues on windows

#

-realtime

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard
#

read the 1st link, wokada deiteris fork

toxic nacelle
#

few mins

#

may take a while

low shard
toxic nacelle
#

ok ummm i'm no an english speaker soo what should i do

#

i am mate

trim sparrow
#

@low shard should i keep it going or stop it

toxic nacelle
#

@lofty lichenyou relize i can see deleted text

low shard
low shard
#

It's better you try to read it, use a translator, and let us know for any issues

toxic nacelle
#

yeah but idk where is the download

low shard
toxic nacelle
#

alrightly

trim sparrow
#

time to end it?

#

@low shard

low shard
#

Wait still a bit

trim sparrow
#

isnt it spking down like

#

by a ton?

toxic nacelle
trim sparrow
#

im pretty sure its at epoch 289

low shard
#

This isn't a graph about quality

#

It's about the machine learning model loss in the training

#

Read how the tensorboard works in the docs

toxic nacelle
low shard
# toxic nacelle

GPU: ur amd GPU

Extra: 2.0
Chunk: since youre going to use it in a game, you will need to check the perf value at top left when you click start, then put a little higher value as chunk

#

Did you do the step 3 and download vac lite?

toxic nacelle
low shard
toxic nacelle
#

@low sharddone

#

how how do i set it up

simple ore
#

look at 3 g losses - fm, mel, total

toxic nacelle
#

@low shardok fixed but kinda slow

trim sparrow
#

is it this one?

#

do i keep it going or should i stop it

#

(im at epoch 435)

#

@simple ore

#

@low shard

coral estuary
#

Hey, I actually just started a chill little server where we share useful tools, automation tricks, and side hustle ideas.
If you're into that, I can send you the link!

elder coral
#

I have a good GPU It's Nvidia 3050 Geforce Rtx too

#

I din't get any errors

elder coral
#

after installing

#

errors

leaden crag
#

Hi! what are the differences between client and server? Other than echo/supp1-2? [Okada]

simple ore
# elder coral errors

once again your options are 1) use compiled version that does not require 'installing' step, 2) env\python -m pip install gradio==5.23.1

simple ore
#

then open command prompt and run env\python -m pip show gradio

elder coral
#

in the applio folder

#

I tried that yesterday

#

and it needs me a bigger pc

simple ore
#

i mean now and post a screenshot of the result

elder coral
#

is this an

simple ore
#

now run the command I gave you

elder coral
#

i already

simple ore
#

and upgrade it to 5.23.1

simple ore
elder coral
#

ok now its updating

#

A new release of pip is available what does that mean?

simple ore
#

now run applio

elder coral
#

ok will work?

trim sparrow
#

noobies if my voice is kinda crackly near higher pitch would it be a good idea to add a dataset of a little of similar artists that have proper pitch or would that mess it up?

elder coral
#

yes it finally worked

trim sparrow
#

im adding falsetto & some higher freq/pitched vocals to the dataset to see if it fixes the problem also took around 60% of the talking out to make sure it doesnt interfere but just there for tone 22 minutes long 650 epochs lets see how this goes i guess

simple ore
trim sparrow
pliant pier
#

hey so like, is this thing supposed to use 56% of my cpu? it wasnt doing it yesterday

#

also Im using okada's changer

urban fractal
#

Hey guys quick question with tts-with-rvc
Is there a way to speed up the TTS voice?

urban fractal
#

I'm just on windows

simple ore
#

applio uses edge tts, there's a speed parameter

urban fractal
#

tts = TTS_RVC(
model_path="cartman.pth",
index_path="cartman.index",
voice="en-US-EricNeural",
f0_method="rmvpe"
)

simple ore
#

where did you get it?

#

I assume it is the same edge tts based on the voice

urban fractal
#

Yea MS crap so there should be a way to control the speed

#

I dug into the library but didnt see where its getting the tts audio from and how it generates it

#

Oh I think I found the speech function in there

simple ore
#

rate should be non 0

#

+5 if you want 5% faster

urban fractal
#

Got this error though
path = tts(text=ai_response, pitch=0, index_rate=0.85, tts_add_rate=5, tts_volume=2, is_half=True)
TypeError: TTS_RVC.call() got an unexpected keyword argument 'tts_add_rate'

simple ore
#

it is tts_rate

urban fractal
simple ore
#

add a debug before `communicate = tts.Communicate(``

#

and print the value

#

anyway, that project uses ancient rvc code

daring sable
#

yo

#

it isnt

#

like working for

#

me like

#

its not talking

#

fix?

simple ore
daring sable
#

is

#

that y

#

ty

#

it sounds robotic

#

as shit

#

where can i fix that

#

sounds robotic

hallow thistle
daring sable
hallow thistle
# daring sable

Set chunk number to around 46 ms for less audio delay. Set "F0 Det." to regular rmvpe. Although you have set extra number correctly, but the audio still sounds robotbic, it can be the voice model itself. Try another one.

rare charm
#

Hello everyone I am starting a project, what i need is a co-host for my YouTube gaming streaming, i already created a working model, but i have some problems, it uses eleven labs api, so ot cost me a lot, i would like to you guys to give me ideas on what voice clone open source good enough to do a Brazilian kratos Voice that i already have 5 minutes of isolate voice to use on the model, also I am using open ai api, o would like to do this project entirely open source, can i have some directions in what to use for my idea?
I have:
I9
4090
64 gb ram

hallow thistle
# daring sable the perf

If you use W-Okada with a game or something, increase the chunk number until perf number turns green.

urban fractal
simple ore
#

20 as in 20% faster

#

it does not do more than 50 I think

urban fractal
#

good to know

trim sparrow
hallow thistle
trim sparrow
#

do you have a link perchance

hallow thistle
wanton valley
#

This might be the end if mainline-colab doesnt get fixed

hallow thistle
regal pivot
#

What's the best way to do a duet locally? Right now I'm using ultimate-rvc, which I'm not sure if I can do it there, so I assume I need to separate the vocals another way, but I'd much prefer locally. Specifiacally male and female vocals.

knotty moth
regal pivot
#

I tried MVSEP and it didn't really do that good of a job. I'm trying UVR5 and it just refuses to separate the male backing, messing around with different models to hopefully find a system that works.

trim sparrow
#

how do i continue training a model

#

i wanna add a new dataset to it but i dont wanna re render the old dataset that was already processed

simple ore
trim sparrow
#

wdym?

simple ore
#

I would not advise trying to use a finetuned model as a pretrain for a larger model

#

the base has to be generalized enough

#

and by training a finetune model you're cutting off some of the parts

trim sparrow
#

no im just adding more singing/rhytem to it

simple ore
#

just start over

trim sparrow
#

ts took like 5 hours😭

simple ore
#

happens

trim sparrow
#

so i cant just add the new audio to the same dataset and process it or will that basically just restart it or

simple ore
#

as I just explained

#

imagine you have a block of marble and you're carving a statue out of it

#

a pretrain roughly cut shape of a human being, then you finetune it by cutting it into a shape of a bald man with a beard

#

and now you want to turn that into a statue of a woman with big tits holding a vase

#

did I make a point?

trim sparrow
#

sorry im not 100% with the vocab for a lot of the ai stuff even if lets say i created the model from scratch and its just singing/humming etc and i just wanna add more would that still fall in the criteria of that

#

yeah just not 100% on the vocab for that stuff but i get ur point just not sure of what a pretrain or finetune is

#

but either way ill just start over im probably gonna go to bed anyway and let it sit

simple ore
#

I gave you totally not AI-based explanation

#

pretrain is the weights you're using as a base

#

finetuning is using a small dataset to finalize the voice model

trim sparrow
#

ahhh okay

#

n quick question for the dataset does it have to be in just one file for all the audio or can it be multiple?

simple ore
#

it can be many files in the same folder

#

ideally they need to be same quality recording-wise

past barn
#

can someone walk me through setting up realtime voice changer using these ai models

trim sparrow
#

alrighty thank you also do you have any clue on how to delete old models its taking up storage for kaggle😭

cobalt vine
#

can someone tell me what happened

pastel oak
#

Youll need Kaggle

trim sparrow
pastel oak
trim sparrow
#

should i stick with fps16 or fps32 @simple ore

cobalt vine
#

damn i just paid, few days ago for it

simple ore
#

so bad token?

#

or whatever it says

hallow thistle
# cobalt vine can someone tell me what happened

Make sure you censor your ngrok token before sending screenshot, otherwise someone would snatch your token for their things. What is your PC GPU? Because W-Okada can work locally without having hard time finding ngrok token for Colab/Kaggle.

thorn pollen
#

Hii is there an ai which can like fake mouth movement on a video where the persons mouth is closed

cobalt vine
#

i did refresg

simple ore
#

you can see that by a bunch of triangles shown on the charts

trim sparrow
#

thank god i just put it at 32 and got in bed😭

hallow thistle
cobalt vine
#

gtx 1660

#

realtime collab cant use ?

hallow thistle
# cobalt vine gtx 1660

With that GPU, it should work. Just won't be that fast like one in Colab/Kaggle. https://rentry.co/ForkVoiceChangerGuide#download-nvidia-on-windows

cobalt vine
#

it is but it sometimes very laggind and dc when i play fiveM

#

that why i need collab

hallow thistle
#

That's pretty much it. Most of W-Okada Colab notebooks are broken. Even using fork W-Okada Colab can get your account to be terminated, especially when you have free tier.

cobalt vine
#

but i buy premium a few days ago

#

its not free

hallow thistle
#

No excuse. If you have bought Colab Pro or compute units, make sure you keep it set to T4 GPU instead of A100 or L4, because these two GPUs eat more compute units than T4 one.

cobalt vine
#

yup i have it

hallow thistle
#

Kaggle gives an option to use two of T4 GPUs for free 30 hours a week. But if you have bought Colab Pro or pay as you go, then use it to your advantage I guess.

cobalt vine
#

So does RealtimeVC mean it can't be used anymore?

hallow thistle
hallow thistle
#

What's up?

lofty lichen
#

@steel forge warning while sleeping?

lofty lichen
#

while sleeping

knotty moth
lofty lichen
#

Reason: Be respectful to other server members and do not post NSFW content

#

lmao

hallow thistle
#

!howtoask

patent trellisBOT
# hallow thistle !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
lofty lichen
#

while sleeping

steel forge
#

Just don't send that gif again

lofty lichen
steel forge
lofty lichen
hallow thistle
#

Sending what you got warned by mod into public chat won't fix your issue.

steel forge
knotty moth
lofty lichen
#

anyone can search it on the gif at the side of the chat

#

then tell owner to remove gif access

steel forge
#

Its the fact that it was a penis

#

Yes that's the truth

lofty lichen
steel forge
#

By tenor*

knotty moth
steel forge
#

So if you can avoid sending things that might be on the line, wed appreciate it. We have kids here

steel forge
#

Ok

cobalt vine
#

@hallow thistle still the same

hallow thistle
#

Don't be a dick here. Anyone can make a mistake. The warning should make you aware of what you're doing. If you think you were unfairly treated, you can write down your appeal in AI Hub's website and let what other mods think about you. anime_nom

cobalt vine
steel forge
#

"Failed to accept"

cobalt vine
knotty moth
hallow thistle
#

Although you can still use Colab for other AI programs like Applio or UVR5. Maripray

cobalt vine
#

many of my numbers can't be used on kaggle but ok i'll try

hallow thistle
#

I've never had any problem using W-Okada on Kaggle. The error can happen when you skipped some important steps like where to start the notebook and the settings.

long oyster
#

is this setting voice good enought guys?

hallow thistle
steel forge
#

Well

#

If your current cable works

hallow thistle
steel forge
#

That's fine

steel forge
#

I'm so used to seeing nightmare settings so yours are nice to see

hallow thistle
#

If you have any problem using VB-Cable, you can try switch to VAC next time. Pop_cat

pastel oak
past barn
#

yo

#

i need help

#

my voice changer is super laggy when i run it

#

how do i fix that

#

im on AMD

low shard
low shard
past barn
sharp crescent
#

Is it posible to tailor AI Voice to your own, like if someone is making AI model for me, they can also tailor it to my voice so that the sound is better, more fuild, less choppy ?

umbral acorn
#

Hai. Can anyone show me how to train model in colab? I just came back after a year so i don’t remember anything huhuhu

past barn
#

@low shard

low shard
#

!give-media-perms 1h @past barn

low shard
#

That's prehistorical

#

Lemme guess, you used YouTube tutorials?

#

All video tutorials are outdated asf

low shard
umbral acorn
sharp crescent
silk pike
sharp crescent
#

Anyone got info since i only know of youtube ones ?

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard
#

1st link

#

Wokada deiteris fork

#

@sharp crescent also what is your PC GPU name?

sharp crescent
#

rtx 3060ti

low shard
#

Wokada doesn't support it yet

low shard
jolly tangle
low shard
#

RefineGAN is experimental

#

It is disabled even in applio main branch

#

Since it's not that stable yet

sharp crescent
pastel oak
silk pike
golden walrus
#

cat_pawbite i guess it used system in and out

#

wah ?

#

it's in sound setting

outer fog
#

anyone know why my voice is cutting out while talking

knotty moth
#

same as for discord

outer fog
#

my voice only cuts off while using echo feature on noise, anyone know why?

low shard
#

One day could be 4 hours

#

Another could be 1

#

Another could be 3

#

It's not granted time

#

Kaggle has granted time instead

low shard
#

!give-media-perms 1h @outer fog

#

What game are u playing

unreal linden
#

Hey, I'm trying to play the app in server mode but I can't see my headphone in windows WASAPI

stone lynx
#

I want to use my gpu but it shows me there is only cpu option to choose

#

how can i use my gpu instead cpu

atomic kindle
#

how to fix mic background noises?

#

is it possible to kinda silent them?

#

because my mic is ig broken or something

#

the ai voice detects these noises as my voice and there is like a permanent moaning

#

and my voice has a kind of an echo without even activating echo

low shard
low shard
low shard
#

!give-media-perms 1h @stone lynx

stone lynx
#

my pc gpu is 6700 xt
i want to use my gpu instead of cpu

low shard
#

that's an anciet version, an old version of original wokada

#

over a year old

#

never follow youtube tuts for wokada

#

they are all old

#

also vb audio cable gives random issues on windows

low shard
#

-realtime

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard
#

read the 1st link, wokada deiteris fork

stone lynx
#

i downloaded but i cant find the run http

low shard
#

you gotta run the .exe

#

it's a different version of the program, the deiteris fork

#

please be sure to not skip any steps of the guide

stone lynx
#

wait it says

#

It says it will open in the browser.*

#

so i cant use it as program anymore right?

low shard
# stone lynx so i cant use it as program anymore right?

it is still a program, it runs locally on your hardware

both programs always used a web user interface made in javascript and typescript

the only difference is that original wokada made it's own browser window to open the local hosted url, which iirc is removed in deiteris fork for sometimes cuasing performance issues

knotty moth
# stone lynx It says it will open in the browser.*
low shard
pearl zealot
#

How can I completely uninstall mangio rvc from my pc?

low shard
pearl zealot
#

yes :))

stone lynx
#

which one is better

#

to use

low shard
#

Fcpe is good only for being faster but shittier quality

stoic oasis
#

Hello
Is there a voice changer where I could modify my voice just partially? Something like voice model changing voice by 20% let’s say. Aim to mix, modify, not transform completely. I was checking w-okada and some other software, either they don’t have such features or I don’t know how to use them. I’m experimenting.

stone lynx
#

hello again

#

When I speak, I hear a small voice repeating the last word I said.Is there a way to fix this

low shard
low shard
stone lynx
pastel oak
gloomy mortar
#

guys i have a question

#

when using the index

#

i dont get this problem without index

#

but when i try index i get this

#

error