#✨│ai-help

1 messages · Page 240 of 1

lucid creek
#

ita just 2 h

stiff goblet
#

these are my charts , training for 3 hours

orchid flint
#

Question, is 2 and a half hours of data for a model too much? I tried making a Richtofen model and it sounds like shit. Granted I'm not German but its very rough. btw I'm using it with W-Okada so im not sure if that is a factor or not.

orchid flint
#

Ok, just stick to an hour then?

stiff goblet
orchid flint
#

I use an hour and a half sometimes and it sounds good, first time I tried with a 2 hour dataset

crude flame
#

anything after an hour you just get diminishing returns

orchid flint
#

Ok I see, makes sense

stiff goblet
orchid flint
#

yeah, it is good quality, I the dataset is ripped right from black ops 3

stiff goblet
orchid flint
#

Yeah but this character is very expressive.

simple ore
#

a lot of dips on normal g/total indicates a lot of silence in the dataset

#

fm going up very fast, so most likely this is gonna be a trash model

stiff goblet
analog obsidian
#

too much silence in the dataset bad
mute files are fine as long the dataset has the silence truncated

simple ore
vale heart
#

What AI, preferably local, will allow to convert subtitles with timestamps into AI voice?

stiff goblet
orchid flint
#

Bro, RVC hates me 💀

analog obsidian
stiff goblet
analog obsidian
#

yep thats a small dataset, so makes sense why fm is raising like that

stiff goblet
lucid creek
lucid creek
#

its in overtrain right

orchid flint
#

Eh, kind of hard to tell. You could always train for a bit longer

viral mason
#

idk what bro is using so I cannot help

lucid creek
viral mason
lucid creek
vale heart
#

guys, anyone text to speech ai? with timecodes 00:00:18,000 --> 00:00:21,000 please

viral mason
lucid creek
#

kaggle ofc

simple ore
lucid creek
turbid ridge
#

Quick question even at 250 my rvc models are all quiet

gilded flicker
#

I am running a local ai on a 3060 ti and a ryzen 3 series cpu what can I rent to render faster someone recommend me vps at a good price na servers I wanna be able to open windows on vps and run my own programs

junior gull
#

Allegedly, i cant really confirm just what i was told by someone else, the errors that were resulting from this was like a domino effect that also led to my virtual environment loading incorrectly and processing quietly defaulted to cpu based without failing or warning me. When I ran it all correctly and with cuda support it was blazing fast lol.

simple ore
#

if your dataset is too low volume, then the results of inference will be quiet

simple ore
#

there's no way in hell to train a meanigfully good dataset this fast even on the most advanced AI accelerator

vestal helm
#

I js farted

visual compass
#

Does anyone have tips on speeding up the process of building a dataset for a voice model? Specifically sifting through an entire stream or episode's worth of audio to find and isolate where the voice you're trying to clone is located? I figure there's a better way than manually going through it and comparing it to a transcript to easily locate them.

next vortex
#

I have some problem when I try to use RVC on Colab

hallow thistle
patent trellisBOT
# hallow thistle !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
hallow thistle
#

Which RVC Colab notebook?

next vortex
#

W-Okada one

#

But I cant send screenshot here

hallow thistle
#

RVC and W-Okada are two different programs. Izunahyper

#

What is your PC GPU?

glacial pollen
next vortex
glacial pollen
#

rip

hallow thistle
#

The original Colab W-Okada is broken. Using Detris' fork W-Okada on Colab with free plan can get you terminated.

glacial pollen
#

I mean... It can be run on cpu but yt_nails

#

the delay.. oh boi

next vortex
hallow thistle
next vortex
#

i dont know how can I run it without GPU

hallow thistle
glacial pollen
#

but whether you should ( due to delay ofc and stutters ) that's another story yt_nails

#

better if you follow what Namari says, less hassle for nothing beneficial really.

next vortex
#

Do u have some voice changer else

#

I mean, ofc, free one

hallow thistle
next vortex
simple ore
#

at worst bye-bye google account

bleak torrent
#

hallo

#

Is the colab/kaggle version of w-okada no longer working?

latent kettle
#

how to fix this ??

latent kettle
#

Will it cause any impact or not ? Should I keep it as it is or there is a need to fix it ?

hot dome
#

If there is some people who need someone to make them thumbnail i can help im a thumbnail youtube maker , mp

simple ore
#

does not really matter

latent kettle
#

That cause the issue

simple ore
#

same thing

latent kettle
simple ore
#

it is just the log got double entries for 18.5k (saved epoch) - 19k (when you lost power)

brittle wing
#

can someone explain why there is an paywal on voice.ai and how to remove it and its not on browser

simple ore
#

and then the dangle 'you can do it for free*" (* not really)

brittle wing
#

so it cant be removed

simple ore
#

for the money

brittle wing
#

ok

simple ore
#

if you got a decent gpu you can run the same thing locally on your PC, or on google colab/kaggle for free

knotty moth
lucid creek
#

i mange to delete all the silence from dataset this time its so good now ? { i use appilo on kaggle}

vague edge
#

when i try to delete a voice model i get this pop up and i have to reload page

knotty moth
vague edge
#

alr ill try that now

uncut rampart
#

**in the colab version of Aicovergen i get this for the Run webUI command
**
Timer: 00:02:14Traceback (most recent call last):
File "/content/HRVC/HRVC/src/webui.py", line 10, in <module>
from main import song_cover_pipeline
File "/content/HRVC/HRVC/src/main.py", line 22, in <module>
from rvc import Config, load_hubert, get_vc, rvc_infer
File "/content/HRVC/HRVC/src/rvc.py", line 5, in <module>
from fairseq import checkpoint_utils
File "/usr/local/lib/python3.11/dist-packages/fairseq/init.py", line 20, in <module>
from fairseq.distributed import utils as distributed_utils
File "/usr/local/lib/python3.11/dist-packages/fairseq/distributed/init.py", line 7, in <module>
from .fully_sharded_data_parallel import (
File "/usr/local/lib/python3.11/dist-packages/fairseq/distributed/fully_sharded_data_parallel.py", line 10, in <module>
from fairseq.dataclass.configs import DistributedTrainingConfig
File "/usr/local/lib/python3.11/dist-packages/fairseq/dataclass/init.py", line 6, in <module>
from .configs import FairseqDataclass
File "/usr/local/lib/python3.11/dist-packages/fairseq/dataclass/configs.py", line 1104, in <module>
@dataclass
^^^^^^^^^
File "/usr/lib/python3.11/dataclasses.py", line 1232, in dataclass
return wrap(cls)
^^^^^^^^^
File "/usr/lib/python3.11/dataclasses.py", line 1222, in wrap
return _process_class(cls, init, repr, eq, order, unsafe_hash,
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/lib/python3.11/dataclasses.py", line 958, in _process_class
cls_fields.append(_get_field(cls, name, type, kw_only))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/lib/python3.11/dataclasses.py", line 815, in _get_field
raise ValueError(f'mutable default {type(f.default)} for field '
ValueError: mutable default <class 'fairseq.dataclass.configs.CommonConfig'> for field common is not allowed: use default_factory
Timer: 00:02:16

#

can someone help?

viscid moss
#

Are u using the original AICoverGen Colab?

#

AICoverGen it's outdated

uncut rampart
viscid moss
#

makes sense, dat colab isn't working

uncut rampart
#

what can i do now to generate ai covers?

#

@viscid moss

viscid moss
# uncut rampart <@274566299349155851>

U have 2 ways:
If u want to do everything automatically like AICoverGen, use RVCAICoverMaker.
If u have some time and u can do it manually using UVR5 + Applio

uncut rampart
uncut rampart
uncut rampart
patent trellisBOT
vague edge
#

whats the best chunk and extra setting for rtx 4060 i tried looking on website but couldnt find 4060 settings

arctic gate
#

what do you guys use for voice chagners?

warm steppe
#

guys

vague edge
#

?

warm steppe
#

how can i get what voice changer i want through weight site?

#

i have w okada

#

but i wanna have peter griffim like my voiuce chamher

#

ansd i dont find it in voice models

quasi condor
#

are u typing fast or are u nervous?

warm steppe
#

fast

#

mb

vague edge
warm steppe
#

yeah

vague edge
#

you press the three dots

warm steppe
#

but how can i make it into the w oakada

vague edge
#

and press download

warm steppe
#

oakda

#

oooohhhhh

quasi condor
vague edge
#

true

warm steppe
#

oh ty

#

wait

#

what 3 dots?

quasi condor
#

some of the models of peter griffin on weights is shit cuz they just make the model without cleaning it

warm steppe
#

oh ok

quasi condor
#

but u gotta test the model if its good or nah

lucid creek
leaden kestrel
#

Any turkish girl models?

lucid creek
#

nah

lucid creek
#

how 😉

hallow thistle
arctic gate
mystic tangle
#

how do i make the perfect dataset?

clever burrow
#

you guys use the rvc gui thing for ai covers right?

lucid creek
#

yes

kindred ledge
#

help me choose gpu, currenly i use rtx 3050 mobile for rvc, voice good. But now i want to build and change gpu to rx 6700 xt. Will the sound be different?

simple ore
#

6700 is not a big upgrade

glacial pollen
#

rvc-gui is heavily outdated and nobody should use it

lucid creek
#

its good ?

wheat tapir
glacial pollen
#

or either original rvc but no point in that

#

whereas rvc-gui specifically, yeah, outdated ( last update it had was 2 years ago so that should give you an idea of how old it is

#

tl;dr. What should you choose?

If you want advanced features and are yourself rather advanced / want nightly features n stuff, my fork
https://github.com/codename0og/codename-rvc-fork-3

If you want simplicity and something that just works, og Applio
https://github.com/IAHispano/Applio

GitHub

Codename's rvc fork version 3, based on Applio. . Contribute to codename0og/codename-rvc-fork-3 development by creating an account on GitHub.

GitHub

A simple, high-quality voice conversion tool focused on ease of use and performance. - IAHispano/Applio

golden walrus
glacial pollen
pallid patio
#

Hi! Good morning/afternoon/evening! Can someone help me create voice models? A simple explanation or, if possible, a collab that does it would be very helpful!

odd shale
#

-rvc

patent trellisBOT
kindred ledge
simple ore
fresh storm
#

in my voice changer have cpu, gpu0, gpu1, gpu2, gpu3, what i choose ?

junior gull
balmy girder
#

im having trouble with training a model

#

it keeps saying "list index is out of range"

#

how do i fix it?

simple ore
somber cobalt
knotty moth
somber cobalt
#

rtx won't work?

somber cobalt
knotty moth
somber cobalt
#

ok

knotty moth
somber cobalt
#

no i have 30 series

#

do i need igpu for it?

#

my cpu doesn't have one

knotty moth
somber cobalt
#

so its basically the same as the old voice changer?

knotty moth
#

dec 2024 is still not too old, just before 50-series launches

somber cobalt
#

no i was talking about the okada voice changer

#

i used it in the past

somber cobalt
#

i am downloading it from releases

#

not from the link in the guide

#

is that ok?

knotty moth
#

it should be the same I suppose

somber cobalt
#

in releases it has cuda part 1 and part 2

knotty moth
#

all parts are needed to extract

somber cobalt
#

ok

uneven harness
#

Im using RVC AI Cover Maker and for some reason it wont convert. ill upload the music via drop, select the model, click convert and it says error.

#

this is the precompiled version

uneven harness
#

nvm

#

fixed it

#

:3

pastel oak
#

@fallen wing whats your gpu

fallen wing
#

Nvidia 4050

#

Which do i download i may be stupid

pastel oak
#

Then download first nvidia

#

The 2nd says rtx 5000

#

You have 4000

fallen wing
hallow thistle
#

!howtoask

patent trellisBOT
# hallow thistle !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
fallen wing
#

After downloading i assume i extract it?

hallow thistle
#

If the zip finished downloading, use 7zip or WinRAR to extract it to a folder. cat_wtf

#

No need to ask every step on this one.

pastel oak
#

Ye you asked questions that get answered by scrolling for 1 second on the guide

#

If anythings still unclear ask

fallen wing
pastel oak
low shard
low shard
fallen wing
#

Something else fr

copper terrace
#

I got this and im confused

#

idk what to do

latent kettle
hallow thistle
copper terrace
copper terrace
#

and sorry for the late late replies

simple ore
#

older than dinosaurs in AI terms

copper terrace
#

xD

copper terrace
#

or a link to a guide

simple ore
copper terrace
simple ore
#

with 4GB VRAM at most you'll be able to replace voice in small files

copper terrace
#

alright

copper terrace
tender socket
#

is there a guide here on how to make an AI voice model for RVC

analog obsidian
copper terrace
#

I need to record voice then covert in rvc?

analog obsidian
#

oh

#

so u wanna do local conversion

copper terrace
#

I'm confused

#

I'm an actually ai noob lol

analog obsidian
#

you want to change your voice in realtime to use it on disc/games? use deiteris w-okada

simple ore
#

realtime in games on 1650... i'm ded

analog obsidian
copper terrace
#

Going afk for a bit*

analog obsidian
#

yea he cant do that with a 1650 laptop gpu

simple ore
#

i mean.. if you play competitive Solitaire

copper terrace
copper terrace
#

I'm try both and see

analog obsidian
patent trellisBOT
# analog obsidian -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

analog obsidian
#

first link

#

you may not be able to use it while playing games, your gpu is too weak

gloomy mortar
#

bc i dont have a really good computer, ive tried this voice changer on kaggle. the thing is, with this voice changer apon clicking any of the models i get this error "TypeError: Cannot read properties of null (reading 'modelSlots')
TypeError: Cannot read properties of null (reading 'modelSlots')
at i (https://85c5fd18-860e-42eb-8752-dc96044fe4e4.hrzn.run/index.js:2:1305771)
at Object.updateServerSettings (https://85c5fd18-860e-42eb-8752-dc96044fe4e4.hrzn.run/index.js:2:1306003)"

then while trying to intalise again i get this: "TypeError: (0 , Zz.removeDB) is not a function
TypeError: (0 , Zz.removeDB) is not a function
at https://85c5fd18-860e-42eb-8752-dc96044fe4e4.hrzn.run/index.js:2:3293292
at m (https://85c5fd18-860e-42eb-8752-dc96044fe4e4.hrzn.run/index.js:2:1536451)
at Generator.<anonymous> (https://85c5fd18-860e-42eb-8752-dc96044fe4e4.hrzn.run/index.js:2:1537797)
at Generator.next (https://85c5fd18-860e-42eb-8752-dc96044fe4e4.hrzn.run/index.js:2:1536880)
at e (https://85c5fd18-860e-42eb-8752-dc96044fe4e4.hrzn.run/index.js:2:1543652)
at s (https://85c5fd18-860e-42eb-8752-dc96044fe4e4.hrzn.run/index.js:2:1543855)" any fixes? im using hrzn

#

this is w-okawda's voice changer btw

#

and yes i do have the modded version

#

but i also wanna try this out

low shard
potent bone
#

should i use codename fork instead of applio?

peak path
#

hi
should i use .index file in my model zip file ?
i use only the final .pth and add it to my zip
Replay software can import only .pth file without error.
i want to know if i don't use .index, can effect in my final result?

regal fog
#

help, an error pops up
[Voice Changer] Web Server Launch Exception, DLL load failed while importing beatrice_internal_api: The Dynamic link library initialization program (DLL) failed.
[VCClient] wait web server...10 http://127.0.0.1:18888/
[VCClient] wait web server...20 http://127.0.0.1:18888/
[VCClient] wait web server...30 http://127.0.0.1:18888/

simple ore
potent bone
simple ore
#

it is not noob friendly after all

low shard
magic hare
#

I'm using RVC and stuff, as a voice changer for VRChat, whenever I load a model, I get this error: Error:'NoneType' object has no attribute 'host_api'

low shard
magic hare
low shard
trail dirge
#

do i have to use RVC GUI for files or can i just use wokada?

simple ore
#

or you can just use the cloud option on huggingface with illaria rvc

trail dirge
#

is there command to see the RVC GUI link thing like with - realtime?

simple ore
#

-rvc

patent trellisBOT
smoky linden
#

RVC i think

junior magnet
#

Hi, is it normal for the audio to take so long to convert?

junior magnet
#

Is the collab working?

turbid juniper
#

does anyone have this kind of problem when merging models?

smoky linden
#

anyone now how to install rvc voice changer

knotty moth
turbid juniper
#

wait I think I found it

#

thx

low shard
low shard
# smoky linden anyone now how to install rvc voice changer

what’s ur pc gpu? What do u want to do?

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

low shard
#

what’s ur pc gpu? Which collab link? How much time did it take?

turbid juniper
low shard
#

wokada deiteris fork got more performance

#

be sure to not use original wokada

#

it is local for both

low shard
mystic tangle
#

what settings should i use in UVR vocal remover if im trying to make a dataset?

#

this is what i have currently

#

its supposed to be a talking model

copper terrace
#

how to I use gpu?

#

I thought I downloaded the GPU version but its using my CPU

mystic tangle
#

how do i trim audio in audacity efficiently? its hard to get the exact trim

low shard
mystic tangle
copper terrace
#

the first link

low shard
#

oh u got a gtx 1650?

low shard
# copper terrace

seems like wokada deiteris fork latest version, be sure ur nvidia drivers are up to date

copper terrace
low shard
copper terrace
stiff goblet
#

still learning ?

sly topaz
#

I can't send files to make ai covers

icy ember
#

Hey all, i have a problem with the voicechanger from w okada.
It runs and works perfectly, but i want to use it when i play a game eg. Red M, i have a huge delay, the voice laggs and i cant use it really.
If i tab out of the game and switch to any other programm such as discord or stuff, the laggs and delay are gone.
Can anyone help me ?

broken arrow
#

hey guys i have a 2080 ti and have tried using a voice changer to troll but quite a few number of times now people call it out very quickly. Is there any tips i can do to make it sound more natural? idk what much else to say should i provide a demo on how it sounds like?

icy ember
#

3060 Nivida (12GB)

icy ember
quasi condor
#

maybe its because ur playing red m?

#

are u playing it on high settings?

icy ember
#

I do have higher settings, but the porblem also appears if i play other games too.
I can have the game open and just switch to a diffrent window, then the voicechanger works fine, i can show you if you have time

low shard
#

Show a screenshot of ur wokada while rubbing the game

#

Be sure to close useless programs in the background

icy ember
#

On it!

trail dirge
low shard
#

I'm talking about the rvc GUI fork from t1g3r or smt like that

#

What's ur PC GPU

low shard
icy ember
#

So it helped but its still not good enough

#

sadly i need to go now, i will msg you later

trail dirge
knotty moth
patent trellisBOT
knotty moth
#

rvc gui by tiger18n is too old and doesn't have rmvpe pitch extractor as the current state of the art

trail dirge
#

ive seen here Apolio being mentioned

#

maybe thats newer misc_kekw_hehe

low shard
low shard
low shard
trail dirge
#

alright will check it out later when I get home

#

yesterday i updated to wokada fork and its waaay better than the one i previously had

low shard
# trail dirge yesterday i updated to wokada fork and its waaay better than the one i previousl...

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

#

Rvc (like the applio fork) have completely different purposes than wokada

junior magnet
#

On my phone

#

And the time it took was like, forever

#

When i clicked to convert, it didn't convert

junior magnet
low shard
slim schooner
#

is anyone using rvc-project? im not a coder or anything and im literally just a casual person trying to clone a voice. i was using grok.ai to help me out but it seems like the one it asked me to download just runs into problems over and over again, does anyone have one where there aren't any issues at all? grok.ai has been asking me to add debugging scripts all god damn day

would appreciate the help, if any!

copy and pasted from the other channel lol

icy ember
low shard
low shard
#

this is an old original wokada version which has worse performance and quality

#

along with bad settings

#

plus windows users reported that vb audio cable might give issues

#

forget everything about the tutorial you followed and uninstall the programs you got from it

#

-realtime

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard
#

read the 1st link, wokada deiteris fork

#

the only updated guide is the written one

low shard
slim schooner
# low shard what's your pc gpu and what do you want to do?

i got an rtx 4070ti and im trying to clone a voice locally since its "better" i guess? and then use that cloned voice for the wakada voice changer. however i just saw you mention how the one used in yt tutorials is old and has performance issues. so im assuming i should scrap the old wakada and download the newer one?

low shard
#

not for training

#

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

#

inference = use models

inference realtime = use models in realtime

training = actually making the models

#

do you just need to use the models in realtime voice changing?

slim schooner
#

i know i know, i want to train one and then import it to wokada, i want to train one using rvc project, which was my main issue since the one i have installed has too many issues

analog obsidian
low shard
#

also local is better because you won't be limited by the free cloud time, since you got a good pc gpu

low shard
#

check applio for training models

#

and get wokada deiteris fork for realtime inference

slim schooner
#

sounds good, thanks! ill let you guys know if i run into any issues (hopefully i dont)

low shard
#

don't just use the same gradio public url link

#

because it will always change whenever you re-run it, as the notebook won't stay running 24/7

junior magnet
#

Okay

#

Do you know another collab link so i can do a ai cover?

low shard
#

the problem is you used the same url public link that was expired

#

be sure to always redo the same colab process everytime you need to use it

icy ember
icy ember
#

Delete everything ?

simple ore
#

dont use vb-audio for sure

#

use the virtual cable lite v4.70

icy ember
#

I currently am deleting everything as told 😮

simple ore
#

realtime voice changer does not need python, it comes with its own packaged in, and I think does not require cuda toolkit either

low shard
low shard
alpine eagle
#

Como hago las piedras de evolucion?

icy ember
icy ember
#

@low shard

#

Voice works. but it cuts at the end.. have not tried ingame yet

abstract anvil
#

Hey, quick question: Let's say you have quite a few models in Okada's directory and you want to organize them better. Can you move models into different numbered folders and have it reflected in the program's slots next time you open it? Or do you have to delete the folders like it says in the guide and start over?

icy ember
#

So ingame i cant rly use it, any setting i should use

low shard
low shard
# icy ember

set extra to 2.7, also try it in game and show again the program while in game at min settings

icy ember
#

Would you be able to help me live via voice ?

#

The game is on min settings

low shard
low shard
#

put game graphics to minimum

#

be sure to also play with the pitch and other models, not all models are perfect

icy ember
#

They are on minimum, sup1 is unchecked

#

Is there a way to creata ticket for a meetup on live supp via vc.?

low shard
low shard
#

tickets are meant for moderation

#

it depends on the staff if he can help via vc, which not really everyone can

icy ember
#

forget about the delay, the voice sounds robotic 95% of the time when ingame

low shard
#

not all models are good

icy ember
#

Thx for the help, for today i give up

low shard
icy ember
#

I know, and i will, but for today i got no time left.
You already helped me alot, and it got better, i will get there

sharp palm
#

continue to get "Not enough data present in the training set. Perhaps you forgot to slice the audio files in preprocess?" 22 min dataset, saved as .wav, with the path to the folder in the Preprocess. I even tried uploaded my dataset. What's up with this error?

low shard
#

that's kinda the fix

sharp palm
sharp palm
low shard
sharp palm
low shard
sharp palm
low shard
#

you need to make sure preprocess output shows X minutes were loaded, then you need to make sure extract features did extract N segments

sharp palm
simple ore
#

you dont need to split audio in audacity

sharp palm
sharp palm
simple ore
#

if you're using applio colab with UI, you click [x] dataset creation, enter the set name, then upload individual file(s)

#

then preprocess

#

if you're using applio colab without UI, you put the files into a folder on your google drive and provide the folder name

#

everythig is done properly you'll see 'Preprocess completed on x seconds of the audio' and then extract features gonna have ~x/3s number of slices

#

so for one hour of audio there should be 1200-1300 slices

sharp palm
slim schooner
#

lol was having this exact same issue

#

its training now

junior magnet
#

I think the problem is not that the link has expired.

junior magnet
# junior magnet

Look, I followed the step by step correctly, I just clicked on the link, there's no way it could have expired

#

I just recorded this video

low shard
#

So it's running in CPU

#

Click the arrow top right

#

Change runtime type

#

Set t4

#

Then re run the colab

#

GPU is wayyyy faster than CPU

junior magnet
#

Okay, I'll try

#

Ty

sharp palm
junior magnet
#

I changed it and still the audio doesn't convert

#

😭

low shard
junior magnet
#

Yes

low shard
#

When you change GPU and CPU the whole session restart so you have to do everything again

#

It doesn't seem in the background that you run the cell again

junior magnet
#

I redid everything

low shard
# junior magnet I redid everything

Are you sure you clicked save then did everything again? There should be at least some output in the background and also showing the cell running but nothing is running

junior magnet
low shard
slim schooner
junior magnet
#

Wait a minute, I'll record another video

slim schooner
simple ore
junior magnet
#

Still not converting audio

distant turtle
#

-colab

patent trellisBOT
# distant turtle -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

low shard
#

It would have been better if you showed the colab output after that it not being able to show the converted audio file, because here it shows the actual error

#

But from my guessing it might be because of the extension

#

Please try converting it, if it still doesn't work, do another video showing the colab output too at the end

sharp palm
simple ore
junior magnet
#

Aaaa so hard😭

#

I'm terrible at programming

low shard
slim schooner
#

ok training stopped after overtraining was detected, is 321 epochs good for 15 mins of audio?

low shard
#

Programming is the process of creating the instructions, code, for the computer to follow, while this is just inference (utilization of the models) of the RVC program, and AI is a bit harder to use since this one is open source and not meant to be easy to use even though forks like applio make it kinda easier

analog obsidian
slim schooner
analog obsidian
#

it's pretty easy to hear those problems

#

you'll hear the model having a static-electric type sound

slim schooner
#

i hear it, usually at the start or at the end when i say something

#

thanks man, ill see what i can do 👍

sharp palm
slim schooner
# analog obsidian

do you achieve this by supplying more data? like how many minutes of audio did you use to train this voice model? im guessing 15 isnt enough as it isnt anywhere as good as what you have shown here, even the overtrain sounds better than what i got lmaoo

analog obsidian
simple ore
analog obsidian
#

^ worse than the 42 minute model, but the quality is ok
so ideally you want your dataset to have good quality (remove noise and reverb) and diverse data

slim schooner
#

adding singing clips boosts perforance i assume rather than just talking clips correct?

analog obsidian
#

makes singing better yeah, these models weren't trained using singing tho, they were trained only using speech

#

i think it's fine to have both singing and speech in the dataset tho i have never tried it

slim schooner
#

ok and you can do that inside applio or do you need to source that outside of applio?

#

oh it looks like you deleted your messages lol but you mentioned noise can kill a model so i was wondering where you can do the cleanup and whatnot

#

say like maybe wind blowing or background music and stuff

analog obsidian
#

more of a me error

slim schooner
#

but noise still can kill a model or no?

analog obsidian
slim schooner
#

and what would i use to clean an audio file? say maybe theres like background music or something? is that the "noise reduction" option in applio?

analog obsidian
#

for just background noise (pc fans, mic noise, etc) mel denoiser aggr

slim schooner
#

aahh so the first option is only if its like people singing alongside a soundtrack?

#

and the second one would be as if people were speaking like in a livestream or something and you could hear their ac or something?

analog obsidian
#

but it cannot remove sound effects, only instrumentals

slim schooner
#

aaa got it, i could probably add like 25 mins of audio then, hopefully this helps out the quality lol

#

thanks dude, you've been hella helpful

analog obsidian
#

there's another model dedicated to remove sfx but the results are veeery bad for rvc

analog obsidian
#

rvc can't deal well with audio inconsistency

analog obsidian
slim schooner
#

makes sense

analog obsidian
#

for us it sounds fine but rvc isnt human, so

slim schooner
#

makes perfect sense actually, thanks

analog obsidian
junior magnet
junior magnet
#

The audio has finally been converted

#

Thank youu sooo muchhh

worldly magnet
#

what the best tts ai i can use my own voices for free

quiet axle
#

hi i was wondering, is it possible to run VC on a phone. Of course not locally. Lets say i start my vc on my laptop, and i host it on my local network. Can i connect into it from my phone?

low shard
low shard
low shard
#

Phones lack of a VAC

#

A VAC (Virtual Audio Cable) makes a fake audio device, used to re-route the audio of different programs

In Wokada context, it's used to get the output of wokada as the input in other programs

quiet axle
#

so theres no VAC for android?

#

or its straight up impossible

low shard
#

It's straight up impossible to use it on ur phone rn

zinc mesa
#

One message removed from a suspended account.

quiet axle
dreamy seal
#

how do i create a ai text based that uses gguf models? since i tried using chatgpt and it doesnt work

junior gull
# simple ore there's no way in hell to train a meanigfully good dataset this fast even on the...

lol yeap...it was silently failing and just going through the motions >_<
FINALLY got everythign working for real....took 5 hours....and trained model sounds beautiful!
I've used it for TTS and realtime-rvc. So happy to FINALLY have a success. I ended up getting Applio to work, after I fixed a few things it ran perfectly. RVCv2 still not running right though.
How can I post my trained voice model to the voice-models channel? I've already packed it in a zip included the index and config and a working onnx conversion too.

slim schooner
#

@analog obsidian hey not sure if you're on atm but ive come across this uvr app that im using, i know you mentioned that I should use gabox_fv4 however it's nowhere to be seen or found. perhaps i need to use a different uvr software? or if i could manually download it from somewhere and use it with the current one? same goes for mel denoiser aggr. hmu when you can

viral mango
#

Is it normal for pre-processing a 10 minute data set in Applio to be taking over 35 minutes with a good CPU/GPU?

simple ore
#

with simple slicing 50-hour set of 109 files takes just a minute

viral mango
#

Got it. Just restarted and it finished within seconds

final glade
#

Am I only one who struggles uploading model on weights.gg? For some reason it says that verification not completed and my files are corrupted but everything is fine when using in local RVC

tropic geyser
#

Hi I'm using a voice modified from male to female specifically it's called psych2go-By-Dan I'm using it on Fivem the problem is that most of the time it sounds bad and very robotic can someone help me fix it?

odd shale
#

If you got a great pc and if you're using W-Okada for voice changing, then it's likely that the issue is on your w-okada settings or the model itself.

distant turtle
#

-colab

patent trellisBOT
# distant turtle -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

tropic geyser
odd shale
#

We only provide support for OG W-Okada and deiteris' fork.

#

All other ones are paywalled garbage.

tropic geyser
#

come can i download this app? and most importantly a realistic male to female voice? @odd shale

odd shale
tropic geyser
#

oh sorry

odd shale
patent trellisBOT
# odd shale -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

odd shale
#

Read the deiteris guide.

tropic geyser
#

I tried to install it but I don't understand how to open the application

#

I can't find the folder to open the application
is there a section where I can send you the screenshots? @odd shale

low shard
#

what's your pc gpu? what tutorial link did you use? are the game settings to the minimum? can you show a screenshot of the program?

#

!howtoask

patent trellisBOT
# low shard !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
olive plover
#

Force FP32 mode: on (THIS IS OFF BY DEFAULT!) Turning this on improves stability, significantly reduces glitching/artifacting, increases VRAM usage by 200 MB.

#

I put it ( on ) will it improve the sound quality?

crude flame
#

But it’s very minor

olive plover
crude flame
olive plover
#

ON

crude flame
#

Yep

low shard
#

btw it will add a bit of delay for that little improvement

analog obsidian
#

i wouldnt call getting stable results a minimal improvement

low shard
craggy saffron
#

is there any way to... change... model names... in... okada... it... crashes... when... i.. try... to...

#

Error
unhandledrejection
no error stack

#

TypeError: Cannot read properties of null (reading 'modelSlots')

patent trellisBOT
# hallow thistle !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
hallow thistle
#

Which W-Okada version are you using?

low shard
craggy saffron
#

my gpu is 3060

#

rtx

low shard
#

fork just means modified version in IT field

#

if you make a project named "a" that just prints "a" on github, i can fork it, making a modified version that is called "ab" and prints "ab" to console

#

that's the meaning of fork

craggy saffron
#

wait im gonna go find

#

it was something nvidia cuda

low shard
# craggy saffron idk

send a screenshot of the entire program or the program folder if you really don't remember the link

craggy saffron
#

it open in web

low shard
# craggy saffron it open in web

Wokada has 2 main versions:

  • Original made by Wok
  • Deiteris fork (modified version) made by Deiteris

each version has it's own updates

the latest deiteris fork has way better performance and quality than the latest original

#

from my guessing, it might be the wokada deiteris fork, is it the version b2332?

craggy saffron
#

it perfectly works

low shard
craggy saffron
#

Is that the right one

low shard
# craggy saffron

alr it seems the latest version, the issue is in that wokada deiteris fork there's a bug at the moment where you can't rename a model after uploading it, the only way is by renaming the model file then re-uploading the slot

craggy saffron
#

just rename the original file

low shard
craggy saffron
#

which one

#

json is the accent yes

low shard
#

json isn't needed at all

#

you can delete the json

craggy saffron
#

why is json not needed

low shard
craggy saffron
#

if i get the model i use the whole model blud

hallow thistle
#

Or you can go through inside \MMVCServerSIO\model_dir\ folder, then use Notepad to edit "params.json".

low shard
hallow thistle
low shard
#

it means nothing for the program, it doesn't even utilizes it

craggy saffron
#

oh

low shard
#

but sure you can keep it and not save some storage if u really want lol

craggy saffron
#

its only 3 kb

#

so

#

theres like nothing in it

#

ok it changed the name now

#

thanks

low shard
# craggy saffron its only 3 kb

in rvc context:

  • pth files: contain the voice
  • added index files: contain the accent
  • metadata.sjon file: it's just some extra info about the model download link if you downloaded it off weights.com, it's not needed and won't impact the actual model at all
hallow thistle
#

In a voice model zip file that downloaded from Weights, there may be a json file alongside pth and index files, which is not needed for latter.

low shard
#

i mean sure do whatever, it won't really matter, tho it's usually just suggested to delete the json for saving some space

low shard
hallow thistle
#

I'm just too slow to explain.

craggy saffron
#

wait .json is not the accent

#

oh

low shard
craggy saffron
#

i thought it was the accent

#

do you need the accent

#

on 1 or 0

low shard
#

nope, that's the index

craggy saffron
#

yes do oyu need the index be on

hallow thistle
craggy saffron
#

i know voice models work without a index but is it really necessary

hallow thistle
#

The actual accent file is an index file.

craggy saffron
hallow thistle
low shard
# craggy saffron on 1 or 0

you need to play with the value,

higher value can give you a more similar accent to the trained one but can make it look like autotune

lower value will make it more like your own accent, but won't sound like autotune,

usually it's at like 0.3 or not used in realtime since it's not necessary but you can play around with it

craggy saffron
#

K

#

so if the model doesnt have a accent file i can just mash potato it with a another accent file from a different model

low shard
# craggy saffron K

u can also optionally turn force fp32 mode in advanced settings for a more stable model and slightly better quality at the cost of some delay

low shard
low shard
#

i mean u can use the index file of another model, but it would be useless

low shard
craggy saffron
#

doesnt let me

low shard
craggy saffron
#

to disable or enable

#

tahnks

#

so thats it?

hallow thistle
#

That's fast.

craggy saffron
#

yes

low shard
# craggy saffron so thats it?

yup, u can also optionally use server instead of client to have less delay with more complex steps but u won't be able to use noise/echo suppression

#

u interested into that?

hallow thistle
craggy saffron
low shard
#

which do you need? you're using client right now

craggy saffron
#

client

#

red or blue pill be faster but drain more energy be slower but have more energy

hallow thistle
low shard
# craggy saffron is this right

also i noticed that the disable jit compilation setting is off

Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)

do you wanna leave it off for faster startup speed or set it on for longer start up but a bit less delay?

hallow thistle
#

The normal or original version of W-Okada launches its own separated window. The fork W-Okada should always launch up a browser, so this it is.

craggy saffron
#

exactly

low shard
craggy saffron
low shard
analog obsidian
hallow thistle
craggy saffron
#

but i know it was bad at that time

low shard
craggy saffron
#

i use vac lite for this thing

low shard
craggy saffron
#

@low shard what do you mean by slower start up

#

its the same speed

low shard
low shard
#

yeah it's not going to take like 5 mins to start up lol, pretty little difference

low shard
craggy saffron
#

K

azure patio
#

what would be the best site to train my voice model?

low shard
#

what's your pc gpu?

#

@azure patio You can check your pc gpu on Windows via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU

azure patio
#

i think NVIDIA Quadro P620

low shard
#

if so, then yeah your pc has wayy to low vram, 2gb vram are nothing nowdays

azure patio
low shard
# azure patio yeah

Train (make) RVC Models on cloud:

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (max 4 hours of daily T4 16gb gpu not granted for free, not much hours for training, but easy to use, there's a paid tier):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus, either T4x2 16gb each or P100 16gb, only free):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly, Free Studios run 24/7 but require restart every 4 hours. There's a paid tier):
  1. Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

low shard
#

there's cloud, remote good pc

low shard
azure patio
#

yeah definetly, its just my laptop that i use it sometimes

low shard
#

just like how content creation like video editing and games also need a decent pc most of the times

low shard
broken urchin
#

on w okada deiteris fork i get an average perf of 37 - 42ms so how low can i lower the Chunk setting?

low shard
broken urchin
#

just discord and wokada

low shard
broken urchin
#

ok

low shard
#

always new models racing with eachother

#

now iim not saying every single program and model but you get the point, the majority is, just like chatbots

broken urchin
#

no perms i think

low shard
#

!give-media-perms 1h @broken urchin

#

retry

azure patio
#

have not checked it that much recently, but i was wondering whether voice models for examples are progressing much, i remember my models from 2023 being not that bad compared to now

low shard
broken urchin
#

there

low shard
#

We are having improvements but mostly about performance

broken urchin
#

so how low can i lower the Chunk?

azure patio
low shard
#

There's some of our engineers experimenting

#

Right now the most up to date fork is applio with the best performance improvements

azure patio
low shard
slim schooner
#

hey guys, just wondering if i could get some help here, i originally pinged Lyery but they may have missed my message. anyways, i was running some tests on this uvr software and it seems to always produce a slight "static/hiss" noise throughout the whole clip. its not very noticeabale unless you listen very carefully but like people have said rvc aint human so theres a chance it will pick it up and ruin the whole model. Lyery mentioned on using gabox_fv4 and mel denoiser aggr for separation of vocals from instruments or background music and background noise like ac, fan, etc, respectively. however i can not find these two models. my question is, do i need to download a different uvr software or can i download those models from somewhere for me to use with the current one?

analog obsidian
slim schooner
analog obsidian
#

hmm iirc latest uvr beta can download them from the gui itself
gonna check that rn

slim schooner
#

sounds good, i may not have downloaded the beta as i was not able to find those two there, but if the beta has 'em ill download that version instead

analog obsidian
#

i personally use the colab version of uvr because the conversion can be quite slow with big datasets (anything above 30 minutes)

slim schooner
#

ahh makes sense, i decided to split them into clips instead to make it easier somewhat.

analog obsidian
analog obsidian
#

because for rvc training you have to split the whole dataset into 3s chunks

#

the process is like this:
merge all files into a big one singular file
truncate silence
simple slicing in applio using 3s and 0.3s

slim schooner
#

i could use adobe premier to combine them all in one right?

#

audacity is a bit weird and i dont like its ui

analog obsidian
#

you have to use audacity to truncate the dataset silence

#

otherwise results are going to be bad

#

rvc kinda hates silence

slim schooner
#

i could do that, i meant as just mainly the combining part, like combine all the files into one export that and import the single file into audacity?

analog obsidian
#

yup

#

installing the uvr models is easy

#

be sure to download the ckpt file alongside its yaml

#

then just follow the gui instructions

#

remember to export the file as wav 32bit

slim schooner
#

alright so i combine all files into one using adobe, export that as wav 32bit, and do the cleanup in uvr and then use the cleaned clip in applio?

analog obsidian
#
  1. combine all files into a big singular one
  2. clean the dataset
  3. convert to mono and truncate silence
  4. in applio select "simple mode slicing" and use the default settings of 3s and 0.3s
  5. f0 extraction
  6. train
slim schooner
#

got it

#

thanks, you've been hella helpful ngl 👍😁 hopefully it goes smoothly

analog obsidian
#

when cleaning the dataset don't use any type of eqing or compressors

slim schooner
#

gotcha, ill keep note of that

analog obsidian
slim schooner
#

niceee this is gonna be helpful, thanks for that 👍

azure patio
#

does anyone knows what is causing this? i am copying the path of my audio file and im assuming its not finding it, when i try to pre-process

simple ore
worldly magnet
harsh nexus
#

I want to do the voice of Mario Bros.
I want to download it for an animation.

brisk trail
#

Hello dear peoples, I have a question, So I'm working on an Alex Mercer voice model and I want it to be as high quality as possible, it's dataset length is: 00:30:58
What's the best batch and epoch for it?

brisk trail
#

Forgot to mention, I'm a newbie at training models and my goal is to become a model maker, this is my first model.

north galleon
#

i think weights is down again

#

or it’s at least having technical difficulties

#

i keep getting no healthy upstream errors

slim schooner
#

i went ahead and downloaded gabox and i get this error, not sure what went wrong here, sorry for the amount of screen shots lol, what would the fix be here? anyone know what went wrong? not a coder so whatever happened i have no idea 😭

analog obsidian
slim schooner
#

dang forgot about that, lmaooo let me do that

restive kiln
#

-realtime

patent trellisBOT
# restive kiln -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

restive kiln
#

How do I fix this? It does not show my mic

low shard
restive kiln
low shard
# restive kiln Edge

Be sure you gave it microphone permission, you may also want to try if it works on chrome or firefox

restive kiln
#

oh god.

#

I thought I allowed mic.

#

I pressed block

azure patio
slim schooner
#

also just one question, i split the clips (beforehand like i said) but some clips have music while the ones that dont have any music do have paper flipping, would it still be fine for me to combine all clips and run them through both cleaning methods? or should i clean them separately and then combine them in adobe and then run that single file through audacity for truncate silencing and then export it to applio for training? lmk what you think

empty pecan
#

hi sorry if this was asked a dozen times, but how/where can i use the Cloud RVC?

simple ore
#

like G:\training, not G:\training\character.wav

azure patio
spare marlin
#

Also guys how to make ai cover the rvc v2 thing got shut down in google collab cuz no longer free

low shard
low shard
low shard
#

did you check it?

#

because it's pointless to use cloud when you got a good pc

empty pecan
low shard
#

there's many

empty pecan
#

AMD ryzen 5 7535U with radeon graphics

#

thats whats listed on my device manager like 15 times

low shard
empty pecan
#

oh sorry...

low shard
empty pecan
#

AMD radeon(TM) graphics?

low shard
#

like a dedicated amd gpu

empty pecan
#

no idea where to check

low shard
#

or is it an old laptop

empty pecan
#

yes

low shard
#

it's better to check

empty pecan
#

i have no idea what im looking at, but it says the dedicated gpu memory is 0.8/1.0 gb

#

same gpu

low shard
#

!give-media-perms 1h @empty pecan

empty pecan
graceful ocean
low shard
# empty pecan

nvm, your pc is extremely way too weak for local ai usage

empty pecan
#

mhm!

low shard
#

Train (make) RVC Models on cloud:

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (max 4 hours of daily T4 16gb gpu not granted for free, not much hours for training, but easy to use, there's a paid tier):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus, either T4x2 16gb each or P100 16gb, only free):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly, Free Studios run 24/7 but require restart every 4 hours. There's a paid tier):
  1. Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

low shard
#

gave the list to u

empty pecan
#

thanks

#

imma try to use my brain now

upbeat carbon
#

Quick question: should i use seed vc instead of realtime voice changer?

#

Cuz idk

low shard
upbeat carbon
low shard
#

also, you want to do realtime voice changing for calls/games?

upbeat carbon
low shard
upbeat carbon
upbeat carbon
low shard
#

which do you need?

#

RVC = Retrieval-based-Voice-Conversion, it's the best quality few shots training Speech To Speech AI

upbeat carbon
#

But in there you don't need a pth and a index file, anyways can i sent a video so you can understand what i mean?

low shard
upbeat carbon
low shard
#

are you sure you want to use seed vc compared to rvc?

upbeat carbon
#

Not sure..

low shard
#

also, i asked you if you wanted to use ai for pre-recorded audios or realtime for calls, because there's different programs

#

id need to at least know this

upbeat carbon
low shard
low shard
# upbeat carbon Yup

alright,
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

Your pc gpu is good enough, you can do it locally, idk who told you it isn't

What you seem you need is Wokada deiteris fork for the best realtime inference, which you can get by reading https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

if you want, you could optionally also use https://github.com/Plachtaa/seed-vc which is less used though, since yes you don't have to train models since it's 0 shot, but at the same time it will be lower quality

0 shot is kinda like a student who read 10 pages once, and few shots is like a student who read those 10 pages multiple times, ofcourse the 2nd student should perform better at the test

#

personally, i'd suggest you wokada deiteris fork, but it's your choice

upbeat carbon
#

Oh okay! Can you tell me why you recommend deiteris? I'm curious

low shard
#

a properly trained rvc model should have better quality compared to another ai that doesn't train

#

-# ofcourse, if the model is trained on garbage, the model will be garbage

upbeat carbon
low shard
upbeat carbon
low shard
upbeat carbon
#

But how do i sound accuretly to the character that i want?

hallow thistle
upbeat carbon
hallow thistle
mortal topaz
#

Is it possible to reduce artifacting somehow?

#

Like, can using an embedder help?

deep nova
#

Anybody got a good video to help me create a good rvc model for the first time

low shard
#

There's only written guides

#

What's your PC GPU

deep nova
#

idk if its good enough

worldly ibex
#

is there a model i can run locally that can work from youtube? i dont wanna seperate the voice and shit

eager shuttle
#

The biggest problem I have is installing I have the files, but I always have trouble with the torch section especially with having to command prompt it never works

potent bone
#

i literally know nothing about making voice models should i use applio or codename fork? and also is there a guide that teaches me how or what to do?

viral mason
#

If I apologize to Weights will they be nice again and verify all my models

tight ether
viral mason
#

I have no experience with codename tho so can't help there, dm me whenever you're ready

odd isle
viral mason
#

I don't think there is a difference

stiff valley
#

guys i need help there's an echo where it seems like the voice changer is picking up the changed voice, and applying the changer again. Is there a way to fix that?

low shard
low shard
#

what's your pc gpu?
show a screenshot also of the program

low shard
low shard
#

what's your pc gpu

#

what do you want

#

what do you want to install

#

be sure to not use video tutorials

low shard
viral mason
#

what he said

low shard
#

this is why u should always check ur pc before using cloud

low shard
stiff valley
viral mason
#

I dislike local because it makes my vr run slow

low shard
stiff valley
#

i think i cant

low shard
stiff valley
#

where can i send it

low shard
#

try now

stiff valley
low shard
stiff valley
#

yessirski

low shard
#

this is an old version of original wokada

#

and the settings are completely wrong lmao

low shard
#

they are outdated asf