#✨│ai-help

1 messages · Page 229 of 1

warped latch
#

Hi guys, I've tried local RVC voice changer (I think it was W-Okada) a while ago using older AMD GPU with no tensor hardware (RX 5600 XT) and it worked but wasn't great.

Does it just use CUDA / OpenCL? Or does it also use ML hardware like tensor cores?

Now that I'm looking to upgrade my system, what kind of GPU is required to do it more smoothly?
I'd like to be able to run it while gaming too. Would having extra VRAM help with that?
I was thinking of getting an RTX 5060 Ti 16GB. Would it be good at that?

simple ore
warped latch
simple ore
#

5060ti is a waste of money, do not ever get 8GB

warped latch
simple ore
#

extra vram can help.. if you play in 1080 ot 1440p there's usualy enough left to run a voice changer

#

but it is a competition between the game and ai, some tweaks are needed to run both smoothly

knotty moth
# warped latch Hi guys, I've tried local RVC voice changer (I think it was W-Okada) a while ago...
  • it is actually onnx/directML for AMD cards under Windows, but the fork version has better optimization on it https://rentry.co/forkvoicechangerguide
  • I'd suggest to get RTX 40-series such as 4060 Ti 16 GB. 50-series cards can't run the voice changer due to using the stable torch version. (too bad I lost the link to voice changer for 50-series misc_cry )
warped latch
#

Yea I could set FPS limit to like 70 or 80% of max fps to leave GPU headroom but my question is if the 5060 Ti would need like 90% of the GPU to run it smoothly or if it can do it more easily while gaming

knotty moth
#

sorry but I can't decipher what you mean the percentage?
perhaps you mean the model fusion ratio?

warped latch
knotty moth
#

not only that 50-series are also notorious for several known issues plaguing them

warped latch
#

So the 50 series are in like a beta phase? and will get full support once the stable gets an update?

warped latch
knotty moth
warped latch
knotty moth
warped latch
#

Ok so if all else equal (which it isn't) I should get an RTX card

#

I'm not very hyped for 5060 Ti but it's like the least bad card under 600 euros where I live.. I'll see what AMD releases soon but I have doubts..

knotty moth
#

for most AI applications, 4060 Ti is a solid choice, its 16 GB vram is just enough

warped latch
#

At least it should be able run voice changer smoothly even while playing a lighter game like an esport game

knotty moth
#

to stretch your budget, you can consider 4070 Ti super

#

for the same 16 GB

warped latch
knotty moth
warped latch
#

Where I live you can easily get the 5060 Ti 16 GB for 449 euros (basically the MSRP here) while 40 gen is usually same or more expensive

knotty moth
#

let's see how the market would respond to

warped latch
#

I think 4060 Ti 16GB is over 500 euros

untold marten
# knotty moth sorry but I can't decipher what you mean the percentage? perhaps you mean the mo...

think of the percentage like volume: when u are setting to 0, u can't hear anything right? and at 100%, that's loud. Apply the same principle here:

processing slider:

(your voice) (model voice)
0%...........................................................100%

currently we can like either set to 0% and that would be the stop button
and start would be 100%. would there be a way to create a slider which allows to like change the way program process the voice input in increments of 5? like 5% voice model and rest 95% your original voice? and that slider is availably even when using it like pitch slider.

another example would be the DO-RE-MI-FA-SO-LA-TI-DO-RE exercise where u start with a low note (deep voice) and raise slowly to high notes then go low at the end again.

analog obsidian
simple ore
#

@untold marten train your own voice model.. .then merge it with other voice model with some blend rates, 10%, 20%, 30%, etc

knotty moth
#

ye model merge is what I thought

analog obsidian
#

oh he meant he wants the model to sound more like him

simple ore
#

it wont sound like him, it will sound like a blend

#

a bit of this and a bit of that

analog obsidian
#

yea im aware of that, tried that once

#

i was trying to understand his message lol

simple ore
#

since it is just a merge of weights, the result wont be an actual blend between voices

#

it is a blending of data which may or may not sound good

untold marten
#

like a slider that when set to 0%, it's just your original voice. when raising, it's slowly/gradually changing the voice to the model set by you

analog obsidian
#

i remember there was a voice changer guide that also did that
supposedly it allowed the model to be able to inference your voice better

untold marten
#

lemme find a video to show better'

#

tbh idk if thats even possible

knotty moth
analog obsidian
knotty moth
analog obsidian
#

would be better to just speak like the model rather than merging

untold marten
#

ok, found a video to show what i mean. it's ok/am i allowed to post?

#

as i know so far, most links are prohibited and few like the rentry guide/aihub website are allowed

rotund spindle
#

what do i do with the models
i am not used to this new technology hehe

untold marten
#

it would be prob good if we can achieve such results by just merging the weights , since that means this feature could be done without starting from scratch

rotund spindle
#

what voice changer should i use?

untold marten
#

on short, this is what u need to specify:

  • what's your pc gpu
  • what tutorial link did you use
  • a screenshot of the program
  • whats the issue
rotund spindle
#

how do i use these models

restive kestrel
#

Hi everyone, sorry to interrupt but is it ok to post a job announcement here?

rotund spindle
#

i did it nvm

hallow thistle
untold marten
#

wanted to say the same thing but my message got being held as a hostage :))

hallow thistle
restive kestrel
tough fiber
#

--okada

#

what was the command

tame mica
#

-realtime

patent trellisBOT
# tame mica -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

hallow thistle
untold marten
#

would be fine to tag Lyery, Noobies and Makidanyee regarding what i mentioned before and allow me to post a link from a yt video showing what i mean?

untold marten
#

hope it wont be deleted tho'

#

video: https://www.youtube.com/watch?v=s1HPMV2EMGo

short description: a feature that can achieve that transition from your original voice to the model you set and vice versa like this person can achieve using his own voice. (this person doesn't use rvc and solely using his own 7 years trained voiced and does voice trolling)

timestamps:

  • 7:10
  • 7:55+
#

oh, it works

untold marten
#

thats why i said before that i'm not sure if it's even possible

#

guess it's also taking more processing power than usually does

analog obsidian
#

w-okada has a bug that everytime you change a setting, it'll use more resources

#

idk why

untold marten
#

huh

#

a setting, not a model?

analog obsidian
#

to be honest, i believe doing anything will cause that

#

restarting the thing fixes it

#

closing it and opening it again

untold marten
#

but the fork from deiteris is not having that bug, right?

analog obsidian
#

in the original w-okada is worse because every slot after 1 also has increased gpu resources

#

at least in the fork that doesnt happen

untold marten
#

weird. while i don't know how this entire thing works, but imagine that it would make sense if with each setting modified or a few key settings, a new instance of processing the model is created and the previous one isn't disposed of? or idk but hope the devs can manage to fix that (no offense towards anyone) 🙂

knotty moth
lyric fossil
#

does anyone use deiteris on collab

#

ive used less than 5 minutes and colab disconnected me

#

it says due to executing code that is disallowed

viscid moss
#

isn't patched ig

golden girder
#

where to change epochs?

valid spruce
#

Hi guys

golden girder
valid spruce
#

How do I remove those low voice effects from vocals?

golden girder
#

there might be 2 reasons

valid spruce
golden girder
#

because of how much

#

u put index

#

max

#

or

#

F0 DET..: u choosed is wrong

sly furnace
lyric fossil
#

which rvc cloud you guys using?

#

mainline colab is broken :/

mossy ocean
#

HELP!!!!

#

при открытии файла start_http.bat выскакивает ошибка

#

when opening a file start_http.bat I have files downloaded and then an error pops up and nothing opens what to do

knotty moth
# mossy ocean when opening a file start_http.bat I have files downloaded and then an error pop...

if not sure, try the fork version that shouldnt have issue like that https://rentry.co/forkvoicechangerguide

mossy ocean
soft zealot
#

for an llm with personality

mossy ocean
wild bobcat
#

why dont my pth and index files show up?

#

am i missing something

oak hornet
#

which ai trainer is better for google colab

white ice
#

I need help, there is no gpu option during the voice conversion I can use my cpu, how can I make the program use the GPU?? (in W Okada)

wanton valley
#

Yeah? what is this "better" method? hope is similar to this colab

#

-colab

patent trellisBOT
# wanton valley -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

wanton valley
craggy saffron
#

is there any w-okada sport?

#

support?

#

wait i fixed it

analog obsidian
#

Latest mainline realtime inference runs better in nvidia than both w-okada and deiteris fork

#

Clock speed of mainline

#

Clock speed of deiteris

proud hinge
#

Im having troubles with the Real time voice changer, it keeps buffering and not play the full audio just parts of it how can I fix this. I have a AMD RX 6600 and I want to use the audio in discord or any other games.

formal locust
#

hi, when i upload my ai voice model, it doesnt seem to pick up my voice

#

compared to the presets

#

when i use passthru it picks up my voice but the vol right next to the picture doesnt fluctuate at all

brittle wing
#

im trying to download for 5000 series but when i try to extract it says the same volume can not be used as the same source and destination

#

@odd shale @rustic ridge

viscid moss
#

-rt

#

-realtime

patent trellisBOT
# viscid moss -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

viscid moss
#

The first link one?

brittle wing
#

yes

#

for 5000 series

#

i have 5070ti

viscid moss
#

try using 7-zip to unzip the files u need

#

7-zip it's a program

brittle wing
#

from google?

viscid moss
#

ye

brittle wing
#

which one

#

first?

viscid moss
brittle wing
#

omg im stupid it literally says to make sure u have 7zip in the guide

#

sorry

viscid moss
#

dw

brittle wing
brittle wing
#

i extracted with winrar

#

but when i open its just a black screen

dusty rampart
formal locust
#

having some issues with using virtual audio cable atm

#

all my inputs are the vitual audio cable

#

and my virtual audio cable is set to default

brittle wing
#

do i need to change any of these

#

to help with voice lag

oak hornet
#

please say me free ai trainer

kind drum
#

I am using RVC and I couldn't see the usage of RTX in performance monitors

oak hornet
#

how to use mainline rvc in colab ?

worthy saffron
#

Hey, is anybody good at clipping/making AI voice overs. Can send references, dm me. Money ready.

craggy bough
knotty moth
#

also you don't need to make a support tiket for your question

sweet wren
#

anyone know how to fix the pytorch installation on applio for 50 series?

#

been spending hours

hallow thistle
sweet wren
#

ik i need the nightly build, but cant figure out how to get my applio to run on the nightly build

hallow thistle
sweet wren
#

cant get pytorch 2.8.0 nightly to install in the applio script

sweet wren
#

this really should be pinned. incredibly helpful thank you so much bro

pastel oak
#

Increase chunk to 128

pastel oak
# formal locust and my virtual audio cable is set to default

Change defaults in audio device settings to your normal stuff

https://rentry.co/ForkVoiceChangerGuide in step 3 virtual cable its explained

analog obsidian
pastel oak
pastel oak
#

There was an update? Pog

analog obsidian
#

yuh

#

now it forces both input and output to use the same api

#

no longer mixing mme with wasapi

#

fcpe in mainline uses like 1gb of vram

#

and rmvpe 2gb

pastel oak
#

Nicee

analog obsidian
#

and you can do fp32 inference too

analog obsidian
#

fp16 should use even less

pastel oak
#

Did this come with a prepackage release? I remember the audio driver type was changed before, but the latest release which most people end up downloading is still mixed

analog obsidian
analog obsidian
#

the new package is only available in pan baidu

#

for chinese users

#

but the changes are in both fumiama's repo and rvc-boss repo

#

the installation is easy, create a env, install requirements for the voice changer and after installing those, you can use it

#

they changed a lot of things since 2023

analog obsidian
#

boss also even added support for 50xx gpus

#

original rvc devs are aware of realtime being popular

#

for me it makes sense mainline realtime runs better now, its made by literally the people who created realtime inference in the first place

so uh yeah i wanted to mention that because i like how fast mainline is now, and the best part, it doesn't run in your browser misc_trolley

pastel oak
hallow thistle
#

Me who doesn't have a good GPU: ankha_siparissip

sour niche
#

Does anyone want to learn AI with me?

hallow thistle
brisk beacon
#

Is the Voice Changer compatible for Discord?

hallow thistle
brisk beacon
hallow thistle
hallow thistle
brisk beacon
sly furnace
outer isle
#

I put in a datsset of Blackiana and I want to put it in crepe. How to do that on Applio in Google Colab?

orchid flint
latent yacht
#

you guys know a way to get the funny autotune effect in realtime?

#

how do you do that?

#

I've seen people use that in-game

orchid flint
#

Voicemod most likely

analog obsidian
#

that is still technically "mainline" because the person who made this repo is part of the original rvc dev team

errant parrot
#

how do i use the model with huggingface

low shard
errant parrot
low shard
median lance
#

is it possible to merge two voice if they have different sampling rate? when i tried doin it, web-page wont allow me to select them

crude bolt
outer isle
#

I put in a datsset of Blackiana and I want to put it in crepe. How to do that on Applio in Google Colab?

dusty rampart
simple ore
median lance
simple ore
#

technically models can be converted, but they need some training after that

#

technically the mismatched parts can be taken from one of the model

#

would not recommend

#

but it is doable

cedar sky
#

Hi guys, does someone know great video-to-video model which is NOT changing background. Or its not possible without train custom LoRa?

median lance
simple ore
#

that's the problem with that approach

median lance
#

what would be a way?

simple ore
#

as I said, while merging copy the higher SR generator weights, blend other weights

simple ore
#

hm.. not quite

nimble copper
#

how can I connect rvc to discord?

#

any guide for it

low shard
nimble copper
low shard
#

Wokada has 2 main versions:

  • Original made by Wok
  • Deiteris fork (modified version) made by Deiteris

each version has it's own updates

#

You can uninstall all u got before since ud need Wokada

nimble copper
#

any offical download link for wokada?

#

by wok

low shard
#

Your GPU is good enough

low shard
#

-realtime

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard
#

1st link

north basin
#

What’s with the black screen when I open the app?

low shard
north basin
#

The weights app. When I go to open the app, it shows the logo and then turns into a black screen

olive cliff
#

What's the best settings to use for Renegate?

plain zinc
#

Do you know if I can run the program with my laptop that has an RTX 2050? I'm not at home and therefore can't use the desktop, but I'll try when I get back.

gilded trail
#

Can anyone tell me how I can use voice models to modify my pre-recorded audio to have that voice? I already have the zip with the .pth and .index of the voice but i dont know how to apply it. My GPU is NVIDIA GTX 1050 Ti

odd shale
#

But you can anyway try and install it

#

Just go to the "Releases" tab

#

In fact they released the 3.2.9 version today

#

Also there you have the docs.

low shard
plain zinc
low shard
# plain zinc RVC w./ w-okada if I'm not mistaken

those are 2 different programs

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

plain zinc
#

wokada

#

or either of the two

low shard
low shard
#

they serve 2 different purposes

plain zinc
#

I don't think I'll use it in heavy games, it would only be Roblox and VR chat.

low shard
plain zinc
low shard
# plain zinc Do you know what configuration is recommended in the program?
#

also cap games fps, and you might want to play with the extra (lower extra = less delay but also less bit of quality)

plain zinc
#

I guess what the GTX16 would be

low shard
plain zinc
#

Later I will try with my microphone that I play with but my brother is using it

#

still, thanks

royal juniper
#

guys I found some nice AI voices in weights. are all of them copies of someones voice? it says that it's an original voice, is it possible to make an original voice with ai voice that doesn't belong to anyone?

inner trout
#

all ai voices are trained off of real.. voices, or else there's no progress made.

simple ore
#

but you cant built it out of nothing

royal juniper
glossy cipher
#

Hey everyone, If I send someone a one minute dataset, can someone show me where Im failing? My AI model sounds nothing like I thought after I trained on Apollio

simple ore
outer isle
#

I did put the Blackiana dataset in a folder and now I want to put in "crepe" on Applio in Colab. How do I do that?

#

A Ai model in "crepe" in good quality?

glossy cipher
simple ore
inner trout
#

what??

#

i was answering someone's question are you good?

glossy cipher
simple ore
#

too large, let the app spli things

glossy cipher
#

I have Audio Cutting on, Training it now

#

Whats the reccomended?

#

Usually Ill do a minute set with 9 second splits

#

10 minute set*

inner trout
#

Looking for ways to make a voice sound more natural and realistic, so far I've got some background noise / taps in the back that play randomly. Does anyone perhaps know of an equalizer setup or badmic setup I could use?

crude flame
inner trout
#

What about things that would work live?

crude flame
#

lighthost and some vsts

#

for live

inner trout
#

Huh okay

#

Wow why didn't people tell me about lighthost before, this is pretty simple and light weight

#

Thanks

knotty moth
#

yea try lighthost

knotty moth
crude flame
knotty moth
analog obsidian
#

adding it on the fork guide would be nice

#

also we need a new mainline realtime guide because now its better than both w-okada and the fork

#

there's no compiled version for that tho 👀

knotty moth
crude flame
#

any suggestions?

#

make a tab dedicated to making models sound "real"?

knotty moth
crude flame
#

it is just shad's guide

#

but its there

analog obsidian
low shard
#

Isn't mainline almost abandoned since they work on GPT-SOVITS

analog obsidian
#

is not abandoned

#

last update was in 2025

#

boss added support for 50xx

#

and the realtime inference update was in october 2024

analog obsidian
#

last actual update was in dec* 2024

low shard
hallow thistle
analog obsidian
#

the compiled version for that is only in pan baidu

broken vortex
#

what do u open to get the okada running by

#

i forgot

#

@analog obsidian Cn u belp.

hallow thistle
low shard
#

@pastel oak , @knotty moth pls talk about that in staff chats, not here

lyric fossil
jovial fox
#

hey is there anyone that can help me with the w-okada i have a high end pc but dont know good settings

low shard
#

What’s your pc gpu? What tutorial link did you use?

jovial fox
hallow thistle
low shard
#

there aren’t chunks and extra that go well with every single pc

jovial fox
low shard
#

If u don’t remember, you can send a screenshot of ur wokada

hallow thistle
#

There are two different versions of W-Okada; one is original and Detris fork.

low shard
#

!give-media-perms 1h @jovial fox

knotty moth
# jovial fox i 9 13900K and a 4080
jovial fox
hallow thistle
#

If you've watched what "tutorial videos" on YouTube saying to you, those ones are outdated, they only tell you to install the original version of W-Okada.

brittle wing
#

Warning this AI server is bri*ish scary

low shard
hallow thistle
jovial fox
low shard
#

-realtime

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard
#

1st link

hallow thistle
low shard
#

** video tutorials are outdated**

#

that one is an old version of original wokada

#

get the wokada deiteris fork instead

jovial fox
#

okay ill uninstall then thank you

hallow thistle
low shard
jovial fox
#

yea vb audio i have okay i uninstall everything

hallow thistle
jovial fox
low shard
# jovial fox is it right that it opens a browser when i install ?
glad scaffold
#

How do I make a chat bot

thorny cargo
#

After two successful models I've made I wanted to step up the dataset and so I did. at first I ran into the issue surrounding the index not being able to be generated, but after finally fixing that I am now having this issue

#

I've had the same issue on Kohya (stable diffusion image model training software)

#

But I resolved it by quite literally removing the variable

#

I wanted to first ask here before anything

#

the line in question

#

I've seen a few others have this issue in the server but their questions were met with no help

low shard
low shard
thorny cargo
#

I am well aware, im just surprised I got the same unbound local error

#

on two completely different platforms

#

🤦‍♂️

thorny cargo
low shard
thorny cargo
#

Could it be that a 40 minute dataset is problematic for my vram

#

actually nah I doubt it

#

this error seems unrelated

low shard
#

Be sure that applio, windows & your drivers are up to date tho

thorny cargo
#

Yup

thorny cargo
#

this maneuver is gonna cost us 51 years

#

ssd btw

low shard
thorny cargo
low shard
thorny cargo
#

oh...right....good point

#

I have 7zip

#

did not use it

low shard
hallow thistle
thorny cargo
#

joe_shrug but ill look into the ssd issues

#

I have samsung magician for that afaik

#

what the heck is this

#

I didn't see this in 3.2.8

#

What even is the call on this, like how do I gauge what I set it at

low shard
thorny cargo
#

ahhh

low shard
#

there’s a description for most features in applio btw

#

as you can read below the feature title

thorny cargo
#

Question is how do I gauge what I set it to

#

between 2 and 10

#

since I already know to put something (there isn't much silence within the dataset)

#

I'll just leave it at default for now

thorny cargo
#

much appreciated

#

It's gonna take a while to train but slow is better than nothing

uncut raven
#

am wanting just a rvc to put in audio buits and get the ai converted audio whats the best way to get that

thorny cargo
#

as in, put in an audio file as an input

#

and it will do the conversion

uncut raven
#

ahhh i have the old version

thorny cargo
#

I had the version from 2023

#

and that one does it

hallow thistle
#

-gui

uncut raven
#

welll....

patent trellisBOT
uncut raven
#

maybe im dumb

thorny cargo
#

you mean

#

clueless

uncut raven
#

that too

thorny cargo
uncut raven
#

i been using the realtime for years

#

no issues

#

not completely clueless

#

as such as out of the loop

hallow thistle
#

Applio is an RVC program that can both do AI cover and train voice model in one program.

thorny cargo
uncut raven
#

i just installed wokoda

hallow thistle
#

W-Okada is realtime voice changer that uses RVC voice model for inference.

thorny cargo
hallow thistle
uncut raven
#

3060

hallow thistle
uncut raven
#

yeah

hallow thistle
#

Good. iorithumbsup

uncut raven
#

figured it out on my own

#

im a big boy

#

so much easier than the old onme

uncut raven
#

would u be a chad and te ach me real quick how to use it

low shard
#

updating solves everything 🙏

low shard
uncut raven
#

im dyslexic

thorny cargo
thorny cargo
uncut raven
#

ill kiss u if u do that again

thorny cargo
uncut raven
#

just how do i throw audio in and get ai back

low shard
thorny cargo
#

under the input section for devices etc

uncut raven
#

yah

#

lemme guess

#

file

thorny cargo
#

see if theres a "file" one

#

yes

#

...

uncut raven
#

big brain

low shard
thorny cargo
uncut raven
#

do you use crep

low shard
#

rvc is for pre-recorded audios

#

wokada is for realtime calls like discord vc

uncut raven
thorny cargo
#

I think?

#

i've done it before

#

huh'

uncut raven
#

just not as reefined

thorny cargo
hallow thistle
#

This is Detris' fork W-Okada.

low shard
#

@uncut raven do you need only pre-recorded audios? Or also realtime usage in calls?

uncut raven
#

both

#

i used the old one

#

for gaming and stuff

#

but now i want it for pre recorded

#

aswell

low shard
#

the issue is that video tutorials are old, since ai changes at sonic speed and you can’t easily update a video

#

it’s the reason why we have written guides instead boohooh

thorny cargo
#

You should see things on the img gen end, stuff from 2-3 months ago is out the window

#

lmao

uncut raven
#

ohh

thorny cargo
#

but It could be'

#

I should have batched 16 instead of 4

#

I was told batch of 4 means higher quality

#

but given the time it will take, is it worth the time increase

low shard
low shard
#

i use comfyui

thorny cargo
thorny cargo
#

its similar to the old Automatic 1111

#

just better

hallow thistle
thorny cargo
#

comfy is nasty

#

I would never

low shard
#

i like comfyui for having day 1 support for shi

#

but yea it’s harder to use

thorny cargo
uncut raven
#

how much index do you suggest using or does it depend

low shard
thorny cargo
#

I was told to use 0 most of the time

#

since it has the least restriction

#

1 is like, closest to the original source

upper furnace
#

sorry one thing how do i create beatrice templates personlized for voice changer?

low shard
low shard
#

there’s very few, they aren’t preferred for being lower quality and mostly focused on being lightweight

thorny cargo
#

its subtle so not egregious but

#

you hear it

low shard
thorny cargo
#

Yeah I have to completely refamiliarize myself with tensorboard

#

focus on g loss total and the other loss total right

low shard
#

be sure to use rmvpe, since it’s more robust to noise compared to other pitch extraction algorithms like crepe

thorny cargo
#

Yes I use that

thorny cargo
#

ranging from 200 to 500

low shard
thorny cargo
#

on a 14 minute dataset

low shard
#

there’s also a more advanced explaination in the docs about the tensorboard btw, explaining you each graphs

thorny cargo
#

got this

low shard
#

i would really suggest you to give the docs a good read since they got alot of good info

#

and also still thanks to @crude flame for maintaining the docs, you did a great job

thorny cargo
#

always stop once the graph starts to perk back up

#

atp its overtraining

#

I remember when this server had to be remade

#

been around a long time

thorny cargo
low shard
#

-# idon’t remember u tbh

thorny cargo
#

oct of 2023?

#

i think

low shard
thorny cargo
#

hmm

low shard
#

actually

thorny cargo
#

odd

low shard
#

4 october 2023

#

iirc

thorny cargo
#

why did you reply with 2024

low shard
#

got confused lol

thorny cargo
#

oh lul

#

yeah

#

it was tragic cuz a lot of the models were gone or supposedly uploaded to weights

low shard
#

yup

thorny cargo
#

why is that

#

don't we need the index for rvc

thorny cargo
#

Uh some voice models whether for characters or just in general

#

been a while so idr them off the top of my head

#

why do you ask

low shard
#

index is optional since it’s just the accent, but model makers should upload both pth and index

thorny cargo
#

yeaaa, ok so next time i encounter that

#

do I let someone know or what

low shard
#

might be sometime ago a model maker accidentally forgot to upload it but yeah u should prob ask the model maker

thorny cargo
#

oki

thorny cargo
low shard
thorny cargo
#

Got it

#

Nice

#

I assume that this is like this because I had cancelled the training and restarted it with a different batch size

low shard
#

just redo the training atp

thorny cargo
#

back to square one

#

LMAO

low shard
analog obsidian
knotty moth
#

but there are lot of customizable comfyui workflows you can search in civitai, etc

outer isle
#

Where’s the mangio-crepe in Applio in Colab? I don’t see it

thorny cargo
#

Ive used it though

#

so I can at least attest to that

#

anyway WHAT THE HELL

#

the graph is meant to go DOWN

#

then UP

#

@low shard

#

this is the real graph

#

way overtrained

#

gonna stop it now.

upper furnace
crude flame
thorny cargo
crude flame
#

did you change the batch size?

thorny cargo
#

(i stopped training, changed a parameter and started it again)

#

yes

crude flame
#

oh

#

dont do that

thorny cargo
#

just start a whole other one?

#

or wha

crude flame
#

yea

thorny cargo
#

oki

thorny cargo
#

yipee its normal

simple ore
upper furnace
#

ok

simple ore
#

let me know how it goes

upper furnace
simple ore
thorny cargo
#

can't I use a hugging face link?

#

does it have to be Weights?

thorny cargo
#

nvm

coarse pier
#

ey i wanna clone my own voice, how can i start?

low shard
coarse pier
#

is there no collab for it?

#

yikes

#

mine is a gtx 1650

low shard
# coarse pier is there no collab for it?

there is but we ask the pc gpu because we need to check if it's good enough to run it locally first, there's people with a good pc too, and it happened in the past that people with an rtx was using colab bc they didn't know any other way

low shard
#

I would suggest cloud

#

Train (make) RVC Models on cloud:

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
  1. Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

#

Mainline is broken as right now though

left dirge
#

Which is the current best RVC version? For AMD, I kind of can't find where to download the updates.

#

-realtime

patent trellisBOT
# left dirge -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

left dirge
#

I don't think the version from 7th of december is the latest one

#

By the way, I'm currently using DirectML from a year ago, just want to know if there's a better one now for AMD (I have a good card, the RX 6800), or if there's an update on DirectML

low shard
left dirge
#

Or if there's some other fork I should use.

low shard
#

that's the best for amd rn

left dirge
#

No clue if its that version, can you send me the link? I'll download it and just use that one

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard
#

1st link

#

it's inside the wokada deiteris fork written guide

#

i feel like you were using an old version of original wokada off youtube lol

left dirge
#

This one then?

left dirge
low shard
left dirge
#

hMMM

#

Just a few versions behind

#

Let's see if my model won't sound like a fake robot now with this version, by the way, got any reccomendation for English sounding models?

#

I tried searching a few months ago but I didn't get anything decent

analog obsidian
#

realtime changes are only for perfomance
also rvc does not mean realtime voice changer just in case

left dirge
#

But- It says Realtime Voice Changr

low shard
# left dirge But- It says Realtime Voice Changr

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

analog obsidian
#

blame wok for naming his realtime gui like that

left dirge
#

Ohh, fair enough

analog obsidian
#

so you are running a realtime inference gui for rvc models

left dirge
#

Makes sense, any good rvc models then? I didn't really have performance issues before. Mainly just model performance. Youtubers have these models that sound perfect but even after trying 30 different ones they all sound robotic

low shard
low shard
# left dirge Makes sense, any good rvc models then? I didn't really have performance issues b...
elfin nebula
#

Is there an AMD version of the Tortoise tts?

low shard
brisk steeple
#

Random Question, is this the most recent version ?

#

MMVCServerSIO_win_onnxgpu-cuda_v.1.5.3.18a

low shard
#

If you followed a video tutorial, it's better you just forget it since most of those are old

#

What's your PC GPU and what you want to do?

brisk steeple
#

i have a Ge Force RTX3060

#

i just wanna know if theres another version just to know

fading lodge
#

does anyone know the best mic for wokada?

#

i got told before by a voicemdoel creator it was a maono pd400x

#

but im just making sure if thats a good one

#

or if i can get something better

low shard
#

-realtime

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard
#

1st link

brisk steeple
#

Perfect thank you so so much

#

Do I need Vb cable ?

fading lodge
#

you do

#

you always need vb audio cable for the voicechanger to work

brisk steeple
#

Perfect

fading lodge
#

do u need any help?

#

i have expirience in helping people here i can help u with whatever else

#

actually wait

#

add me rq

brittle wing
#

hii im trying to make a female voice a male one, does anyone know how much to pitch down by?

low shard
#

Uninstall it and get vac lite at the 3rd step of the guide I sent

low shard
low shard
fading lodge
#

ill tell her that instead

low shard
#

Already did

fading lodge
#

also, @low shard can u awnser my question

fading lodge
low shard
fading lodge
#

i replied to it

low shard
# fading lodge does anyone know the best mic for wokada?

You'd just need a decent one, don't need an extremely pricey 1k mic because rvc conversion downsamples the voice input for pitch estimation anyways https://rentry.co/forkvoicechangerguide#do-i-need-an-extremely-expensive-mic-for-good-quality

fading lodge
#

but is there a reccomended mic?

#

the mic an old voicemodel person gave me was the MAONO PD400X

#

its 150$

#

its a podcast microphon

low shard
low shard
fading lodge
#

but if ur looking for a voicemodel creator i have one for you

brittle wing
#

im doing lorde to harry styles

fading lodge
#

he charges though

#

but there insane voicemodels

#

i personally use him and hes great

brittle wing
#

no thats ok i have voicemodels

fading lodge
#

alright

#

hes in the model master shop if u want him tho

#

i think its @tulip cloak

#

yeah its him

fading lodge
#

im deadass planning to buy a double 5090 for it to have no delay

#

😭

analog obsidian
#

._.

#

okay just in case
for like the 100th time
rvc inference downsamples the source audio to 16khz

low shard
analog obsidian
low shard
fading lodge
#

i just wanna know a good mic i should use to pick up my voice better

fading lodge
#

like a 4090 super has almost 0 delay

#

close to none

analog obsidian
#

and crossfade 0.05

#

but rip stability and naturalness

low shard
analog obsidian
#

not exactly 0 but like 10ms or so

fading lodge
#

i just need a good microphone to pick my voice up better for wokada to function properly

#

right now i have a blue yeti but the shitty usb-b cable is breaking

analog obsidian
#

well the thing that affects the most in realtime inference is noise and random sounds

fading lodge
#

thats the thing, the mic i got reccomended removes all backround noise and sounds really crisp

#

so i was wondering if it was a reccomended pick by otheres

#

like i said i got it reccomended by some vociemodel person in here

crude flame
fading lodge
#

i might buy the mic anyway i need a new one

#

😭

fresh cairn
#

ahh

#

the linux verson of the discord voice changer is still busted

knotty moth
knotty moth
fading lodge
#

i see

#

alr

#

thanks

visual meteor
#

i saw that RVC-GUI is made in 2023 is there any better ones now? "VC-Project/Retrieval-based-Voice-Conversion-WebUI"

violet badger
#

I can prob figure it out myself if someone just shows me where to go

shy spruce
#

5090 works 100% fine now. I have one, and trained plenty of models since. You just need to use the new pytorch 2.7 with CU 12.8 that was recently released, before the stable release things were a bit weird. all good now though

burnt saffron
#

hi was wondering if this is overtraining?

#

model is at 275 epochs

brittle wing
#

I was trying to download a model in Applio then this happened

#

It turned out like this

#

It is the wrong model

thorny cargo
thorny cargo
#

Around there

#

You should smooth it to 0.987

elfin nebula
simple ore
agile kelp
#

Need help

#

i downloaded Ducks voice changer and it only works in Beatrice i want to use RVC

fervent rover
#

Hey y’all it is be an while since last time, but I just wanna to know, is RVC Mainline Colab Finally Worked and Fixed, I Just Wanna To Know

pastel oak
compact linden
#

Hi everyone! Does anyone have the "checkpoint_v2" file for OpenVoice 2? I'd really appreciate it if you could share it or point me to a working download link. Thanks a lot in advance! 🙏

lyric fossil
#

hello

#

why cant i share screenshot here? need some help

feral bane
lyric fossil
#

yes i tried copy pasted img and doesnt work

#

there is no run button on the left cell

#

is it because i need to "edit my copy" every session?

#

this error prompted on the last call [Errno 2] No such file or directory: '/kaggle/working/MMVCServerSIO' /kaggle/working Go back and run first and second cells.

knotty moth
lyric fossil
#

i did refresh. the first cell had button while the last cell wasnt

#

i ended up making new notebook and all buttons are there

compact linden
#

Has anyone downloaded and used it before?

humble dock
#

What is the top ai for 25$ a month in my case? Should i move out to grok 3/ claude 3.7? Im using chat gpt plus and have alot of information it knows there. I can start again but it will take like some time.I chat alot with o4 and it does alot of the things i need maybe- it doesnt give me ideas about my project and doesnt understand me only follows my commands on what to do so i need to program it only by seeing the answers and he dont know how to solve some problems of mine and dont understand some of what i explain to him( i am explaining only by examples but it doesnt recognize the patterns in other things in the world. For example i need a menu for something and i have that in mind but it doesnt understand what im telling him and i am assuming he will understand the connection to things happening in movies and in real life but it doesnt)

merry kernel
#

is there any version of RVC that support intel arc card?

simple ore
low shard
low shard
low shard
merry kernel
#

i can use beatrice v2 but can't use any RVC model
i try many version but still not work
anyone know how to solve this?
im using laptop with intel arc gpu

low shard
#

If you don't remember the tutorial link, you can also send a screenshot of your program

#

!give-media-perms 1h @merry kernel

merry kernel
#

i want to use rvc model but it so buggy and lag

pastel oak
merry kernel
pastel oak
merry kernel
#

i saw this and tried

#

still not work some how :v

knotty moth
pastel oak
#

What about it doesnt work

low shard
#

Did you read the guide

knotty moth
patent trellisBOT
# knotty moth !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
merry kernel
#

i did exactly what you guy said :v

low shard
#

What doesn't work?

merry kernel
#

wait

#

let me

#

re dowload

#

and send you picture

low shard
# merry kernel re dowload

You were using original wokada, you shouldn't use that

Uninstall original wokada and vb audio cable

Install Wokada deiteris fork and vac lite from the guide

#

I guess you followed a video tutorial lol

merry kernel
#

i uninstall vb audio ✅
install vaclite ✅

#

install the old version ✅

low shard
merry kernel
#

i have a question

low shard
#

Maybe u meant uninstall lol

merry kernel
#

i mean install this

low shard
low shard
merry kernel
#

can i keep the version im using rn and install this version

low shard
#

Install the version we told you

merry kernel
low shard
# merry kernel done

Alright, take a look for the guide and let us know for any issues or questions

merry kernel
#

downloading ...

#

ok

#

im stuck at this

#

what now?

low shard
merry kernel
#

so what should i do now

low shard
# merry kernel

GPU: ur Intel arc
Input: microphone
Output: line 1
Chunk= perf number + 60, to find your own chunk

#

Then upload a model

merry kernel
#

ok wait

#

i need to unzip the model file right?

knotty moth
# merry kernel

you should first consult the guide to get started and set up everything you need, then you can tell us if there is some problem

merry kernel
#

its said this

low shard
low shard
merry kernel
#

i upload the model

#

and boom

low shard
#

Also what browser?

merry kernel
merry kernel
#

this model

low shard
merry kernel
#

yes

#

what should i do now

agile kelp
merry kernel
#

can i retry and screen shot it

low shard
merry kernel
#

how to retry btw :v

low shard
merry kernel
#

oke

pastel oak
merry kernel
#

like this?

low shard
merry kernel
#

and what now

#

last time it crash when i tried to rename it

pastel oak
#

Guide says what to do for linux virtual cable in the virtual cable step, might have to google extra stuff if you dont know Linux well

low shard
#

It's bugged

merry kernel
#

oh

#

that why

pastel oak
#

Rename before uploading

low shard
low shard
pastel oak
#

Yuuurr

merry kernel
#

bruh it so lag :v

low shard
#

Close all useless programs in background too

merry kernel
#

only discord rn

low shard
#

Are you gonna use it in games too?

merry kernel
low shard
# merry kernel only discord rn

Show a screenshot of ur wokada, be sure everything else is closed, you can even check the ^ at the bottom right of windows to check programs in background

#

Steam is an example of programs in background that you see in ^

merry kernel
low shard
# merry kernel

Can't recognize all the programs just be sure to exit those that aren't needed

low shard
merry kernel
#

i mean

#

i cant even hear what i said rn

#

it killin my cpu

#

not gpuy

low shard
merry kernel
#

i set it

low shard
#

Did you set the GPU option to your Intel arc?

merry kernel
#

what ui

#

yes

low shard
merry kernel
#

i just stop

low shard
# merry kernel

Set extra to 0.3 and chunk to 800, don't expect low delay on a not so powerful gpu

#

Also don't expect it to run with things like marvel rivals lol

merry kernel
#

erm

#

it said this

merry kernel
#

im playing valorant

low shard
#

It's bc non Nvidia GPUs aren't that good for ai

#

And for non Nvidia you gotta change back like that to make sure it utilizes the GPU in a good way for the program

merry kernel
#

oh

#

it

#

kinda work

low shard