#✨│ai-help

1 messages · Page 273 of 1

next geyser
#

windows 11

#

no tut link

rigid tartan
#

also i deleted the vc to find if i downloaded the wrong one

#

voice changer*

#

would be very helpful if somene guided me though everything

toxic isle
#

Hey anyone i downloaded VBCable and when i downloaded the driver i cant even hear anything like yt i only hear people on discord and i also cant talk

toxic isle
#

wdym

#

im new to this

#

i dont understand anything

next geyser
# toxic isle wdym

I will go over every single step needed to fully understand and use this powerful AI-voice changer. U can use it for trolling people or whatever purpose floats your boat.

💾VOICE CHANGER DOWNLOAD (GITHUB): https://github.com/w-okada/voice-changer/blob/v.2/docs_i18n/README_en.md

💾VIRTUAL AUDIO CABLE DOWNLOAD: https://vb-audio.com/Cable/in...

▶ Play video
#

watch this

toxic isle
#

i downloaded this

#

when i was on the step

#

with the vbcable

#

i downloaded driver like him

#

it told me to restart

#

and i couldnt hear anytinh

next geyser
#

screenshot

#

rvc

#

and make sure ur actual mic works

toxic isle
#

what is rvc

#

it does

#

it was working

next geyser
toxic isle
#

and after i downloaded rvc it doesnt

rigid tartan
# next geyser screenshot

also happened the same with me after all the config when i tried to use in discord there was no sound

toxic isle
next geyser
toxic isle
#

no

next geyser
#

ok start it

toxic isle
#

i did

next geyser
#

and check in discord

#

with mic

toxic isle
#

it got delay asf

next geyser
#

set it as cable

toxic isle
#

how

next geyser
#

for ur gpu

#

thats some dog settigns

toxic isle
#

lol

next geyser
#

🍏 Applio Guide

‎ Deiteris' W-okada Guide

‎ Vonovox Guide

which one for vc male voice to male voice rp

toxic isle
#

so it doesnt have delay

next geyser
#

bro just look in the link for a settings for gpu tab

#

i cant hold your hand through everything you lazy bum

toxic isle
#

you can

next geyser
toxic isle
#

nega you think i understand those things

next geyser
#

😂

toxic isle
#

thats some fucking high levels for me

next geyser
#

go back to playing in the sand

toxic isle
next geyser
#

it isnt that hard

next geyser
next geyser
#

🍏 Applio Guide
‎ Deiteris' W-okada Guide
‎ Vonovox Guide
which one for vc male voice to male voice rp

knotty moth
candid rune
#

guys when i try and put meta data file in the index thing in the voicechanger it doesnt work can someone help asap

#

i cant send images

#

but it says

#

extonsion of file something somethng

#

if someone helps ill buy them nitor

#

<@&1159293204038955078>

thick ferry
candid rune
#

@thick ferry

thick ferry
#

ur trying to put

candid rune
thick ferry
#

what are all the files in the folder

#

name them

candid rune
thick ferry
candid rune
#

use it without index its not working

#

@thick ferry

thick ferry
candid rune
#

and its used by 100s of people

#

its not broken

thick ferry
#

can u send the file

thick ferry
#

shows a index for me

candid rune
#

reinstall it?

thick ferry
#

did u extract the folder

candid rune
#

yesd @thick ferry

latent kettle
#

shoul i use 40khz or 30khz for sample rate

thick ferry
#

it doesnt show in my file explorer

#

js my w-okada

candid rune
#

so what do i do

#

@thick ferry how do i make it show up in my w-okada

#

or just send me the index file

thick ferry
#

let me try if it works

#

1 second

candid rune
latent kettle
analog obsidian
latent kettle
# analog obsidian 32k

i downloaded the files with 20khz that means it should be 40khz but when i extracted it using UVR it went to 15khz idk whyyy😭

analog obsidian
#

like uvr de echo

#

mel roformer models doesnt do that

viral mason
rigid tartan
#

when i try to download the voice changer files it says this

#

no its not my network and i tried changin browsers

#

ts is not helping

merry forge
#

how do i fix this? RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.

#

im very new to ts so sorry if its a dumb question

viral mason
upper wedge
#

How many epoch does it normally take to train a model? i'm using replay now (im now at 9 epoch)

floral quiver
#

i need help

#

its about my quality with the client and it sounds supper chunky any suddgestions?

simple ore
merry forge
#

weights worked before without a dedicated graphics card

simple ore
#

so what program is failing?

merry forge
simple ore
merry forge
simple ore
#

from Applio's folder

#

you can even run it as X:\Applio\env\python -m pip install torch==2.7.0 torchvision==0.22.0 torchaudio==2.7.0 --upgrade --index-url https://download.pytorch.org/whl/cu128

#

just change the path to env

merry forge
floral quiver
#

so can i still get help?

thorny plaza
#

Oh got dang it, now i see "CABLE input (VB audio virtual cable)" facepalms, if you still have that problem, try using that one. Could be it. 🫡

Edit, or it could had been discord-browser setting, scroll bit down from this message.

#

If you had w-okada tool downloaded and set up, now you can change your voice real time based of what you say to your mic, or out of sound file you load into the software. W-okada by default have few voices included, but if you want more voices, you can use this site MODELS section to download new voices from huggingface & weights.com. After you downloaded model you click this in w-okada (1st pic) then upload (2nd pic) and then you select .pth file for model and .index file for index and click upload, and that's it, you got new model to use 🤗 (3rd pic). You can download any picture from internet and if you click here (4th pic) (after uploading the model) you can give your new sound an visual icon for easier navigation in future when you will have couple dozens of models 🫡

trail spruce
#

im new to this ai thing

#

how do u make it sound like the person u want it to be?

thorny plaza
#

w-okada tool give me sec

#

Wokada will do you both real time and from a file (voice recording)(those links are to the newer wokada fork)
"How to use
Running locally on Windows
Before you start

1 [If not installed] Download and install 7-Zip or WinRAR. (https://www.7-zip.org/)

2 [If not installed] Download and install VAC Lite by Muzychenko. (https://software.muzychenko.net/freeware/vac470lite.zip) (or the https://vb-audio.com/Voicemeeter/banana.htm)

3 Navigate to the releases section." (https://github.com/deiteris/voice-changer/releases)"
And then you can use this discord (⁠🎧│voice-models) to get voice models, or use online tool like (https://colab.research.google.com/github/IAHispano/Applio/blob/main/assets/Applio_NoUI.ipynb#scrollTo=0pKllbPyK_BC) to make your very own voices (easy to use, no set up required) or you can install appolio locally (https://docs.aihub.gg/rvc/local/applio/) but that might not be easy for someone who is green

rigid tartan
#

The virtual cable

#

Wont show on discord

thorny plaza
#

Hmm

#

as i dont use VAC by muzychenko, i can't really help, i don't know why it does not work. That is what i got, and how its set up, both set as default devices, microphone is as input (in wokada) and B1 as output and it works for me

#

The out: is what the PC should "hear" and the mon (monitor) is not necessary to be enabled (as it will make you hear the changed voice with second of delay - very distracting)

#

You need to set virtual cable out put to be default comunication device, for system to by default use it in all aplications, instead of microphone

#

Did you set that up with browser?

cursive gyro
#

h

thorny plaza
#

Check your browser microphon setting

#

this is very likely the cause why the tool is not working for you

#

click this

vernal boneBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1405252622734065816> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
rare folio
#

Hi! Okada suddenly stopped working for me at all today, it initializes properly but doesn't register my voice at all

rare folio
#

<@&1159293204038955078>

cold shoal
#

Hey I'm on bandlab and I'm trying to add the beat to my track can someone help me

low shard
# rare folio Hi! Okada suddenly stopped working for me at all today, it initializes properly ...

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • E girl trolling
    • TTS
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
blazing panther
#

anyone know why audio picked up on my pc is relaying to w-okada?

lethal basin
low shard
# lethal basin nick, which one am i suppose to download if i dont have amd?

you need to elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • E girl trolling
    • TTS
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
lethal basin
#

which one do i download

#

😭

gloomy lynx
#

Is there a way to use AICoverGen locally? I feel like I've had way more luck with it than trying to separate vocals manually, but Idk if I'd risk running cloud AI via Drive, IIRC Google's not very happy with using it for that

low shard
lethal basin
low shard
lethal basin
#

and tutorial link im using is ur forum for realtime

low shard
# lethal basin roleplay

everything, not just a bit:

  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • E girl trolling
    • TTS
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
low shard
gloomy lynx
low shard
#

this is only for making ai covers btw

gloomy lynx
#

In theory UVR5 runs relatively quick on it, but a lot of the time the isolation just isn't really good

low shard
low shard
#

it's the easiest automatic u can run locally

lethal basin
low shard
lethal basin
gloomy lynx
#

Like AICoverGen yeah, for some reason IIRC I had a better success rate just letting it do all the work, even if I ran the whole line of instrumental -> de-reverb -> de-noise on UVR5

gloomy lynx
low shard
low shard
low shard
lethal basin
#

expect it wont run

#

😡

gloomy lynx
#

Tends not to be much for anything AI-related, but I dunno much of an alternative, not sure if Google still cracks down on using its thing for AI training

low shard
lethal basin
lethal basin
#

okay

low shard
gloomy lynx
#

Not sure why, but I remember having a really bad time trying to set up Kaggle

low shard
#

else there's lightning.ai, which is also another great alternative but still requires phone number

gloomy lynx
#

I'd be down for either, if there's guides/resources that use those anywhere

low shard
#

let me know ofc

low shard
lethal basin
# low shard lmk

for these settings, is there anything i should do to make it sound better

gloomy lynx
#

Oh wait, no wonder I couldn't find these anywhere, I was looking at outdated docs I think

low shard
#

i wonder how u even found that link

#

anyways ima take it down

gloomy lynx
lethal basin
#

nick, why doesnt my input and output not show anything, and if i select any of the options it jst crahses

low shard
#

never expected so many things would change in 10 months

low shard
valid violet
#

nick

lethal basin
valid violet
#

like fivem?

gloomy lynx
#

Just to be sure about Kaggle, it just needs a phone number, and the 30 free hours are presumably a one-time thing, right?

low shard
lethal basin
gloomy lynx
#

Don't think I'd need more than thirty, but I should have ways to deal with that, just making sure

lethal basin
#

but how do i set my mic and headset?

low shard
low shard
lethal basin
gloomy lynx
#

Oh, weekly sounds wild, that's definitely gonna be way more than enough

low shard
gloomy lynx
#

Yeah that's fine

#

I should probably get around to updating my model, current one sorta works? I guess? But IIRC it was Colab training that shut off in the middle of it

lethal basin
low shard
#

lightning.ai offers around 80 hours freely monthly, i mean depends on how u spend ur credits and which provider, u can even get extremely good gpus like the best for free but with fewer time

low shard
lethal basin
# low shard that explains

2 questions first why cant i change any of the advanance settings
secondly do u prefer server or client?

low shard
gloomy lynx
#

I might give that a go yeah, but a 2-3 day verification seems spooky

low shard
#

u got like 4 options to run applio: locally, colab, kaggle & lightning.ai

lethal basin
low shard
lethal basin
low shard
lethal basin
low shard
lethal basin
#

😡

gloomy lynx
#

Okay yeah, one inference in CoverMaker and the result is immediately a hundred times better than half an hour of screwing around in UVR5 GremlinRose

blazing panther
#

4060, windows 10, let me know some settings to use

#

using nvidia i believe, its for either e girl trolling or roleplay but its mostly just to mess around with fun voices

timid zinc
#

Hey, currently training a spin model for the first time and its very time consuming 😆

Is there any way to pause the training process and continue later?

simple ore
#

before it starts more steps for next epoch

timid zinc
#

And is it normal that it takes like 20-30min on average for each epoch ?
20min dataset with 12 batch size and a 2060ti SUPER

simple ore
#

unless you ran out of vram

timid zinc
simple ore
#

yes

#

show task manager/performance/gpu

timid zinc
simple ore
#

yeah, you probably spilled it into shared memory

#

and that kills the peformance

#

can lower batch size / use checkpointing

timid zinc
#

ohhh, is the batch size that affects vram usage ? or how do i calibrate the configs

simple ore
#

larger batch size, more memory it uses

timid zinc
#

ahhh i see the setting

#

people said the quality is better with batch size 8-12, so i figured lets go with 12

simple ore
#

you can probably manage with 8, for 12 you'll need to use checkpointing

timid zinc
#

whats the drawback of using checkpointing ?

simple ore
#

lil slower, but faster than using shared memory

timid zinc
#

understood tyvm ❤️

low shard
low shard
gloomy lynx
#

I mean, it works better than trying to do it manually, that's for sure, I'll probably try Kaggle/Lightning whenever I feel like doing something related to training

#

As far as the latter, not the right place to talk about venty shit, so I'll pass on answering

low shard
low shard
gloomy lynx
#

I mean, an AI server probably isn't the right place to go extensively into that sorta stuff, I'd imagine

low shard
low shard
#

this somehow reminds me that some people use chatbots for mental health and therapist

#

which isn't great as they are just predicting text and could just say some bad shi, like it happened already yt_nails

#

it is ironic tho, because the first chatbot, Eliza, was made as a therapist chatbot

gloomy lynx
#

Last time I tried, for some reason it got stuck constantly responding in a 2-point format

timid zinc
#

if you want to vent to a real stranger DM me
its 3am and im bored enough :P

low shard
gloomy lynx
#

Already had both happen, probably not to that much of an extreme, though

low shard
low shard
gloomy lynx
#

Don't have anyone I'd trust or feel comfortable with for that sorta thing ngl

gloomy lynx
low shard
#

ofc do NOT do it for those topics tho

gloomy lynx
#

Haven't had luck with using that

low shard
#

but hey, everyone's different

gloomy lynx
#

Not gonna feel any worse than I have for ages now, but that's getting into venty shit and, again, this isn't the right time or place SelenShrug

low shard
#

i don't like that its GUI isn't open source tho, only the cli,
personally

#

oh right, u could also try GGUF model files, like Q4_K_M, which are efficient file formats

gloomy lynx
#

Already tried both LM Studio and GGUF models too, local textgen just doesn't really work with this setup

low shard
#

but yeah u can deffo run some nice for its size models locally, i did it even on my phone (ofc the ones on my phone is way worse than my rtx 4060 ti 16gb, but u get what i mean)

low shard
gloomy lynx
#

I couldn't remember

low shard
#

i literally ran qwen3:0.6b_Q4_K_M yesterday on my phone cpu via termux and gpt-oss 20b on my rtx 4060 ti 16gb (yes i test random shit for fun lol), maybe u tried too big models

#

anyways, goodluck with ur life and i hope u get better
feel free to ask any ai help here, like for llms or if applio gives issues cat_roomba_exceptionally_fast

sleek burrow
#

hi

granite spruce
#

How do I train my own model?

blazing panther
timid zinc
#

How do I know my model is finished training?
Im at epoch 200 now and the loss/g/total lowest point was at epoch 130 so far.
Does this mean i can stop now and after 130 its been overtraining?
The value doesnt really go up or down by a lot anymore

Using spin embedder, if thats relevant

minor isle
#

how to fix okada repeating peoples voice from headset

#

change it to gpu usage in the menu

#

its under chunks

lone vale
#

C:\Users\saudi\Downloads\ai voice changer\MMVCServerSIO>MMVCServerSIO.exe -p 18888 --https false --content_vec_500 pretrain/checkpoint_best_legacy_500.pt --content_vec_500_onnx pretrain/content_vec_500.onnx --content_vec_500_onnx_on true --hubert_base pretrain/hubert_base.pt --hubert_base_jp pretrain/rinna_hubert_base_jp.pt --hubert_soft pretrain/hubert/hubert-soft-0d54a1f4.pt --nsf_hifigan pretrain/nsf_hifigan/model --crepe_onnx_full pretrain/crepe_onnx_full.onnx --crepe_onnx_tiny pretrain/crepe_onnx_tiny.onnx --rmvpe pretrain/rmvpe.pt --model_dir model_dir --samples samples.json
Booting PHASE :main
PYTHON:3.10.11 (tags/v3.10.11:7d4cc5a, Apr 5 2023, 00:38:17) [MSC v.1929 64 bit (AMD64)]
Activating the Voice Changer.
[Voice Changer] download sample catalog. samples_0004_t.json
[Voice Changer] download sample catalog. samples_0004_o.json
[Voice Changer] download sample catalog. samples_0004_d.json
[Voice Changer] model_dir is already exists. skip download samples.
Internal_Port:18888
protocol: HTTP
-- ---- --
Please open the following URL in your browser.
http://<IP>:<PORT>/
In many cases, it will launch when you access any of the following URLs.

[VCClient] Access http://127.0.0.1:18888/
[VCClient] wait web server...0 http://127.0.0.1:18888/

#

It's just stuck like that

#

ive been waiting for 20 minutes

#

It used to work but I had deleted it a few months back

floral quiver
#

need help

#

Hot do i make my ai sound smooth?

queen osprey
#

how to fix the voice changer not working like its not talking

candid rune
#

where is the lastest w okada version

rich brook
#

i have a problem with google collabs w/okada voice changer

#

after start a first step it is written here "Restart session. The following packages were previously imported in this runtime"

#

but when I click reset the GPU can't reconnect

#

any solution?

glossy ingot
flint vapor
flint vapor
vernal boneBOT
# flint vapor -wokada
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

queen osprey
flint vapor
queen osprey
flint vapor
queen osprey
flint vapor
queen osprey
flint vapor
# queen osprey v1.5.3.18a

Ok you have mainline outdated okada, i will give you new version and a guide, you have to download version for nvidia, not 5000 series

#

-wokada

vernal boneBOT
# flint vapor -wokada
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

flint vapor
#

Follow this 😄

#

Uninstall the current version you have now

queen osprey
#

but for some reason when i first used it it worked

#

i didnt change anything big

flint vapor
flint vapor
flint vapor
queen osprey
#

i was wanted to sound like ichigp

flint vapor
queen osprey
flint vapor
queen osprey
#

so after i unzip

#

do i put the model and call it a day or do i need to install that cable again

flint vapor
queen osprey
#

this one the one i used

flint vapor
#

Install this

glossy ingot
#

anyway to make the voice changer sound smoother and shi?

queen osprey
flint vapor
flint vapor
queen osprey
flint vapor
# queen osprey can i just keep it

Open Device Manager. Expand the "Sound..." device category by clicking on the "+" sign. Right-click the Virtual Audio Cable device and choose Uninstall.

flint vapor
lone vale
#

i dont know what im doing wrong

glossy ingot
#

what its name is

#

just says voice changer native

queen osprey
#

@flint vapor it finished downloading now how do i open it and can you show me how to set it up

flint vapor
#

Have you run the .exe after unzipt it?

#

Hi, didn't you tried also other models of killua?

flint vapor
#

on right up

queen osprey
#

also is there a english dub ichig

flint vapor
flint vapor
high spoke
#

yes

low shard
# high spoke yes

E girl trolling is catfishing, which is illegal. You have been warned both verbally and via sapphire and continuing to ask will lead to further actions possibly like a ban

viral mason
elfin nebula
#

When voice training an ai model, would it be better to use a word list with all the english phonetic sounds?

autumn orchid
#

Hey, I was testing out the Albert / Flamingo (V4) (RVC V2) (400 Epochs) model, and what I heard from the MP3s he sent sounds completely different from what I have. On my end, the voice changer sounds like the "ai/model" has rocks in its mouth. No offense to the person who made it, but either way, it's an issue on my end, and I was wondering if an admin could help me out.

simple ore
young halo
#

i'm using kaggle for training but i can't do anything because of this ModuleNotFoundError: No module named 'gradio'

simple ore
simple ore
#

it would depends on which notebook you're using

young halo
viral mason
#

they're asking if you're using like applio or something else

young halo
#

i tried re running and it didn't work

simple ore
#

in kaggle create a new notebook, then file/import notebook,

#

as I see it starts just fine

still holly
#

can anyone tell me some good settings?

young halo
#

it's weird, my training was going just fine

simple ore
#

did you run install cell? 🙂

#

I just tried importing the notebook code as explained and it was fine

young halo
#

Unless i didn't understand properly 😭

#

I installed the Notebook

#

And then re runed the one i was using

#

But i don't know if that's what i had to do

simple ore
#

go to kaggle, +Create, Notebook

#

File/Import Notebook -> screenshot I gave you

young halo
#

Done

simple ore
#

after that you can rename your notebook so you know what to use next time

young halo
#

So I'll do my work here?

simple ore
#

you should not see this screen

#

if you follow my instructions

#

DO NOT DO +CREATE/IMPORT

#

because it immediately runs something

young halo
#

Idk what I'm doing wrong 🙁

#

I never had this issue before

#

Thanks for the help though

primal anchor
#

who knows why custom voice models arent working on okada voice changer?

simple ore
#

it opens a blank notebook

#

use File/Import Notebook

sand heart
#

How to download voice changer on my mac air m3 apple silicon chip

young halo
sand heart
#

How to download voice changer on my mac air m3 apple silicon chip

#

??

young halo
glossy ingot
#

sum help me download w-okada

young halo
#

It's working, thank you @simple ore !

young halo
#

This froze, the tensorboard and applio show the training, but here it's frozen

low shard
# glossy ingot sum help me download w-okada

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • E Girl trolling
    • TTS
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
low shard
# sand heart How to download voice changer on my mac air m3 apple silicon chip

voice changer is too vague

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • TTS
    • E girl trolling
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
timid fox
#

I keep changing my mic and output in okada voice changer but none are working

dusk jewel
#

my mic dont work when i use the Voice Changer, why?

patent gazelle
#

keeps crashing

prisma grove
#

hey, I wanna train some models but I don't know which pretrain to use

#

I know

#

but I want to know which pretrain specifically I should use

#

I want to do 2 character voices, one has a dataset of about 1 hour with very clear but repetetive japanese singing audio, and the 2nd one is of a character that has an entirely computer-generated voice so it sounds very muffled and noisy, the dataset has about 30 minutes of mostly japanese speech and some singing

#

should I go with TITAN? KLM? Snowie?

#

I know, I've read the guidelines

prisma grove
#

original?

viral mason
#

Mhm

prisma grove
#

as in, the default one?

viral mason
#

yup

prisma grove
#

okay, thanks

viral mason
#

no problem!

vernal boneBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1405252622734065816> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
vernal boneBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1405252622734065816> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
low shard
vernal boneBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1405252622734065816> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
viral mason
#

NickBot9000 misc_trolley

low shard
viral mason
#

Ik I was just poking fun lol

low shard
daring sail
#

What should I use to install different apps that require different versions of Python and CUDA so that they don't conflict with each other?

timid fox
low shard
#

delete everything

#

i don't think ur gpu meets the bare minimum for ai tasks nails

#

u still want to try locally? but i feel like ur gpu wont be recognized

timid fox
low shard
#

what are u trying to do? roleplay in vc? e girl trolling? or roleplay in games?

timid fox
#

shits and giggles with my friedns?

#

it would be funny

low shard
timid fox
#

vc

low shard
# timid fox vc

less demanding on ur gpu for sure, but ehh not sure if it would be recognized, the bare minimum is a gtx 900

#

u still want to try locally? or want to use cloud?

timid fox
low shard
# timid fox what does trying locally and cloud mean?

using something:

  • locally: runs on your hardware, like running the software on your gpu
  • cloud: remote good pc, running the software on a service that allows u to use way better gpus, with limited time and subscriptions, and as it's a realtime voice changer, ud need a good connection too for this
#

cloud would have better performance in ur case ofc, if ur gpu can even get recognized as it's super ancient

timid fox
#

i live in a third world country with 20/mbps max internet lol

low shard
#

ofc u can't expect to run things with poor hardware and poor connection

#

u still want to try locally?

#

and then check cloud if it doesn't work?

timid fox
#

this seems too much work bro my normal voice is okay 🥀

low shard
#

only products expensive made by companies are 1 click

timid fox
low shard
#

but AI isn't 1 click at all

#

this is an open source program driven by the community

timid fox
#

so how do u use cloud?

low shard
# timid fox cloud would be faster, idw reinstall everything from scratch

About Cloud, there are different services:

low shard
timid fox
#

maybe one day...

#

thanks for the help though

low shard
trail sleet
#

How do I lower my input buffer in wokada

#

And output buffer

#

Sometimes my output is at 124s sometimes input is 124s

unreal kettle
#

what chunk values have u guys found that works good without being too delayed? using Deiteris w-okada fork for rtx 50 series (5070ti) paired with 7800x3d

viral mason
#

If that's not good for you lower the chunk a bit until the delay is good for you ^^

unreal kettle
viral mason
unreal kettle
#

im using rtx voice as well for noise reduction since my mic picks up background noise a lot

unreal kettle
viral mason
unreal kettle
mortal kite
#

hel

#

p

rocky smelt
#

Why when I use AI voice, sometimes it records and translates my voice well, sometimes it records and pronounces poorly, so it can't speak clearly.

mortal kite
#

Mine doesnt work

#

I get errors

#

But i can't send 9mgs

#

Immages

fallow lava
#

Since Okada is no longer getting updated. Which would you recommend that's on the lower end when it comes to specs requirement?

#

I tried Vonovox yet the results are inconsistant

dusk jewel
#

!howtoask

vernal boneBOT
# dusk jewel !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1405252622734065816> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
dusk jewel
latent kettle
#

Is it a good idea to set batch size of 4 for 24 minutes of dataset

latent kettle
#

Or does it mean between 4 to 8 is good?

simple ore
#

4, 5,6,7,8

#

give it a try with 4 and with 6 and with 8

#

see which one comes out better

#

there may be some differences in results

latent kettle
simple ore
#

it depends entirely on the dataset

#

well, I did test once with 2,4,6,8,10,12,16,24 batches

#

2 was bad

#

but here was no much noticeablew difference between 4 and 6

latent kettle
simple ore
#

well, it is not exactly that

latent kettle
#

Why it is written there 😶

simple ore
#

because a proper explanation would take pages and pages

latent kettle
latent kettle
simple ore
#

it is just the total loss.. scroll down for examples.

#

👉 How many chunks of audio the model looks at before it adjusts itself.

Differences in Batch Sizes

Small batch size (like 2, 4, 6)

The model only looks at a few audio chunks at a time.

Pros:

Works even on weaker GPUs (less memory used).

Sometimes can capture more subtle details of a voice.

Cons:

Training is slower because the model updates too often.

Can be a bit “noisier” — results may vary more between training steps.

Medium batch size (like 8, 12, 16)

A balance: the model sees a fair number of chunks per step.

Pros:

Training is smoother and faster.

Usually good quality and stability.

Cons:

Needs more GPU memory than very small batches.

Large batch size (32, 64, 128 …)

The model sees lots of chunks before updating itself.

Pros:

Training is very stable and efficient (less “noise”).

Often used when you have a big, powerful GPU.

Cons:

Needs lots of GPU memory.

Sometimes can “average out” too much, losing some finer details of a specific voice.

A Simple Analogy

Imagine you’re learning to sing a song:

Small batch size: After every 2–3 notes, your teacher stops you and corrects you. It’s detailed, but it takes a long time and can feel jumpy.

Medium batch size: Your teacher lets you sing a full line, then corrects you. It’s a good balance.

Large batch size: You sing the whole verse before getting feedback. It’s smoother and efficient, but little mistakes might get overlooked.

For RVC training, most people use batch sizes between 8 and 16 if their GPU allows it. If the GPU is weak, go smaller (2–4). If it’s strong, you can experiment with larger ones, but medium is usually best.```
#

not a perfect explanation but there you have it

latent kettle
simple ore
#

you have an upper limit (VRAM) how big of a batch you can use, that can be cheated a bit by using BFloat16 precision (if your gpu allows it, 3000series+), or by using checkpointing

#

unless you're using 10+ hour dataset it is not an issue, you should not go over like 16 for <1hr set anyway

low shard
low shard
low shard
low shard
# dusk jewel yes

E girl trolling is catfishing, which is illegal. You have been warned both verbally and via sapphire and continuing to ask will lead to further actions possibly like a ban

low shard
low shard
fallow lava
#

I messed up, RVC Okada is now outputting gargled delay sounds

#

I only tinkered with Chunk & Extra. Now it's just a mess even if I try to place it back

glossy ingot
#

ya'll think this good

viral mason
#

yikes

#

that's an over year version

weary minnow
#

im using deiteris fork with vac lite and put my input as my regulkar mic and myoutput as the line 1 and input as line 1 on discord and output as my headphones but i only hear my regular voice in the mic test

grizzled heart
#

is there any tts website that i could upload my voice model and tell the ai to generate it for me?

rose arrow
rose arrow
low shard
#

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • TTS
    • E girl trolling
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
low shard
# glossy ingot ya'll think this good

nope, the settings are completely wrong, and that's an extremely old program
delete everything
Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • TTS
    • E girl trolling
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
low shard
low shard
#

it depends alot, there isn't just a single version
Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • TTS
    • E girl trolling
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
low shard
# rose arrow GPU: nvidia rtx 4050 OS: Windows 11 Detailed Description: Everytime i open Deite...

Hello, I am having an issue with my audio automatically lowering whenever someone speaks, then going back up when everyone is quiet. I have done all the steps in the reply to this thread here…

#

u couuld also send a screenshot

low shard
glossy ingot
#

vc

#

mostly rp

#

ig

low shard
glossy ingot
#

oh

low shard
#

don't just say only roleplay, elaborate everything asked in that message, there isn't just 1 program and 1 version

glossy ingot
#

win 11, NVIDIA GeForce GTX 1660 SUPER, Roleplay Vc, No tut,

#

good?

low shard
glossy ingot
#

girl rp

#

4 family

#

since we got no females

#

lol

#

e girl trolling is against rules anyway

#

these were only good 1s i could find

hallow thistle
glossy ingot
#

roblox maybe gta

#

but roblox rn

hallow thistle
glossy ingot
#

its dc vc but brookhaven

#

with my friends

cold cave
#

AttributeError: 'Namespace' object has no attribute 'normalization_mode'

brittle minnow
#

yo

#

my voicechanger don t work

#

can somebody hellp me

viral mason
lime siren
#

how do I fix the voicechanger hearing other people in the discord call and speaking for them?

#

do I gotta use a noise gate

glossy ingot
lime siren
glossy ingot
#

microphone or headset tho

lime siren
#

wdym?

glossy ingot
#

these lol

#

somtimes headsets do that

lime siren
glossy ingot
#

might be ur headset loudness like too high

#

or use noise gate

#

🤷 tho im not smart so prop ask helper

clear crater
low shard
vernal boneBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

low shard
#

either vonovox or wokada deiteris fork

low shard
glossy ingot
#

friend sent it

clear crater
#

ahh okay im just asking bc thats the one i used to use and it worked but the version im trying to use now isnt working for me

#

Currently trying to use RVC

cold cave
clear crater
#

but when trying to hear the converted voice I only hear myself not the voice model

low shard
low shard
#

RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime. There also updated forks with extra features like Applio.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

Vonovox = Another Realtime Voice Changer based on RVC, with similar quality and performance to wokada deiteris fork but other perks

low shard
#

pls elaborate

vernal boneBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1405252622734065816> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
clear crater
low shard
# clear crater So if im wanting realtime which would be best to go for?

there's different programs and versions, you have to elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • TTS
    • E girl trolling
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
clear crater
low shard
#

also be aware windows 10 support is ending in like 2 months

#

-realtime

vernal boneBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

low shard
#

you can either use vonovox or wokada deiteris fork

#

it's better you read the pros&cons of both

lime siren
#

how do I fix the voicechanger hearing other people in the discord call and speaking for them? should I use noisegate?

clear crater
#

Okay perfect thank you :D and thats okay I plan to change to windows 11 soon anyway due to the support for windows 10 ending anyway

low shard
# lime siren how do I fix the voicechanger hearing other people in the discord call and speak...

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • TTS
    • E girl trolling
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
low shard
#

tbh windows 11 isn't as bad as everyone says, been using it since a year

lime siren
low shard
#

RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime. There also updated forks with extra features like Applio.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

Vonovox = Another Realtime Voice Changer based on RVC, with similar quality and performance to wokada deiteris fork but other perks

#

wokada deiteris fork isn't for ai covers at all

#

you can delete everything

lime siren
low shard
#

it's way better you use either ai cover maker or applio for ai covers

#

aicovermaker even automatically separates the song and mixes it back with converted vocals, it's the easiest way locally

#

there's a difference between voice changer and realtime voice changer

#

and also this way others voices won't be converted, since you'll automatically upload the audio :D

lime siren
#

I leave my computer and apparently it repeats what everyone else is saying

low shard
#

forgot to give it

#

you can send a full screenshot of the entire program without cropping

lime siren
#

and open it

low shard
clear crater
low shard
#

i dont mention my desktop bc that's ofc more powerful than my old laptop

#

i think part of it is that not everyone can upgrade if they got an ancient pc

clear crater
lime siren
#

it should only work when im speaking

viral mason
#

btw you have no mic settings set up

lime siren
viral mason
lime siren
#

well just the inputs

#

not all my settings

viral mason
#

still not normal

#

it shouldn't do that

low shard
# lime siren what I dont know is if the issue is in the voice changer or my mic

use sup2
input: microphone
output: line 1
monitor: headphones optionally to hear urself

on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:

  • Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
  • Advanced Settings -> Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
  • Advanced Settings -> Crossfade Lenght: Controls how smoothly the AI stitches different processed parts "chunks" of your voice back together. 0.1 for fastest voice, 0.15 for improved quality but increases delay by ~50 ms
  • Reduce the delay on Windows via the Wasapi / Asio Guide
#

You could also use vonovox btw, it's a windows nvidia realtime voice changer based on RVC which still gets updates, but ur choice

lime siren
#

I already use all of these settings and i dont know why its picking up on that

low shard
lime siren
#

I've never used echo but yeah I use in sens and sup2

low shard
#

try echo too

#

else it might just be ur headphones

lime siren
chilly kestrel
#

my realtime voice changer wont work

#

like when i try and hear my voice nothing comes out

#

and when i record as well there is no audio from my mic

low shard
# chilly kestrel my realtime voice changer wont work

This is a General AI Server, AI has many fields, so we can't know your issue with little info

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • E girl trolling
    • TTS
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
chilly kestrel
#

and i cant upload a screenshot

low shard
chilly kestrel
#

i actually didnt know

#

sorry about that

fallow lava
low shard
#

on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:

  • Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
  • Advanced Settings -> Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
  • Advanced Settings -> Crossfade Lenght: Controls how smoothly the AI stitches different processed parts "chunks" of your voice back together. 0.1 for fastest voice, 0.15 for improved quality but increases delay by ~50 ms
  • Reduce the delay on Windows via the Wasapi / Asio Guide
tired timber
#

Hi, I just want to set up RVC for character voices.
Can someone guide me on what I need to download first?

low shard
# tired timber Hi, I just want to set up RVC for character voices. Can someone guide me on wh...

RVC doesn't mean realtime voice changer

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • TTS
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using (if any)
  • a screenshot of the program (if any)
tired timber
low shard
#

Because realtime isn't meant for that, it's meant for discord VCs or games

tired timber
#

want to record gameplay in OBS, but I don’t want to use my real voice.
I just want to use RVC in real-time to make a character voice while recording.

deft pewter
#

is there any other better fork of wokada

#

because honestly i just launched wokada latest version today and suddenly im talking but i dont hear any output even tho its set to the right input and monitor

simple ore
blissful flint
#

Please tell me how to use a sample attached to a specific voice?

vernal boneBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

low shard
#

You can use either vonovox or wokada deiteris form

tired timber
low shard
deft pewter
#

it just stopped itself

#

also it picks up my voice but never converts it

simple niche
#

Hello, how do I not have delay?

opaque tinsel
#

hello, when i start rvc my ping goes crazy like 2.5K and stay like this even if i close rvc. Only restart of my PC helps. Im not sure is it using local or cloud. Can someone help please?
RTX 3070 TI laptop
Win 11

valid spruce
#

Help

#

This happened after I continued training from a model

simple ore
low shard
low shard
# simple niche Hello, how do I not have delay?

This is a General AI Server, AI has many fields, so we can't know your issue with little info

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • TTS
    • E girl trolling
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
low shard
opaque tinsel
# low shard rvc doesn't mean realtime voice changer Please Elaborate: - what you want to do...

Hello i dont have access to my PC rn so i can't give you screenshot of it, but here's link(https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/releases), I've got it from all working rvc collection on this server(it's literally first in working local rvcs). Didn't really used one special guide but did it by myself probably I missed something or etc. but before i used another one rvc but deleted it because crazy ping and here its again

knotty moth
# valid spruce

make sure the sample rate setting should match the one used in preprocess and also match the pretrain used
if you try to resume training with Applio on dataset preprocessed/trained with older RVC, unfortunately it might not work

low shard
# opaque tinsel Hello i dont have access to my PC rn so i can't give you screenshot of it, but h...

that's original/mainline RVC

RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime. There also updated forks with extra features like Applio.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

Vonovox = Another Realtime Voice Changer based on RVC, with similar quality and performance to wokada deiteris fork but other perks

I'm not sure what you mean by ping, Are you trying to do ai covers, training rvc models, e girl trolling, roleplay in vc or roleplay in games?

opaque tinsel
low shard
opaque tinsel
#

Idk how to explain

low shard
opaque tinsel
#

Do i need to create a new one thread or?

low shard
rose arrow
#

I also tried the original w-okada before which runs in its own client instead of the browser and it didnt have this problem so im not sure what the issue is

foggy ice
#

Yo i need help

low shard
# foggy ice Yo i need help

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
  • E girl trolling
  • TTS
  • Roleplay in VC
  • Roleplay in Games
  • etc
  • what tutorial link are you using
  • a screenshot of the program
foggy ice
#

I am trying use program for these girl voices but idk what to use i heard about w-okada but idk

#

I have nvidia 2060

#

Ryx

#

Rtx

#

I am. Windows 11

foggy ice
#

No

#

Like roleplaying as girl in game

#

I just need voice for it and program to use it in

#

Because i don’t know really what to use it kinda confusing

#

?

low shard
# foggy ice Because i don’t know really what to use it kinda confusing

mm, sorry but there are different programs for different things, a voice changer is different than a realtime voice changer, it's best you explain so i know which program to help you with

are you trying to roleplay as peter griffin in a game, do ai covers, or trying to troll/mess with others as a girl lol?

foggy ice
#

U know roblox?

low shard
foggy ice
#

Why u keep saying trolling?

low shard
foggy ice
#

There is map for role playing i am trying use it to play with my friends

#

As anime girl or smth yk

foggy ice
tired timber
#

@low shardCan I also use this model for making songs, or is there a separate tool for that?

tired timber
viral mason
hallow thistle
tired timber
#

im asking

viscid moss
#

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
  • E girl trolling
  • TTS
  • Roleplay in VC
  • Roleplay in Games
  • etc
  • what tutorial link are you using
  • a screenshot of the program
rose arrow
#

GPU: nvidia rtx 4050
OS: Windows 11
Detailed Description: Everytime i open Deiteris fork, my system audio becomes bad. Every sound is lower and a bit distorted; spotify, youtube, media player, etc. It happens when I set my input device into any microphones, audio becomes normal again when it's set to "none".
Tutorial: https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/

knotty moth
#

so are you trying to use it for catfishing/trolling?

fallow lava
deft pewter
#

bro

#

man

#

wokada is broken as hell for me

#

i even allowed mic and all its legit picking up on my voice

#

but its not even working even tho it is picking up on my voice

#

i trieds switching to multiple other voice models (i even checked settings its using my mic and i changed to other seperate mics i have still the same) and the output is the same as my speakers

#

im just using this to sound like arthur morgan from rdr2 lol

#

i've been js makin ppls days in rdr2 online

#

until this issue happened

#

can i also have help

rocky gate
#

What are your thoughts on Vonovox? I’m not sure what to get. Currently running on a 2 year old Okada download. I use an Nvidia 4060

#

On the GitHub Vonovox is shown to have more perks/better performance but I’m not sure how true that is—

viral mason
#

Tbh I recommend Vonovox although I don't use it as of now but I've tested it, I only don't use it because of not having enough slots for models as of the current update

#

Besides that it'll all positives

rocky gate
#

Ah… awesome thanks. How many slots are there on estimate?

viral mason
#

From what I remember 8

#

Or 6

#

I have kind of a spotty memory

rocky gate
#

What! That’s bull… I guess I’ll go with Okada until they get that situated LOL

#

Thanks for the help, seriously!

viral mason
#

Deiteris has over 100 slots so I recommend that for now if you want a bunch of models!

rocky gate
#

Now that’s impressive, definitely what I’m looking for.

viral mason
#

Soon Vonovox will update to have a slot system that continues to increase dynamically but not sure when

#

Dr did say that'd be added tho eventually

deft pewter
#

but the thing is i love the new ui skull_sob

pastel oak
rose arrow
pastel oak
rose arrow
#

with WASAPI right?

pastel oak
#

Yes

pine mist
#

what is better than rvcgui

rose arrow
#

yea, it doesnt connect

pastel oak
# rose arrow yea, it doesnt connect

Then there could be an issue with your audio devices but can't pinpoint it atm. Client is MME by default and seems to cause issues and wasapi doesn't work at all even though it's a newer version
Have you tried the vonovox voice changer on the guide?

minor isle
#

how to fix voice rechocing on discord

#

like others ppl voice are coming through okada

faint loom
#

Hello 🙂 where is the right place to ask for okada help?

faint loom
#

Thank you! I cant get the vcclient to run properly with my 7900XTX. cuda 2078 beta works only with cpu (5800x3d) and stutters (it recognizes it as cpu-1). 214 alpha seems to recognize my gpu but is even slower (takes 10-20secs to voice mod) and dml does not work at all.

low shard
low shard
# faint loom Thank you! I cant get the vcclient to run properly with my 7900XTX. cuda 2078 be...

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • E girl trolling
    • TTS
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
low shard
# pine mist what is better than rvcgui

rvc gui is super outdated, it depends on what program ur even talking about

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • E girl trolling
    • TTS
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
rose arrow
#

GPU: RTX 4050
OS: Windows 11
Description: Im using Vonovox and it just says this:

low shard
rose arrow
#

yes and it said Setup Complete

low shard
#

Nvidia drivers came out yesterday

rose arrow
#

oh yea i just checked, i have windows and nvidia driver updates

low shard
#

:D

charred drum
#

what program can i use to separate a choir?

rose arrow
low shard
rose arrow
#

last question, do the settings save when you close the client? I dont see any "save settings" buttons like the other RVCs

low shard
#

and yes the settings should automatically close

rose arrow
pastel oak
#

old ass program
whats your gpu name and what are you trying to do with the voice changer, like trolling egirl catfishing etc

median drum
#

what's the best AI website to do text to speech with a imported file that sounds the best without paying

oak edge
#

@median drum

#

you can ask here

median drum
#

thanks

median drum
# oak edge

need help on how to make voice clone like this for free

simple ore
#

that looks like incompatible torch and torchaudio libraries installed

#

2.2.0 cuda12.1 is so old

#

requirements are

pine mist
low shard
# pine mist Rx6600 Windows 11 AI Covers idk

Your AMD GPU is good enough to do inference (use models) locally (on ur pc), not sure about training

You can:

  • Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
    • Applio (AMD Windows) : A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
  • Cloud (remote good pc, easier and faster than ur PC but it's limited):

Easiest possible (automatically separates vocals & instrumentals) : weights.com & rvc-ai-cover-maker-ui colab/kaggle
easiest cloud: Ilaria rvc zero
easiest local: Applio

simple ore
#

probably because python there is 3.12

#

faiss-gpu is

#

it is very old

#

you can change requirements and install faiss-cpu instead

#

on windows it installs cpu, on lunux it installs gpu

#

faiss is for index

#

if you want to use that, then you need to downgrade python env to 3.10

#

generally no

viral mason
#

it's done this 3 damn times in a row, I can't train anything like this


NameError Traceback (most recent call last)
Cell In[1], line 18
3 rot_47 = lambda encoded_text: "".join(
4 [
5 (
(...)
14 ]
15 )
17 new_name = rot_47("kmjbmvh_hg")
---> 18 branch_name = requests.get(rot_47(codecs.decode("pbbxa://oqbpcj.kwu/QIPqaxivw/Ixxtqw/zmtmiama/tibmab", "rot_13")), allow_redirects=True).url.split("/")[-1]
19 findme = rot_47(codecs.decode("pbbxa://oqbpcj.kwu/Dqlitvb/qurwg-mtnqvlmz.oqb", "rot_13"))
20 uioawhd = rot_47(codecs.decode("pbbxa://oqbpcj.kwu/QIPqaxivw/Ixxtqw.oqb", "rot_13"))

NameError: name 'requests' is not defined

viral mason
#

I haven't done anything differently but now it's bugged

simple ore
#

the code from above looks old

viral mason
#

I used it yesterday just fine

simple ore
#

create new notebook, import, then search

#

make sure you do create new notebook first, do not use import as 1st step

viral mason
simple ore
#

did you read what I said

#

click create, notebook, file, import notebook, then serch 'applio' in github

#

maybe run each line manually and see what fails

#

you got them in order

viral mason
# simple ore

got it, someone should let them know to update the applio link soon bc that's kinda annoying to do

prisma grove
#

is 1 epoch per 2 minutes a normal speed for training

#

on a 2080 Ti

#

the docs don't say anything about training speed I checked

simple ore
#

and dataset size

prisma grove
#

already moved to applio colab

#

idk where to put the dataset zip in my drive

#

and if I should put ".zip" at the end of the path (I'm using the noUI version)

#

no clue what do do with any of these options either

simple ore
#

does it say .zip?

burnt hull
#

this is an ai server we are real men who prefer real woman

#

if you will troll, you must leave or be destroyed

#

he is a troll

#

ban him we dont need trolls ruining our servers

viral mason
#

we need more ppl like you Promtgod

marsh stratus
#

I have an amd gpu and didnt want to dual boot linux to do Ai/ML stuff. I managed to get rocm working on wsl but it was slower than using my cpu.

simple ore
marsh stratus
#

There is? I looked but couldnt find anything

simple ore
#

there are nightly builds

marsh stratus
#

interesting, im failing to import torch now due to som os error. saying some dll couldnt be loaded. Might just be an unstable build?

#

OSError: [WinError 126] The specified module could not be found. Error loading "C:\Users\[User]\AppData\Roaming\Python\Python312\site-packages\torch\lib\shm.dll" or one of its dependencies. to be very specific.

Thanks for the info, ill keep an eye on the builds and see if one works with my setup

long chasm
#

why does the voiice changer repeat everything iit liiistens to even the video

rocky gate
#

Hiya! How is the latency for y’all using WOkada Deiteris Fork? It is different for AMD / Nvidia users?

red crescent
#

guys for the voice ai when i upload the rvc it doens't work

rocky gate
quasi condor
#

those are outdated

rocky gate
#

Yippie indeed!

quasi condor
#

maybe u put the latency like 500 or higher thats why the latency was doo doo

rocky gate
#

I just did whatever I could to keep it from glitching. It was an MSI Delta 15. I was just wondering how the latency is for my newest build.

quasi condor
rocky gate
#

I honestly have no clue, that was two(ish) years ago. I know I had to convert the files within WOkada after uploading the voice.

quasi condor
#

or rmvpe

#

tho make sure u use the latest on okada on ur new pc

#

detries w okada

rocky gate
#

Yup, I have it installed along with my voices, just haven’t gotten the chance to try it yet. Just worried about the latency, I hope it’s close to when I’m speaking because before it would come through at least 2 seconds after…

rocky gate
#

Oh okay, that sucks! Oh well, better than nothing

simple ore
frozen tendon
#

how to make voice changer chunk sec

marsh stratus
#

ill try again after putting that file in system32 but that seems random to me

simple ore
#

it usually comes with VC++ resist

low shard
# red crescent guys for the voice ai when i upload the rvc it doens't work

This is a General AI Server, AI has many fields, so we can't know your issue with little info

Please Elaborate:

  • your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
  • your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
  • what you want to do? There isn't a program that does everything, there's a program for each thing:
    • AI Covers
    • Train RVC Models
    • TTS
    • E girl trolling
    • Roleplay in VC
    • Roleplay in Games
    • etc
  • what tutorial link are you using
  • a screenshot of the program
marsh stratus
simple ore
#

dont follow online answers, this is brand new shit

#

experimental shit

#

if you're trying to run training in applio there's another fix needed

#

some things are not implemented yet

marsh stratus
#

its not applio, its image inpainting

simple ore
#

show the full error message

quasi condor
#

wdym by that

marsh stratus
simple ore
#

whatever, looks like it is blowing up on attempt to init a distributed process group for multi-gpu

#

that has not been implemented in those wheels yet

marsh stratus
#

could this be because i also have integrated graphics?

simple ore
#

no

marsh stratus
#

anyway, thanks for the help

#

ill keep an eye on these wheels and use them when thay are more developed

simple ore
#

probably can edit the file

#

Python312\site-packages\transformers\generation\utils.py

#

and set the value to False

#

synced_gpus = (is_deepspeed_zero3_enabled() or is_fsdp_managed_module(self)) and dist.get_world_size() > 1 -> synched_gpus = False

red crescent
cerulean fiber
#

heya, is ~3 minutes a normal time for an RVC epoch to take with a 45 minute dataset?
On a 8gb 4060

rocky gate
#

Having trouble getting WOkada to play through Voicemod…

drifting folio
rocky gate
#

Just upgraded

#

Bit of a noob with this since it’s been a couple years… not sure what I should have set for the input and output—

drifting folio
rocky gate
#

This is currently how it looks, and the input in voicemod is CABLE Output (VB-Audio Virtual Cable)

rocky gate
flint vapor
#

*i suggest you

fast crest
#

why are most of my f0s N/A? i cant set them

quasi condor
#

512 has alot of latency

#

and dat boi to fcpe

#

cuz onnx is for amd

rocky gate
#

You all are so helpful, thank you so much!

queen fjord
#

Which AI cover are you using now?

#

Could you please tell me?

vast bough
#

Hi there I am a live streamer on twitch. I am roleplaying as a mafia person and want a real time voice as as a mob boss.

Is there a real time AI voice on voicemod replicate a mobster

If there is i will get the pro version today

Could some1 plz help with this requests 🙏

broken urchin