simple ore Jun 1, 2025, 2:45 PM

#

god.. who's using teamspeak in 2025

young dirge Jun 1, 2025, 2:52 PM

#

simple ore god.. who's using teamspeak in 2025

hahaha. I use it for bannerlord where we got somewhat of a RP medieval clan. But thanks man you really did help me alot It now works on every platform!

copper phoenix Jun 1, 2025, 3:51 PM

#

Hello, can anyone tell me which Colab is currently used to make AI Covers and Models?

odd isle Jun 1, 2025, 4:21 PM

#

is it normal for the output headphones to sound a little better than cable output?

real depot Jun 1, 2025, 6:05 PM

#

Hello everyone!
I’m looking for help to create a custom Arabic RVC voice model for a gift.
I want to make a voice similar to Gulf Arabic artists like Ouzii / Luigii style, for a personal use (not commercial).

If anyone is available to help me build the model, I would really appreciate it 🩷
I’m ready to pay for the service if needed.
Thank you so much 🙏🏼

charred marten Jun 1, 2025, 6:52 PM

#

Which voice changers are free?

viral mason Jun 1, 2025, 6:56 PM

#

is Mel-Roformer-Denoise-Aufr33 better or Mel-Roformer-Denoise-Aufr33-Aggr

#

specifiacally speaking when using this
https://huggingface.co/spaces/TheStinger/UVR5_UI

UVR5 UI - a Hugging Face Space by TheStinger

viral mason Jun 1, 2025, 7:05 PM

#

charred marten Which voice changers are free?

this one is and I can help u set it up in dms
https://huggingface.co/Shadicti/deiteris-Fork/resolve/main/voice-changer-windows-nvidia-b2332.zip?download=true

charred marten Jun 1, 2025, 7:07 PM

#

viral mason this one is and I can help u set it up in dms https://huggingface.co/Shadicti/de...

Yes pls

acoustic coral Jun 1, 2025, 7:18 PM

#

bro i can not find the rvc download anywhere

viral mason Jun 1, 2025, 7:26 PM

#

acoustic coral bro i can not find the rvc download anywhere

what are you looking for specifically?

acoustic coral Jun 1, 2025, 7:28 PM

#

i used to have it it was just called rvc and i rember getting it off git hub, you would import the zip and put in a pre recorded audio and then it would be ai voice so

viral mason Jun 1, 2025, 7:28 PM

#

are you trying to train a model?

jaunty bloom Jun 1, 2025, 8:57 PM

#

I wanted to train a voice model, but the link i usually use for the rvc fork is down. Does anyone have another link?

delicate whale Jun 1, 2025, 9:36 PM

#

Is their a cracked version of Voice Mod?

charred marten Jun 1, 2025, 9:51 PM

#

Is there a way to fix static or is it just a mic issue?

mighty sinew Jun 1, 2025, 10:32 PM

#

using applio and recieving this error An error occurred during audio conversion: index -1 is out of bounds for axis 0 with size 0 how to fix?

shell gulch Jun 1, 2025, 10:36 PM

#

i forgot what is the best ai cover maker rn?

prisma kettle Jun 1, 2025, 10:57 PM

#

What happened to the separate channels for the RVC help

prisma kettle Jun 1, 2025, 10:58 PM

#

shell gulch i forgot what is the best ai cover maker rn?

I use weights

prisma kettle Jun 1, 2025, 11:18 PM

#

I'm not getting any output through the virtual cable doing realtime, anyone know why?

#

I can hear it when I set my monitor to headphones but nothing comes through on discord or in sound control panel

brittle wing Jun 1, 2025, 11:35 PM

#

hi, does anyone know, how to check my model in training process (weights.com)? because it says that it already created, but i cant use it and even see in the list. I just wanna know what time left

charred marten Jun 2, 2025, 12:15 AM

#

Is there a way to get rid of like the weird voice cracks?

analog obsidian Jun 2, 2025, 12:20 AM

#

charred marten Is there a way to get rid of like the weird voice cracks?

no but there's a way to decrease them

#

well different ways actually

#

one is training a better model

#

then the other way is enabling fp32 mode and pray if your current model is going to get any better (if the model was trained in fp16 i don't think enabling fp32 will help that much since the model is already fried inside)

charred marten Jun 2, 2025, 12:24 AM

#

analog obsidian one is training a better model

how do i do that?

analog obsidian Jun 2, 2025, 12:25 AM

#

charred marten how do i do that?

train the model using a considerable big dataset, around 40 mins ~ 1 hour

#

i noticed voice cracks happen because the f0 estimator being mid (they're not that great sadly) but also because the model is trying to do a sound it doesn't know how to reproduce

charred marten Jun 2, 2025, 12:26 AM

#

analog obsidian train the model using a considerable big dataset, around 40 mins ~ 1 hour

Yeah I mean how do I do that in itself?

analog obsidian Jun 2, 2025, 12:27 AM

#

charred marten Yeah I mean how do I do that in itself?

https://docs.aihub.gg/essentials/how-to-make-voice-models/

How to Make Voice Models

In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.

fleet epoch Jun 2, 2025, 2:52 AM

#

i am using rvc ai cover maker and when im trying to make a cover it at export audio and output information it says error

acoustic coral Jun 2, 2025, 3:50 AM

#

viral mason are you trying to train a model?

no

viral mason Jun 2, 2025, 3:53 AM

#

acoustic coral no

what are u doing then?

acoustic coral Jun 2, 2025, 3:55 AM

#

okay so it was an app. called RVC. i had it on my old computer. And you couldint train voices but you would import zips upload an mp3 click how many. "ceces" i thinnk it was called then it would export it in the voice

viral mason Jun 2, 2025, 3:56 AM

#

acoustic coral okay so it was an app. called RVC. i had it on my old computer. And you couldint...

@viscid moss I think u need to translate this for me

acoustic coral Jun 2, 2025, 3:56 AM

#

omfg

#

dude

#

let me do smth rq

#

🥀

#

im gonna make a drawing of what i remeber the app looking like

viscid moss Jun 2, 2025, 4:19 AM

#

viral mason <@274566299349155851> I think u need to translate this for me

hhmmm idk

junior gull Jun 2, 2025, 5:25 AM

#

Heya, just wondering how do we get ranked up so we can share our RVC and TTS models? Also been doing ton of learning, testing, and ai training and would love to soon share some of the work I have. Making a reliable compact modular system that can spontaneously regenerate your custom AI on almost any platform with training/memory enabled, very lightweight and 100% portable.
Having a blast learning about all this and making stuff and would love the chance to share too!

wicked crow Jun 2, 2025, 6:15 AM

#

@junior gull maybe check https://discord.com/channels/1159260121998827560/1305527335646269440 ?

potent bone Jun 2, 2025, 6:24 AM

#

odd isle is it normal for the output headphones to sound a little better than cable outpu...

yeah i think so

viral ruin Jun 2, 2025, 6:37 AM

#

getting this on kaggle when starting applio:

#

PyngrokNgrokError: The ngrok process errored on start: authentication failed: The authtoken you specified does not look like a proper ngrok tunnel authtoken.\nYour authtoken: token\nInstructions to install your authtoken are on your ngrok dashboard:\nhttps://dashboard.ngrok.com/get-started/your-authtoken\r\n\r\nERR_NGROK_105\r\n.

knotty moth Jun 2, 2025, 7:09 AM

#

junior gull Heya, just wondering how do we get ranked up so we can share our RVC and TTS mod...

https://discord.com/channels/1159260121998827560/1305527335646269440 there's no level requirement to submit there, and the model quality matters

fallen crater Jun 2, 2025, 7:56 AM

#

where can i download onnx models most of them are pth models there

junior gull Jun 2, 2025, 12:40 PM

#

knotty moth https://discord.com/channels/1159260121998827560/1305527335646269440 there's no ...

thanks, that channel was collapsed before didn't' see it.

digital bear Jun 2, 2025, 1:51 PM

#

hey i'm currently using the w okada fork but i can't seem to find a way to prevent the voice changer from picking up my laptop's fans and making unwanted noises

simple ore Jun 2, 2025, 1:52 PM

#

digital bear hey i'm currently using the w okada fork but i can't seem to find a way to preve...

what GPU?

hard parrot Jun 2, 2025, 2:18 PM

#

why rvc gui not opening
is it a error?

Running with the system Python.
Nie moPress any key to continue . . .

maiden latch Jun 2, 2025, 2:19 PM

#

where is a more up-to-date tutorial on how to use colab for w-okoda's realtime voice changer thing?
i dont understand anything and the tutorial im following isint working

digital bear Jun 2, 2025, 2:28 PM

#

simple ore what GPU?

i switched to another mic and it's better, i have the gtx 1660 ti

#

laptop

pastel oak Jun 2, 2025, 2:39 PM

#

maiden latch where is a more up-to-date tutorial on how to use colab for w-okoda's realtime v...

Colabs are generally broken, youre best of using kaggle

#

But whats your gpu first of all

maiden latch Jun 2, 2025, 2:39 PM

#

it's like an amd 580, it's really not the greatest so thats why i wanted to do it online since the app doesnt work too great on my pc

pastel oak Jun 2, 2025, 2:40 PM

#

maiden latch it's like an amd 580, it's really not the greatest so thats why i wanted to do i...

Fork wokada would work decently, id say give it a try first

#

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

maiden latch Jun 2, 2025, 2:40 PM

#

was about to ask where to find that thank you

pastel oak Jun 2, 2025, 2:40 PM

#

Else, for the online hosted, follow this
https://docs.aihub.gg/rvc-voice-changer/cloud/w-okada-kaggle/

W Okada Kaggle

Last update: May 5, 2025

maiden latch Jun 2, 2025, 2:41 PM

#

tysm

hard parrot Jun 2, 2025, 2:43 PM

#

why rvc gui not opening
is it a error?

Running with the system Python.
Nie moPress any key to continue . . .

maiden latch Jun 2, 2025, 2:50 PM

#

pastel oak Else, for the online hosted, follow this https://docs.aihub.gg/rvc-voice-changer...

i've cloned the public kaggle thing but i cant find "Notebook Options"

simple ore Jun 2, 2025, 2:53 PM

#

maiden latch i've cloned the public kaggle thing but i cant find "Notebook Options"

I guess the accelerator part here

maiden latch Jun 2, 2025, 2:54 PM

#

k

simple ore Jun 2, 2025, 2:56 PM

#

they may have shifted some options recently

round bear Jun 2, 2025, 3:10 PM

#

is "Realtime Voice Changer Client" still good?

hard parrot Jun 2, 2025, 3:38 PM

#

i just installed applio rvc and this shows up
Please run 'run-install.bat' first to set up the environment.
Press any key to continue . . .

#

what i need to do

crude flame Jun 2, 2025, 3:45 PM

#

hard parrot i just installed applio rvc and this shows up Please run 'run-install.bat' firs...

would you be suprised if i said you need to run "run-install.bat" first

simple ore Jun 2, 2025, 3:47 PM

#

No, it means they ran run-applio.bat it as admin

#

@hard parrot

digital bear Jun 2, 2025, 3:59 PM

#

by the way is there a way to make it sound better my models sound nowhere near as good as the samples

maiden latch Jun 2, 2025, 4:01 PM

#

maiden latch i've cloned the public kaggle thing but i cant find "Notebook Options"

okay so i've done all of this
is there a way to save this so that i dont have to do the setup the next time i want to try and run this?

analog obsidian Jun 2, 2025, 4:01 PM

#

digital bear by the way is there a way to make it sound better my models sound nowhere near a...

there is a slight quality degradation while using a model in realtime
u could try max extra value, crossfade set to 0.1s, and enabling fp32
if the model still sounds bad then its a model issue and the only way to fix is to get another model or train one yourself

mental pine Jun 2, 2025, 4:27 PM

#

my twiter is blocked how do i login

#

to my ai hub

#

anyone help me

fleet cedar Jun 2, 2025, 6:53 PM

#

can anyone help me why is this lagging

pastel oak Jun 2, 2025, 7:04 PM

#

fleet cedar can anyone help me why is this lagging

Looks like your GPU is not strong enough for the chunk you chose

#

Chunk (the 512.0 ms) has to be higher than perf

#

Whats your gpu

fleet cedar Jun 2, 2025, 7:05 PM

#

pastel oak Looks like your GPU is not strong enough for the chunk you chose

im rtx 4060

fleet cedar Jun 2, 2025, 7:05 PM

#

pastel oak Whats your gpu

pastel oak Jun 2, 2025, 7:05 PM

#

fleet cedar im rtx 4060

2.7 💀

#

do chunk 200 and extra 2.7

fleet cedar Jun 2, 2025, 7:05 PM

#

idk bro

pastel oak Jun 2, 2025, 7:05 PM

#

f0 det rmvpe

fleet cedar Jun 2, 2025, 7:06 PM

#

pastel oak f0 det rmvpe

alr trying

fleet cedar Jun 2, 2025, 7:06 PM

#

pastel oak f0 det rmvpe

tsym gng

#

goated

#

it works

round orbit Jun 2, 2025, 7:33 PM

#

can anybody help me with the error of vol0000?

#

line 1 is working, but sound is not being played. I reinstalled the program and components ~5 times, but no changes.

jaunty anvil Jun 2, 2025, 8:04 PM

#

so for videos like the president stuff, is elevenlabs the best to do that stuff?

fleet cedar Jun 2, 2025, 8:27 PM

#

guys when i speak how do i make it loud for others

#

monitor thing doesnt work

languid cliff Jun 2, 2025, 8:41 PM

#

fleet cedar guys when i speak how do i make it loud for others

Make sure your input is at 100%, pretty sure its 10% at default

jaunty anvil Jun 2, 2025, 8:49 PM

#

is elevenlabs the best thing for the tts videos or is there a better alternative?

simple ore Jun 2, 2025, 9:48 PM

#

jaunty anvil is elevenlabs the best thing for the tts videos or is there a better alternative...

chatterbox is very good and free and may be better than 11labs for english

jaunty anvil Jun 2, 2025, 9:49 PM

#

simple ore chatterbox is very good and free and may be better than 11labs for english

awesome, thanks

#

...and i'm not a coder, shit.

simple ore Jun 2, 2025, 9:52 PM

#

it installs with one command line

jaunty anvil Jun 2, 2025, 9:52 PM

#

where do i put that command?

simple ore Jun 2, 2025, 9:52 PM

#

as long as you have python installed, ideally 3.10 or 3.11

jaunty anvil Jun 2, 2025, 9:52 PM

#

i uh don't have python at all lmao

#

i'll get on that

simple ore Jun 2, 2025, 9:52 PM

#

dont use 3.12 or 3.13

jaunty anvil Jun 2, 2025, 9:58 PM

#

got this

simple ore Jun 2, 2025, 10:01 PM

#

from command line

#

not from python

jaunty anvil Jun 2, 2025, 10:01 PM

#

oh

simple ore Jun 2, 2025, 10:02 PM

#

#

somewhat more proper way of doing it without using global repository

jaunty anvil Jun 2, 2025, 10:03 PM

#

this correct?

simple ore Jun 2, 2025, 10:03 PM

#

it probably installs cpu version of the torch

#

if you run nvidia gpu you may need to change it to cuda version

jaunty anvil Jun 2, 2025, 10:04 PM

#

i'm a bit new to all this stuff

simple ore Jun 2, 2025, 10:05 PM

#

anyway, it can run on CPU as well, that's fine to just test it

jaunty anvil Jun 2, 2025, 10:05 PM

#

but i am doing the correct command in the right place now

simple ore Jun 2, 2025, 10:05 PM

#

you are installing it in to the global repository

#

usually not a good idea to do that with multiple project as there are often conflicting versions of the libraries

jaunty anvil Jun 2, 2025, 10:06 PM

#

any idea how to uninstall it?

simple ore Jun 2, 2025, 10:06 PM

#

dont worry too much, you can install it properly later

jaunty anvil Jun 2, 2025, 10:06 PM

#

alright

#

i mean i just restarted the installation like three times already lmao

#

alright it was giving me a lot of errors

#

idk what to do

simple ore Jun 2, 2025, 10:11 PM

#

worked fine for me

#

#

I bet that's that wheel creation thing

jaunty anvil Jun 2, 2025, 10:12 PM

#

wheel creation?

simple ore Jun 2, 2025, 10:12 PM

#

jaunty anvil Jun 2, 2025, 10:12 PM

#

i just don't want the files cramped in my c drive

#

i already installed and cancelled it like 3 times to troubleshoot

simple ore Jun 2, 2025, 10:12 PM

#

well, by default the global repository is on C

jaunty anvil Jun 2, 2025, 10:12 PM

#

yeah but i couldn't find it

#

no folder saying global repository

simple ore Jun 2, 2025, 10:13 PM

#

under c:\users\user\appdata\local\programs\python\python310\libs\site-packages

#

that's the global repository

#

to make a local you need to use #✨│ai-help message

jaunty anvil Jun 2, 2025, 10:15 PM

#

python isn't in programs

#

simple ore Jun 2, 2025, 10:17 PM

#

jaunty anvil Jun 2, 2025, 10:17 PM

#

simple ore Jun 2, 2025, 10:18 PM

#

unless you've installed python to somewhere else

jaunty anvil Jun 2, 2025, 10:18 PM

#

OHH i got it from the microsoft store

simple ore Jun 2, 2025, 10:19 PM

#

https://www.python.org/ftp/python/3.10.11/python-3.10.11-amd64.exe

#

make sure you check [x] add python to path

jaunty anvil Jun 2, 2025, 10:19 PM

#

i'm just not cut out for this shit 😭

simple ore Jun 2, 2025, 10:20 PM

#

most of AI project is not for beginners

jaunty anvil Jun 2, 2025, 10:20 PM

#

so uh

#

i guess i should just do elevenlabs?

simple ore Jun 2, 2025, 10:20 PM

#

but you can ask chatgpt to explain things

jaunty anvil Jun 2, 2025, 10:20 PM

#

is that the best for beginners that want to export models here and make tts videos and shit?

simple ore Jun 2, 2025, 10:21 PM

#

if you install chatterbox is it really simple to use

jaunty anvil Jun 2, 2025, 10:21 PM

#

alright i'll keep working

#

https://youtu.be/trgPAtcVNfQ
following this video but the commands don't exactly work

coral haven Jun 2, 2025, 10:31 PM

#

my vcc thing isnt working. help

fleet cedar Jun 2, 2025, 10:43 PM

#

how do i fix this being loud at the start when i speak then it gets normal

desert linden Jun 2, 2025, 10:49 PM

#

how do i fix this error:

#

The distutils package is deprecated and slated for removal in Python 3.12. Use setuptools or check PEP 632 for potential alternatives
from distutils.util import strtobool

simple ore Jun 2, 2025, 10:56 PM

#

not an error

tender sand Jun 2, 2025, 10:57 PM

#

my rvc wont work, my friend is just saying its playing a hello every few seconds and my audio wont go through at all

#

nvm got it to work

desert linden Jun 2, 2025, 11:21 PM

#

simple ore not an error

Timer: 00:00:48/content/voice-changer/server/HVoice.py:3: DeprecationWarning: The distutils package is deprecated and slated for removal in Python 3.12. Use setuptools or check PEP 632 for potential alternatives
from distutils.util import strtobool
Traceback (most recent call last):
File "/content/voice-changer/server/HVoice.py", line 10, in <module>
from downloader.SampleDownloader import downloadInitialSamples
File "/content/voice-changer/server/downloader/SampleDownloader.py", line 12, in <module>
from voice_changer.RVC.RVCModelSlotGenerator import RVCModelSlotGenerator
File "/content/voice-changer/server/voice_changer/RVC/RVCModelSlotGenerator.py", line 4, in <module>
import torch
ModuleNotFoundError: No module named 'torch'
WARNING:pyngrok.process.ngrok:t=2025-06-02T23:20:37+0000 lvl=warn msg="Stopping forwarder" name=http-41369-e68eec6b-0315-4a59-870c-6d6c66810395 acceptErr="failed to accept connection: Listener closed"
--------- SERVER STOPPED! ---------

#

this is the full error

simple ore Jun 2, 2025, 11:22 PM

#

seems like you tried to install requirements and it failed. You may want to grab a prebuilt package.

#

but again this looks like colab?

desert linden Jun 2, 2025, 11:22 PM

#

yeah im using colab

simple ore Jun 2, 2025, 11:23 PM

#

with realtime rvc?

#

it is dead

desert linden Jun 2, 2025, 11:23 PM

#

whattttt

simple ore Jun 2, 2025, 11:23 PM

#

unless someone fixes the install part

desert linden Jun 2, 2025, 11:23 PM

#

omg ur kidding

#

when did it stop working

#

i used it litch months ago

simple ore Jun 2, 2025, 11:23 PM

#

show me the url

desert linden Jun 2, 2025, 11:24 PM

#

https://colab.research.google.com/github/hinabl/voice-changer-colab/blob/master/Hina_Modified_Realtime_Voice_Changer_on_Colab.ipynb#scrollTo=lLWQuUd7WW9U

Google Colab

simple ore Jun 2, 2025, 11:25 PM

#

yeah, outdated af, i aint gonna touch it

desert linden Jun 2, 2025, 11:25 PM

#

fml

#

i never had issues w it before

#

do u happen to know an updated version?

#

that supports mac?

simple ore Jun 2, 2025, 11:26 PM

#

well, maybe not oudates, but something is wrong with requrements install perhaps

desert linden Jun 2, 2025, 11:26 PM

#

how do i fix it

simple ore Jun 2, 2025, 11:27 PM

#

#

yeah, f that

#

colab is using python 3.11, this package has only 3.10 wheel

desert linden Jun 2, 2025, 11:29 PM

#

oh

#

so no way to fix it?

simple ore Jun 2, 2025, 11:30 PM

#

The last fix for this colab was done 5 month ago

#

desert linden Jun 2, 2025, 11:30 PM

#

rip lol

simple ore Jun 2, 2025, 11:31 PM

#

in that time I had to fix applio colab like 5 times already

desert linden Jun 2, 2025, 11:31 PM

#

ugh

violet rose Jun 3, 2025, 12:05 AM

#

what is the best plz ?

#

fleet cedar Jun 3, 2025, 12:20 AM

#

whats wrong with this rvc not sending audio

#

to the vc

safe echo Jun 3, 2025, 12:55 AM

#

hi does anyone know how to stop this happening? i cant delete slots with the edit button as all my slots are coming up as "blank" from previous voices

#

is there another version? xd

#

Is there a way to use without internet connection/chrome tab?

#

like a built in program?

analog obsidian Jun 3, 2025, 1:11 AM

#

safe echo Is there a way to use without internet connection/chrome tab?

no, but i recommend you to use vonovox
its faster than wokada and doesnt run in your browser

safe echo Jun 3, 2025, 1:11 AM

#

oh, do you have a link/guide for that please?

analog obsidian Jun 3, 2025, 1:12 AM

#

safe echo oh, do you have a link/guide for that please?

https://github.com/dr87/Vonovox

GitHub

GitHub - dr87/Vonovox: Realtime AI Voice Converter for NVIDIA GPUs

Realtime AI Voice Converter for NVIDIA GPUs. Contribute to dr87/Vonovox development by creating an account on GitHub.

#

tutorial is there, just scroll down until you find it

safe echo Jun 3, 2025, 1:12 AM

#

thank you very much

analog obsidian Jun 3, 2025, 1:12 AM

#

nvidia only

safe echo Jun 3, 2025, 1:13 AM

#

gotcha, im on RTX 3070 TI.

analog obsidian Jun 3, 2025, 1:13 AM

#

misc_lets_fucking_go

safe echo Jun 3, 2025, 1:13 AM

#

oh, do i have to compile it myself?

crude flame Jun 3, 2025, 1:14 AM

#

just download it and run setup.bat

analog obsidian Jun 3, 2025, 1:14 AM

#

safe echo oh, do i have to compile it myself?

no dont worry, setup.bat will just download the required stuff to run it

crude flame Jun 3, 2025, 1:14 AM

#

then once thats done run start.bat

safe echo Jun 3, 2025, 1:14 AM

#

im only finding the source code downloads on the "downloads" tab

analog obsidian Jun 3, 2025, 1:14 AM

#

download the repo as zip

#

click code > download zip

crude flame Jun 3, 2025, 1:15 AM

#

safe echo oh, do you have a link/guide for that please?

there is a guide for it https://docs.aihub.gg/rvc-voice-changer/local/vonovox/ its kinda bare atm but it explains stuff

safe echo Jun 3, 2025, 1:16 AM

#

analog obsidian click code > download zip

legendary, thank you thats the one

#

excited to play around with it 👍

analog obsidian Jun 3, 2025, 1:18 AM

#

good luck! vonovox is great

#

dev is active in the rvc community

crude flame Jun 3, 2025, 1:19 AM

#

analog obsidian good luck! vonovox is great

kinda wild how like 2 months ago we were sus of vonovox

#

things change quick

analog obsidian Jun 3, 2025, 1:20 AM

#

yeah lol

safe echo Jun 3, 2025, 1:25 AM

#

would you say the performance of vono is better than rvc?

crude flame Jun 3, 2025, 1:25 AM

#

rvc does not mean realtime voice changer

#

vonovox is faster than w-okada

safe echo Jun 3, 2025, 1:26 AM

#

oh, what should i use to refer to the web based version?

crude flame Jun 3, 2025, 1:26 AM

#

w-okada

safe echo Jun 3, 2025, 1:26 AM

#

gotcha, that would make sense.

crude flame Jun 3, 2025, 1:26 AM

#

if you talking about the realtime version

safe echo Jun 3, 2025, 1:26 AM

#

i am yes.

crude flame Jun 3, 2025, 1:27 AM

#

then yea, w-okada

safe echo Jun 3, 2025, 1:27 AM

#

Thank you, been using okada for a while, but i feel like my card can be utilised better.

#

unsure how to word that but yeah

analog obsidian Jun 3, 2025, 1:28 AM

#

wokada is poorly optimized yup, the fork dev tried his best to improve it

#

vonovox is a completely new software, so things are better

safe echo Jun 3, 2025, 1:29 AM

#

awesome Cool_Doge

analog obsidian Jun 3, 2025, 1:29 AM

#

in terms of perfomance

#

cat_dance

safe echo Jun 3, 2025, 1:29 AM

#

long install time via the bat xD

#

getting there

#

do i use launcher once that says complete?

#

or start

simple ore Jun 3, 2025, 1:33 AM

#

I think there's now some way to conver the whole pytorch model thing to just a cuda kernel

#

for super fast performance

safe echo Jun 3, 2025, 1:34 AM

#

oh, vonovox shows my folders empty where my pth files are, thats odd. (oh, vono uses pth not safetensor)

analog obsidian Jun 3, 2025, 1:56 AM

#

safe echo oh, vonovox shows my folders empty where my pth files are, thats odd. (oh, vono ...

dev said he's going to add safetensors soon

safe echo Jun 3, 2025, 1:56 AM

#

where can i stay updated with that please? as all my main voices are on safetensor

analog obsidian Jun 3, 2025, 1:56 AM

#

safe echo where can i stay updated with that please? as all my main voices are on safetens...

join vonovox discord server (it's in the github repo)

safe echo Jun 3, 2025, 1:58 AM

#

you've been so helpful, thank you lyery!

safe echo Jun 3, 2025, 2:28 AM

#

is there any good places to find pth files?

#

usually use weights, but im unsure if its just safetensor there

simple ore Jun 3, 2025, 2:30 AM

#

voice-models.com

crude flame Jun 3, 2025, 2:31 AM

#

safe echo is there any good places to find pth files?

https://discord.com/channels/1159260121998827560/1175430844685484042

safe echo Jun 3, 2025, 2:31 AM

#

thank you

knotty moth Jun 3, 2025, 2:44 AM

#

safe echo is there any good places to find pth files?

valid vine Jun 3, 2025, 2:59 AM

#

I'm about to stab the hugging face website 23 times on the aides of march

#

I want to download deepseek prover V2 671B but there's 163 files and nothing I do works

#

I've tried git, aria, Jdownloader, and the python CLI

simple ore Jun 3, 2025, 3:11 AM

#

git lfs?

valid vine Jun 3, 2025, 3:12 AM

#

idk if it doesn't exist or what

#

I'm on windows btw

#

oh wait I found it

#

how do I download it using that?

simple ore Jun 3, 2025, 3:14 AM

#

the usual git clone of the repo?

valid vine Jun 3, 2025, 3:16 AM

#

where does one find that? it's not the link at the top of the website or the one the files are coming from apparently

#

simple ore Jun 3, 2025, 3:17 AM

#

wihout /resolve/main ?

valid vine Jun 3, 2025, 3:17 AM

#

my god finally

#

tysm

#

I've spent like an hour + on this

spice nacelle Jun 3, 2025, 3:23 AM

#

does anyone know of any text to speech models that run locally or on web and are not as expensive as eleven labs since I create audio's of over 1hr every other day

red kayak Jun 3, 2025, 5:51 AM

#

@candid basin

#

Is very likely that it's your datasets issue

#

Care to show me a short sample of the audio u are using

candid basin Jun 3, 2025, 5:52 AM

#

red kayak Is very likely that it's your datasets issue

i still have not used my dataset.i am aksing if possible to lett my i can finetunn it on my dataset

candid basin Jun 3, 2025, 5:52 AM

#

red kayak Is very likely that it's your datasets issue

i just tried the hugind space veriosn

red kayak Jun 3, 2025, 5:53 AM

#

Well training is fine tuning so yes you can definitely fine tune

red kayak Jun 3, 2025, 5:53 AM

#

candid basin i just tried the hugind space veriosn

Which one

candid basin Jun 3, 2025, 5:54 AM

#

red kayak Which one

https://huggingface.co/spaces/ResembleAI/Chatterbox

red kayak Jun 3, 2025, 5:54 AM

#

Yeah that's a zero shot tts

candid basin Jun 3, 2025, 5:55 AM

#

red kayak Yeah that's a zero shot tts

is threre any git of python example code on how i can finetune one on my dataset ?
thnks fo ryour help

or any guide

red kayak Jun 3, 2025, 5:57 AM

#

candid basin is threre any git of python example code on how i can finetune one on my dataset...

Ah you simply want to fine tune a chatter box model. In that case go to their git hub and look through their read me. Perhaps they'll have some info there regarding model training which I think they do. I thought you were asking about RVC initially

candid basin Jun 3, 2025, 5:59 AM

#

red kayak Ah you simply want to fine tune a chatter box model. In that case go to their gi...

:] unfortunately they do not :p .I thought to ask her ein order to skip some time searching :]
thank a lot for the help!!

red kayak Jun 3, 2025, 5:59 AM

#

candid basin :] unfortunately they do not :p .I thought to ask her ein order to skip som...

Aww sucks; well good luck finding some sources then. Perhaps a quick Google query might do the trick

viral ruin Jun 3, 2025, 6:07 AM

#

anyone has a RVC colab link that works??

#

for training

jaunty shale Jun 3, 2025, 10:16 AM

#

I tried it and it's insane how you can make the vocals sound so much better

#

thank you!

hot violet Jun 3, 2025, 10:17 AM

#

jaunty shale thank you!

your welcome bro

junior gull Jun 3, 2025, 12:37 PM

#

Is this W okadas? It looks a little different.

broken urchin Jun 3, 2025, 12:37 PM

#

junior gull Is this W okadas? It looks a little different.

yes its the W Okada Deiteris Fork

junior gull Jun 3, 2025, 12:40 PM

#

broken urchin yes its the W Okada Deiteris Fork

Awesome, i just found that and downloaded it… Definitely have to check out the improvements!

broken urchin Jun 3, 2025, 12:40 PM

#

junior gull Awesome, i just found that and downloaded it… Definitely have to check out the ...

alright bet i can even help you if you need some help ❤️

junior gull Jun 3, 2025, 12:48 PM

#

broken urchin alright bet i can even help you if you need some help ❤️

I probably won’t get to check it out until tonight, but I may hit you up on that if it gives me any trouble setting it up. Thank you!

broken urchin Jun 3, 2025, 12:48 PM

#

junior gull I probably won’t get to check it out until tonight, but I may hit you up on that...

alright bet

frigid arrow Jun 3, 2025, 1:41 PM

#

who can make me a index and pth

graceful jettyBOT Jun 3, 2025, 1:41 PM

#

🎉 | Jawh leveled up!
ℹ | Level up messages can be disabled for the guild with owo level disabletext

frigid arrow Jun 3, 2025, 1:41 PM

#

i try everything . shi isnt working

knotty moth Jun 3, 2025, 1:54 PM

#

frigid arrow who can make me a index and pth

literally no one

golden walrus Jun 3, 2025, 3:39 PM

#

Guys. Can i ask if there is any voice changer that can use spin embedder?

#

cat_pawbite

crude flame Jun 3, 2025, 3:42 PM

#

golden walrus Guys. Can i ask if there is any voice changer that can use spin embedder?

vonovox

graceful jettyBOT Jun 3, 2025, 3:42 PM

#

🎉 | Razer leveled up!
blank | Extra rewards were added for missing levels

crude flame Jun 3, 2025, 3:42 PM

#

graceful jetty 🎉 **| Razer** leveled up! <:blank:427371936482328596> **|** Extra rewards were ...

ewwwwwww

analog obsidian Jun 3, 2025, 3:43 PM

#

graceful jetty 🎉 **| Razer** leveled up! <:blank:427371936482328596> **|** Extra rewards were ...

💀

golden walrus Jun 3, 2025, 3:47 PM

#

crude flame vonovox

cat_blush btw, do you know if it's okay to pair spin with KLM 4? Or the experimental one that SSS made in pretrain lah.

crude flame Jun 3, 2025, 3:48 PM

#

golden walrus <:cat_blush:1159361904301580300> btw, do you know if it's okay to pair spin with...

if it was made with 7_12 spin then yea

golden walrus Jun 3, 2025, 3:48 PM

#

cat_blush

#

Thank you so much

cosmic epoch Jun 3, 2025, 4:09 PM

#

can someone give me a link to a colab where i can train models (one that isn't applio)?

simple ore Jun 3, 2025, 4:16 PM

#

good luck with that

median monolith Jun 3, 2025, 6:16 PM

#

What limitations (if any) does the Weights voice model training feature have regarding audio quality compared to a local or cloud training like "Mainline Collab" or Applio?
Like, whats the max ammount of khs that the Weights training (USING A PREMIUM TRAINING) uses/supports of an audio for example.
Or if it adds any sort of compression to the dataset/final model audio no matter what format and properties it has.
Im only certain of the fact that you cannot upload very heavy audios for the training, wich means you will mostly not be able to use a max quality wav dataset for example.

viral mason Jun 3, 2025, 6:23 PM

#

do not train on weights.gg yt_nails

arctic trail Jun 3, 2025, 6:31 PM

#

how do i even download is there a tutorial

viral mason Jun 3, 2025, 6:32 PM

#

arctic trail how do i even download is there a tutorial

what are you trying to download?

arctic trail Jun 3, 2025, 6:32 PM

#

viral mason what are you trying to download?

voice changer

viral mason Jun 3, 2025, 6:32 PM

#

bet I can help u, dm me ^^

viral mason Jun 3, 2025, 6:46 PM

#

median monolith What limitations (if any) does the Weights voice model training feature have reg...

take this example, both used the same dataset and one sounds better than the other (I didn't use premium because ai should be FREE)

#

idk any of the complicated stuff I can only provide this kinda info

#

I'm no nerd..

median monolith Jun 3, 2025, 6:55 PM

#

viral mason take this example, both used the same dataset and one sounds better than the oth...

yeah, that premium thing is... eh... but atleast you can get free premium trainings for each 5 day streak successfully made. thats something I suppose ¯_(ツ)_/¯

median monolith Jun 3, 2025, 6:55 PM

#

viral mason I'm no nerd..

me neither i guess 😐

median monolith Jun 3, 2025, 7:22 PM

#

viral mason take this example, both used the same dataset and one sounds better than the oth...

Jesus, now that im really analazing, the Weights one is just depressing comparing to tbh

viral mason Jun 3, 2025, 7:25 PM

#

median monolith Jesus, now that im really analazing, the Weights one is just depressing comparin...

indeed

storm merlin Jun 3, 2025, 7:47 PM

#

how can i turn someones normal voice into like an ai singing voice

viral mason Jun 3, 2025, 7:55 PM

#

storm merlin how can i turn someones normal voice into like an ai singing voice

I can show u how

storm merlin Jun 3, 2025, 7:56 PM

#

viral mason I can show u how

okkk thank youuuu

viral mason Jun 3, 2025, 8:02 PM

#

storm merlin okkk thank youuuu

check dms

daring heath Jun 3, 2025, 9:16 PM

#

whats the target lufs for rvc dataset

#

-18?

valid vine Jun 3, 2025, 9:19 PM

#

valid vine my god finally

how long is lfs supposed to be finished for? it's been like this for over 30 minutes now

#

omfg it doesn't even work???

simple ore Jun 3, 2025, 9:21 PM

#

(689 * 1024) / 82.. 3+ hours

#

tf do you expect lol

valid vine Jun 3, 2025, 9:22 PM

#

yeah no when I say "like this" I mean "100% (163/163)"

#

this has been running for abt 4 hours

#

that 689 isn't the total going to be downloaded, that's the amount already downloaded

#

and it's been going up every once in a while those 4 hours

#

and it's been stuck at 689 for like 30 minutes

#

well more like 40/45 now

valid vine Jun 3, 2025, 9:33 PM

#

simple ore tf do you expect lol

maybe for the tool you suggested to actually work

#

which apparently learning how bad huggingface is was too much to ask

simple ore Jun 3, 2025, 9:34 PM

#

lfs is how the models are stored there, there's no other way of downloading them, other than clicking off each file manually

valid vine Jun 3, 2025, 9:35 PM

#

so the way they're stored just doesn't work then

distant turtle Jun 3, 2025, 9:37 PM

#

-colab

patent trellisBOT Jun 3, 2025, 9:37 PM

#

distant turtle -colab

📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**

Google Colab

• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

simple ore Jun 3, 2025, 9:37 PM

#

valid vine so the way they're stored just doesn't work then

you are trying to download almost 700GB from a web site for free

#

be grateful it does not give you 5MB/s download speed

#

ohh.. 700GB download is taking more than 4 hours... the horror

valid vine Jun 3, 2025, 9:41 PM

#

dude

#

I'm not "downloading 700 gb for free" if over half of the 4.3gb dls are 1kb

#

I'm not complaining it's taking long

#

I'm complaining it doesn't work

simple ore Jun 3, 2025, 9:43 PM

#

chill the f down and wait until it is done

valid vine Jun 3, 2025, 9:45 PM

#

oh my bad I though percent was out of a hundred

#

so could you enlighten me on what percent means then?

simple ore Jun 3, 2025, 9:49 PM

#

open one of those 1kb files in notepad

#

then do git lfs pull

valid vine Jun 3, 2025, 9:51 PM

#

I'm doing stuff with my pool one minute

simple ore Jun 3, 2025, 9:51 PM

#

lfs downloads the resource pointers 1st, those are 1kb files

#

then it actually pulls the content

sand quartz Jun 3, 2025, 9:53 PM

#

Does the server have a channel for Lora's for using stable diffusion

simple ore Jun 3, 2025, 9:53 PM

#

valid vine I'm doing stuff with my pool one minute

you could've ran out of space or something

valid vine Jun 3, 2025, 9:54 PM

#

simple ore you could've ran out of space or something

I did last night, that's why it didn't finish till today, I had to delete the stuff, expand that partition, and restart

#

so the 1kb files have this in it

#

but none of them have changed

#

and it's been sitting at 100% for over an hour now

simple ore Jun 3, 2025, 9:57 PM

#

ctrl-c and git lfs pull again

#

that's the resource pointer

valid vine Jun 3, 2025, 9:59 PM

#

it's blank

#

not sure how long it's supposed to take before I notice smth

simple ore Jun 3, 2025, 10:00 PM

#

i mean the text in that 1kb file, it is supposed to be replaced by the actual 4GB content

valid vine Jun 3, 2025, 10:00 PM

#

yeah I think that's only happened with 4 so far

#

like the one numbered 004 not 4 different ones

simple ore Jun 3, 2025, 10:01 PM

#

it should be downloading them after the pull

valid vine Jun 3, 2025, 10:01 PM

#

well they should all be downloaded

#

there's a 641gb "objects" folder in .git

#

I assume it's pulling smth from there

simple ore Jun 3, 2025, 10:02 PM

#

you can check the status with git lfs ls-files I think

valid vine Jun 3, 2025, 10:03 PM

#

I assume * is done and - is in progress

valid vine Jun 4, 2025, 12:26 AM

#

it was all a fucking waste anyway

#

you need an Nvidia GPU and I have an AMD

#

it doesn't tell you ANYWHERE

#

this one section is the only way it'd be possible to find out

simple ore Jun 4, 2025, 1:08 AM

#

Attempts to run deepseek, finds out

valid vine Jun 4, 2025, 1:17 AM

#

now I'm trying to run the 7B parameter version of math (not prover) and using the exact thing it tells me to in the way it tells me to and it's giving an error based on their code

#

Traceback (most recent call last):
  File "D:\AIs\Deepseek-Prover\runDeepseek.py", line 6, in <module>
    model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype=torch.bfloat16, device_map="auto")
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "d:\AIs\Deepseek-Prover\venv\Lib\site-packages\transformers\models\auto\auto_factory.py", line 571, in from_pretrained
    return model_class.from_pretrained(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "d:\AIs\Deepseek-Prover\venv\Lib\site-packages\transformers\modeling_utils.py", line 309, in _wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "d:\AIs\Deepseek-Prover\venv\Lib\site-packages\transformers\modeling_utils.py", line 4508, in from_pretrained
    model = cls(config, *model_args, **model_kwargs)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "d:\AIs\Deepseek-Prover\venv\Lib\site-packages\transformers\models\llama\modeling_llama.py", line 618, in __init__    
    self.model = LlamaModel(config)
                 ^^^^^^^^^^^^^^^^^^
  File "d:\AIs\Deepseek-Prover\venv\Lib\site-packages\transformers\models\llama\modeling_llama.py", line 379, in __init__
    self.post_init()
  File "d:\AIs\Deepseek-Prover\venv\Lib\site-packages\transformers\modeling_utils.py", line 1969, in post_init
    if v not in ALL_PARALLEL_STYLES:
       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: argument of type 'NoneType' is not iterable```

#

this is getting rediculus

simple ore Jun 4, 2025, 1:19 AM

#

v has no value

valid vine Jun 4, 2025, 1:19 AM

#

yeah I don't know why

simple ore Jun 4, 2025, 1:19 AM

#

trace it back

valid vine Jun 4, 2025, 1:19 AM

#

that's 5 files deep

#

and half the functions just tell me there's no definition in vscode

#

simple ore Jun 4, 2025, 1:21 AM

#

usual thing when you IDE did not load definitions

#

if v not in ALL_PARALLEL_STYLES: this is a part of a pretrained model load

valid vine Jun 4, 2025, 1:23 AM

#

okay so it's hardcoded to be none

simple ore Jun 4, 2025, 1:23 AM

#

that goes thru attention implementation object

#

no it is not

valid vine Jun 4, 2025, 1:23 AM

#

"if self._tp_plan is not None and is_torch_greater_or_equal("2.3"):
for _, v in self._tp_plan.items():
if v not in ALL_PARALLEL_STYLES:"

#

" _tp_plan = None"

#

and everything I've done is from the example given

simple ore Jun 4, 2025, 1:23 AM

#

no

valid vine Jun 4, 2025, 1:24 AM

#

my bad I thought "here give some examples" was an example

simple ore Jun 4, 2025, 1:24 AM

#

config = self._autoset_attn_implementation(config, torch_dtype=dtype, check_device_map=False)

valid vine Jun 4, 2025, 1:24 AM

#

and that's what I did

simple ore Jun 4, 2025, 1:24 AM

#

self._tp_plan = self.config.base_model_tp_plan.copy() if self.config.base_model_tp_plan is not None else {}

#

if it is not empty, the loop happens

#

and it throws and exception if it is unsupported style

#

for you it throws one because the value is None

valid vine Jun 4, 2025, 1:26 AM

#

so I have to go on my own and figure out what a tp plan is and set it for the model to work and they don't say that anywhere

simple ore Jun 4, 2025, 1:27 AM

#

it should work without much fiddling with the code

#

what's your GPU?

valid vine Jun 4, 2025, 1:27 AM

#

AMD rx 6750

simple ore Jun 4, 2025, 1:27 AM

#

using zluda?

valid vine Jun 4, 2025, 1:27 AM

#

if this was an issue about not having a GPU I'd understand bc it doesn't support any type of cuda or whatever the AMD equivilant was called

valid vine Jun 4, 2025, 1:27 AM

#

simple ore using zluda?

not sure what that is

simple ore Jun 4, 2025, 1:28 AM

#

it is the magic that lets you run CUDA stuff on AMD GPUs

valid vine Jun 4, 2025, 1:28 AM

#

does it still work if my GPU doesn't support ROCm?

simple ore Jun 4, 2025, 1:28 AM

#

https://github.com/resemble-ai/chatterbox/issues/52#issuecomment-2925328247

#

lil guide

#

this is for windows

valid vine Jun 4, 2025, 1:30 AM

#

yeah my actually strong PC is on windows

simple ore Jun 4, 2025, 1:30 AM

#

for 6750 read te instructions on https://github.com/likelovewant/ROCmLibs-for-gfx1103-AMD780M-APU page

valid vine Jun 4, 2025, 1:31 AM

#

wait but isn't this all for nothing if it's still giving an error I can't fix?

#

based on what the models are supposed to actually do this probably won't help with the thing I wanted anyway so this all is just kind of a waste of time

opal cobalt Jun 4, 2025, 1:35 AM

#

@simple ore u seem knowledgeable mind if i shoot u a random question new to this discord but wanted ur opinion on something

hallow thistle Jun 4, 2025, 3:26 AM

#

opal cobalt <@155030383648440320> u seem knowledgeable mind if i shoot u a random question n...

!howtoask

patent trellisBOT Jun 4, 2025, 3:26 AM

#

hallow thistle !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

inland hill Jun 4, 2025, 3:28 AM

#

My friend cannot change their input settings. Every time they do they get this error message. They just got the software ( fresh install ), version 1.5.3.16a

hallow thistle Jun 4, 2025, 3:29 AM

#

inland hill My friend cannot change their input settings. Every time they do they get this e...

What is your friend's PC GPU?

#

There's a better W-Okada than this version.

inland hill Jun 4, 2025, 3:31 AM

#

hallow thistle What is your friend's PC GPU?

NVIDIA GeForce RTX 3050 Ti Laptop(4GB)

inland hill Jun 4, 2025, 3:32 AM

#

hallow thistle There's a better W-Okada than this version.

i did not know, i just got the same one they had before/version that i use

#

hoping to avoid compatibility issues whoopsies

hallow thistle Jun 4, 2025, 3:32 AM

#

inland hill i did not know, i just got the same one they had before/version that i use

Use this W-Okada instead. https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#download-nvidia-on-windows

Deiteris' W Okada Fork

Last update: May 5, 2025

inland hill Jun 4, 2025, 3:33 AM

#

should we just delete the entire folder the old one was in?

hallow thistle Jun 4, 2025, 3:33 AM

#

anime_nom

#

Of course, sure. Delete the old one.

inland hill Jun 4, 2025, 3:34 AM

#

i just want to make triple sure because i tend to mess up

#

#

we're downloading htis one right?

hallow thistle Jun 4, 2025, 3:38 AM

#

inland hill

https://cdn.discordapp.com/attachments/1159290139609137264/1371371778181431328/image.png?ex=68408ebe&is=683f3d3e&hm=e9a32f36877c8d37dd6ff4a2fd3594a2da1c0b05b0f651da95cbb5a5d7e9a125&

inland hill Jun 4, 2025, 3:40 AM

#

thank you!

inland hill Jun 4, 2025, 4:08 AM

#

It's working wonderfully, thanks a lot!

viral ruin Jun 4, 2025, 6:20 AM

#

anyone got a working colab link for RVC2 (no applio) ?

gleaming wasp Jun 4, 2025, 8:22 AM

#

What's the best option for using the voice changer with AMD? Is this it or is there a better one? https://www.kaggle.com/code/suneku/voice-changer-public

dry jewel Jun 4, 2025, 8:34 AM

#

sorry to come back so late but i checked and the link doesn't work

#

error 404

dry jewel Jun 4, 2025, 9:31 AM

#

i'm trying to do this with 'ngrok'

pastel oak Jun 4, 2025, 9:37 AM

#

Chunk is mainly for latency but if its too low for your gpu to handle it will lose some quality too

pastel oak Jun 4, 2025, 9:38 AM

#

dry jewel sorry to come back so late but i checked and the link doesn't work

Colabs not working atm, follow Kaggle

dry jewel Jun 4, 2025, 9:42 AM

#

pastel oak Colabs not working atm, follow Kaggle

how tho

#

https://www.kaggle.com/code/suneku/voice-changer-public

#

this guide there doesn't even tell me what site to go to

pastel oak Jun 4, 2025, 9:42 AM

#

There is a guide on the link nick sent right next to the link you opened

pastel oak Jun 4, 2025, 9:42 AM

#

gleaming wasp What's the best option for using the voice changer with AMD? Is this it or is th...

Full name of your amd gpu?

dry jewel Jun 4, 2025, 9:43 AM

#

i have to use phone numbah?

pastel oak Jun 4, 2025, 9:43 AM

#

dry jewel i have to use phone numbah?

Yes

dry jewel Jun 4, 2025, 9:43 AM

#

bruh

knotty moth Jun 4, 2025, 9:48 AM

#

gleaming wasp What's the best option for using the voice changer with AMD? Is this it or is th...

check your gpu name in task manager or any other applications like GPU-Z

#

RX 5000 series/newer are recommended

#

if it's only "AMD radeon graphics", more likely it's integrated gpu which is less capable

dry jewel Jun 4, 2025, 11:25 AM

#

guys

#

what's this setting's purpose

#

odd shale Jun 4, 2025, 12:17 PM

#

dry jewel

F0 Det is the pitch algorithm you're already using.

dry jewel Jun 4, 2025, 12:17 PM

#

odd shale F0 Det is the pitch algorithm you're already using.

yes but there are options to choose from

#

odd shale Jun 4, 2025, 12:18 PM

#

dry jewel yes but there are options to choose from

Just keep rmvpe selected.

#

Don't use any of these crepe options.

dry jewel Jun 4, 2025, 12:18 PM

#

that's the one used for these models

#

but i realized that "_onnx", which was default set, sounds more clear

odd shale Jun 4, 2025, 12:20 PM

#

dry jewel but i realized that "_onnx", which was default set, sounds more clear

Keep that one selected then if it works fine for you.

pastel oak Jun 4, 2025, 12:31 PM

#

No

#

Higher chunk higher delay but also more time to compute the voice, but at some point increasing doesnt improve voice

#

Extra 2.7s , advanced settings: increasing crossfade length helps with clearer voice, turning on fp32 for nvidia gpus too

cosmic epoch Jun 4, 2025, 1:47 PM

#

can someone give me a link to a colab where i can train models?

viral ruin Jun 4, 2025, 2:16 PM

#

got this error on mac M1 when i wanna convert. Any ideas how to fix? Tried google already but didn't fix it: AttributeError: 'NoneType' object has no attribute 'tobytes'

#

the audio-path is definitely not wrong

simple ore Jun 4, 2025, 2:35 PM

#

viral ruin got this error on mac M1 when i wanna convert. Any ideas how to fix? Tried googl...

you need to provide a bigger error log

viral ruin Jun 4, 2025, 2:35 PM

#

Traceback (most recent call last):
File "/Users/jlapping/.pyenv/versions/3.10.11/lib/python3.10/site-packages/gradio/routes.py", line 488, in run_predict
output = await app.get_blocks().process_api(
File "/Users/jlapping/.pyenv/versions/3.10.11/lib/python3.10/site-packages/gradio/blocks.py", line 1434, in process_api
data = self.postprocess_data(fn_index, result["prediction"], state)
File "/Users/jlapping/.pyenv/versions/3.10.11/lib/python3.10/site-packages/gradio/blocks.py", line 1335, in postprocess_data
prediction_value = block.postprocess(prediction_value)
File "/Users/jlapping/.pyenv/versions/3.10.11/lib/python3.10/site-packages/gradio/components/audio.py", line 349, in postprocess
file_path = self.audio_to_temp_file(
File "/Users/jlapping/.pyenv/versions/3.10.11/lib/python3.10/site-packages/gradio/components/base.py", line 325, in audio_to_temp_file
temp_dir = Path(self.DEFAULT_TEMP_DIR) / self.hash_bytes(data.tobytes())
AttributeError: 'NoneType' object has no attribute 'tobytes'
2025-06-04 15:10:38 | INFO | httpx | HTTP Request: POST http://localhost:7865/api/predict "HTTP/1.1 500 Internal Server Error"
2025-06-04 15:10:38 | INFO | httpx | HTTP Request: POST http://localhost:7865/reset "HTTP/1.1 200 OK"

simple ore Jun 4, 2025, 2:38 PM

#

viral ruin Traceback (most recent call last): File "/Users/jlapping/.pyenv/versions/3.10....

you can try saving the file you want to convert into assets/audios and then just click refresh on UI and pick the file from the drop-down list

tough fiber Jun 4, 2025, 2:39 PM

#

guys anyone can tell me which training and makings rvc models, last time i used RVC1006Nvidia

simple ore Jun 4, 2025, 2:42 PM

#

tough fiber guys anyone can tell me which training and makings rvc models, last time i used...

RVC1006Nvidia still works, otherwise you have Applio or more advanced forks

tough fiber Jun 4, 2025, 2:43 PM

#

simple ore RVC1006Nvidia still works, otherwise you have Applio or more advanced forks

if u say still rvc1006nvidia is good ill keep that

#

thanks for info btw <3

simple ore Jun 4, 2025, 2:46 PM

#

tough fiber thanks for info btw <3

if you're happy with it and know how to use it properly, why not

tough fiber Jun 4, 2025, 2:48 PM

#

simple ore if you're happy with it and know how to use it properly, why not

i did really good models with this thanks. oh i also my i ask. can we make good models with laughing or screaming, i mean i did really nice realistic models for realtime voice changer but. they cannot laugh or scream etc. that voices is not word thats why maybe.

#

i wondering if we have good dataset for that voices is it possible to make good models?

simple ore Jun 4, 2025, 2:51 PM

#

laugher generally fails, screaming requires a dataset with a large dynamic range, generally rvc inference is pretty flat

pastel oak Jun 4, 2025, 2:53 PM

#

Optional

latent kettle Jun 4, 2025, 4:17 PM

#

In w-okada ?

#

In and out sliders

low shard Jun 4, 2025, 7:04 PM

#

dry jewel sorry to come back so late but i checked and the link doesn't work

the new link is https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

covert portal Jun 4, 2025, 7:31 PM

#

What do you need to download to train locally?

low shard Jun 4, 2025, 7:38 PM

#

covert portal What do you need to download to train locally?

what's your pc gpu? what do you want to do?

covert portal Jun 4, 2025, 7:39 PM

#

nvidia

#

train rvc voice from dataset

low shard Jun 4, 2025, 7:40 PM

#

covert portal nvidia

which?

#

nvidia made a lot of gpus

covert portal Jun 4, 2025, 7:40 PM

#

2070 super

low shard Jun 4, 2025, 7:40 PM

#

covert portal 2070 super

good

#

As you got a good PC, you can use RVC locally, you can choose between:

Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
Mainline: The original RVC

#

I gave you an explaination to the differences and links to the docs

covert portal Jun 4, 2025, 7:45 PM

#

ok thanks you!

low shard Jun 4, 2025, 8:05 PM

#

covert portal ok thanks you!

yw and lmk

cosmic epoch Jun 4, 2025, 8:12 PM

#

can someone give me a link to a colab where i can train models (one that isn't applio)?

viral ruin Jun 4, 2025, 9:45 PM

#

Let me know when you found something.. I need the same

crude flame Jun 4, 2025, 9:45 PM

#

gl on finding a working on

warm swift Jun 4, 2025, 9:53 PM

#

hi

#

someone can help me funding fuerza regida RVC?

viral mason Jun 4, 2025, 9:54 PM

#

ew money

warm swift Jun 4, 2025, 9:54 PM

#

?

viral mason Jun 4, 2025, 9:54 PM

#

don't pay for ai

warm swift Jun 4, 2025, 9:55 PM

#

i need it for a proyect

viral mason Jun 4, 2025, 9:56 PM

#

https://tenor.com/view/monster-versus-vs-alien-strange-mob-gif-12413747667150461911

Tenor

simple ore Jun 4, 2025, 10:08 PM

#

finding or funding, that's two very different requests

warm swift Jun 4, 2025, 10:08 PM

#

o sorry

#

can someone help me?

simple ore Jun 4, 2025, 10:11 PM

#

a or b?

warm swift Jun 4, 2025, 10:11 PM

#

A

simple ore Jun 4, 2025, 10:11 PM

#

warm swift can someone help me?

warm swift Jun 4, 2025, 10:13 PM

#

thank u

viral mason Jun 4, 2025, 10:14 PM

#

spelling mistakes go hard

junior gull Jun 4, 2025, 10:49 PM

#

live mods dont review the model submissions do they?
just wondering...i got a very silly and confusing reply for mine.
"sounds distored" "retrain and getr rid of distortion"
.... submited with the details and description and listed online with the same.... DISTORTION is signature to this char's voice pattern, since she always talks that way, and my goal was to faithfully capture that in great detail, and I did. I clearly communicated that on her hugginface listing too, and on my model submission details. Its not any different than the dozens of robotic voices listed here already. Same idea.
But then I got that wierd reply like a real person hadn't even bothered to read anything.

crude flame Jun 4, 2025, 11:02 PM

#

junior gull live mods dont review the model submissions do they? just wondering...i got a ve...

your model was rejected because the voice you submitted had effects and that makes it harder for us to QC it

#

thats why its a rule that you cant submit a robotic voice or a voice with effects

junior gull Jun 4, 2025, 11:10 PM

#

crude flame your model was rejected because the voice you submitted had effects and that mak...

but yet there are like over a dozen modles i can immediately see that have such effects and MUCH heavier too all in the model section?
So that doesnt' really make sense? And hers is WAY lighter than that....its a knowon characteristic of this character. It's not added. Its literally in EVERY refrence file to her voice becuase....its her voice. There is no version of her iwthout it....and I wouldn't want one if there was. Against thtat doesnt' make any sense???

crude flame Jun 4, 2025, 11:11 PM

#

junior gull but yet there are like over a dozen modles i can immediately see that have such...

they can post those models because they passed the model maker test

simple ore Jun 4, 2025, 11:11 PM

#

Show your skills first with a normal model, then do whatever

crude flame Jun 4, 2025, 11:11 PM

#

^

junior gull Jun 4, 2025, 11:12 PM

#

why wasn't that said in the reply then? ~_~ instead of telling me to build my model withotu her main defining feature she is known for lol. could have saved ALOT of confusion.
Alirght then.

crude flame Jun 4, 2025, 11:12 PM

#

junior gull why wasn't that said in the reply then? ~_~ instead of telling me to build my mo...

"Submit a model trained on a normal voice"

#

it was

junior gull Jun 4, 2025, 11:14 PM

#

maybe a languate barrier? that wasn't how it was phrased...it was still complaining about her voice. It never said I had to do a normal BEOFRE hers could be considered. That's a key differnce, and changes the meaning completely. But I understand now thanks to @simple ore clarification.

#

Thank you

crude flame Jun 4, 2025, 11:15 PM

#

😐

junior gull Jun 4, 2025, 11:20 PM

#

might be a bit though...the only other one i was already working on was another male voice with a similar effect....based on Zachary Quinto's Invincible character 🙃 Found these both really appealing as voices to use for AI related projects.

#

https://tenor.com/view/sylar-zachary-quinto-heroes-smile-no-gif-16862933

sullen lion Jun 4, 2025, 11:31 PM

#

i'm trying to learn illustrious character lora training

#

the tut i used for pony claims that the settings should be fine for illustrious (and were in fact written with it in mind) but i find my resulting models are always less style influenced

#

any images i gen with them come out looking very booru

#

https://civitai.com/articles/9005/a-detailed-beginners-guide-to-lora-training-on-civitais-trainer
ive been using this tut which is supposed to be for the civit trainer but the settings can be pretty much replicated on kohya

#

whats the best place for me to iterate on my settings to try and reenforce style? ive seen plenty of cartoon loras that actually maintain their source style on models like wai-nsfw and im trying to achieve that

#

~~also worth noting im p sure i trained at 512~~ nvm i checked i AM on 1024 res, will also say my datasets are unfortunately limited in size, usually around 15-25 total images

warm swift Jun 5, 2025, 12:20 AM

#

mi spelling mistakes arre hard?

knotty moth Jun 5, 2025, 1:29 AM

#

sullen lion the tut i used for pony claims that the settings should be fine for illustrious ...

flux is another one you can try though it's rather demanding to do locally

simple ore Jun 5, 2025, 1:54 AM

#

knotty moth flux is another one you can try though it's rather demanding to do locally

you can run quantized flux really well

#

great quality for regular artsy-fartsy and realistic pics

silent stratus Jun 5, 2025, 2:10 AM

#

does this look bad to yall

#

4 batch size with a 4 min dataset and that was at like 120 epochs

silent stratus Jun 5, 2025, 5:14 AM

#

nvm i was just paranoid its fine

royal grove Jun 5, 2025, 5:19 AM

#

Any good recent okada tutorials?

#

these good?

silent stratus Jun 5, 2025, 5:29 AM

#

royal grove Any good recent okada tutorials?

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

idle bramble Jun 5, 2025, 5:31 AM

#

what is the diff between sio and rest protocols for the voice changer? i know what rest is, but does this provide any latency help over sio? if it does, then why isnt it the default? (asking bc some guides say to swap sio to rest)

royal grove Jun 5, 2025, 6:06 AM

#

anyone know why my shit is just blank white

sand bison Jun 5, 2025, 6:09 AM

#

Does anyone have the new Google Collab for creating AI models (the RVC v2 disconnected)?

knotty moth Jun 5, 2025, 6:38 AM

#

idle bramble what is the diff between sio and rest protocols for the voice changer? i know wh...

Protocol: rest (Use SIO if you want less delay but if you encounter any issues with SIO switch back to rest. Rest has slightly more delay than SIO)

royal grove Jun 5, 2025, 7:22 AM

#

Any support on how to make them sound less ai>

deep tulip Jun 5, 2025, 8:42 AM

#

Hey! Where can I find people to train a voice model? I have a dataset, I would be grateful for help

rancid fiber Jun 5, 2025, 8:43 AM

#

sand bison Does anyone have the new Google Collab for creating AI models (the RVC v2 discon...

I was hoping someone could point me in the direction too, I have tried a few "clone" models of it but they always fail to work

rancid fiber Jun 5, 2025, 9:15 AM

#

what would you suggest if you don't have a GPU but still want to train now that Disconnected is gone?

low shard Jun 5, 2025, 9:48 AM

#

rancid fiber what would you suggest if you don't have a GPU but still want to train now that ...

Train (make) RVC Models on cloud:

Prepare the Dataset
Setup RVC:
Choose a cloud way to use RVC,

Google Colabs (max 4 hours of daily T4 16gb gpu not granted for free, not much hours for training, but easy to use, there's a paid tier):
- Applio (UI)
- Mainline (U, broken right now)
Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus, either T4x2 16gb each or P100 16gb, only free):
- Applio (UI)
- Mainline (UI, broken right now)
Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly, Free Studios run 24/7 but require restart every 4 hours. There's a paid tier):
- Applio (UI)
- Mainline (UI, No guide as of right now)

Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

Weights.com: Easiest Possible Ever Automatic
Ilaria RVC Zero: Fastest free on cloud
Applio UI Colab: RVC Fork with some extra features like TTS
RVC AI Cover Maker UI: Automatically Separates the Vocals and Instrumentals, converts the voice and mixes them back

low shard Jun 5, 2025, 9:48 AM

#

deep tulip Hey! Where can I find people to train a voice model? I have a dataset, I would b...

You can't request for models anymore, tho you can find docs on how to make them yourself

low shard Jun 5, 2025, 9:48 AM

#

royal grove Any support on how to make them sound less ai>

elaborate

low shard Jun 5, 2025, 9:49 AM

#

sand bison Does anyone have the new Google Collab for creating AI models (the RVC v2 discon...

what's your pc gpu first

low shard Jun 5, 2025, 9:49 AM

#

royal grove anyone know why my shit is just blank white

!howtoask

patent trellisBOT Jun 5, 2025, 9:49 AM

#

low shard !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

royal grove Jun 5, 2025, 9:49 AM

#

low shard elaborate

How do I make female voices sound more like human and not ai like the monotone background of the ai

low shard Jun 5, 2025, 9:51 AM

#

royal grove How do I make female voices sound more like human and not ai like the monotone b...

elaborate:

your pc gpu
what tutorial link are you using
a screenshot of the program you're using with the settings
if you want to use them in realtime or pre-recorded audios

royal grove Jun 5, 2025, 9:51 AM

#

RTX 3080

#

cant send pics

#

or files

low shard Jun 5, 2025, 9:52 AM

#

royal grove cant send pics

!give-media-perms 1h @royal grove

low shard Jun 5, 2025, 9:52 AM

#

royal grove RTX 3080

nice, you want to use them in realtime of pre-recorded audios?

#

also what tutorial link are you following?

royal grove Jun 5, 2025, 9:52 AM

#

Real time

#

#

I didn follow any tutorial the only one i used was to actually download wokada

low shard Jun 5, 2025, 9:53 AM

#

royal grove

lemme guess, you found the link via a youtube tutorial?

royal grove Jun 5, 2025, 9:54 AM

#

yeah

#

https://www.youtube.com/watch?v=CbrZj8ZYyr4

YouTube

HitPaw

How to Change your Voice for FREE | W-okada Voice Changer VS HitPaw...

🌟Best FREE Voice Changer :👉 https://bit.ly/3U3MVYi
👉W-okada Voice Changer download link: https://huggingface.co/wok000/vcclient000/tree/main
👉VB-Audio download link: https://vb-audio.com/Cable/
👉Get RVC Voice Models Here : https://voice-models.com/
🚀 Up To 60% OFF on Father's Day :https://bit.ly/43XM7cJ

#freevoicechanger #real...

▶ Play video

#

this one

low shard Jun 5, 2025, 9:54 AM

#

video tutorials get outdated easily, and in fact this is an old version of original wokada lmao

#

thats old asf

royal grove Jun 5, 2025, 9:54 AM

#

oh really

#

damn

low shard Jun 5, 2025, 9:54 AM

#

royal grove oh really

yup u just wasted time

royal grove Jun 5, 2025, 9:54 AM

#

shit

low shard Jun 5, 2025, 9:54 AM

#

plus vb audio cable gives issues and can randomly stop working as users told us on windows

#

forget everything you get by video tutorials

royal grove Jun 5, 2025, 9:54 AM

#

alright

#

what do i do

low shard Jun 5, 2025, 9:55 AM

#

royal grove what do i do

read https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

#

wokada deiteris fork is way better

royal grove Jun 5, 2025, 9:55 AM

#

should i delete the vb audio cable

low shard Jun 5, 2025, 9:55 AM

#

royal grove should i delete the vb audio cable

you should delete everything you just installed like if you never saw that video

royal grove Jun 5, 2025, 9:56 AM

#

how big of a difference is it

#

like massively better

low shard Jun 5, 2025, 9:56 AM

#

royal grove how big of a difference is it

you're using an almost year old version lmfao

#

its like trying to run windows xp in 2025, shittiest performance you could get and you're missing out on options to get better quality

royal grove Jun 5, 2025, 9:57 AM

#

thanks bro

#

ill download this one

#

can i keep the voices when i delete everything?

low shard Jun 5, 2025, 9:57 AM

#

royal grove

that's just a link to the github repo, you need to read the full guide

#

if you dont read it you will just fuck things up lol, dont just click the first thing you see

fair thistle Jun 5, 2025, 10:09 AM

#

is it supposed to open up in your browser

royal grove Jun 5, 2025, 10:12 AM

#

#

which one do i download?

#

they are all AMD

#

@low shard

#

#

is this not required to download

#

because after downloading the nvidia it opened a version of the voice changer on my browser

#

do i use that one?

#

nvm i figured it out

#

thanks

low shard Jun 5, 2025, 10:50 AM

#

fair thistle is it supposed to open up in your browser

which tutorial link are u following

low shard Jun 5, 2025, 10:51 AM

#

royal grove

you shouldn't look at the github

#

@royal grove dont go in the github, read the guide i sent you

lethal grail Jun 5, 2025, 12:51 PM

#

where do i put my downloaded models

rancid fiber Jun 5, 2025, 1:13 PM

#

low shard # Train (make) RVC Models on cloud: 1. [Prepare the Dataset](<https://docs.ai-hu...

Thank you for this, I have tried the Applio (UI) via Colabs but the tutorial is referring to an older version so alof of it isnt the same and then it just fails to generate an index or model. RVC V2 Disconnected was so much easier, and it worked for me. SHame it was closed

simple ore Jun 5, 2025, 1:20 PM

#

rancid fiber Thank you for this, I have tried the Applio (UI) via Colabs but the tutorial is ...

if you get a message about the index failing to generate, you probably messed up the prep part, did not slice dataset or pointed to a wrong folder. The preprocess log should output xx minutes processed and extract feature log should say xxx/xxx segments processed. Not 0/0.

opal cobalt Jun 5, 2025, 1:23 PM

#

Just wanted to know anyones thoughts on Renting a Cloud GPU for a few hours specifically for building a model that im unable to build locally but will be able to run locally once built. (havent seen this topic get mentioned what i see is those rent for environment due to specs)
Use Case Qwen2.5 32B -Instruct fp8 Quant GOAL
Extra Context my Specs:
CPU: AMD Ryzen 7 7800X3D (8 cores / 16 threads)
GPU: NVIDIA GeForce RTX 4090
Motherboard: MSI MAG B650 TOMAHAWK WIFI
RAM: 64GB G.Skill Flare X5 Series DDR5

simple ore Jun 5, 2025, 1:54 PM

#

oops, sorry

#

Q8_0 wont do

#

opal cobalt Jun 5, 2025, 1:55 PM

#

thanks im trying to do through tensorRT LLM and go through the building process through trtllm-build but it requires x3 x4 so my thought process is this.
Quantization Optimization Strategy
Testing Protocol (Shoot for the Best, Work Downwards):
Priority 1: FP8 Validation
✅ RTX 4090 Support: Test FP8 tensor operations compatibility
✅ Performance Benchmark: Measure speed vs memory vs quality
✅ Stability Test: Ensure consistent outputs and no crashes
Priority 2: Enhanced AWQ Evaluation
✅ Calibration Quality: Test 1024 vs 512 calibration samples
✅ Block Size Impact: Compare 64 vs 128 block sizes
✅ Mixed Precision: FP8 KV cache + AWQ weights performance
Priority 3: Baseline Confirmation
✅ Standard AWQ: Ensure proven configuration works as expected
✅ Fallback Readiness: Validate backup option performs acceptably

#

goal use cloud gpu specs like A100 80GB to build then it will fit within my 24GB VRAM if that makes sense

simple ore Jun 5, 2025, 1:57 PM

#

Q5_0 will fit

#

8192 context is a bit small, but it may be fine with it offloaded into RAM

#

Q5_0 model size is 21GB

opal cobalt Jun 5, 2025, 2:00 PM

#

my apologies for lack of context i aim for 32k context length with this i used Q4_K_M previously but i need improvements because that uses llama.cpp which is like 75%-80% compared to tensorRT
Decision Matrix:
Metric
FP8
Enhanced AWQ
Standard AWQ
RTX 4090 Support
Test Required
✅ Proven
✅ Proven
Expected Speed
🥇 Best
🥈 Better
🥉 Good
Memory Efficiency
🥇 Best
🥈 Better
🥉 Good
Quality
🥇 Best
🥈 Better
🥉 Good
Risk Level
🔶 Medium
🟢 Low
🟢 Minimal

Selection Criteria:
RTX 4090 compatibility (must work flawlessly)
Performance improvement over current Q4_K_M (minimum 6x speedup)
Memory efficiency (must fit in 24GB with overhead)
Output quality (must maintain coherent responses)
Stability (no crashes or artifacts during extended use)

#

Executive Summary
Goal: Build an ultra-optimized Qwen2.5-32B-Instruct model using W4A8_AWQ (but aiming for FP8) quantization via cloud GPU, then deploy locally on RTX 4090 for maximum performance with 32K context support.
Problem: RTX 4090 (24GB) cannot compile TensorRT engines for 32B models due to memory constraints during build process, despite having sufficient memory for runtime.
Solution: Use cloud GPU (A100 80GB) for one-time engine compilation, then deploy locally.

#

Primary Goal (Best-Case Scenario):
Build Qwen2.5-32B with FP8 quantization achieving:
✅ Speed: 70-110 tokens/sec on RTX 4090 (10-20% faster than W4A8_AWQ)
✅ Memory: ~12-16GB runtime usage (more efficient than AWQ)
✅ Context: Full 32K token support
✅ Quality: 98%+ of FP16 performance (floating-point precision advantage)
Secondary Goal (High-Performance Fallback):
Enhanced W4A8_AWQ quantization achieving:
✅ Speed: 65-95 tokens/sec on RTX 4090
✅ Memory: ~14-17GB runtime usage
✅ Context: Full 32K token support
✅ Quality: 96%+ of FP16 performance
Tertiary Goal (Proven Baseline):
Standard W4A8_AWQ quantization achieving:
✅ Speed: 60-90 tokens/sec on RTX 4090
✅ Memory: ~15-18GB runtime usage
✅ Context: Full 32K token support
✅ Quality: 95%+ of FP16 performance
Sorry if is too much context just trying to share relevant details after ideal build is complete i plan to use with anythingllm and setup draft model,embedding model,vector db etc these options will just beat the slow Q4_K_M 32k context speed i was unsatisfied with

full moss Jun 5, 2025, 2:27 PM

#

how to do text to speech?

#

someone please help?

crystal girder Jun 5, 2025, 2:30 PM

#

guys why my app crash everytime i tried to use voice ai

latent kettle Jun 5, 2025, 2:32 PM

#

full moss how to do text to speech?

Tell me your PC specs

full moss Jun 5, 2025, 2:32 PM

#

latent kettle Tell me your PC specs

wdym

latent kettle Jun 5, 2025, 2:32 PM

#

Your cpu gpu and ram

full moss Jun 5, 2025, 2:33 PM

#

i7 4079 geforce 32

latent kettle Jun 5, 2025, 2:33 PM

#

full moss i7 4079 geforce 32

What ?

full moss Jun 5, 2025, 2:33 PM

#

17 cpu

#

4070 gpu

#

and 32 ram

latent kettle Jun 5, 2025, 2:33 PM

#

Ohh I see

#

Your are good to go

full moss Jun 5, 2025, 2:34 PM

#

yes but how do i use the text to speech

latent kettle Jun 5, 2025, 2:34 PM

#

You can use kokoro tts, f5 tts

full moss Jun 5, 2025, 2:34 PM

#

where

latent kettle Jun 5, 2025, 2:34 PM

#

On your system, install it

full moss Jun 5, 2025, 2:34 PM

#

the what

#

what do i install

latent kettle Jun 5, 2025, 2:35 PM

#

Lemme send you guide

#

https://docs.aihub.gg/tts/tts-tools/#tts-tools

TTS Tools

Last update: Dec 12, 2024

#

@full moss

full moss Jun 5, 2025, 2:39 PM

#

yes

#

but i cant create the voice

#

freemium but its paid to create a voice

latent kettle Jun 5, 2025, 2:40 PM

#

full moss freemium but its paid to create a voice

You mean you want to fine tune?

full moss Jun 5, 2025, 2:40 PM

#

no i wanna create an text to speech model

#

i got the singing mp3 already

latent kettle Jun 5, 2025, 2:41 PM

#

full moss no i wanna create an text to speech model

That's what I said, that's called fine tuning a model, if you want to create a pre trained from scratch, that's like impossible on our normal systems.

latent kettle Jun 5, 2025, 2:43 PM

#

full moss i got the singing mp3 already

I suggest you to use applio tts. There you can use your voice model. Or you can use any tts and then convert that audio into your desired character's voice

simple ore Jun 5, 2025, 2:47 PM

#

I would not suggest that, edge tts in applio is purely for demo purposes. There are better tts available. Edge is just a screen reader for websites after all.

scenic arch Jun 5, 2025, 2:49 PM

#

is there any benefit to use nvidia broadcasts echo and noise removal as opposed to using okada's builtin echo and sup1&2?

simple ore Jun 5, 2025, 2:49 PM

#

scenic arch is there any benefit to use nvidia broadcasts echo and noise removal as opposed ...

broadcast is much much better

scenic arch Jun 5, 2025, 2:50 PM

#

simple ore broadcast is much much better

alright, is it advisable to have them both on? or just broadcast with dietris fork?

simple ore Jun 5, 2025, 2:50 PM

#

mic -> broadcast app -> voice changer

#

both gonna use rtx cores on gpu, have not tried it personally.. should be fine on a newer gpu

#

@scenic arch https://x.com/mrprowestie/status/1252867224466939908

Westie (@MrProWestie)

Okay this is amazing. NVIDIA RTX Voice filter blocking out direct fan noise and a hammer banging on the desk... what is this wizardry?! 👀

@NVIDIAGeForceUK

#

version from 5 years ago, should be even better now

scenic arch Jun 5, 2025, 3:01 PM

#

whats the best tts for rvc also

low shard Jun 5, 2025, 3:29 PM

#

scenic arch whats the best tts for rvc also

There are different Text To Speech (TTS) AIs:

GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese, Cantonese, japanese & korean, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/

Freemium 11labs: Easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS

FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site

You can check TTS in our tts index

With RVC Models:

RVC is natively for Speech To Speech, but forks such as Applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

If you wanna do tts locally with RVC Voice Models (if you got a good pc):

You can get Applio in our docs

If you don't got a good pc you can do tts with RVC Voice Models on cloud:

Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
Use Applio UI Colab (with google colab T4 free daily limit gpu)
You could try another tts from our tts index and use the output as an input in rvc

tranquil schooner Jun 5, 2025, 4:05 PM

#

Question about W-Okada voice changer. If I still have an older version, is it not supported anymore or work properly? Because I've been having trouble with my gaming laptop I just got a year ago after graduation that worked just fine until recently last month went I was minding my business on Minecraft and it closed my laptop and never turned back on so Geek Squad looked at it and said its a faulty battery issue they "fixed" but after it returned home to me three days ago and only got 5 minutes of playtime for updating, clearing storage space and some games and it turned back off on me after opening Google so my father had to return it and Geek Squad said they had a feeling he'd be back like they were expecting it and now they're saying it can't be fixed and I'm forced to buy a new laptop after my father used up the protection plan. Anybody know?

tranquil schooner Jun 5, 2025, 4:56 PM

#

I would like an answer before the laptop arrives in two days...

simple ore Jun 5, 2025, 4:58 PM

#

Ah, the Geek Scam

#

You either learn how to diagnose and fix your PC or you pay thru the nose for placebo fixes.

tranquil schooner Jun 5, 2025, 5:03 PM

#

So the voice changer is not responsible? And I knew Geek Squad was a scam from a different pc expert actually opening up the devices to look inside and fix things and replace the motherboard but my father would not listen

brisk plover Jun 5, 2025, 5:19 PM

#

guys?

#

can you help me out?

#

how can I use okada

#

on discord

tranquil schooner Jun 5, 2025, 5:45 PM

#

brisk plover how can I use okada

So idk if that's a good idea for now until I figure out if my problem with my laptop dying and not turning back on is caused by okada or not...But just saying in case it is

simple ore Jun 5, 2025, 5:47 PM

#

tranquil schooner So the voice changer is not responsible? And I knew Geek Squad was a scam from a...

very unlikely to damage anything permanently

viral mason Jun 5, 2025, 5:47 PM

#

brisk plover how can I use okada

I know how

brisk plover Jun 5, 2025, 5:47 PM

#

can u help me out?

viral mason Jun 5, 2025, 5:47 PM

#

Yeaz in dms

brisk plover Jun 5, 2025, 5:47 PM

#

i already have cable device

viral mason Jun 5, 2025, 5:47 PM

#

Just gimme a minute since I'm not home

tranquil schooner Jun 5, 2025, 5:50 PM

#

simple ore very unlikely to damage anything permanently

Okay but I did read this and I wasn't sure if he meant the file not working for some users or the whole computer itself

tranquil schooner Jun 5, 2025, 5:50 PM

#

low shard forget everything you get by video tutorials

#

Wrong one but yeah-

latent talon Jun 5, 2025, 5:56 PM

#

Okay weird things are happening and idk why.

Specs:

Cpu: 13th Gen Intel Core i7-13700K
Ram: 64 GB
Gpu : RTX 4070

So I've tried using the latest W-Okada, and the one from a year ago. The newest tends to break then not work at all, while the old one gets worse over time, basically cuts in and out and fails to make any sound at all.

I'm using it for streaming, and changing my voice in game, so I expect delay, No matter what it comes out to about 4 second delay, then gets choppier and choppier. Any ideas on what to do? I can try and gather more info later today like logs and so on.

Thanks in advance, I'm not the best when it comes to tech so even obvious fixes are welcome

pastel oak Jun 5, 2025, 6:32 PM

#

latent talon Okay weird things are happening and idk why. Specs: Cpu: 13th Gen Intel Core...

Send screenshot of your interface of the latest

odd valve Jun 5, 2025, 6:49 PM

#

whats the best rvc for an amd gpu

#

or is okada better?

low shard Jun 5, 2025, 6:59 PM

#

odd valve whats the best rvc for an amd gpu

what's ur pc gpu exactly? what do u want to do?

#

RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

odd valve Jun 5, 2025, 7:00 PM

#

low shard what's ur pc gpu exactly? what do u want to do?

rx 6700 xt

#

want to try out realtime

#

so should i go with the fork?

low shard Jun 5, 2025, 7:01 PM

#

odd valve want to try out realtime

then you need wokada deiteris fork yes, read https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

odd valve Jun 5, 2025, 7:01 PM

#

low shard then you need wokada deiteris fork yes, read https://docs.aihub.gg/rvc-voice-cha...

alright thanks alot

low shard Jun 5, 2025, 7:01 PM

#

odd valve alright thanks alot

yw and lmk!

plush stream Jun 5, 2025, 7:25 PM

#

guys please can i ask for some free ai websites except weights gg i need them

odd valve Jun 5, 2025, 7:45 PM

#

low shard yw and lmk!

what does this mean 😭

#

so for my gpu do i use the nividia link?

simple ore Jun 5, 2025, 7:46 PM

#

odd valve what does this mean 😭

it means for <=4000 serie use one version, for 5000 series use the specifically one internded just fro 5000.

odd valve Jun 5, 2025, 7:47 PM

#

simple ore it means for <=4000 serie use one version, for 5000 series use the specifically ...

so what would i use for 6000

simple ore Jun 5, 2025, 7:48 PM

#

6000 what

#

AMD?

#

or Nvidia RTX 6000?

odd valve Jun 5, 2025, 7:50 PM

#

simple ore AMD?

amd rx6700 xt

simple ore Jun 5, 2025, 7:52 PM

#

you download the AMD version of the voice changer from the corresponding link

true heron Jun 5, 2025, 8:00 PM

#

How do I get started?

latent talon Jun 5, 2025, 9:00 PM

#

pastel oak Send screenshot of your interface of the latest

Sorry had to deal with some VA stuff dming it now

#

Welp Idk what I have managed to do, but now the old one is borked

swift thunder Jun 6, 2025, 12:00 AM

#

32 bit float or 64? to train models

winter dew Jun 6, 2025, 12:54 AM

#

how much does mic quality matter? I feel like I can’t get w okada to sound that realistic

#

im using a hyperx solo cast so like about average

simple ore Jun 6, 2025, 12:55 AM

#

swift thunder 32 bit float or 64? to train models

simple ore Jun 6, 2025, 12:56 AM

#

winter dew how much does mic quality matter? I feel like I can’t get w okada to sound that ...

Does not need all that good of a mic, rvc model extracts pitch and phonemes.

winter dew Jun 6, 2025, 12:56 AM

#

simple ore Does not need all that good of a mic, rvc model extracts pitch and phonemes.

hmmm

#

it still just sounds a bit unnatural ngl I don’t get why

#

you’re right I read that on the fork site I did the crackle fixes as well but it just sounds weird

swift thunder Jun 6, 2025, 12:57 AM

#

simple ore

Oh, I was told Float was better.

timid bronze Jun 6, 2025, 12:58 AM

#

Can someone please point me the right direction on how to get consistent characters in text to images. I am already including as much detail as possible in the prompts. I have tried so many different AI tools already.

simple ore Jun 6, 2025, 12:58 AM

#

swift thunder Oh, I was told Float was better.

well, it should convert it to +/- 1.0 values during load

simple ore Jun 6, 2025, 12:59 AM

#

timid bronze Can someone please point me the right direction on how to get consistent charact...

make a lora, 2) use image prompt with an image of the character face, for example

swift thunder Jun 6, 2025, 12:59 AM

#

simple ore well, it should convert it to +/- 1.0 values during load

I see

#

At what values do I see the graph better, my friend?

timid bronze Jun 6, 2025, 1:00 AM

#

simple ore 1) make a lora, 2) use image prompt with an image of the character face, for exa...

Sorry but I am kinda new to AI. What is lora?

simple ore Jun 6, 2025, 1:03 AM

#

timid bronze Sorry but I am kinda new to AI. What is lora?

It is basically a set of specific guidelines how to draw something. You train it on a small set of images, I think 20 is enough, then it can draw the same character using a keyword consistently

simple ore Jun 6, 2025, 1:04 AM

#

swift thunder At what values do I see the graph better, my friend?

huh?

swift thunder Jun 6, 2025, 1:05 AM

#

simple ore huh?

the tensorboard, to see the lowest peaks, will 0.7 be fine?

timid bronze Jun 6, 2025, 1:05 AM

#

simple ore It is basically a set of specific guidelines how to draw something. You train it...

Oh nice I will do some research and try to learn how. Thank you!

simple ore Jun 6, 2025, 1:05 AM

#

if you're using avg_50 charts, they are smooth enough to use 0.5

#

with more epoch the grap itself smoothes out

#

since tensorboard does not really shows every logged value

swift thunder Jun 6, 2025, 1:06 AM

#

simple ore if you're using avg_50 charts, they are smooth enough to use 0.5

Okay, thanks, I used it at 0.7 or 0.9

simple ore Jun 6, 2025, 1:06 AM

#

for old loss graph 0.987 was necessary because they were so random

swift thunder Jun 6, 2025, 1:06 AM

#

if I use avg_50

#

cat_blush

lost ember Jun 6, 2025, 1:25 AM

#

do yall have any voice changer that i can use the models with?

long obsidian Jun 6, 2025, 5:23 AM

#

can someone help me when im speaking sometimes the voices make a robotic sound with which setting can i avoid that - using okada

pastel oak Jun 6, 2025, 7:01 AM

#

long obsidian can someone help me when im speaking sometimes the voices make a robotic sound w...

Whats your gpu and send screenshot of wokada interface

pastel oak Jun 6, 2025, 7:01 AM

#

lost ember do yall have any voice changer that i can use the models with?

Whats your gpu

long obsidian Jun 6, 2025, 7:01 AM

#

pastel oak Whats your gpu

nvidia geforce rtx 5060 ti(0) - i hear some clunky noices and robotic pitches and voice cracks

#

i cant fix it tried changing the chunk but it doesnt work

pastel oak Jun 6, 2025, 7:02 AM

#

long obsidian nvidia geforce rtx 5060 ti(0) - i hear some clunky noices and robotic pitches an...

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

#

Do you have the 2nd nvidia link specifically for rtx 5000

long obsidian Jun 6, 2025, 7:03 AM

#

let me check

pastel oak Jun 6, 2025, 7:03 AM

#

Its a separate version

long obsidian Jun 6, 2025, 7:03 AM

#

deiteris fork?

pastel oak Jun 6, 2025, 7:04 AM

#

Yes

long obsidian Jun 6, 2025, 7:04 AM

#

i dont have it

pastel oak Jun 6, 2025, 7:04 AM

#

Get it

#

Its better than original atm

#

And original cant run rtx 5000 gpus iirc

long obsidian Jun 6, 2025, 7:07 AM

#

so i need to install voice-changer-windows-amd64-cuda.zip.002
?

#

i read that i need to download both 001 and 002 and then unzip them

long obsidian Jun 6, 2025, 7:29 AM

#

RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.

@pastel oak it said this

#

what am i supposed to do

#

it works when im using my cpu but doesnt let me use gpu

pastel oak Jun 6, 2025, 7:51 AM

#

long obsidian so i need to install voice-changer-windows-amd64-cuda.zip.002 ?

Just use the precompiled download from the guide

#

What youre downloading is not for rtx 5000

long obsidian Jun 6, 2025, 7:51 AM

#

ye i followed the steps

#

oh i see

#

i thought the gpu is better for voice changer

pastel oak Jun 6, 2025, 7:52 AM

#

long obsidian i thought the gpu is better for voice changer

Yes it is

#

I dont think you understand what im trying to say just download the 2nd nvidia link

#

https://github.com/IllIlIlIllIl/voice-changer/releases/tag/b2335

GitHub

Release b2335 · IllIlIlIllIl/voice-changer

+CUDA12.8 Pytorch updated
(Pytorch nightly version)
RTX 5080 test done
Windows / NVIDIA
For RTX 5000 users
thanks to deiteris (https://github.com/deiteris/voice-changer)

long obsidian Jun 6, 2025, 8:47 AM

#

@pastel oak with my gpu do u have idea for what chunk extra and f0 to run it on (last question sry for being annoying)

pastel oak Jun 6, 2025, 8:53 AM

#

No

pastel oak Jun 6, 2025, 8:53 AM

#

long obsidian <@965636299509882980> with my gpu do u have idea for what chunk extra and f0 to ...

Start with chunk 200 and extra 2.7

#

If the "perf" on the graph the one in green color is a low number like 30, then decrease chunk to around 100 if you want less delay

long obsidian Jun 6, 2025, 8:58 AM

#

pastel oak Start with chunk 200 and extra 2.7

thanks a lot - also should i use the formant (im using it on non supported langueage)

pastel oak Jun 6, 2025, 9:00 AM

#

long obsidian thanks a lot - also should i use the formant (im using it on non supported langu...

The one has nothign to do with the other, formant pitch shifting is like pitch but more "inbetween", generally not needed unless you want funny chipmunk voices

long obsidian Jun 6, 2025, 9:01 AM

#

oh okay thanks a lot

#

also how do u delete models bec when i go to edit i dont see it

knotty moth Jun 6, 2025, 9:06 AM

#

long obsidian also how do u delete models bec when i go to edit i dont see it

do it manually within the model_dir folder, then restart the voice changer

long obsidian Jun 6, 2025, 9:06 AM

#

thanksssss

hazy dune Jun 6, 2025, 10:40 AM

#

help, I installed the voice changer and virtual microphone correctly in the voice changer, I also installed the virtual cable on the microphone correctly in the discord, I start, I say, it doesn't work, but when I turn on the loud video, the voice changer perceives it as a voice, and changes the voice to the video, in general, instead of the microphone sound, the voice changer changes the sound of headphones

#

(im using amd version, im had amd graphics card)

hazy dune Jun 6, 2025, 10:59 AM

#

and now, it captures both the sound and the microphone, what should I do?

pastel oak Jun 6, 2025, 11:25 AM

#

hazy dune help, I installed the voice changer and virtual microphone correctly in the voic...

Screenhot of your interface

#

Cutting off is you getting too quiet at the end of sentences, keep in. Sens. Threshold further to the left if you moved it to the right

#

Do you mean crackles with distoetion

#

No the in. Sens. Under F0

#

And check crossfade length in advanced settings and bring it to 0.10 if its lower than that

knotty moth Jun 6, 2025, 11:45 AM

#

hazy dune help, I installed the voice changer and virtual microphone correctly in the voic...

check audio settings in voice changer and make sure the input is the real mic for regular cases

pastel oak Jun 6, 2025, 12:00 PM

#

Tooltip and guide has brief explanations but roughly explained it constructs the voice more clearly the higher it is but adds delay

obtuse pagoda Jun 6, 2025, 12:22 PM

#

Whenever i try to use the deiteris fork on kaggle, it manages to process and get the server ready, but when i click on the link it says "this site can't be reached" is this just a network problem or something?

hazy dune Jun 6, 2025, 12:52 PM

#

pastel oak Screenhot of your interface

one cecond

hazy dune Jun 6, 2025, 12:53 PM

#

pastel oak Screenhot of your interface

im cant send right here

#

there is no access

hazy dune Jun 6, 2025, 12:55 PM

#

knotty moth check audio settings in voice changer and make sure the input is the real mic fo...

I specified everything correctly, I can take screenshots, if I did something wrong, please write

knotty moth Jun 6, 2025, 1:15 PM

#

hazy dune I specified everything correctly, I can take screenshots, if I did something wro...

show the screenshot here

#

!give-media-perms 30m @hazy dune

snow sphinx Jun 6, 2025, 1:57 PM

#

Hey all - just joined here so apologies in advance for any repetitive questions.
I'm pretty new to AI, so is there any material / videos that anyone could recommend,
I'm specifically struggling with getting ChatGPT to recall and give me an accurate time.
Anyone had similar issues or any advice on how to resolve?
I'm also interested to understand if / how I can link up my various platforms to create an autonomous set of AI agents who need minimal human supervision or direction?
Thanks in advance

simple ore Jun 6, 2025, 2:19 PM

#

snow sphinx Hey all - just joined here so apologies in advance for any repetitive questions....

why would you expect a model that generates tokens from probabilies give you current time?

#

at best it can repeat something you said from its context

#

or there can be some patches added to the processing, like storing things you told the model to remember in a special context or running queries without using the llm such as "what time is it now?"

knotty moth Jun 6, 2025, 2:28 PM

#

snow sphinx Hey all - just joined here so apologies in advance for any repetitive questions....

Please do not cross post in multiple channels as it could be considered spamming

knotty moth Jun 6, 2025, 2:51 PM

#

if the voice changer is being used, the settings are greyed out

#

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#crackle-fix

Deiteris' W Okada Fork

Last update: May 5, 2025

worn walrus Jun 6, 2025, 3:37 PM

#

hey everyone - i just had a question

#

so everyone probably knows how people are using temp student emails to gain free access to veo 3

#

but i had some concerns - what if google finds out? this may be a dumb question but do u guys think they will charge the cards for the full 15 months

#

?

simple ore Jun 6, 2025, 3:43 PM

#

You are trying to do the most compute demanding tasks one can think of.. for free

worn walrus Jun 6, 2025, 3:49 PM

#

this guy...

#

guys.. its a FREE plan

knotty moth Jun 6, 2025, 3:50 PM

#

worn walrus guys.. its a FREE plan

free with some asterisk ffs*

worn walrus Jun 6, 2025, 3:51 PM

#

wdym tho

#

it is free

#

for students

#

in college

simple ore Jun 6, 2025, 3:51 PM

#

are you a student?

#

you are trying to steal a service that is being provided conditionally by google with certain expectations

#

also be aware that google is logging anything and would report you if you attempt to do anything below the board

worn walrus Jun 6, 2025, 3:57 PM

#

sorry i dont ever recall saying i was going to use it - i was merley asking a question. so pls mind ur own business

lime patrol Jun 6, 2025, 4:10 PM

#

what to do if the ai itself says TRAIL

low shard Jun 6, 2025, 5:00 PM

#

lime patrol what to do if the ai itself says TRAIL

elaborate

#

what's ur pc gpu? what do u want to do? what tut link are u following?

vital merlin Jun 6, 2025, 5:02 PM

#

how can i sound like a waifu

#

perfectless

#

i wanna sound like a mommy

#

yt_nails

lime patrol Jun 6, 2025, 5:03 PM

#

low shard what's ur pc gpu? what do u want to do? what tut link are u following?

4060/I want to use the voice/from the streamer

vital merlin Jun 6, 2025, 5:05 PM

#

i use a ryzen 5 5500

#

and a 3060 TI

low shard Jun 6, 2025, 5:05 PM

#

lime patrol 4060/I want to use the voice/from the streamer

want to use it in realtime? or in pre-recorded audios? also, share the link of the tutorial you used

#

I hope you didnt use a youtube tutorial, since video tutorials are outdated asf

lime patrol Jun 6, 2025, 5:06 PM

#

low shard want to use it in realtime? or in pre-recorded audios? also, share the link of t...

realtime

lime patrol Jun 6, 2025, 5:06 PM

#

low shard I hope you didnt use a youtube tutorial, since video tutorials are outdated asf

sorry😅

low shard Jun 6, 2025, 5:06 PM

#

lime patrol sorry😅

you can delete everything you got off youtube

#

they use an over year old software

#

-realtime

patent trellisBOT Jun 6, 2025, 5:06 PM

#

low shard -realtime

💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

low shard Jun 6, 2025, 5:06 PM

#

read the 1st link

lime patrol Jun 6, 2025, 5:20 PM

#

low shard read the 1st link

official or deiteris fork?

low shard Jun 6, 2025, 7:16 PM

#

lime patrol official or deiteris fork?

deiteris fork

crude flame Jun 6, 2025, 7:18 PM

#

low shard deiteris fork

oh btw if a user has a nvidia gpu you might want to also recommend vonovox

#

its better than deiteris' fork

low shard Jun 6, 2025, 7:18 PM

#

crude flame its better than deiteris' fork

why's that?

crude flame Jun 6, 2025, 7:18 PM

#

better performance

#

same quality

low shard Jun 6, 2025, 7:19 PM

#

crude flame better performance

were there tests done? may i see them?

crude flame Jun 6, 2025, 7:20 PM

#

low shard were there tests done? may i see them?

not extensive tests but tests

i am able to get 35 ms of delay with vonovox with no game open and with overwatch on ultra graphics on 1440p i can get like 50ms delay

#

lyery has also messed with it i think

viral mason Jun 6, 2025, 8:48 PM

#

vital merlin i wanna sound like a mommy

https://tenor.com/view/gubby-roblox-forsaken-doors-gubby-forsaken-gif-193523705627044412

Tenor

sand bison Jun 6, 2025, 9:11 PM

#

low shard # Train (make) RVC Models on cloud: 1. [Prepare the Dataset](<https://docs.ai-hu...

But what would be the direct link to the colab since I'm looking and I just can't find it?

sand bison Jun 6, 2025, 9:28 PM

#

low shard what's your pc gpu first

An RX 580 and an i7 2700k are no good for making models locally.

paper bloom Jun 6, 2025, 11:03 PM

#

is there any way to change the accent on owakada? like even when i speak my first language it sound like i got an accent. is it using like an asian accent ?

simple ore Jun 6, 2025, 11:07 PM

#

paper bloom is there any way to change the accent on owakada? like even when i speak my firs...

with 0 index value it uses your own pronunciation

#

if you increase the index use it would use a blend between the voice model and you, at 1 it is the voice model's... but there's performance drawback from using the index

sullen lion Jun 6, 2025, 11:23 PM

#

illustrious lora training parameter question

#

why the fuck is this

#

when i

#

kohya if you couldnt tell

simple ore Jun 6, 2025, 11:32 PM

#

what's the source for 1st screenshot?

simple ore Jun 6, 2025, 11:37 PM

#

sullen lion why the fuck is this

#

in case you have not seen github ticket

viral cape Jun 6, 2025, 11:41 PM

#

where i download the voice mod?

random latch Jun 6, 2025, 11:42 PM

#

what kind of files does the beatrice vst take? I'm trying to put a model in but it shows .toml files

#

Also can someone convert an rvc or pth file to toml

sullen lion Jun 6, 2025, 11:50 PM

#

simple ore

not a valid branch

#

and i cant tell if it was pushed or nixed

winter dew Jun 6, 2025, 11:59 PM

#

simple ore if you increase the index use it would use a blend between the voice model and y...

so do people normally use index or not?

simple ore Jun 7, 2025, 12:04 AM

#

winter dew so do people normally use index or not?

for realtime usually not

#

but if your performance is fine, go for it

simple ore Jun 7, 2025, 12:05 AM

#

sullen lion not a valid branch

strange

#

https://github.com/bmaltais/kohya_ss/issues/3147

#

https://github.com/bmaltais/kohya_ss/issues/3220

winter dew Jun 7, 2025, 12:08 AM

#

simple ore for realtime usually not

i see

#

do you find you could tell if someones using it?

#

im trying to get it to sound realistic but it just doesnt lol

analog obsidian Jun 7, 2025, 12:08 AM

#

index doesnt make things realistic, its just where the accent of the model is stored, so it kinda makes the model sound more truthful to the original dataset

#

for a more realistic result train a big dataset cat_dance

winter dew Jun 7, 2025, 12:10 AM

#

analog obsidian index doesnt make things realistic, its just where the accent of the model is st...

ohhh

#

makes sense

#

do you recommend any natural sounding female models?

#

searched through the channel for it but maybe you have the secret weapon

random latch Jun 7, 2025, 12:16 AM

#

can someone help me

simple ore Jun 7, 2025, 12:19 AM

#

this question.... 50 times a day

random latch Jun 7, 2025, 12:23 AM

#

im new

crude flame Jun 7, 2025, 12:23 AM

#

winter dew do you recommend any natural sounding female models?

this one https://discord.com/channels/1159260121998827560/1341216399372062823

winter dew Jun 7, 2025, 12:25 AM

#

simple ore this question.... 50 times a day

i mean it’s hard to find im ngl

#

so many egirl models and no resources on natural ones

#

makes sense why people ask considering this is why most people go to voice changers

analog obsidian Jun 7, 2025, 12:26 AM

#

winter dew so many egirl models and no resources on natural ones

just train yourself one

#

its easy

winter dew Jun 7, 2025, 12:26 AM

#

yea but why?

#

rvc seems to have been out for a while so id be surprised if no one’s made a good one

analog obsidian Jun 7, 2025, 12:27 AM

#

i have natural models, it just takes time

winter dew Jun 7, 2025, 12:27 AM

#

like I don’t really mind going through making one but I feel like someone else would have made something a lot better than I could’ve

winter dew Jun 7, 2025, 12:27 AM

#

analog obsidian i have natural models, it just takes time

how long?

#

if it’s not a lot of manual work that’s fine for me tbh

sullen lion Jun 7, 2025, 12:28 AM

#

simple ore strange

ok ill try to get on the dev branch and try again

analog obsidian Jun 7, 2025, 12:28 AM

#

winter dew how long?

depends
for natural results usually a week, for mid results maybe 1 day or less

winter dew Jun 7, 2025, 12:29 AM

#

analog obsidian depends for natural results usually a week, for mid results maybe 1 day or less

is it a lot of just afk hours making it?

#

bc I mean I could try making one that takes longer

analog obsidian Jun 7, 2025, 12:30 AM

#

winter dew is it a lot of just afk hours making it?

nope, u gotta clean the dataset which takes a lot of time

lime crag Jun 7, 2025, 12:31 AM

#

Do people still use applio for model making? I really haven't heard anyone mention it for some time.i also don't think it's been updated for a few months

simple ore Jun 7, 2025, 12:31 AM

#

What else is there lol?

#

Sure you can go back to the mainline

lime crag Jun 7, 2025, 12:32 AM

#

I guess applio is still the best option

winter dew Jun 7, 2025, 12:32 AM

#

analog obsidian nope, u gotta clean the dataset which takes a lot of time

ohhh

#

now I get why there’s not many good models lmao

#

do you know any that are actually decent or do most just use what they make

analog obsidian Jun 7, 2025, 12:33 AM

#

i use mine

#

its quite easy to make models not sound robotic

winter dew Jun 7, 2025, 12:39 AM

#

analog obsidian its quite easy to make models not sound robotic

I see

#

how much time would you say per day id have to put to make it sound good then?

#

if it’s like 20 min a day id consider it lol

analog obsidian Jun 7, 2025, 12:40 AM

#

analog obsidian its quite easy to make models not sound robotic

well this was a 5 hour set that took me around hmm 1 week or so to clean? the audio quality is extremely bad so i had to remove tons of stuff

#

i think i was cleaning 40 minute per day

winter dew Jun 7, 2025, 12:41 AM

#

oh damn

random latch Jun 7, 2025, 12:41 AM

#

How does one put an rvc model in beatrice vst?

winter dew Jun 7, 2025, 12:41 AM

#

I think another challenge is actually finding data sets lmao

#

idk where id even start with that tbh

#

wait so the actual length depends on the data set you give it?

#

like a 3 hour data set would be like 3-4 days

analog obsidian Jun 7, 2025, 12:42 AM

#

winter dew like a 3 hour data set would be like 3-4 days

my set without cleaning was around 8 hours

winter dew Jun 7, 2025, 12:43 AM

#

analog obsidian my set without cleaning was around 8 hours

that’s a bit confusing ngl but im sure I’ll figure it out once I research how it works

#

do you think a lesser data set like 4 hours is enough?

#

im assuming the data sets in the voice models channel were small ones

analog obsidian Jun 7, 2025, 12:43 AM

#

winter dew do you think a lesser data set like 4 hours is enough?

the 2 hour model i trained of that person sounded pretty mid

#

not robotic but mid

analog obsidian Jun 7, 2025, 12:44 AM

#

winter dew im assuming the data sets in the voice models channel were small ones

you guessed it, small datasets give robotic results

winter dew Jun 7, 2025, 12:44 AM

#

analog obsidian the 2 hour model i trained of that person sounded pretty mid

so 2 hours was the original data set like your 8 hour one?

analog obsidian Jun 7, 2025, 12:45 AM

#

originally i had a 3 hour stream, that after cleaning, truncating etc, got me around 2 hours and 30 minutes??? can't remember

#

ah yes

#

but that person talks a lot in their streams so ye

winter dew Jun 7, 2025, 12:46 AM

#

I see

#

so id probably need like at least 6 hours of data

#

or somewhere around there

analog obsidian Jun 7, 2025, 12:46 AM

#

hold on i still have the 2 hour model

analog obsidian Jun 7, 2025, 12:47 AM

#

analog obsidian its quite easy to make models not sound robotic

so u can compare it to this

#

so this is the 2 hour model

#

does kinda sound like him??? but not as accurate as the 5 hour one

crude flame Jun 7, 2025, 12:47 AM

#

winter dew I think another challenge is actually finding data sets lmao

if you want mommy egirl model literally just search f4m on yt and you will find so much

analog obsidian Jun 7, 2025, 12:48 AM

#

analog obsidian does kinda sound like him??? but not as accurate as the 5 hour one

but still decent since its not robotic at all

winter dew Jun 7, 2025, 12:48 AM

#

crude flame if you want mommy egirl model literally just search f4m on yt and you will find ...

LMAOOOO

#

I didn’t even know something like this existed

analog obsidian Jun 7, 2025, 12:48 AM

#

misc_trolley

winter dew Jun 7, 2025, 12:49 AM

#

analog obsidian does kinda sound like him??? but not as accurate as the 5 hour one

hmm

crude flame Jun 7, 2025, 12:49 AM

#

winter dew LMAOOOO

lol

winter dew Jun 7, 2025, 12:49 AM

#

okay

#

razer the problem is though these are like 17 mins each

analog obsidian Jun 7, 2025, 12:49 AM

#

i would say 2 hours is enough for most models to not sound robotic

winter dew Jun 7, 2025, 12:49 AM

#

are you saying take multiple videos

crude flame Jun 7, 2025, 12:49 AM

#

winter dew are you saying take multiple videos

yup

analog obsidian Jun 7, 2025, 12:49 AM

#

take multiple streams

#

maybe 3 or 4

winter dew Jun 7, 2025, 12:49 AM

#

wait yea there’s like 7 hour dumps

analog obsidian Jun 7, 2025, 12:50 AM

#

yeah u can just take that 7 hour alone

crude flame Jun 7, 2025, 12:50 AM

#

but these mommy voice are mad monotone so they wont sound as good as a expressive voice

analog obsidian Jun 7, 2025, 12:50 AM

#

true

winter dew Jun 7, 2025, 12:50 AM

#

analog obsidian yeah u can just take that 7 hour alone

can there be background noise?

#

these videos have like breathing and rain and shit

winter dew Jun 7, 2025, 12:50 AM

#

crude flame but these mommy voice are mad monotone so they wont sound as good as a expressiv...

yeah that’s the problem ngl

analog obsidian Jun 7, 2025, 12:50 AM

#

winter dew can there be background noise?

depends

winter dew Jun 7, 2025, 12:50 AM

#

the monotone gives it away imo

analog obsidian Jun 7, 2025, 12:50 AM

#

if u wanna train using cvec, no

#

but spin it's better at handling noise

winter dew Jun 7, 2025, 12:51 AM

#

ill keep that in mind

#

yea ngl I need something that is more expressive

#

legit just a regular woman’s voice is what I need

analog obsidian Jun 7, 2025, 12:52 AM

#

my workflow is using a noise gate to remove noise, then manually silencing every bad part

#

cat_yes

winter dew Jun 7, 2025, 12:52 AM

#

LMAO okay

analog obsidian Jun 7, 2025, 12:52 AM

#

but ye keep in mind very monotone datasets can't do much

analog obsidian Jun 7, 2025, 12:52 AM

#

analog obsidian its quite easy to make models not sound robotic

this one is very monotone i can say

winter dew Jun 7, 2025, 12:53 AM

#

yea it sounded like it

#

do you have any recommendations on getting larger data sets of women voices?

crude flame Jun 7, 2025, 12:53 AM

#

winter dew do you have any recommendations on getting larger data sets of women voices?

vtuber

analog obsidian Jun 7, 2025, 12:53 AM

#

i only train male voices :D

winter dew Jun 7, 2025, 12:53 AM

#

maybe like certain voice actors idk really what’s out there

winter dew Jun 7, 2025, 12:53 AM

#

crude flame vtuber

ooo I see

crude flame Jun 7, 2025, 12:54 AM

#

you can also look up speed painting with commentary