#✨│ai-help
1 messages · Page 334 of 1
oh I dunno how to use it locally, NeuralNoob may know more
i followed the website for the AMD section perfectly but whenever i launch run-applio-amd.bat i just get
`HIP Library Path: C:\WINDOWS\SYSTEM32\amdhip64_7.dll
Failed to launch on port 6969, trying again on port 6968...
Failed to launch on port 6968, trying again on port 6967...
Failed to launch on port 6967, trying again on port 6966...
Failed to launch on port 6966, trying again on port 6965...
Failed to launch on port 6965, trying again on port 6964...
Failed to launch on port 6964, trying again on port 6963...
Failed to launch on port 6963, trying again on port 6962...
Failed to launch on port 6962, trying again on port 6961...
Failed to launch on port 6961, trying again on port 6960...
Failed to launch on port 6960, trying again on port 6959...
Press any key to continue . . `
nvm fixed it changed it to the same port Wokada uses in app.py
DEFAULT_SERVER_NAME = "127.0.0.1" DEFAULT_PORT = 18888 MAX_PORT_ATTEMPTS = 10
in app.py
all i did was change the port
from 6969
Hey I uh
Hi Kinger
downloaded Replay
and well
frankly I'm pretty excited
got the voices downloaded and all
now I just gotta wait for Replay to download these files it needs
It's been on this D-f048k-TITAN-Medium.pth thing for a while
do u want my Digital circus models that I don't have public in this server? they're not the best but they're all decent
that's normal right?
Agh erm... nah... I hate TADC
sorry
probably, it may be a large file
ah oki!
Yep we're good it's almost done
I said it
and I'LL FUCKING SAY IT AGAIN!
I-
awwwww :((
that would've been a good one
yea!
could you elaborate what program you're having issues with
what gpu do u have(Nvidia or AMD) and what are you using it for?
like playing as Darth Vader or Goku, something cool right?
no, but I can give u a better thing
vonovox
does the same thing but it's the current best for voice chaning realtime rvc stuff
way better quality
ect
wdym? 👀
lol
have you read the rules?
anything at all related to catfishing isn't allowed
mmmmh
<@&1159293140440723499>
ok?
just get rid of preds the normal way, find them and eliminate them, remove their breathing tube
preds are not a stable source of income
I don't understand it
that's
mhm
you should report to the police not taking advance
just don't bring it here annnnd use literally anything but egirl models
who
who is jidion
never heard of him
yikes we got a problem
Error creating song: CUDA error: no kernel image is available for execution on the device Search for cudaErrorNoKernelImageForDevice' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. Compile with TORCH_USE_CUDA_DSA` to enable device-side assertions.
says this when I try to start the conversion
what are you trying to use
Replay
this is my setup
use applio
wha-
ohhh i see damn thats a lot of stuff
replay is no longer supported
which are the parts that cost money
was literally about to ping you
and could u send a link for where to download
nothing, I pirated bought fl studio, everything else is free
i see i see i might know a guy
what gpu do you have
lemme look...
NVIDIA GeForce GTX 1050
Yea I have a gaming laptop
biggest regret ever
my fourth pc have that it will be sloooooow
😭
happy holidays and good fucking night
oh

I finished training in Applio and i got the index file along with D_333333.pth and G_333333.pth what do i do with these?
it didnt generate any pth file with the same name as the index file, in the output it had the index file along with G_333333 and D_333333
that's not good
what folder should I find it in once it trains? is it ApplioExported?
<@&1159293204038955078>
I'm using Applio interface with public URL https://colab.research.google.com/github/iahispano/applio/blob/main/assets/Applio.ipynb and once i train the model, i dont see a .pth file with the same name as my index. The index is there, but its there with D_3333333.pth and G_3333333.pth in the logs folder. am i looking in the right place?
im learning fl studio now XD
@viral mason what is line 1? i only have the VB-Cable input and output
line 1 comes from vac lite
https://vac.muzychenko.net/en/download.htm here or another place?
Gang, is anyone running okada on AMD gpus? I'm trying everything and it's not working 
have you tried wokada tg fork?
I even tried a fork of tg fork of okada fork
forkception
that link isn't familiar at all
mine or his?
this is a scam and malware and its uses by scammers.
i suggest you NOT using that
WHAT 
jeepers
im joking but that fork is useless prob
I'm trying tg now
try this one, and not whatever that is https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-dml.zip
this is wokada tg fork for AMD
not that thing u just had
I will! 
:3
is dat windows version?
mhm
am on linus 
oh
idk how to help anymore
<@&1159293204038955078> give this kind soul the wokada tg fork download for linux amd
i dunno what one it is
guys what are gpt sovits in #1175430844685484042
tts models
How do i make it sound less roboty
im not sure, maybe try with more epochs? :)
beep boop boop beep
Whars epochs
they're talking about in the voice changer I assume, epochs cannot be changed
how many "cycles" the model is trained on to learn the voice better, less = worse quality, higher = better quality
ohh ok
How do i train it then
u cannot train an existing model
you can only create new ones with audio you collect or public datasets
so true
model, start running on linux!
I dunno
test the model
the best models can be found here https://discord.com/channels/1159260121998827560/1175430844685484042
just look around
model, start production of quantum computers that cost 40 cents and can run deepseek 4 pro at over 300 tokens per second
also make me a hot choco 
ts is stuck on downloading gang

linux?
It's downloaded, just doesn't run anyway
Hatsune miku? :D
yes 
Getting this on tg's fork on RDNA2 6800XT, can't copy text somehow
<@&1159293204038955078>
I was able to fix it with
sudo pacman -S patchelf binutils
target="./_internal/onnxruntime/capi/onnxruntime_pybind11_state.so"
cp -a "$target" "$target.bak"
readelf -lW "$target" | grep GNU_STACK
patchelf --clear-execstack "$target"
readelf -lW "$target" | grep GNU_STACK```
but it runs only on CPU and not GPU 
sorry i dont use linux and gemini wont help me :(
Gemini is the dumbest AI I've ever used
I remember asking it some technical question about spotify and it was like uhh cant help you with that
it helped me tons with my discord bot
I was like WDYM YOU CANT HELP ME and then it proceeded to answering (got the asnwer wrong)
fuck all those, the e girl tag has been deleted a long time ago (why have it? no reason compared to like the technical tags we have), the commissions in fact caused a lot of issues especially with scams, female models themselves aren't a problem, the issue is people misusing
all of those tutorials aren't affiliated to us and the youtubers are banned
do you have a slow internet connection usually?
Last update: April 15, 2026
no, I got a gigabit channel
that's really weird, could you try downloading the fix in that link?
it's not downloading issue, it's not running
on the first run, the program downloads internal pretrained models, it might be that they didn't successfully download, it happened just earlier to another user where the program runned but the program didn't download correctly so it didn't work, but they were on windows
btw are you sure your amd drivers and os are up to date?
yeah, am on a rolling release, fresh as a bunny
oh my god, fixed this with docker
dont mind it
should be using line 1 there and is the hyperx bla bla is your normal mic then that's fine
I havent setup output thingy, was just trying to make this whole thing work on linux first
This is how I managed to make tg fork of okada work on Linux with AMD GPU wtih Docker
OS: CachyOS
GPU: AMD Radeon RX 6800 XT / gfx1030
Repo: tg-develop/voice-changer
1. Problem summary:
The prebuilt MMVCServerSIO bundle may say:
Voice changer version: AMD-ROCm
but still only show CPU conversion.
In our case, the bundled ONNX Runtime was ROCm-capable:
ROCMExecutionProvider MIGraphXExecutionProvider CPUExecutionProvider
but bundled PyTorch was CUDA-only:
libtorch_cpu.so libtorch_cuda.so
and not ROCm/HIP:
missing libtorch_hip.so missing libc10_hip.so
So PyTorch reported:
torch hip: None cuda available: False device count: 0
That means the UI only exposes CPU.
The fix is to run the source version of tg-develop/voice-changer with proper ROCm PyTorch.
2. Host requirements
Install Docker, your user should be in video, render, docker groups (sudo usermod -aG video,render,docker "$USER" if not)
sudo systemctl enable --now docker
sudo usermod -aG docker "$USER"
docker run --rm hello-world```
**3. Pull ROCm ONNX Runtime base image**
We use AMD’s ROCm 6.4 + ONNX Runtime 1.21 image because tg-develop/voice-changer ROCm requirements are built around onnxruntime-rocm 1.21.0. ONNX Runtime documents ROCm and MIGraphX execution providers for AMD GPU acceleration.
```docker pull rocm/onnxruntime:rocm6.4.1_ub24.04_ort1.21_torch2.6.0```
Test that the container sees the GPU:
```docker run --rm -it \
--device=/dev/kfd \
--device=/dev/dri \
--group-add video \
--group-add render \
--ipc=host \
--security-opt seccomp=unconfined \
-e HSA_OVERRIDE_GFX_VERSION=10.3.0 \
rocm/onnxruntime:rocm6.4.1_ub24.04_ort1.21_torch2.6.0 \
bash -lc 'rocminfo | grep -E "Name:|gfx" | head -30'```
I have been intimidated
4. Clone tg-develop/voice-changer
git clone https://github.com/tg-develop/voice-changer.git
- Create Dockerfile
Inside of your voice-changer folder
nano Dockerfile.rocm64
and paste this (2 separate posts because discord wont let me post in one, you need to copy from both)
FROM rocm/onnxruntime:rocm6.4.1_ub24.04_ort1.21_torch2.6.0
ENV DEBIAN_FRONTEND=noninteractive
ENV ROCM_PATH=/opt/rocm
ENV HIP_PATH=/opt/rocm
ENV HSA_OVERRIDE_GFX_VERSION=10.3.0
ENV HIP_VISIBLE_DEVICES=0
ENV ROCR_VISIBLE_DEVICES=0
RUN apt update && apt install -y \
git \
wget \
curl \
gcc \
g++ \
make \
unzip \
ffmpeg \
libportaudio2 \
portaudio19-dev \
libasound2t64 \
libasound2-dev \
libsndfile1 \
alsa-utils \
python3-pip \
&& rm -rf /var/lib/apt/lists/*
WORKDIR /tmp/build
COPY server/requirements-common.txt /tmp/requirements-common.txt
COPY server/requirements-rocm.txt /tmp/requirements-rocm.txt
RUN python3 -m pip install --break-system-packages --upgrade pip wheel setuptools
# Common voice-changer deps.
RUN python3 -m pip install --break-system-packages -r /tmp/requirements-common.txt
# Install ROCm requirements, but skip torch/torchaudio/onnxruntime lines.
# We install ROCm torch manually later from the PyTorch ROCm index.
RUN awk '\
/^[[:space:]]*($|#)/ { next } \
/--index-url|--extra-index-url|--find-links/ { next } \
/onnxruntime/ { next } \
/^torch==/ { next } \
/^torchaudio==/ { next } \
/^torchvision==/ { next } \
{ print }' /tmp/requirements-rocm.txt > /tmp/requirements-rocm-filtered.txt && \
cat /tmp/requirements-rocm-filtered.txt && \
python3 -m pip install --break-system-packages -r /tmp/requirements-rocm-filtered.txt
# Some deps may accidentally pull CUDA torch. Remove it.
RUN python3 -m pip uninstall -y torch torchaudio torchvision triton pytorch-triton pytorch-triton-rocm || true
# Install ROCm PyTorch.
# If 2.9.1+rocm6.4 is unavailable later, use the newest version pip says is available.
RUN python3 -m pip install --break-system-packages \
--index-url https://download.pytorch.org/whl/rocm6.4 \
'torch==2.9.1+rocm6.4' \
'torchaudio==2.9.1+rocm6.4'
# Sanity check.
RUN python3 - <<'PY'
import torch
import onnxruntime as ort
print("torch:", torch.__version__)
print("torch hip:", torch.version.hip)
print("torch cuda available:", torch.cuda.is_available())
print("torch device count:", torch.cuda.device_count())
print("ort:", ort.__version__)
print("ort providers:", ort.get_available_providers())
PY
CMD ["bash"]```
6. Build image
from your voice-changer folder:
docker build -f Dockerfile.rocm64 -t tg-vc-rocm64 .
7. Run container
Create persistent data folders, for me it was
mkdir -p /mnt/data/random/VC/tg2/rvc-data/sound_dir```
Run:
```Bash
docker run --rm -it \
--device=/dev/kfd \
--device=/dev/dri \
--device=/dev/snd \
--group-add video \
--group-add render \
--group-add audio \
--ipc=host \
--network=host \
--security-opt seccomp=unconfined \
-e HSA_OVERRIDE_GFX_VERSION=10.3.0 \
-e HIP_VISIBLE_DEVICES=0 \
-e ROCR_VISIBLE_DEVICES=0 \
-v /mnt/data/random/VC/tg2/voice-changer:/workspace/voice-changer \
-v /mnt/data/random/VC/tg2/rvc-data:/data \
tg-vc-rocm64```
8. Verify ROCm Torch inside container
```Bash
python3 - <<'PY'
import torch
import onnxruntime as ort
print("torch:", torch.__version__)
print("torch hip:", torch.version.hip)
print("cuda available:", torch.cuda.is_available())
print("device count:", torch.cuda.device_count())
if torch.cuda.is_available():
print("device:", torch.cuda.get_device_name(0))
print("ort:", ort.__version__)
print("providers:", ort.get_available_providers())
PY```
Should look like
`torch: 2.9.1+rocm6.4
torch hip: 6.4.x <--------------- you're good if it recognizes torch hip
cuda available: True
device count: 1
device: AMD Radeon RX 6800 XT
ort: 1.21.0
providers: ['MIGraphXExecutionProvider', 'ROCMExecutionProvider', 'CPUExecutionProvider']`
- Start VC server
python3 main.py --log-level debug```
if you get any errors on your system, just feed these instructions into any AI assistant
pants? pissed.
and dont forget to install portaudio inside of the container
pulseaudio-utils libpulse0
I just trained a voice model with this https://colab.research.google.com/github/JackismyShephard/ultimate-rvc/blob/main/notebooks/ultimate_rvc_colab.ipynb where can I put it over an audio file?
i have RVC GUI locally but i think its too old
install what
what gpu do u have (Nvidia or AMD) and what are u going to use it for?
why are u using it on roblox? that platform already has enough weird ppl on it
<@&1159293140440723499> if u read the rules that isn't allowed sorry
Are there any ways to train ai voice models? I been looking for about a week and all of them are outdated
try Kaggle, it has Applio on it and works fine for me
I even recorded how to use it
You guys train the voices in collab/kaggle which is like cloud pc what is the maximum gpu collab has to train voices best?
I train in specifically kaggle using applio
Whats the cpu or gpu specs for kaggle do u use for training
Could you send me that video
Idk if this may be the right channel to post this, but I want to upload 2 voice models and would like the model maker role.
I read the info on the website but there's no model-maker-role channel no more (or at least I can't see it?)
What's next?
how do I post some of my latest ai images in the ai-images channel? do I need a certain role for that?
Oh thank you so much
you'll need image perms if u cannot post images at the moment, just talk
no problem, if u have questions on models or anything I'm available for a bit
Thanks!
just talk?
sure
yea you need a specific level to send images or gifs, not sure what level that is tho
I dunno tbh
Actually now that I think of it... what tool would you recommend that can separate the backing vocals and the main vocals from a song?
Lalala.ai site works the best, but I'm looking for a free alternative with the same quality as them
I use mvsep and I use this specific model
Holy gold, thank you! Will definitely try it later
no problem!
u can add me if you like so u can message me if u have questions or anything
Sure thing lemme do it
Most online cloud services generally use Intel Xeon CPU, including Kaggle and Google Colab. Kaggle offers dual NVIDIA T4 and single P100. Kaggle's GPU quota has a fixed 30 hours in a week.
You're banned.
hey how yall brainstorm saas ideas ?
What is SaaS?
I7 13700
3090 gigabyte oc
32gb 3600mhz ram
Msi mag mortar b760m ddr4 motherboard
Is this okay my pc i got compared to the cloud kaggle/collab since what you mentioned sounds like those nvidia a100s datacenter gpus with like 40gb or 80gb vram
Bro.
?
So what does it mean then, i havent tried training yet so , and also i just found out training text ai for a 3090 they said it cant handle much and is slow local ai text response compared to the latest models
So thats why im asking
The ones i asked about that does ai with a 3090 said they use dual gpu with 3090 just so its alot faster with just text ai
To be honest, your PC specs and GPU are actually faster than typical Kaggle specs, even though most GeForce RTX GPUs have lower VRAM than those top-tier RTX Pro and data center GPUs. T4 is an older entry-level one.
Really? The ai group im in is like 4 gpu 3090 for ai and they want more, and i feel like shouldnt expect much with just one
Let move on.
<@&1159293140440723499> don't forget about this thing
Im about to train with the ai you guys have ,probably soon , hopefully things go well aaaa
on kaggle?
Locally with my pc
Applio RVC.
hey is that possible to make a v1 pth model to v2 ???
ahhh but we cant merge v1 and v2 right?
Hello
Question, how much tb do i need for storage for training the ai software here
Linux Voice Effects
you run it through docker?
@low shard also can i run the softwares on an external ssd and its training data to external hdd
this is weird, we usually don't have linux users but i don't remember them needing such a workaround to make it run, maybe its related to your specific linux distro or gpu drivers, I could try adding this in the docs
what's your pc gpu and os?
you sure you don't want to try training locally first?
they're executed
what ai software? there are multiple ones, elaborate using the help template
!help-template
To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.
⚠️ NO INFO = NO HELP
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
• Check Docs: Many fixes are in the AI Hub Docs.
• Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
• English Only: Keep all discussions in English.
• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).
The ones you have in these server ill try all of them
this is a general ai discord server, we talk about any ai programs lol
This one
Will its softwares training data be enough on a 2tb ssd or no get the training data to another hdd
elaborate your specs using this:
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
I already answered that though
Full gpu: gigabyte 24gb oc
Operating: windows 11
Does anyone know how to train voice models? I honestly wanna make one myself
theres the guide
wich model of the gpu, also theres missing the goal
ah thaks mate 👍
hi @simple ore sorry if you don't appreciate pings but is changing the LR for pretrains recommended? or is sticking to 1e-4 better? i was thinking cause im doing a weird hacky way of embeddings that might need 5e-5 to prevent mode from collapse
wtf hi ilaria!!! its me frail/helena if you remember heh
So how do you guys make the AI vocals apply to the backing vocals in a song as well?
I'm using Replay and it doesn't really do that.
sorry i had a problem and my memory is wiped almost completely (not a joke) but helena tickles me something
windows 11 omen laptop 7.8 GB of NVIDIA GeForce RTX 3050 Ti Laptop GPU
ouh it's okay! np! i was one of the helpers in the server far before this server was acquired / bought out / whatever, back when it was kalo tea snoop menh running things around here
jeez thats 2023 stuff
that tells us nothing
what do u mean
whats the gpu
yeah wayyy back then
sadly the server is mostly dead
yep i miss those days they were really fun i learnt alot and got into all this
^ would you happen to know anything abt this btw ilaria
7.8 Gb and its NVIDIA GeForce RTX 3050 Ti Laptop GPU
ask me anything in dms
download applio
dmed
i did and it does not work for me, i click on run-applio.bat and it looks like this
did you download the precompiled version?
use applio
What settings would I put on Applio for that?
no special settings just take the backing vocals and convert
I'll give it a shot.
Last update: April 13, 2026
what's your pc gpu and os?
rtx 3050 ti has 4gb vram, not 8gb, 4gb vram might not be good for local rvc training
err I cannot check it rn, I'm currently having it cleaned up a bit. I'll check it by tomorrow
it might be best you tell us later because that can help a lot directing you to the right guide and telling you if its good enough
or i could just give you the Applio guide but you risk wasting time if it's not good enough
I'll just check it myself if it can run good or not
your choice: https://docs.aihub.gg/rvc/local/applio/, https://docs.aihub.gg/essentials/how-to-make-voice-models/
Last update: July 17, 2025
yes, i run the install and it says this
but then run-applio.bat does nothing
Ah wait, I have to extract the backing vocals separately, is it?
And then convert that and the main vocals as separate files?
delete everything, download the latest precompiled version, unzip it and just run run-applio.bat, show what it does
I once tried the single 53-stem BS Roformer model within an MSST notebook on Google Colab with NVIDIA T4 GPU; the GPU suddenly ran out of memory and process crashed. I just think because MSST attempted to load up 53 audio stems at once, which in my theory it would of course. The model itself is meant to be run elsewhere. Funny observation. 
there were some experiments, but nothing particuarly good came out of them... maybe half of current one for very small datasets
What software are you using? And what for?
yesh, other option would be following og okada guide and going thru venv + replacing torch packages
I was surprised this worked worse on linux, you would assume rocm and ai stuff works better
I think someone more knowledgeable needs to come up with a proper guide for this
Nick is one of staff members who manages AI Hub's docs. 
What is your PC GPU? Did you follow any tutorial or guide before? And what would you use the voice changer for?
wait this looks interesting, might check that out later, that didn't work either?
Hey, I have an RTX 2060, what's the best real-time option for it?
I was using w-okada, but I feel it's outdated.
Maybe I'm just being dumb, but I'm testing it out and it's lagging a lot
have you tested the current beta https://huggingface.co/dr87/vonovox/resolve/c8034f5f6d50648a8109bb4f847182362e2b779b/Vonovox_beta_17_11.zip
ngl
the voicemodels.com server is ass and has no helpers
we are much more organized
i remember working with a site like that
it was weights before weights
were the staff also dumb?
i mean i dont care.
they paid me
aw man
I'm on this version, and what I'm feeling is like the Vonovox isn't picking up my voice, because when I speak right up to the microphone, it picks it up, but the volume is at maximum.
I'm using the same template I use in W-Okada, the template doesn't look bad, I really feel like I have some wrong setting.
Believe it or not, I copied your settings. lol
you did?
the lower gpus tend to struggle somewhat yea, have you tried wokada tg fork?
the 2060 is very old too
@gusty socket could you try my experimental fork without any type of workarounds just following the AI Hub Docs guide normally?: https://github.com/Nick088Official/voice-changer/releases/tag/b2401
what is this?
It should fix the Linux ROCm issue, there was a typo in the requirements-rocm.txt and needed to remove legacy flag from libraries when building
yeah I just forked it to try to fix that, nothing else changes and I can't really test it since I'm not on AMD GPU nor Linux
is this for the AMD linux version?
making sure since I could send this to a friend as well who's been having issues
Yeah this should fix his issue on Linux with AMD GPU, that uses ROCm
I'm not sure if it works and if he had the same issue as minn, but let me know
def not the same issue but it might help it run as he got errrors before with it not working
sure, I'm done with work anyway
does it work only on Ubuntu?
hello. how can i use ai voice changer pls? mine is outdated
hey
im using the b2332 nvidia-cuda version of the owakada voicechanger is that an old version?
It's not working for you on CachyOS? It uses Ubuntu 22.04 to build the release, what issue do you get?
!help-template
To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.
⚠️ NO INFO = NO HELP
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
• Check Docs: Many fixes are in the AI Hub Docs.
• Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
• English Only: Keep all discussions in English.
• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).
that seems to be wokada deiteris fork last update which is old and not suggested, elaborate more
!help-template
To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.
⚠️ NO INFO = NO HELP
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
• Check Docs: Many fixes are in the AI Hub Docs.
• Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
• English Only: Keep all discussions in English.
• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).
gpu: msi rx 570 8gb
os: win 10
link used: i used it back in mid 2025
is there a better version?
im using site version of voice changer
this is a general ai discord server, you need to elaborate more, if you're trying to do AI Covers, E Girl Trolling / E boy Trolling / Catfishing or Roleplay
and also the screenshot of the program or program files if you don't have the link, there are many programs, I can't know whatever you're using
you need to first elaborate what I asked, read up the template
okay let me show you real quick
Just was going through the commits, I'm installing it now
its called MMVCServerSIO
tbh idk those answers im rlybad at stuf like this
Gives an error
open the program and show a screenshot to check the version
is there no tutorial how i can install a newer version?
those are needed, what can't you answer? there isn't just a single better voice changer, this is a general ai discord server with multiple programs, elaborate everything
that seems to be an old version of the wokada deiteris fork https://github.com/deiteris/voice-changer/releases/tag/b2309, which is outdated
now there are multiple replacements, but you need to elaborate your goal, are you trying to do Realtime TTS, AI Covers, E Girl Trolling / E boy Trolling / Catfishing or Roleplay?
yes i want to real time TTS. i have got voice models from #1175430844685484042
or like trolling yk
what trolling?
aha, so catfishing 
no not like that i already got my lesson when i did that
like Darth Vader or Goku?
Goal (e.g., TTS, AI Covers, Roleplay): roleplay
- Specific Issue: i wanna know if my voice changer is outdated and if there is a better version, and what makes the btter version better?
- Full GPU Name: NVIDIA GeForce RTX 4060
- Operating System: Windows i think
- Tutorial Link used: idk that
like this?
yeah, movie characters and etc.
thank you for being normal
Nick, this one is good do not ban them for their previous answer
what kinda roleplay? like playing as a movie character or something?
then you're using those wrong tools from the start, most models in #1175430844685484042 are RVC, which is STS not TTS
You could check https://docs.aihub.gg/tts/realtime-tts/ for realtime tts
Last update: July 28, 2025
will check it up rq
no no i dont need tts
like i need to mimic their voice in real time
idk what its called
It was RVC i believe
ehm like i want to use anime voices from jjk megumi espicially
ive been using the wikada one but so far i couldnt find any models and want to know fi mine is outdated
do u also know how to get a newer version of the wokada one
and do u know why those voice models sound so good like u cant even tell its an ai and when i use it it sound so rovbotic
what gpu do you have? nvidia or AMD?
nvidia
Specific Issue: i wanna know if my voice changer is outdated and if there is a better version, and what makes the btter version better?
Full GPU Name: NVIDIA GeForce RTX 4060
Operating System: Windows i think
Tutorial Link used: idk that
try latest forks from the wiki
idk i saw it in this channel before
where can i find it? back then when i got mine there was tutorial soemwhere in this discord
is there also a tutorial for it?
Kinda, but it's very simple to install both, for the second link just extract it and run setup64 and then install drivers, for the first one extract and run Start
Ignore that, that was about a discord server that copied our server
This server is completely safe now
Did you download both files btw
oh and wich one should i use it says there is 3 versions of the voice changer
I sent you the files you need
did you download the source code instead of the release and tried to build it manually ? You seem to be running the source code vc_install instead of the MMVCServerSIO built file
You need to follow exactly https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/, but instead of the original github links you have to just download the tar gz aa and ab from: https://github.com/Nick088Official/voice-changer/releases/tag/b2401
not yet is thjere no tutorial where i can do it step by step
No, I already said what to do, I would record how to use it but I'm not home rn at my pc
Just follow these instructions and if you're confused I can elaborate more
your gpu is old, but you could try https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/
Last update: April 15, 2026
check the actual guide, it might be easier than using manually the files provided by local worm: https://docs.aihub.gg/realtime-voice-changer/local/vonovox/
Last update: March 30, 2026
do i have to uninstall the wokada i got now?
I would recommend that yes, just to save space
you got an old version, it's not needed anymore, it's better to just use vonovox
What Nick said lol
does this one run better?
i mean you could just give the guide link and tell them to use the beta download link if you wanna give them the beta, some people might need the guide to understand better
vonovox is more updated
does both work the same? like both uce rvc models?
Some people struggle with the guide not being adjusted to the beta version and ask about stuff that isn't included or was removed/changed
both use RVC models
Yes they both work the same, Vonovox just has better quality and is much more optimized
They all use rvc models
well because the beta isn't considerated as a stable release, and the guide can't be updated for every single beta since they come out faster than stable releases, it wants to be as stable as possible rather than some finding some rare issue
That's fair but the beta release is the most recommended version of Vonovox at least from my view
Much better than the current full release 1.6.9
ty very much another question what makes this one better then the version i got? are the voice better?
or just liek the speed is betteR?
performance and feature-wise
There's a lot that makes it better, it's much more optimized so it shouldn't be laggy in games or on discord, the quality is noticeably higher because of the upscaler that enhances quality, it's very fast and sounds good overall
There is only 64 slots though which is a downside to the massive over 100 models slots that other okada programs have
That's the only negative thing about it, other than that it's amazing
wdym by slots?
Like how many models you can have on the app
is this how i start it? and it also says post-installation do i have to do it with my gpu
nice ty can i aslo make like breathing sounds more real with it or like can ppl hear when i use my keyboard? cause mine makes weird sounds when it hears the keyboard
It automatically uses your gpu, to start it run the file called "Start"
That's dependent on the model still and it does pick up stuff mostly depending on your microphone but also there are features to suppress it picking up such noises
Wdym
like do i click on ''github repository'' to download that
ahh the second one isa virtual cble?
Yep!
do i need to download it i fi already haveo?
It'll allow you to use it in games, discord, ect
i think is use the lte one s
Nah you only need one
alrigty
@gusty socket please try using the built build of my fork and let me know, don't build yourself with the source code
were you perhaps using vc_install instead of the built release for the upstream repo too?
hi i lost my rights idk what happened can i have a star back or just the right to post gifs or whatever or maybe not thats cool too
that's because you left and rejoined, gave you the verified role back but please use the modmail by dming @hushed perch next time
ty
Hello, I need help with a workflow in ComfyUI for LoRA
hey i downlaoded it what now?
Extract the zip files and then run the file called Start
yes, I git cloned it, lemme try
i love your pfp
hpw do i extract it
haha ty
right click, unzip
which one?
Start
ImportError: /mnt/data/random/VC/vc-nick088/MMVCServerSIO/_internal/onnxruntime/capi/onnxruntime_pybind11_state.so: cannot enable executable stack as shared object requires: Invalid argument```
anyone know how to use the okada voice changer on mac
start adn then extract?
No no, extract the zip file then run start
but liek how do i extract it how do i exract it
Seems like the same execstack security policy issue, I will modify the github workflows to try to patch that again
for now you could run this in your terminal:
cd /mnt/data/random/VC/vc-nick088/MMVCServerSIO
find . -type f -name "*.so*" -exec patchelf --clear-execstack {} \; 2>/dev/null
This temp fix will make it run, and make us understand if my fork atleast fixed the GPU detection in that build
Or you could wait for my next build in like 20 minutes that should fix both GPU detection and the security policy
it runs, but no GPU is detected
could you send logs?
ehm is it normal that it has over 40k elements?
I'm not sure
I downloaded it a while aho
first try this to show me if you actually have the files or there's a pyinstaller bug that somehow removed the rocm libraries:
ls -l _internal/torch/lib/ | grep hip
then try to run it with the flag you talked about earlier:
cd /mnt/data/random/VC/vc-nick088/MMVCServerSIO
HSA_OVERRIDE_GFX_VERSION=10.3.0 ./MMVCServerSIO
no files, grep returns nothing
mmm grapes
weird, could you try that run command with the flag?
so i extracted it but where do i find the extraxted file
it still looks like this
check your downloads folder
cant seem to find it what happens if i extract again
try this
instead of clicking and dragging with left click use right click instead
wont it download it again?
like i extractedit but idk wher eit is but ill try this one out
I did, same thing
does the folder has the same name after extracting it?
yep it should
is it this one
this is it!
perfect so how do i hear my self?
omg ur so awesome
one more question whats the diffrence between ouztput and input?
does both effect how loud i am ?
this one controls overall loudness and this one u can change for each model
adn what is output?
I'm not really sure, but the audio settings should be
Input: normal headset/headphones mic
Output: line 1
yes i men like u can swipe to rght and elft at the output thign does it change the way how lod ppl hear em or is input for it?
oh yea it should
Input volume is applied directly on the mic input, before passing to RVC. If the mic level is low, it's worth increasing input volume for better conversion. Other than that, output volume should be used, because that's what directly controls the volume of converted voice without affecting feature extraction.
oh so input is for my voice get sinto the rvc and output controls how loud the other person can ehar me?
technically, yeah. Obviously vonovox also handles the volume, so input volume indirectly will affect output volume too, but it's a side-effect, not the primary purpose
For example, If you set input volume very low, then output will be mumbly because feature extraction from low signal will fail. But if you keep input volume normal and set output volume low, the voice will be quiet, but clear
what are u looking for?
thank you guys verymuch ❤️
u 2 thanjk youu ❤️
you're welcome!
@coarse laurel what are u looking for
the issue is still missing files excluded from the pyinstaller
https://github.com/Nick088Official/voice-changer/releases/tag/b2404 try this executable, try it both normally and with the flag, it shouldn't give policy issues anymore nor not detect the GPU
If it doesn't work again, give logs and I might give it another shot but I'm not really a Linux ROCm expert
had to use a flag to get it working, but not GPU still
what flag? are you talking about the policy issue still?
I might try with another pytorch version with rocm6.4, i just checked and pytorch 2.7.1 doesn't have rocm6.4, i checked your message but in their pypi there's no 2.6.0 with rocm6.4, so hopefully a last try with pytorch 2.8.0+rocm6.4 doesn't break anything
Will try to make it soon, else I will just add the docker method to the docs and explain the issue
oh yeah, that's what I've been doing with docker, had to downgrade rocm as well
Any good Pre-trainer for female singers?
I remembered KLM was a thing, has the DEV stopped updating it?
Has the DEV for KLM made any statements?
use Legacy core 1.6 as of now, klm is ok but idk what the best newest one of his pretrains is
Anyone knows how to use gemini without a google account?
ok im unsure this is the right room im trying to get Mangio-RVC-Fork-main to pick up audio from another pc how would i do this from pc to pc suggestions welcome ping or reply if you have done a multi audio setups
object pc a to pc B and have pc B edit audio live
i am sort of stuck it picks up normal audio in the teleport (obs to obs) but but it does not seem like the i think its mangio picks up or edits it and i think it should ping me if you know / ty im brain broken.
whats your goal
okay for what porpuse
thats the porpuse and the goal pc a has the mic pc b is the stream box
no.
what are you going to do? catfish or something?
i just need to know why or possibly how to set up the voice thing to pick up something other than "desktop"
ooo no i just need some lag
so if i crash my main pc it wont crash my stream pc or my record / lab stuff
example before update
so youre going to use it to change your voice in real rime for streaming
its a yes or no question: can it pick up from a to b ?
yes
this conversation is very confusing
the point is audio separation and lag
nah i got my answer so i know its my fault now just unsure how its sending from a to b and why B can not see A's audio 2 it only picks up desktop so it starts doing a bzzzzzz and its terrible
honestly that graph is scaring me
don't they just want to use a voice changer or smth?
im a professonal hacker / redteam / purple team
yes
5 bucks youre trans
then all they need is vonovox on nvidia or TG fork on AMD
they could be a femboy too
i placed my bet
no i just like to break things and im testing something thats about the end of it
i'm still trying to fix it, but is the same issue with https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/ ? This is what tg develop fork is based on
Last update: April 15,2026
I think so, I've tried both before trying docker
@proven hill ty for info while for w/e reason you are simi hostile knowing it is possible will let me know that its my issue not audio to audio
im hostile because this informations is usually asked by people who catfish people
i suppose thats fair: i get the "trust me bro" its my account daily
im just trying to clean a audio scream / hiss and add lag for dc fail over
speaking of catfishing, does anyone have a model for a dommy mommy voice? For uuh reasons
im using 4 - 6 computers at the same time for stuff so i have to isolate everything
no
ask again and you won't be here very long
❤️

I need to call my fren a good boy once he gets me back to celestial in marvel rivals
me
im in my female loki phase
can someone give me the best voice mod and the settings to go with it to sound like a girl please?
do you need it in real time?
yeah
why?
why girl voices specificallu
to create content, as i've been doxxed for my voice before and i don't want it to happen again
why girl voices tho, there's so much cooler stuff like Spongebob or Darth Vader or Goku
i'm known as a girl in all the servers i've already been doxxed in, and yes i'm actually female, so if i ever interact there again, the majority might think that the doxxing was false
@viral mason
I dunno man, just look through here https://discord.com/channels/1159260121998827560/1175430844685484042
i tried, but they all sound really bad
what are u looking for
and whenever i speak a full sentence it starts breaking
im not allowed to speak
also there's this kind of breathing sound in the background constantly
but trust theres place to find
my models sound great
@proven hill help please?
you smell like a catfisher
yea I don't trust this
I'll let someone else deal with this
if u have Nvidia gpu download vonovox, if u have AMD get wokada tg fork, that's all I'll say
what are you searching for anyways
just a traditional girl voice
theres plenty of options
could you please help me with them?
no because i suspect youre a catfisher
heyy everyonee quick question about mic setups. i’m a trans girl and after 8 months of voice training not really working out for me i’ve switched to using a voice changer to help me with my dysphoria and i actually found a model that sounds realistic and feels like me. i’m just trying to make the audio setup sound as natural as possible over discord etc. does anyone know if adding a tiny bit of background noise or static through voicemeeter helps mask any weird artifacts and make it sound all more natural and convincing? any tips for a natural sounding setup would be amazing 💕 thankssss
Hi! what gpu do you have (Nvidia or AMD)?
I'll gladly help wit has much info as u can give ^^
The message
template:help-templatecould not be displayed. More details can be found in the error log.
Please report this to a server admin.
@low shard
say wallahi bro
hiii thank u so much for offering to helpppp. i actually have an nvidia gpuu. what other info do u need from me? i’m a little new to the deeper audio routing stuff so any advice you have would be appreciateddd
rip, last try: https://github.com/Nick088Official/voice-changer/releases/tag/b2418
if this doesn't work in any way, I'm just going to use your Docker fix in the docs, I tried helping myself with Gemini 3.1 Pro Preview and claude opus 4.6 thinking but seems to be a rare bug, i can't really test it myself and its niche so i dont want to waste neither mine or your time, docker might just have a lil bit more of overhead but atleast it works guaranteed
if you could say exactly what gpu that might help me decide which voice changer you can use like RTX 3060 or RTX 1060 ect
-# GTX 1060*
what is a GTX
you didn't do a typo? the RTX serie came after the GTX 16 serie
don't you have a gtx 1660 super yourself
haven't had that for a year
forgot about it tbh
what voice changer, there is more than one
what gpu do you have(Nvidia or AMD) and what are you using the voice changer for?
and what are you wanting to use it for?
mh
that's not allowed here
<@&1159293140440723499>
kk, let me try it
i have been summoned
summoned 2 seconds earlier
unsummon yourself rq, they wnot expect it
no iwas there but im slow typer and i need to use the gif as my intro
it depends on if im online or not
jokes aside all that matters is that the case has been taken care of
is the archive corrupted?
hopefully not, im downloading it, it doesn't work for u?
can't open with Arc either

I'm starting to think that this has been an issue since the original wokada

Because there's barely any linux amd gpu user using the program, majority of users are atleast on windows
Jensen Huang transferring 21 million dollars to original okada developer to not include ROCm libraries
You could try opening an issue about it but I wouldn't expect much about it, tg-develop and deiteris are both using windows iirc
they are included, the issue is PyInstaller that discards them
Jensen Huang hijacking pytorch repo to replace
if amd then pyinstaller_amd
to
#if amd then pyinstaller_amd
btw sorry for wasting our time about this, I think it's just better to give up and use the docker way atp,
I will link to your message and give credit
Btw which AI Did you use to help you out for the docker way?
no worries, happy to help, you can ping or dm me if you want to tinker more
gpt plus
I will add you to keep that in mind, but for now I will give up, if you ever find out a fix please let me know, I might delete my fork in like a week or smt if I totally give up or retry later
You might also try the original wokada and let me know if that works for you
I can make a better instruction that also includes portaudio inside of docker as well
or just send it to you, you can format it however you need
I added already a formatted section: https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/#amd-gpu-on-linux-only-showing-cpu-docker-fix
You can let me know if there's anything wrong and could dm me or PR it
Last update: April 15, 2026
what do epochs mean im a new gen i keep seeing it on voice models and dont know what it means
how do i fix lag in audio while gaming only
it's just how many times the ai saw the audio used to train that model before it sounded good to the person making it
more or less doesn't equal better or worse model
what are u using? and what gpu do u have? (Nvidia or AMD)
epochs are a unit of measuring the training cycles of the AI model
basically the amount of times the model went over its dataset and learned from it
they don't mean how good is the model, it's just an info provided on how they trained the model by the model maker
More ≠ better
Less ≠ better
There's no way to determinate how good the RVC model is until you try it out or listen to the audio samples if there are
!help-template
To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.
⚠️ NO INFO = NO HELP
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
• Check Docs: Many fixes are in the AI Hub Docs.
• Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
• English Only: Keep all discussions in English.
• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).
thank you my question was already answered though
appreciate you
Help, on my end I can't choose any models at all for inference in original RVC. It just doesn't show a list of models. I've installed pretrained models btw
- GPU: RTX 3060 12gb
- Operating System: Linux Mint
- Tutorial Link used: https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/docs/en/README.en.md
are you wanting to make covers or something?
I actually have a goal for realtime voice changer
I can't help much with Linux but I can point you in the right direction
Gotcha
try Vonovox, I am unsure if it works with Linux but why not try
sure, I think original RVC just dead or smthing
rvc doesn't stand for realtime voice changer btw:3
if i have a mac w a m4 chip can i use this 😅
o h
wha
the voice changer
no what's a m4 chip
uh
ye, basically just download those two links, install vac lite (second one) by running setup64 then install driver, after than for vonovox run start
make sure to extract them
wait, is it only for windows?
um like the chip that my macbook uses
unsure tbh, but if it does only work with windows Wokada TG fork has many versions
unfortunate
<@&1159293204038955078> when u can this kind fellow needs help with the Nvidia linux wokada tg build (idk what download version it is)
while u wait tho the quickest way to look is in the guide for it
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project.
A Realtime Voice Changer with similar performance to Vonovox & Wokada Tg-Develop Fork, with extra features.
Deiteris' fork (modified version) of wokada that doesn't get updates anymore.
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
:o
that's
did I send the wrong thing
that isn't tg fork that is deiteris it looks like
much older than tg
hm?
I just followed this guide: /home/lf/Documents/VOICEMODELS/GeneralGrievouscvlv1.5V1_235e_9635s
🥀
tho I've never used anything but the windows version of anything so maybe for whatever reason it's different
prob not tho
yes, I just deided to try again voice changer
I used only on windows previously
before ai hub was reborn
bruuuh
2026-05-08 05:34:20,880 INFO [OnnxLoader] Converting model to FP16...
2026-05-08 05:34:24,854 WARNING [symbolic_shape_infer] Unable to determine if n_samples <= ConvTranspose_308_o0__d2, treat as equal
2026-05-08 05:34:25,183 INFO [OnnxLoader] Done!
2026-05-08 05:34:25.690178271 [W:onnxruntime:, transformer_memcpy.cc:74 ApplyImpl] 2 Memcpy nodes are added to the graph main_graph for CUDAExecutionProvider. It might have negative impact on performance (including unable to run CUDA graph). Set session_options.log_severity_level=1 to see the detail logs before this message.
2026-05-08 05:34:25,736 INFO [PipelineGenerator] Loading index...
2026-05-08 05:34:25,737 INFO [PipelineGenerator] Try loading "model_dir/0/GeneralGrievouscvlv1.5V1.index"...
no nvidia?
I think you should prob switch back when u can
I won't
I hate windows because of it's artifacts, glitches, blue screens (now black screens) and so on
my windows pc on my 5070ti works completely find without any bugs or errors
damn
sometimes parts of windows weren't drawn and win10 just fell into idk kernel panic? Looped last few ms of audio from yt and went into black screen spinning my fans fast
I just better not touch win except for pc clubs
that's inevitable
weirdd
I use the evil windows 11 and it seems fine
YEAH fuck both of this versions
when I build a pc with a bro we decided to install win11. Well, it was a pain in the ass because after downloading first updates it just started lagging
we lost about a fucking hour, ruining feeling of high end pc being right there

sometimes I wish I never learned anything digital
goal = oprate open notebook on my pc ,,issue = desktop doker run into errore ,,,gpu=rtx 5080 ventus ,,,op=win 10 link =https://www.open-notebook.ai/get-started.html
holy shit it's working
please help
@viral mason yoo it's working just requires to reload page a few times xd. Built in protections I guess ruin everything
cant open open notebook
now I guess I have to figure out how to make a virtual cable, but it's probably easy to do\
idk what magic they did but few years ago I couldn't use models with delay less than 128. Now I can
What u supposed to do to fix choppy audio / voice cracks with the voice changer
Just sounds kinda unnatural
what gpu do u have (Nvidia or AMD) and what are u using the voice changer for?
Nvidia
Using it for fun
like Spongebob or Goku or somethin
Yea
ok good good, you should switch to Vonovox
I have the links right here :D
Ok
Ty bro
Does this work with all the voice file things that ppl post in this server?
ye works the same as the thing u already had but wayy better
btw the second link is a virtual audio cable like VB cable but it causes less issues on windows
Ah
So i should replace the one im using rn with that one?
This the thing that i was using before btw https://youtu.be/dZ_2HELnWJU?si=qCBjcHX1AykTUrJL
o yea that's super old
yea delete the folder for the old vc and cable completely
alr night!
Thanks, I'll give legacy core 1.6 a try.
I have AMD, more specific I have RX 9070 XTX.
I would love to train locally, which one would you recommend for AMD?
Locally you can use Applio, tho I have zero idea how to install it locally the guide should be very helpful for that!
-rvc
Just look into the applio docs
@low shard I saw there's a new build, but it seems to be also corrupted, can't extract with cat or arc
Meow
can someone help me when using the voice changer egirl some words cut out
Are you looking to do ai covers, training models, e girl trolling / Catfishing or roleplay?
!help-template
To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.
⚠️ NO INFO = NO HELP
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
• Check Docs: Many fixes are in the AI Hub Docs.
• Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
• English Only: Keep all discussions in English.
• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).
!help-template
To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.
⚠️ NO INFO = NO HELP
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
• Check Docs: Many fixes are in the AI Hub Docs.
• Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
• English Only: Keep all discussions in English.
• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).
Thank you
!help-template
To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.
⚠️ NO INFO = NO HELP
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
• Check Docs: Many fixes are in the AI Hub Docs.
• Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
• English Only: Keep all discussions in English.
• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).
Are you trying to do e girl trolling? What's your PC GPU and OS?
It's better we use DMS since this channel is full,
hello im trying to create ai content for tiktok/youtube mainly short form and i dont know how to do it correctly. im trying to make a video similar to your life as a twich streamer for example but i try multiple promts and the answer is not what im asking anyone having a tutorial to help me do it correctly ?
btw i dont want to pay for any service or anything just to know
which one do i download for an amd gpu
!help-template
To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.
⚠️ NO INFO = NO HELP
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
• Check Docs: Many fixes are in the AI Hub Docs.
• Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
• English Only: Keep all discussions in English.
• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).
You need to elaborate more, you might be using old versions of a program, this isn't just a voice server, this is a general ai discord server
Problem solved
windows & linux, radeon gt 9060 xt oc 16GB: i use thoth, LMstudio, ollama, alpaca (on linux) but not one program is using my GPU, they all stay on the CPU
toth (qwen 3; 14b is already thinking for 5' about what to say after i said hello
LM studio has options to use vulcan, etc
on linux it should not even be an issue as you should be using rocm torch
yo, i installed the latest okada but whenever i put .pth and .index it does not sound like the model i'm trying to use
it sounds like peter griffin instead of butcher
you probably want to activate another model in the UI
I walked away from my PC right now, but in win for lm studio there are indeed options to choose how much my GPU should be used... It is set at 100%
But still, not even a whiff
so right now im trying to use travis scott
input device is my mic, output is VAC
maybe its the audio driver issue?
no
audio driver does not change the voice
wokada has model slots
you add a model and it takes a slot
yeah
you can stop the changer, select a different model and start it again
i know, right, i just installed the newer version
and has only 1 slot selected
the voice model is RVCv2
maybe its the cause?
let me try real quick, i'll let you know what happens next
when you're trying to read War and Peace and instead you're reading Pinoccio, it does ot mean your glasses are bad, it means someone swapped the covers of the books
be logical
can someone help me with RVC voice models? All I can find here is weird fictional characters
one question
how can i use TTS
like peter griffin tts
or make the model sing a song
Tts or singing can be done easier on applio which is separate from the voice changer
-rvc
Why would you need a real person voice?
Most fictional characters have normal voices like my Poppy model
I've never seen a Linux computer before, that's interesting
i am autistic, so most of the time i don't get if people are serious or joking? :/ You really haven't messed about with linux?
I'm autistic as well, and I'm being serious I've only used windows growing up and it's cool seeing a different operating system
i really like linux (i have a dual boot win11/Bazzite), but only use win for the few games that not run in windows...
linux helped me reviving a lot of old laptops
however, i am totally not tech intelligent, like i don't understand not even a tenth of what i should to use linux
That's fair, I really only use Windows for playing games via steam and as well as it being the most commonly used is for projects like rvc or other stuff
rvc?
It stands for retrieval-based voice conversion
Basically ai voice cloning as well as voice changer software
What with?
So long story short I wanted a voice changer and Claude code brought me here for the rvc and apploi and w-okawada and stuff
Now I did try rvc it wasn’t anything spectacular might be that the ui and all were done by Claude so nothing much to change to adjust the voice
It then suggested me to delete it and the downloaded a fork which then it said to delete and get appolio and w/okawada
Alrighty so applio is separate from okada, as applio mainly trains models and does ai covers but can be used for real-time but okada/Vonovox does real-time as its main thing
What distro is that ?
What's the main thing you're lookin for?
bazzite
Oh cool
I want real time voice changing for vrc and Roblox
And very important question what is your GPU (Nvidia or AMD) also are u using it for sounding like a cartoon character or something like Darth Vader or Optimus prime ect
It first did suggest me w okawada but it was in Japanese so I scraped that
talking about voices, do you guys know a really good site/app to put a whole pdf book into an audiobook?
thanks a lot
Nvidia I want to sound like Sunday he sounds like a choir boy who is 28
You could always just tts a pdf no ?
i tried the build in app from Edge, in windows, but they read everything, like the page numbers, the repeated title at the top of the pages,...
that is possible, but i want to know if it possible to do whole books
Heard there is a better ai for singing the hmm what was its name ace xl something ?
True true btw might I ask what brings you to this server ?
Never heard of it, I only trust things I know are commonly used here us unless it can use rvc models it's useless to me
Gotcha mb
Do you know exactly what Nvidia gpu you're using? Like RTX 3070 for example
It'll better help me pick out which of the two voice changers would work better
Rtx 5070ti ryzen 9700x and 64 GB 6000mhz cl30 ram
W
One second while I get the links
K gotcha
i want to learn more about AI, and experiment with it. Like i finally succeeded in running a local LLM (however, i don't think they are as good (yet?) as the online ones)
True I mean the online ai are whole corporates with billions in hardware it’s honestly astonishing for me that we can even run some llms in our own hardware
Here's the two downloads you'll need, I recommend installing the second one first as it's required so ppl can hear u in game with the voice changer
https://huggingface.co/dr87/vonovox/resolve/c8034f5f6d50648a8109bb4f847182362e2b779b/Vonovox_beta_17_11.zip
I gotta piss brb
Nature call lmao
Could you like umm tell me what is what and do I need to remove anything also have you used it ?
but for my studies, i really find it more helpful to create audio files, but instead of creating many small files, it would be nice to be able to create one large
I'd delete any older okada software tbh so it doesn't interfere
First link is Vonovox which is the voice changer
Second link is a virtual audio cable called vac lite which connects the voice changer to games or discord ect
I think notebook LLM does that I’m not sure tho because I don’t study :p
I think I have rvc and appolio
And virtual cable and vb cable
And voice mod
Ah gotcha I already have the second link done and dusted now this vonovox thing is it a fork of w okawada
You here to do any projects ?
U can keep those last two and applio
not yet
Voicemod can be helpful with adding voice effects to models that may need them to sound better
Tho I use fl studio for that
It's basically okada but at its best
I was thinking of doing an automated YouTube short uploader and creating anything but now that YouTube has changed its policies and algo I am switching to ph
My my thanks imma set it up and let ya know
You use it for sound board ?
For voice effects on models live in vrchat, as well as for sound design
i have an idea about a (story based) game that i want to create, and i would need AI to interact within certain parameters
Only use voicemod for the soundboard lol
Uh you mean like how the models speak and stuff when they join the world tbh I just got my pc a week ago and new to pc vrc
Gotcha
So I do t need fl thing rn
Btw you do assets in vrc ?
My my I yap a lot
Not at all sadly, just into sound design like making creature noises for games ect or make a custom effect on a guy's voice to make it sound like he's speaking over the radio
That also goes hand in hand sorta with making voice models
can i add you
Of course
Nah not unless you want the complicated setup I use
hihi lowkey i wanna create a rvc myself, but idk where to start, beginner here, i'm used to asking people but now i wanna make my own 
imma first make things work then go to the big legues
like training your own model ?
Yeah
mind if i ask what you are trying to train it for
for tts or vtv 
vtv is what i'm aiming for
umm whats vtv ?
voice to voice
i mean whose voice mb i should have framed it better
You got it!
Ye like the voice changer?
If you wanna learn how to train a model I got u

