#Applio AMD

1 messages · Page 1 of 1 (latest)

obtuse gazelle
#

Hello! I'm trying to install and run Applio with a Radeon RX 7800 XT. I followed the official documentation for AMD, but when I launch Applio, I get this error: UserWarning: CUDA initialization: CUDA unknown error - this may be due to an incorrectly set up environment, e.g. changing env variable CUDA_VISIBLE_DEVICES after program start. Setting the available devices to be zero. (Triggered internally at ..\c10\cuda\CUDAFunctions.cpp:108.)

magic shadow
obtuse gazelle
#

which one should I use ?

magic shadow
#

well, you probably messed up something in the applio guide

#

anyway, that one does not work with the latest adrenalin

#

grab the source code and unzip it into C:\Applio

#

if C:\Users\username\Miniconda3 exists, delete it

#

run env\python to verify version 3.11 was installed

#

download these files into C:\Applio

#

open command prompt cmd.exe in C:\Applio

#

run env\python -m pip install torch-2.7.0a0+rocm_git3f903c3-cp311-cp311-win_amd64.whl

#

and env\python -m pip install torchaudio-2.7.0a0+52638ef-cp311-cp311-win_amd64.whl

#

open rvc\train\train.py and add a line torch.backends.cudnn.enabled = False like this

#

benchmark should be set to False as well

#

after than use run-applio.bat

obtuse gazelle
magic shadow
obtuse gazelle
#

oh wow

#

It worked I think !!

#

My gpu is shown

magic shadow
#

lmk if anything explodes

obtuse gazelle
#

xd I'll, thanks a lot !

obtuse gazelle
#

@magic shadow I started training and the gpu uses 100% idk if its normal

magic shadow
#

yes, it is using gpu

obtuse gazelle
#

ok thanks 🥲

magic shadow
#

dont use default high batch size

obtuse gazelle
magic shadow
#

😕

obtuse gazelle
#

XD

magic shadow
#

4-8

obtuse gazelle
#

I looked on the discord and ppl said more shouldnt be an issue

#

so idk

#

should i restart at 8?

magic shadow
#

it 17 min is ~20 slices/min = 340 slices.... ideally you want no less than 50 steps/epoch

obtuse gazelle
#

mh ok so what should I do ?

#

sorry m new

magic shadow
#

try 4, or 6

#

see which results in a better sounding model

#

there's no rule set in stone what you should use

#

every dataset is different

obtuse gazelle
#

ok I'll try 4, thanks !

obtuse gazelle
magic shadow
#

uncheck both

#

dont bother with old loss charts, use new avg_50

#

show all the charts

obtuse gazelle
magic shadow
#

not like this

#

collapse grads, expans loss_avg_50

#

like this

#

your kl chart is unusually high

obtuse gazelle
obtuse gazelle
magic shadow
#

click this under each chart

obtuse gazelle
magic shadow
#

anyway, I'd not expect anything good from this.. give a model from ~4.5k steps a test

#

it seems that you've resumed it with a different batch size

obtuse gazelle
#

bcs I did not

magic shadow
#

or stopped after a save + some steps, then resumed

#

this step back on the chart

obtuse gazelle
#

oh yes

#

I did

#

Should I remake it ?

magic shadow
#

what kind of voice is it in the dataset?

#

kl is supposed to do down under 1 pretty quickly

obtuse gazelle
magic shadow
#

may wanna figure what's wrong with the dataset

obtuse gazelle
#

ok I'll check that tomorrow

#

Thanks for your help !

mint bough
lost storm
#

Hi

obtuse gazelle
#

Can I have an example of what the graphs should look like?

obtuse gazelle
#

maybe I'll try with another dataset

magic shadow
#

and converging at some value

#

fm chart going down or staying more or less flat

#

but usually it goes up... as long as it does not go more than 1/10k steps it is usually fine

#

example

#

as for norm g, as long as it does not shoot over 1k it is fine

obtuse gazelle
#

okay thanks !

#

So if it's not like that, it's bcs of the dataset?

magic shadow
#

or batch size

obtuse gazelle
#

smoothing is at 0 btw

magic shadow
#

something went wrong

obtuse gazelle
#

seems like it

#

I don't really understand what I'm doing wrong

#

that confuse me

magic shadow
#

it could be the beta torch wheel

#

it is still experimental stuff

obtuse gazelle
#

oh :/

magic shadow
#

unfortunately the working method with Zluda requires a driver rollback to 25.4.1 or 25.3.1

obtuse gazelle
#

I can try that

magic shadow
#

AMD cleanup utility to remove the current driver

#

then just follow the guide

obtuse gazelle
#

ok so I download applio again ?

magic shadow
#

hip sdk 6.1 or 5,7

#

no, you can keep current

#

you'll deinstall torch and other stuff if you follow it

obtuse gazelle
#

okok thanks

#

I'll try

obtuse gazelle
#

everything seems to work

#

I'll train a new one

obtuse gazelle
magic shadow
#

make sure the driver did not automatically update to the latest

magic shadow
#

🤷‍♂️

mint bough
#

latest version of applio doesnt have that issue unless you messed up the code

mint bough
obtuse gazelle
#

I'll download it again and retry

obtuse gazelle
mint bough
#

that issue looks like the one in the old mangio RVC

obtuse gazelle
# mint bough is it all NaN loss values in the terminal window?

test | epoch=2 | step=206 | time=13:29:10 | training_speed=0:03:25 | lowest_value=34.37 (epoch 1 and step 7) | Number of epochs remaining for overtraining: g/total: 50 d/total: 100 | smoothed_loss_gen=34.370 | smoothed_loss_disc=nan Saved model 'C:\Applio-3.2.9\logs\test\test_2e_206s_best_epoch.pth' (epoch 2 and step 206) New best epoch 3 with smoothed loss_g 34.370 and loss_d nan test | epoch=3 | step=309 | time=13:32:34 | training_speed=0:03:23 | lowest_value=34.37 (epoch 1 and step 7) | Number of epochs remaining for overtraining: g/total: 50 d/total: 100 | smoothed_loss_gen=34.370 | smoothed_loss_disc=nan

mint bough
obtuse gazelle
#

:(

#

I'm cursed

obtuse gazelle
magic shadow
#

I know it works fine

obtuse gazelle
#

I have 6.1.2

obtuse gazelle
magic shadow
obtuse gazelle
#

So I should go back to latest drivers ?

magic shadow
#

yes

#

amd cleanup, reboot, install 25.6.1

#

reboot a few times

obtuse gazelle
#

okay thanks !

obtuse gazelle
# magic shadow

in the patch file :
C:\Applio-3.2.9>rmdir /S /q zluda The system cannot find the file specified.

#

idk if it's fine

#

maybe rm -rf zluda ?

magic shadow
#

if there's no zluda folder it is fine

#

as long as it does the rest it is fine

obtuse gazelle
#

okok perfect

obtuse gazelle
#

it seems to work

obtuse gazelle
#

thanks you a lot !!

obtuse gazelle
#

Should I use another Embedder Model because I want to train a French model ?

magic shadow
#
  1. only as an experiment to see whether it helps or not
obtuse gazelle
magic shadow
#

there's new experimental spin that may improve some things

#

but you need a spin-trained pretrain as well

obtuse gazelle
magic shadow
#

then use default one 🙂

obtuse gazelle
#

so I guessed it was because of that

obtuse gazelle
magic shadow
#

did you use a pretrain?

obtuse gazelle
obtuse gazelle
magic shadow
# obtuse gazelle I don't think

if you did uncheck the [x] Pretrained setting for some reason, that may explain your results. Or if you're trying to run inference using a custom embedder.

obtuse gazelle
#

only custom pretrained was unchecked

#

and no I didn't use a custom embedder

obtuse gazelle
#

hi ! sorry I was not at home so I didn’t try anything but what should I do ? Train again w the same settings ?

magic shadow
#

can you run tensorboard an show what the chart look like for the model you trained?

magic shadow
obtuse gazelle
magic shadow
#

@obtuse gazelle lmk when you're online, I got a test to run

obtuse gazelle
#

sorry I was on vacation

obtuse gazelle
magic shadow
obtuse gazelle
#

and yes 7800xt

magic shadow
obtuse gazelle
#

what should I do now?

magic shadow
#

copy the output

obtuse gazelle
#

this ?
C:\Applio-3.2.9>zluda\zluda.exe -- env\python.exe bench.py Using cuda torch.float32 Compilation is in progress. Please wait... linear : 0.0587s conv1d 192x192x1 : 0.2302s Compilation is in progress. Please wait... conv1d 192x768x3 : 0.3445s conv1d 768x768x1 : 1.1695s Compilation is in progress. Please wait... Compilation is in progress. Please wait... up_0 : 0.4844s up_1 : 0.6375s up_2 : 0.4737s up_3 : 0.4587s dn_0 : 0.2608s dn_1 : 0.2592s dn_2 : 0.2634s dn_3 : 0.2252s res1a : 0.4191s res1b : 17.1694s res1c : 17.3982s res2a : 0.9267s res2b : 17.1960s res2c : 17.7140s res3a : 1.4904s res3b : 17.5465s res3c : 17.7366s conv_post : 0.3962s torch.float16 linear : 0.0472s conv1d 192x192x1 : 0.1530s conv1d 192x768x3 : 0.2621s conv1d 768x768x1 : 0.2638s Compilation is in progress. Please wait... Compilation is in progress. Please wait... Compilation is in progress. Please wait... up_0 : 0.4987s up_1 : 0.5005s up_2 : 0.4999s up_3 : 0.4968s dn_0 : 0.2553s dn_1 : 0.2583s dn_2 : 0.2593s dn_3 : 0.2271s res1a : 0.3422s res1b : 17.2472s res1c : 17.3245s res2a : 0.3494s res2b : 17.6717s res2c : 17.2763s res3a : 0.3481s res3b : 17.7725s res3c : 17.9646s conv_post : 0.1600s torch.bfloat16

#

linear : 0.0598s conv1d 192x192x1 : 0.1472s conv1d 192x768x3 : 0.2555s conv1d 768x768x1 : 0.2573s up_0 : 0.4465s up_1 : 0.4557s up_2 : 0.4575s up_3 : 0.4576s dn_0 : 0.2578s dn_1 : 0.2560s dn_2 : 0.2563s dn_3 : 0.2247s res1a : 0.2942s res1b : 17.7791s res1c : 17.8637s res2a : 0.3042s res2b : 17.7352s res2c : 16.6792s res3a : 0.3150s res3b : 16.5893s res3c : 17.7460s conv_post : 0.1592s

magic shadow
#

ah, run one more time so compilation times are not included

obtuse gazelle
#

okok

magic shadow
#

but anyway, I see the same problem

#

instead of 0.1

obtuse gazelle
#

torch.bfloat16 linear : 0.0232s conv1d 192x192x1 : 0.1586s conv1d 192x768x3 : 0.2567s conv1d 768x768x1 : 0.2564s up_0 : 0.4270s up_1 : 0.4556s up_2 : 0.4543s up_3 : 0.4557s dn_0 : 0.2546s dn_1 : 0.2567s dn_2 : 0.2561s dn_3 : 0.2279s res1a : 0.3000s res1b : 17.4343s res1c : 17.4720s res2a : 0.3019s res2b : 17.6095s res2c : 17.9372s res3a : 0.3191s res3b : 17.7246s res3c : 17.8517s conv_post : 0.1583s

#

yeah same issue :/

obtuse gazelle
magic shadow
#

download it

#

unzip

#

then move all the folders into C:\program files\amd\rocm\6.2

#

overwriting when asked

obtuse gazelle
#

ok done

magic shadow
#

then in rvc\lib\zluda.py add 1==2 and
if 1==2 and torch.cuda.is_available()

#

err

#

hold on

obtuse gazelle
#

oh

magic shadow
obtuse gazelle
#

okay

magic shadow
#

run the bench

obtuse gazelle
magic shadow
#

no, just add 1==2 and after if

obtuse gazelle
#

okay

magic shadow
#

to disable this check and run with cudnn

obtuse gazelle
#

it's compiling again

#

Compilation is in progress. Please wait... Traceback (most recent call last): File "C:\Applio-3.2.9\bench.py", line 75, in <module> t = benchmark_op(layer.to(dtype), x.to(dtype)) File "C:\Applio-3.2.9\bench.py", line 23, in benchmark_op _ = op(x) File "C:\Applio-3.2.9\env\lib\site-packages\torch\nn\modules\module.py", line 1751, in _wrapped_call_impl return self._call_impl(*args, **kwargs) File "C:\Applio-3.2.9\env\lib\site-packages\torch\nn\modules\module.py", line 1762, in _call_impl return forward_call(*args, **kwargs) File "C:\Applio-3.2.9\env\lib\site-packages\torch\nn\modules\conv.py", line 974, in forward return F.conv_transpose1d( RuntimeError: GET was unable to find an engine to execute this computation

magic shadow
#

goddamit

obtuse gazelle
#

🤣

magic shadow
#

I've seen this before

obtuse gazelle
magic shadow
#

I mean I could not fix it

#

well

obtuse gazelle
#

oh

magic shadow
#

there's this option though

obtuse gazelle
#

so what should I do

magic shadow
#

download the HIP SDK develop

#

it contains 6.5 folder

#

that goes to program files/amd/rocm

#

next to 6.2

#

change environment variables Path entry to point to 6.5/bin

#

unzip to C:\

#

it makes Applio-main folder, run run-install.bat

#

while it is installing download torch and torchaudio wheels from the link above into Applio's folder

magic shadow
#

new applio-main

obtuse gazelle
#

okok

#

while it's installing?

magic shadow
#

while run-install.bat is running and doing its stuff

#

so you dont sit and stare at it for 5 minutes

obtuse gazelle
#

okok

magic shadow
#

so run-install is done and you've downloaded the .whl files?

obtuse gazelle
#

yes

magic shadow
#

and unzipped hip sdk develop into rocm folder

obtuse gazelle
#

and changed the path

magic shadow
#

okay, open cmd in Applio-main

#

env\python -m pip install torch-, press tab to auto-complete, enter

#

env\python -m pip install torchaudio-, press tab to auto-complete, enter

#

env\python bench.py

obtuse gazelle
#

ok so I move the bench.py in the new applio folder

magic shadow
#

yes, copy it over

#

if it runs and does not blow up and numbers are not in 5s+ it should be good

obtuse gazelle
#

xd

#

C:\Applio-main>env\python bench.py Traceback (most recent call last): File "C:\Applio-main\bench.py", line 1, in <module> import torch File "C:\Applio-main\env\Lib\site-packages\torch\__init__.py", line 274, in <module> _load_dll_libraries() File "C:\Applio-main\env\Lib\site-packages\torch\__init__.py", line 257, in _load_dll_libraries raise err OSError: [WinError 127] The specified procedure could not be found. Error loading "C:\Applio-main\env\Lib\site-packages\torch\lib\torch_hip.dll" or one of its dependencies.

magic shadow
#

run set

#

show screenshot

#

that seems the path did not take

obtuse gazelle
#

it did

magic shadow
#

these red flagged files should be in 6.5/bin

obtuse gazelle
#

that's all I got

magic shadow
#

go to bin

#

they should be in Program Files/AMD/ROCm/6.5/bin

obtuse gazelle
#

they are mb

magic shadow
#

could you run where hipconfig

obtuse gazelle
#

in the bin?

magic shadow
#

no, from a cmd window

obtuse gazelle
#

okok

magic shadow
#

a new cmd window in applio-main

obtuse gazelle
#

C:\Program Files\AMD\ROCm\6.5\bin\hipconfig.bat C:\Program Files\AMD\ROCm\6.5\bin\hipconfig.exe

magic shadow
#

okay, now env\python bench.py

#

(assuming you did open it in applio-main)

obtuse gazelle
#

xd no I did it in a new cmd

#

C:\Applio-main>env\python bench.py Traceback (most recent call last): File "C:\Applio-main\bench.py", line 1, in <module> import torch File "C:\Applio-main\env\Lib\site-packages\torch\__init__.py", line 274, in <module> _load_dll_libraries() File "C:\Applio-main\env\Lib\site-packages\torch\__init__.py", line 257, in _load_dll_libraries raise err OSError: [WinError 127] The specified procedure could not be found. Error loading "C:\Applio-main\env\Lib\site-packages\torch\lib\torch_hip.dll" or one of its dependencies.

#

same error

magic shadow
#

well, open a new cmd in applio-main

obtuse gazelle
#

okok

#

same

magic shadow
#

oh boy

#

big guns now

obtuse gazelle
#

🥺

magic shadow
#

download anywhere, unzip, run dependencies gui

obtuse gazelle
#

done

magic shadow
#

file, open, C:\Applio-main\env\Lib\site-packages\torch\lib\torch_hip.dll

#

screenshot

obtuse gazelle
#

all are incorrect in 6.5

magic shadow
#

no, the top left corner

obtuse gazelle
#

oh

magic shadow
#

like this

obtuse gazelle
magic shadow
#

at the bottom, click the left column header

#

are there any like this?

obtuse gazelle
#

no

#

i have 0 questionmark

magic shadow
#

weirder and weirder

obtuse gazelle
#

this is crazy 🤣

magic shadow
#

try expanding those files at the top

#

click >

#

see if anything red shows under them

#

you can also try opening C:\Applio-main\env\Lib\site-packages\torch\__init__.py in notepad, find 0x0000 and change it to 0x0001

#

prev_error_mode = kernel32.SetErrorMode(0x0001)

#

oops, change it to 0x0000

#

save, then run the bench again

#

it should pop up a window to say what it could not find

obtuse gazelle
magic shadow
#

i did not say expand all files, just 1st level

obtuse gazelle
#

oh nothing on 1st level

#

no red

magic shadow
#

okay, lets try the init edit

#

change prev_error_mode = kernel32.SetErrorMode(0x0001) to prev_error_mode = kernel32.SetErrorMode(0x0000)

obtuse gazelle
magic shadow
#

okay

#

could you try setting HIP_PATH env variable

#

and point it to program files/amd/rocm/6.5

#

(not bin)

#

open new cmd after that and run the bench again

obtuse gazelle
#

its working

#

finally

magic shadow
#

how are the numbers?

obtuse gazelle
#

xd

#

res1a : 0.2208s MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver <GemmFwdRest>, workspace required: 1228800, provided ptr: 0000000000000000 size: 0 MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver <GemmFwdRest>, workspace required: 1228800, provided ptr: 0000000000000000 size: 0 res1b : 9.5653s MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver <GemmFwdRest>, workspace required: 1228800, provided ptr: 0000000000000000 size: 0 MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver <GemmFwdRest>, workspace required: 1228800, provided ptr: 0000000000000000 size: 0 res1c : 9.5043s res2a : 0.5534s MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver <GemmFwdRest>, workspace required: 2867200, provided ptr: 0000000000000000 size: 0 MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver <GemmFwdRest>, workspace required: 2867200, provided ptr: 0000000000000000 size: 0 res2b : 21.4636s MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver <GemmFwdRest>, workspace required: 2867200, provided ptr: 0000000000000000 size: 0 MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver <GemmFwdRest>, workspace required: 2867200, provided ptr: 0000000000000000 size: 0

magic shadow
#

could you do env\python bench.py > 7800xt.txt

#

and upload the file

obtuse gazelle
magic shadow
#

omfg

#

it is even worse

obtuse gazelle
#

xddd

magic shadow
#

go back to zluda lol

#

change the env variables back to 6.2

#

and delete 6.5, that's 15GB of junk

obtuse gazelle
#

😃

#

done

obtuse gazelle
magic shadow
#

not yet

#

they need to fix this MIOpen thing

obtuse gazelle
#

oh 🙁

#

i'll wait

magic shadow
#

other option is to try the nightly zluda build

#

but again it is another experimental thing

obtuse gazelle
#

xd

#

we can try

#

I like trying

magic shadow
#

this upzips into rocm/6.2 folder

#

it should overwrite some stuff

#

you know what

#

lets not do that

obtuse gazelle
#

oh

magic shadow
#

i wanna do something better with my friday afternoon

obtuse gazelle
#

xdd

#

np !

magic shadow
#

you'll need to re-do the 6.2.4 install, but use thi file instead

#

edit patch zluda and put this file instead of the regular zluda rocm6

#

if you dont want to runin your current install, unzip the compiled version of applio somewhere else again

obtuse gazelle
#

but answer when u can dw !! I don't want to ruin ur afternoon

magic shadow
#

patch zluda 62.bat has curl -s -L https://github.com/lshqqytiger/ZLUDA/releases/download/rel.5e717459179dc272b7d7d23391f0fad66c7459cf/ZLUDA-windows-rocm6-amd64.zip > zluda.zip

#

replace it with the nightly

obtuse gazelle
#

oh so i change the link w the nightly in the .bat

#

Should i test w bench.bat?

magic shadow
#

did you uninstall/reinstall torch cu118?

obtuse gazelle
#

I did

magic shadow
#

then patched it using updated night zluda?

obtuse gazelle
#

yes

magic shadow
#

and hip extensions unzipped to the rocm/6.2 folder with overwriting some of the stuff there?

obtuse gazelle
#

yes

magic shadow
#

okay, you can try running the bench

#

one with 1==2 and one without

#

to see if that makes a difference

obtuse gazelle
#

zluda was extracted in main and not in a folder

#

C:\Applio-3.2.9>zluda\zluda.exe -- env\python.exe bench.py The system cannot find the path specified.

#

so it says that

magic shadow
#

ah.. no, windows defender does not like nightly

#

or firefox

obtuse gazelle
#

defender I think

magic shadow
#

yes, for some reason it does not have zluda folder in it

#

if you ran the patch already, you need to re-do torch again

#

since an important dll was moved

obtuse gazelle
#

okok

magic shadow
#

okay, so make sure the content of nightly is manually unzipped into zluda folder

#

in patch zluda delete lines

curl -s -L https://github.com/lshqqytiger/ZLUDA/releases/download/rel.5e717459179dc272b7d7d23391f0fad66c7459cf/ZLUDA-windows-rocm6-amd64.zip > zluda.zip
tar -xf zluda.zip
del zluda.zip
obtuse gazelle
#

well well well

#

torch.float16 linear : 0.0290s conv1d 192x192x1 : 0.2268s conv1d 192x768x3 : 0.2659s conv1d 768x768x1 : 0.2734s up_0 : 0.5102s up_1 : 0.5176s up_2 : 0.5192s up_3 : 0.5293s dn_0 : 0.2644s dn_1 : 0.2683s dn_2 : 0.2643s dn_3 : 0.2320s res1a : 0.3522s res1b : 17.8821s res1c : 17.8233s res2a : 0.3527s res2b : 18.0610s res2c : 18.2808s res3a : 0.3554s res3b : 18.1599s res3c : 18.2609s conv_post : 0.1569s

magic shadow
#

this is with 1==2?

obtuse gazelle
#

without

magic shadow
#

that's cudnn disabled

#

so as expected it uses a slow method

obtuse gazelle
#

okok

#

🙁

magic shadow
#

so?

obtuse gazelle
#

torch.float16 linear : 0.0693s conv1d 192x192x1 : 0.1666s conv1d 192x768x3 : 0.2663s conv1d 768x768x1 : 0.2680s up_0 : 0.5272s up_1 : 0.5275s up_2 : 0.5016s up_3 : 0.5246s dn_0 : 0.2676s dn_1 : 0.2680s dn_2 : 0.2690s dn_3 : 0.2374s res1a : 0.3558s res1b : 17.8415s res1c : 18.1317s res2a : 0.3602s res2b : 18.1202s res2c : 18.1842s res3a : 0.3532s res3b : 18.2191s res3c : 18.3551s conv_post : 0.1692s

magic shadow
#

ffs

obtuse gazelle
#

😕

#

at least we tried

obtuse gazelle
#

@magic shadow nothing new?

magic shadow
#

nope, also they have some issue with nightly wheels

obtuse gazelle
#

wow

#

I'll wait

magic shadow
#

latest wheels are torch-2.9.0a0+rocm7.0.0rc20250826-cp311-cp311-win_amd64.whl torchaudio-2.8.0a0+rocm7.0.0rc20250826-cp311-cp311-win_amd64.whl

#

from a week ago

magic shadow
obtuse gazelle
#

oh nice

#

what do I need to do ?

magic shadow
#

env\python -m pip install torch torchaudio --index-url https://rocm.nightlies.amd.com/v2/gfx110X-dgpu --upgrade

#

from applio folder

obtuse gazelle
#

from a fresh applio ?

magic shadow
#

yeah, grab v3.4.0 while you're at it

#

@obtuse gazelle

obtuse gazelle
magic shadow
#

works for me

#

if you get 'requirement already satisfied', try env\python -m pip uninstall torch torchaudio torchvision first

obtuse gazelle
magic shadow
#

but i would not expect much.. they have not updated my ticked with any resolution

#

Applio should work without (or with very minimal changes), they are still fixing other things

obtuse gazelle
# magic shadow you can give it a try

Failed to load amdhip64.dll: amdhip64.dll: Can't open: The specified module could not be found. (0x7E) [ERROR] amdgpu-arch failed with return code 1 [stderr]

magic shadow
#

bigger error

obtuse gazelle
#

in the 3.4.0 file, there is no more amd installation?

magic shadow
#

??

#

you download the compiled v3.4.0, unzip, uninstall torch torchaudio torchvision

#

install torch from nightly

#

that should be it

#

after that you can run the bench

obtuse gazelle
#

y

#

same error when I try to run applio

magic shadow
#

show the full error

#

have you used dependency checker before?

obtuse gazelle
obtuse gazelle
magic shadow
#

you can check this file

#

but most likelt it will be the .dll I gave you

#

just save it either in the same folder as shm.dll or into windows/system32

obtuse gazelle
#

ok it worked

magic shadow
#

also

#

check windows/system32 folder

#

there should be a file name amdhip64_number.dll

#

6 or 7

#

make a copy of it without _number

#

that should fix

#

as I see from the bench screenshot they have not fixed the res b/c yet

#

9 seconds is 10x slower than expected

#

but anyway, give training a try

#

may need to fix a few things manually

obtuse gazelle
magic shadow
#

c:\windows\system32

obtuse gazelle
#

so I copy it in system 32?

magic shadow
#

if training explodes, delete ``` dist.init_process_group(
backend="gloo" if sys.platform == "win32" or device.type != "cuda" else "nccl",
init_method="env://",
world_size=n_gpus if device.type == "cuda" else 1,
rank=rank if device.type == "cuda" else 0,
)

magic shadow
#

and the other possible fix needed is removing @torch.jit.script from rvc\lib\algorithm\commons.py

#

(they should fix this soon enough)

obtuse gazelle
#

it worked

obtuse gazelle
obtuse gazelle
#

@magic shadow When I start training :
AttributeError: module 'torch.distributed' has no attribute 'init_process_group'

magic shadow
obtuse gazelle
#

ok done

#

I think it works, I'll try tomorrow

magic shadow
#

devs have reproduced the slowness with res b/c tests, so hopefully going to make some workaround

#

the dist.init_process_group should've been fixed partially(?) with latest

magic shadow
#

could you try downloading the latest applio, then installing the latest nightly wheels again and try inference?

#

or training with bf16 precision

#

@obtuse gazelle

obtuse gazelle
#

where do I find the latest nightly wheels ?

magic shadow
#

use the same method as above

obtuse gazelle
#

res1b : 9.5445s MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver <GemmFwdRest>, workspace required: 1228800, provided ptr: 0000000000000000 size: 0 MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver <GemmFwdRest>, workspace required: 1228800, provided ptr: 0000000000000000 size: 0 res1c : 9.4794s res2a : 0.6199s MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver <GemmFwdRest>, workspace required: 2867200, provided ptr: 0000000000000000 size: 0 MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver <GemmFwdRest>, workspace required: 2867200, provided ptr: 0000000000000000 size: 0 res2b : 21.3799s MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver <GemmFwdRest>, workspace required: 2867200, provided ptr: 0000000000000000 size: 0 MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver <GemmFwdRest>, workspace required: 2867200, provided ptr: 0000000000000000 size: 0 res2c : 21.3607s

magic shadow
#

that has not been fixed yet, I meant other things

#

bfloat16 precistion training

obtuse gazelle
#

@magic shadow still no fix ?

magic shadow
obtuse gazelle
#

should I try?

magic shadow
#

you can give it a try and report back

obtuse gazelle
#

ok thanks, I'll try tomorrow

magic shadow
#

pip install --index-url https://rocm.nightlies.amd.com/v2/gfx120X-all/ --pre torch==2.9.1+rocm7.11.0a20251223

#

you can test it in a separate virtual environment

magic shadow
obtuse gazelle
#

mb

#

I’m not at home, I’ll try saturday !