#tensorflow gpu support on arch
145 messages · Page 1 of 1 (latest)
are you installing tensorflow globally or in an venv? @little galleon
It's supported alright
venv
do you have a requirements.txt?
within the venv just do pip install -r requirements.txt
that import statement failed because it was not installed globally
I don't like global installs (too messy)
i ran it within the venv
rerun the python file
if tensorflow was listed in that file it will work straight away
is that DataSpell?
oh, you might need to restart that one
tensorflow == 2.12.0
opencv-python == 4.7.0.72
pillow == 9.5.0
ipynb == 0.5.1
import_ipynb == 0.1.4
matplotlib == 3.7.1
pandas == 2.0.2
scikit-learn == 1.3
seaborn == 0.12.2
DS and Pycharm work just fine, at least for me
like just the text editor or like the whole machine
the editor
still doesnt work
is this a repo or a project of yours?
mind sharing the repo?
ofc
Yeah I want to see if they left instructions, sometimes they are not straight forward
its my code lmfaoo
mb sorry shoulda clarified
well shit look at you, I heard about this project in another discord
I'm in a few AI discords where projects are shared
damn thats funny
bet
btw i put req.txt in this folder called setup
so j like cd setup then pip install
also if u have the invite link to the discord where they were talking abt this would u mind sharing
im curious to see what they were talking abt lol
it was in HuggingFace I think
I have to double check, I've been nerding out with AI stuff the last few months
ill give it a look lol
what file did you call btw
its in /models/regressions/models
LFW.ipynb
through you can run any of them lol
lmfaoo yeah im not that best at organizing
I think I found the problem, hold on let confirm it is the issue
can you do pacman -Q | grep cuda please
if it returns empty you're missing the cuda-toolkit
cuda 12.3.1-2
python-pycuda 2022.2.2-4
python-tensorflow-opt-cuda 2.15.0-6
tensorflow-opt-cuda 2.15.0-6
it's not recognizing your GPU, that's the issue
I have another thing I wanna try real quick, I followed your instructions for the venv
I normally put that inside the project folder
like my machine or the code
the code
add this under the import for tf print("Num GPUs Available: ", len(tf.config.experimental.list_physical_devices('GPU'))) it will return 0
I can git pull
i think i forgot to push that
whoops
but yeah when i run that code i get 0
Num GPUs Available: 0
2024-01-12 19:25:44.841520: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:996] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
2024-01-12 19:25:44.860821: W tensorflow/core/common_runtime/gpu/gpu_device.cc:1956] Cannot dlopen some GPU libraries. Please make sure the missing libraries mentioned above are installed properly if you would like to use GPU. Follow the guide at https://www.tensorflow.org/install/gpu for how to download and setup the required libraries for your platform.
Skipping registering GPU devices...
@little galleon after a lot of prodding around, I can't get it to work within a venv
there's not a lot of docs on Arch but it seems like global installs work just fine, of course that's never recommended
also pip install tensorflow[and-cuda] this doesn't exist
yeah i think its only for ubuntu
not sure what tensorflow are thinking but that doesn't exist and they recommend that too
huh thats odd
so j global install ig
idk what in the venv is causing it to mess up
do i have to like pacman install cuda inside the venv
I'll keep going it at it, someone said that conda is better for it but I never use conda outside of Win env
no just normally, you do have to add to the $PATH some lines
it's in the CUDA quick install guide
I'll spin up a VM with arch and solo test it, I forgot what else works since I do have one project pulling tensorflow globally but I don't remember if it's just a CPU thing setup
I'll keep you posted, shouldn't take long
ok tysm
ok i got the code to recognize the gpu
the issue was cuda and tf were not compatible
so i upgraded to tf v 15 and it works
but now im getting another error
JIT compilation failed.
[[{{node sequential/batch_normalization/batchnorm/Rsqrt}}]] [Op:__inference_train_function_1290]
what?
I upgraded and I couldn't get that to change
Let me go back to my env, did you push those changes?
nah i didnt want to change the req file
just remove all the versions from it
and delete your venv
then reinstall all the packages
lol I didn't want to do all that
I've been tinkering so much my brain is all over the palce
I do too
I just like Arch more
imma try getting it to work there
.s rs
<command to print output> |& curl --data-binary @- https://paste.rs
I need that sorry lol
I couldn't remember the paste export thing, I get other issues compared to yours
that's what I get on a new venv and new install, that was not there before
you've run this model before? how CPU intensive is it?
unless tensorflow fix that, if something was not intensive enough it would not go to GPU but stay on CPU
i mean if i need to run intensive models at some point
@native zodiac i messed around with it some more but nothing seems to work
i think the cuda cudnn and tf versions are just incompatible
Probably, I couldn’t get it to work either
have u used tf and arch w gpu support before though
globally not through venv
though I'm not sure if it was GPU or not, tf is CPU first
checked it, it was CPU only, hmm