Part 2 of my struggles on trying to get a model running on an AMD GPU. This time, I tried using a GGML model. The problem is, the GPU isn't being utilized. n-gpu-layers is set to 40. I'm on an AMD RX 6800 XT with ROCm 5.4.2 enabled. llama-cpp-python shouldn't be the problem since BLAS is showing up as 1 already. However, when I try to set up bitsandbytes, it doesn't say successful. Check images 4 and 5 to see the command line output when doing python -m bitsandbytes.
As for how I built bitsandbytes-rocm (maybe this is where it went wrong?), I first did ROCM_HOME=/opt/rocm-5.4.2, then I built bitsandbytes-rocm from https://github.com/agrocylo/bitsandbytes-rocm. After that, I ran python setup.py install, then did python -m bitsandbytes (which again, shows an error).