#How to install GPTQ-for-Llama with venv on Linux?
1 messages · Page 1 of 1 (latest)
arch is a beayatch to get anything working. but if you have docker that might be a good idea,. if not podman. pull nvidia/cuda 11.8.0-devel-ubuntu22.04
nvidia/cuda 11.8.0-runtime-ubi8
nvidia/cuda 11.8.0-base-ubi8 then just a simple docker compose up --build with the proper models in the right place and the correct .env to match and depending on your gpu's you'll need the right arch_type generally 30 series and up is arch version 8.6