#Mistral MOE loading?
1 messages · Page 1 of 1 (latest)
update to latest ooba and use llama.cpp (gguf) or autogptq instead
Yeah Gguf is so slow though. I was hoping to get qptq working for some of the 3bit models since they are less than 24 GB. Im guessing they take more than that to hold them in memory?
I understand correctly that now you don’t have to install anything manually for mixtral to work in text ui? Do I just need to update the interface via update_windows.bat? It would be enough?