Mistral MOE loading? | Text Generation WebUI | Page 1

patent wind Dec 15, 2023, 1:45 PM

#

Anyone know how to get this model to load or what this error means. I think the model should fit on my 3090 since its only 20.1 GB?

near hatch Dec 15, 2023, 6:30 PM

#

update to latest ooba and use llama.cpp (gguf) or autogptq instead

patent wind Dec 23, 2023, 2:52 PM

#

Yeah Gguf is so slow though. I was hoping to get qptq working for some of the 3bit models since they are less than 24 GB. Im guessing they take more than that to hold them in memory?

frigid chasm Dec 23, 2023, 3:37 PM

#

near hatch update to latest ooba and use llama.cpp (gguf) or autogptq instead

I understand correctly that now you don’t have to install anything manually for mixtral to work in text ui? Do I just need to update the interface via update_windows.bat? It would be enough?

compact wave Dec 23, 2023, 4:53 PM

#

You could try https://huggingface.co/LoneStriker/dolphin-2.5-mixtral-8x7b-3.5bpw-h6-exl2-2

LoneStriker/dolphin-2.5-mixtral-8x7b-3.5bpw-h6-exl2-2 · Hugging Face

near hatch Dec 23, 2023, 7:13 PM

#

frigid chasm I understand correctly that now you don’t have to install anything manually for ...

yes

#Mistral MOE loading?