#Quantization method

8 messages · Page 1 of 1 (latest)

arctic niche
#

Hello, I am trying to quantize the model, I see several libraries. Do you have any advice on which library is the best? Or they all are fine and I can choose any library

finite grail
#

This does not quantize a model. It allows you to use a model that is already quantized and you specify the quanization format.

arctic niche
#

thank you so much, now i get it

#

2024-06-27T10:50:05.563358317Z ValueError: Quantization method specified in the model config (bitsandbytes) does not match the quantization method specified in the quantization argument (gptq).

#

now I have this error

finite grail
#

You can't select GPTQ quantization when the model is quantized with bitsandbytes.

#

vllm does not support bitsandbytes quantization method.