Hi! I'm trying to fine tune gemma3 4b and then quantize it to Q8_0. But I have the errors you see in the screens. About the packeges versions I'm using the official unsloth gemma3 notebook as starting point. It doesn't work even updating unsloth and unsloth_zoo to the last versions. Can someone help?
#Gemma3 gguf conversion doesn't work
1 messages · Page 1 of 1 (latest)
we're updating unsloth to fully support the new architecture of the llama.cpp conversion class.
until then you can try to do the conversion on the console using the script that comes with llama.cpp, convert_hf_to_gguf.py
Will try, thanks!
Still have problems, don't know if I'm using it rightly
Interesting so I had it work today with with: if True: # Change to True to save to GGUF
model.save_pretrained_gguf(
"fhi-2025-may-0.2-float16",
quantization_type = "Q8_0", # For now only Q8_0, BF16, F16 supported
)
After doing if True: # Change to True to save finetune!
model.save_pretrained_merged("fhi-2025-may-0.2-float16", tokenizer)
/content# pip freeze |grep unsloth
unsloth==2025.4.7
unsloth_zoo==2025.4.4
Yeah sorry my “after” should have been first