Suddenly seeing this error in the classic Llama3.1_(8B)-Alpaca.ipynb notebook. I'm on a L4 instance and uploaded a json dataset for finetuning. This process worked in the past week but suddenly isn't today. I've run the recommended code and tried to re-quantize but no dice. Even fixed the code to use CMake instead and still no dice. Suggestions?
This might take 3 minutes...
Traceback (most recent call last):
File "/content/llama.cpp/convert_hf_to_gguf.py", line 32, in <module>
from mistral_common.tokens.tokenizers.base import TokenizerVersion
ModuleNotFoundError: No module named 'mistral_common'
RuntimeError Traceback (most recent call last)
/tmp/ipython-input-2419798222.py in <cell line: 0>()
10
11 # Save to q4_k_m GGUF
---> 12 if True: model.save_pretrained_gguf("model", tokenizer, quantization_method = "q4_k_m")
13 if False: model.push_to_hub_gguf("hf/model", tokenizer, quantization_method = "q4_k_m", token = "")
14
1 frames
/usr/local/lib/python3.11/dist-packages/unsloth/save.py in save_to_gguf(model_type, model_dtype, is_sentencepiece, model_directory, quantization_method, first_conversion, _run_installer)
1218 )
1219 else:
-> 1220 raise RuntimeError(
1221 f"Unsloth: Quantization failed for {final_location}\n"
1222 "You might have to compile llama.cpp yourself, then run this again.\n"\
RuntimeError: Unsloth: Quantization failed for /content/model/unsloth.BF16.gguf
You might have to compile llama.cpp yourself, then run this again.
You do not need to close this Python program. Run the following commands in a new terminal:
You must run this in the same folder as you're saving your model.
git clone --recursive https://github.com/ggerganov/llama.cpp
cd llama.cpp && make clean && make all -j
Once that's done, redo the quantization.
jesus ok