#Gpt-oss -20b- f16

24 messages · Page 1 of 1 (latest)

sand ledge
#

Can we do gguf conversion with this model?
Can anyone help me out to end to end finetuning with this model?

lime elm
#

bfloat16 . you should be able to yes

#

so gpt-oss-20b-bf16

#

you need to use save_pretrained_merge first to merge the model

eternal light
sand ledge
#

@lime elm oh great thank you and then for gguf conversion Which comment should I use?

sand ledge
#

Yes @valid goblet I have read this but I couldn't able to do gguf conversion and can't able to inference or validate the merged model. Can you help me out guys?

lime elm
#

@sand ledge you need to compile llama.cpp

sand ledge
#

@lime elm Is there any updated llama.cpp for gpt oss?

lime elm
#

wait gimme a sec

oak inlet
# lime elm wait gimme a sec

It seems that this is not just the new model, I fientuned phi-3.5-mini and have the same problem, can u fix it pls

lime elm
#

saving to gguf is non functional right now. nothign to do with the new model. you have to do it manually

#

@sand ledge are you on colab or a local machine?

oak inlet
sand ledge
sand ledge
sand ledge
oak inlet
lime elm
#

sorry guys give me 15 mins. I'll tell you exactly the steps to take.

lime elm
#

so if you're on a local machine and not colab

git clone https://github.com/ggml-org/llama.cpp
cmake --build llama.cpp/build --config Release -j --clean-first --target llama-cli llama-gguf-split llama-quantize
cd llama.cpp
python3 convert_hf_to_gguf.py path_to_your_model/ --outfile out_model_name.gguf
build/bin/llama-quantize out_model_name.gguf  out_model_name-Q8_0.gguf Q8_0

then you can run it with

build/bin/llama-cli --model out_model_name-Q8_0.gguf -p "The meaning to life and the universe is"
#

if you're on colab @oak inlet i will ping you in a thread now

sand ledge