Hi, I tried save_pretrained_merged and it said Model is not a PeftModel (no Lora adapters detected). Skipping Merge. Please use save_pretrained() or push_to_hub() instead!
Using save_pretrained seems to work, but the notebook clearly states that "This ONLY saves the LoRA adapters, and not the full model. To save to 16bit or GGUF, scroll down!" for save_pretrained.
Not very sure which one to rely on. Would appreciate clarification. Also, if save_pretrained is the intended method, what precision is the model saved with?