forum

1495 threads · Page 15 of 30

How does max_seq_length affect a models current context window? 2 messages
Question about conversational datasets 3 messages
Pretraining then merging Gemma 3 doesn't work 5 messages
Open
mistral:7.0-Instruct 4 messages
Gemma3 `save_pretrained_merged()` doesn't work. 98 messages
Open
load_lora for Gemma3ForCausalLM 4 messages
Solved
Custom Trainer 9 messages
Solved
Question about dynamic quants 2 messages
Request for unsloth version of canopylabs/3b-hi-ft-research_release 12 messages
Open
Help in understanding train_on_responses_only 3 messages
Open
Fine tuning Qwen 3 - need tips to improve final loss 49 messages
Solved
Tokenizing issue after installing unsloth 11 messages
Open
undefined symbol: _ZN5torch3 3 messages
Open
Qwen3-8B-Q4_K_M GGUF Looping forever 31 messages
Open
RuntimeError: PassManager::run failed 2 messages
Open
Difference between FastLanguageModel and FastModel. 3 messages
Solved
Help in finetuning Qwen2.5 for SQL Generation 2 messages
Open
Saving Gemma 3 LoRA finetune failed 7 messages
Open
Model acting differently in colab than ollama. 2 messages
Segmentation fault (Core dumped) 2 messages
Running inference on saved model behaves weirdly 11 messages
Unsloth GRPO trainer evaluates entire eval set every eval_step 2 messages
Help on Choosing a Model for a Prompt-Processing. 7 messages
Open
Merging 4-bit Checkpoint into Phi-4 Base Model – Model Inference Inconsistent 20 messages
Open
Gemma3 Finetune Save_pretrained_merged error 31 messages
Vision finetune gemma3 raised "TypeError: 'int' object is not iterable." 479 messages
how can i use prompt-completion dataset and completion_only_loss paramater in SFTTrainer? 4 messages
AttributeError: 'LLMEngine' object has no attribute 'model_executor' - fix in this thread 32 messages
Open
Does this snippet only accepts GGUF models? 4 messages
QLORA's specific features 9 messages
Open
How to use custom optimiser ? 14 messages
Gemma3 gguf conversion doesn't work 11 messages
Open
Error creating PEFT model with Qwen3MoE 48 messages
Open
modernbert issue 2 messages
Confused about reasoning + non-reasoning data mix in Qwen3 (14B) notebook 10 messages
Open
BLEU compute metrics causes trainer to crash 4 messages
Open
Help, unsloth compiled cash 2 messages
GRPO custom parameters 15 messages
Phi-4-reasoning not separating thought in llama.cpp 4 messages
NotImplementedError: No operator found for `memory_efficient_attention_backward` 39 messages
Solved
Need some help to understand the results I got from custom dataset fientuning 66 messages
unsloth/Qwen3-32B-unsloth-bnb-4bit 4 messages
Questions regarding multi-turn conversation finetuning 13 messages
Open
Thank you Unsloth bros 2 messages
Cannot load local checkpoints 61 messages
Qwen3 doesn't support vLLM 16 messages
Open
So i tried to tune qwen using a grpo and eventually i got some problem. 55 messages
Open
Unsloth: Failed to patch SmolVLMForConditionalGeneration forward function. 2 messages
Fine-tuning Gemma3 6 messages
Solved
Support for GLM? 13 messages
Open