forum
Is there any way to split one model across two gpus whilst training? or plans to do so?
TypeError: is_bf16_supported() got an unexpected keyword argument 'including_emulation'
Solved
Deploying using VLLM
Problem in running on google colab T4
Hi all,
unsloth/Meta-Llama-3.1-405B-bnb-4bit is broken
Is there any colab notebook for inferencing unsloth model using vllm
Solved
Hallucination issue
Open
undefinedError: 'dict object' has no attribute 'value'
Unable to run unsloth finetuned model with ollama
Open
invalid model reference
Running into LLVM ERROR: Cannot select: intrinsic %llvm.nvvm.shfl.sync.bfly.i32
Open
Reporting a bug
Open
Why Top_P set to 1 is more deterministic than Top_P set to 0.9?
Tokenizer
Open
The fastest LLM inference on the server
Open
Can I retrain my model?
Open
AttributeError: torch._dynamo.config.vocab_size does not exist
Open
The SFTTrainer parameter `max_steps` can't be `None`
gemma-2-2b gives cache size errors
Open
Loading model on 2 GPUs for inference
Open
Huggingface Inference Endpoints - Chat Template error
Open
Error with Model Loading in Ollama
Unknown RoPE scaling type dynamic (internlm2.5)
Reward training for chat
Meta Tensor error while training
Finetune on completions only and export to Ollama?
Open
TypeError: LlamaRotaryEmbedding.__init__() got an unexpected keyword argument 'config'
Not an error, but Unsloth cannot patch MLP layers with our manual autograd engine since either LoRA
What does `max_seq_length` do? Is it related to the context window?
VRAM Explosion During Loop Training - Help Needed!
Solved
Strange output of Meta-Llama-3.1-8B-bnb-4bit after fine tuning
Unsloth: `unsloth/Meta-Llama-3.1-8B-bnb-4bit` is not a base model or a PEFT model.
llama3.1 - Continuous Pretraining - Rope scaling error when loading model.
Can Mike/Daniel share how you guys split/mistral-fied the Phi-3 mini model?
Try free version on multi-GPU machine .
error during finetune for classification
hyperparameter choice
Fine-tuning for summarization
standardize_sharegpt not working
train on specific gpu
LLM unable to retain knowledge through training.
Open
Error creating GGUF quants for Gemma 2
Remote Code Error when Loading Model
Issue with 16bit training
Script for bnb conversion
Chat Data Finetuning!
Can't save due to collab storage?
New PHI-3 mini (june) support request
Batch Generation in Unsloth