accuracy while training | Unsloth AI | Page 1

per_device_train_batch_size=4,
gradient_accumulation_steps=4,

        eval_strategy="steps",
        eval_steps=60,

        save_strategy="steps",
        save_steps=60,

        warmup_steps=10,
        # max_steps = 150,
        num_train_epochs=3,  # Set this instead of max_steps for full training runs
        learning_rate=4e-5,
        logging_steps=1,
        optim="adamw_8bit",
        weight_decay=0.01,
        lr_scheduler_type="cosine",
        seed=3407,
        output_dir=output_dir,
        report_to="none",  # For Weights and Biases

        # You MUST put the below items for vision finetuning:
        remove_unused_columns=False,
        dataset_text_field="",
        dataset_kwargs={"skip_prepare_dataset": True},
        max_length=2048,

        metric_for_best_model="eval_loss",
        greater_is_better=False,

finetune_vision_layers=True, # False if not finetuning vision layers
finetune_language_layers=True, # False if not finetuning language layers
finetune_attention_modules=False, # False if not finetuning attention layers
finetune_mlp_modules=False, # False if not finetuning MLP layers

    # This is the key change to precisely match the paper
    target_modules=["q_proj", "v_proj"],

    r=8,  # The larger, the higher the accuracy, but might overfit
    lora_alpha=16,  # Recommended alpha == r at least
    lora_dropout=0,
    bias="none",
    random_state=3407,
    use_rslora=False,  # We support rank stabilized LoRA
    loftq_config=None,  # And LoftQ

With these settings i dont get good results at all, the VLM should learn to decide if an image has a good cropped box overlayed ontop of the image. So its binary classification, 2000 image dataset. help me pick better variables or what to change@warm roost

#accuracy while training