File "/home/user/Documents/amy/.venv/lib/python3.11/site-packages/unsloth_zoo/vision_utils.py", line 826, in __call__
batch = self.processor(**proc_kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/user/Documents/amy/.venv/lib/python3.11/site-packages/transformers/models/qwen3_vl/processing_qwen3_vl.py", line 191, in __call__
num_image_tokens = image_grid_thw[index].prod() // merge_length
~~~~~~~~~~~~~~^^^^^^^
IndexError: index 2 is out of bounds for dimension 0 with size 2
getting this when trying to train Qwen3 VL
my data is like this (i printed dataset[0]):
{'messages': [{'role': 'user', 'content': [{'type': 'text', 'text': 'Tell me what this is', 'image': None}, {'type': 'image', 'text': None, 'image': <PIL.Image.Image image mode=RGB size=512x512 at 0x73B9AE433710>}]}, {'role': 'assistant', 'content': [{'type': 'text', 'text': "<think>\nOk, the user is asking me to say what the image they sent is. Looking at the image, I see the MULTIMODAL_TRAIN_TEST symbol (a lilac hexagon with a checkerboard circle in the center) on a black (#000000) background. So, I'll answer that.\n</think>\nHii! That's the MULTIMODAL_TRAIN_TEST symbol on a pitch black background.", 'image': None}]}]}
some examples also dont have any images