Do you know why this error could be showing up? I am doing GRPO with qwen 2.5. I checked I already have the latest versions of vllm and unsloth:
RuntimeError: Inplace update to inference tensor outside InferenceMode is not allowed.You can make a clone to get a normal tensor before doing inplace update.See https://github.com/pytorch/rfcs/pull/17 for more details.
It seems the crash occurred at unsloth_zoo/gradient_checkpointing.py
Pytorch's twitter handle has recommended to clone the inference tensor
The most annoying part of this error is that it is not even occuring every evaluation, just randomly, and it's driving me crazy