Hello, I’m looking for some help setting up local fine tuning SFT and GRPO. I keep getting segmentation errors and I can’t figure it out.
I’m tuning on Ubuntu 24.04 with cuda toolkit 12.8. I’ve tried various different setups with different cuda wheels for torch cu124 etc. tried different torch installs but then I get dependency errors when using vllm, xformers etc.
Any body managed to get it working smoothly?
Cheers