#Are Qwen 2.5 models supported for GRPO long context training?

4 messages · Page 1 of 1 (latest)

plain rose
sand robin
plain rose
#

Thanks a lot Mike! Thanks thank a lot! I have a local GPU so not worrying about the GPU. How to turn on bf16? And what's the difference between the notebook your link pointing to and the notebook titled the Long Context GRPO?

#

My understanding is that the difference lies in Long Context GRPO notebook uses 4bit quantization and qLora so it can support long context? Other than these two, what are other differences?