Hey guys, I know the free version can only run on single GPU, but if I'm trying to run it on GPU 1 or 2 instead of 0, how would I do that? Looks like I'm missing a setup step since I always get a RunTime error on gpu>0
os.environ['CUDA_VISIBLE_DEVICES'] = '2'
gpu = 1
inputs = tokenizer(d_prompt.format(text, ''), return_tensors="pt").to("cuda:%d" % gpu)
RuntimeError: CUDA error: invalid device ordinal
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.