I have 2 GPUs on my machine, primary being 6GB card and secondary 8GB. The secondary is being used by another service which is why I want to set InvokeAI to only use the primary 6GB card. But for some reason, it keeps returning an error whenever I have "cuda:1" selected, and forces the 8GB card to be used with "cuda" resulting in simultaneous use which slows everything down drastically.
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1! (when checking argument for argument weight in method wrapper_CUDA_cudnn_convolution)
It does seem to utilize the VRAM of the primary card, even though it's refusing to actually work. As soon as I try to invoke, 60% of the primary card's VRAM is preserved and won't be released until I shut down the console.
I'd also like to note that it's only possible to select either "cuda" or "cuda:1", setting "cuda:0" puts the whole project into a "bootloop" lol