#Failed to initialize NVML: Unknown Error

4 messages · Page 1 of 1 (latest)

fading rune
#

(compress) root@1908bfec7b85:/workspace# nvidia-smi

Failed to initialize NVML: Unknown Error

Every hour or so on my runpod instance, I get the above nvidia error.

I'm not changing anything with the machine -- I have to restart it to fix it. Any ideas?

Thanks

full oarBOT
#

To help others find answers, you can mark your question as solved via Right click solution message -> Apps -> ✅ Mark Solution

vast depot
cedar zenith
#

looks like something need to be changed for the host server, might want to create a support ticket with pod id and attach the link nerdylive just post, hope they can fix it.