#nvme I/O error and HA crash/lockup

7 messages · Page 1 of 1 (latest)

hasty isle
#

Getting the following I/O errors on my PI with nvme hat. Drive is maybe only 1 or 2 years old (WD black 500g). The errors seem random and don't notice any issues caused until randomly HA will become unresponsive. I'll turn on the display for it and see those I/O errors scrolling by. It locks up and have to hard shutdown with pi button to get it off and reboot (sometimes have to do that 1 or two times as when it reboots will start with errors again)

Any ideas? Not sure if it's hardware related or HA. Seems to have more issues when doing an update and needing to restart than anything else. I've formatted and restored the install a couple of times in the past thinking it was issue with install but always seems to come back.

minor wren
#

Sounds like a hardware/power related issue

misty scroll
#

NVME's can certainly go bad...

#

But with a hat you are pulling hard on the supply, I agree, check that first.

hasty isle
#

How would i go about checking it? I got the recommended power brick to run the pi hat. Btw for that picture i posted, it was taken Sept 9th so it's been little over 2 weeks with no new errors. Also have it on an UPS so no power outage unless it locks up and have to shut down

misty scroll
#

TBH, not sure. If it were me, I'd have some backup hardware on order. Either spare parts or just order a SFF PC (I would go used) and have a back-up plan in place. You may have outgrown your PI days anyway. Most people do after a couple of years.

#

If you plug the nvme into a linux box or something you can test it, but it will wipe it.