My HA server started becoming unresponsive almost daily, or sometimes multiple times per day, with only a hard reset resolving the issue. Looking through the host logs, I see issues that suggest to me it may be either a failing NIC or a broadcom driver issue and I figured I'd ask y'all's opinion before I go buy an external NIC or similar.
In my host logs, I'm receiving errors like tg3_stop_block timed out, Link is Down, NETDEV WATCHDOG ... transmit queue 0 timed out ... resetting, and issues showing where it fails to query external DNS servers due to an i/o timeout.
I can post a full log if needed. Mostly just sanity checking myself that I may need a new NIC.