#One cluster is offline
1 messages · Page 1 of 1 (latest)
Do you know why node2 was taken over? If you fix whatever issue was happening on node2, then you can do a "storage failover giveback -ofnode cluster01-02" to give the control of node2 back.
Once the giveback is complete, you can do a "net int revert *" to send all the LIFs back to their home node.
I have no idea why it was taken over, is there any way to diagnose it?
Connect via SSH to the SP of node2 and check in what condition it is.
system power status
system console
Check if the system is in the Loader.
If the cluster has support, simply call and let them handle it.
But according to the screenshot it looks like the version is quite old...
It shows that node2 has RPC: Port mapper failure - RPC: Timed out
I've contacted support and they say that there is possibilty of physical failure of controller of this node.
"event log show" is probably the best place to start.
this is the usual error. you need to check the console/SP/BMC of the second node as OG1 wrote
you probably just need to boot the second node and everything will fix itself after a few minutes 😄
If I try to reboot it shows the rpc port mapper failure
sadly
It is unable to list entries on node-2 because of port mapper failure
It truly may be physically broken
What is the status when you physically connect to console port of bad system OR SP and access console?
SP-NODE> system console
Is the node sitting at LOADER-B> ?
please look in the SP/BMC not in the cluster, as people suggested multiple times
yes it is sitting on loader b
Try
boot_ontap
that's what I see after boot_ontap, do You reckon any of this errors?
I will try replacing it
ONTAP 8.2.3P5 Cluster-Mode 
yeah I know 😅
yeah so that is fixed and now I'm waiting for new ram delivery