#Best way to alert on HA pair going down?
1 messages · Page 1 of 1 (latest)
@maiden jewel Harvest currently does not collect this information. Please open an issue at the following link: https://github.com/NetApp/harvest/issues
We're not sure how to collect this information from ONTAP. Maybe someone in the #1062049169520476220 channel knows
@fathom linden It appears that this might be the HA pair down EMS? https://docs.netapp.com/us-en/ontap-ems-9131/ic-hainterconnectdown-events.html
As a customer who has been doing this for years, my recommendation would be to run AIQUM alongside nabox/harvest
AIQUM has nearly all the alerting capabilities you should ever require. There have been times where we've created out own rule set for our own prom/alert manager stack such as RX/TX light levels for SFP's where we've requested enhancements to AIQUM
We typically alert from AIQUM for hardware/software specifics, and then harvest becomes our performance alerts
thanks for sharing your experience @cosmic drum I'm curious if you've tried using Harvest's EMS collector? Maybe you haven't since AIQUM meets your needs? (which is totally fine. great even). The Harvest team is interested in any feedback you have if you used the EMS collector. Especially if you hit any gaps
They are trying to move away from AIQUM for .. reasons
Thanks Chris, I'm going to start down this road
This one could be due to issues not related to the actual nodes being both down though
@maiden jewel I have taken down 1 node on a 2-node cluster. We may be able to detect HA down if Takeover Possible is false or - for all?
Yeah that was one of my first routes I was going to investigate lol
Yes same here!
@jovial estuary what have you found so far? My week has gotten away from me so I haven't done much yet
@maiden jewel I have not looked into it since then. Could you please open an issue at https://github.com/NetApp/harvest/issues
doesn't look like Johnathan has gotten an answer in the ontap channel yet #1139025538145599518 message
I forgot I posted there lol
I have not, but I'm happy to have a look and trial
@maiden jewel I have updated issue with an approach
https://github.com/NetApp/harvest/issues/2315#issuecomment-1829942815
Please let us know your feedback.