Hi All, I’ve got a problem with harvest data collection. The pollers themselves are starting and the first period looks like on the Grafana dashboard everything is green regarding the pollers and the collectors as well. After some time, the status of the Zapi collectors is going to standby and not pushing the data to the influxDB anymore. Sometimes the pollers behave also weirdly, usually starting, but there is a scenario where a couple of the pollers are not starting and I need to do the service restart until all the pollers are not running. I've attached some snippets from the poller log regarding the errors. We have different kind of errors and our log file is totally flooded with those. Last time I restarted the services and the Zapi collectors were working fine for a couple of hours and after that, they just stopped again.
Environment details:
Host systems OS - Red Hat Enterprise Linux release 8.7 (Ootpa)
Harvest version - harvest-22.08.0-1.x86_64
ONTAP - 9.9.1P13
InfluxDB 2.6.1
Grafana 9.2.10
Changes regarding the environment within 2 weeks
OS update from 8.5 to 8.7
Harvest upgrade from harvest-22.02.0-4.x86_64 to harvest-22.08.0-1.x86_64
switched the login method to user/pw authentication from a self-signed certificate, the user has read-only permission on the ONTAP.
I think the harvest configuration files are good as the services are starting and operating for a while, but please correct me if wrong.
Your support will be appreciated.