#No Metric after ONTAP upgrade

1 messages · Page 1 of 1 (latest)

thorn yacht
#

Hi all,
Does anyone has experience that the NABox does not collect any metrics for 24h after an ONTAP upgrade? Interesting is that we have multiple NABoxes in place but only one is showing this behavior.

snow rampart
hallow thunder
#

What I've seen happen is that nabox loses connectivity to the cluster. In current releases, it retries. In previous releases, it wouldn't even do that.
I always try to remember and check nabox following a cluster upgrade. If it's not capturing data, restart nabox.

thorn yacht
snow rampart
dim folio
#

I know this has been addressed but is there some specific reason why this couldn't be a tunable parameter and set to a default and overridden if set in the main config file?

snow rampart
#

@dim folio Are you suggesting that we need to adjust the value of this parameter, the retry interval in case of API was rejected, to something other than one hour? We have various retry intervals for different set of errors.

rancid musk
#

@snow rampart NAbox uses the poller binary directly, if it dies, there is no mechanism to restart it currently. This might be a problem during upgrades for example. I’ll see if I can improve this. I’m assuming retry in harvest is implemented by harvest binary ?

snow rampart
#

@rancid musk Yes, the current discussion regarding retry is related to retrying the collector in the event of errors. It is a separate use case from the scenario where the poller binary terminates unexpectedly.

rancid musk
#

Cool, so in that scenario, the poller doesn't quit ?