A couple of weeks ago we noticed that our nabox instance all stopped actively reporting for all the clusters it monitored. Not sure where the best place to go is to start looking at logs but we currently have 0 actively reporting clusters when we should have 5. We restart nabox and it all briefly works again for an hour or so. Not sure if this is a bug or if it would help to provide some version numbers. Running nabox 3.1.2.
We did recently before add another cluster in running 9.12 in to monitoring, wondering if this has pushed it over the edge in terms of collecting metrics? or the new cluster is causing the rest to crash completely. Just an observation. Attached an image which shows an alert we also get strangely, though it does collect metrics albeit in briefly for an hour.
Tia