@elder kayak @covert kelp My NABOX has been acting weird lately, skipping pols and such. I increased the RAM to 16GB's, then it lost all data reporting "Templating: [Cluster]\n Error updating options: bad gateway". This is a little above me. Any crumbs you can throw my way? I'd prefer not to recreate and lost my data.
I looked at nabox-xxx logs. In nabox-harvest2, I see "API request rejected - => Counter collection is disabled, and reqest rejected => Instance name does not exist ...
#NABOX lost array config, shows Templating: bad gateway
1 messages · Page 1 of 1 (latest)
hi @nova pelican that sounds no good! Can you email your logs to ng-harvest-files@netapp.com and we'll take a look
The two logs?
Sorry, the nabox-api and nabox-harvest2?
yep or you can tgz them together like so https://netapp.github.io/harvest/nightly/help/log-collection/#nabox
Ok, it will take me a little time because this is a classified system, and I have to review and have someone else review then transfer... (fun)
understood
@covert kelp Chris, just uploaded, Win does not like ":" so used "-" in file name
hi @nova pelican still digging through your logs, but one thing that jumped out is there are three clusters with TLS problems. Are you using basic_auth or certificate_auth? Either way, can you double check with curl that you can talk to these clusters? I'll send their names in email. Something like curl --user $user:$pass --insecure 'https://$cluster/api/cluster?fields=version' replacing $user, $pass, and $cluster with your values. This is the error tls: failed to verify certificate: x509: certificate is not valid for any names, but wanted to match ...
@covert kelp "curl --user admin:PASSWD --insecure 'https://aces-mgmt/api/cluster?fields=version" replied with "
{
"version": {
"full": "NetApp Release 9.10.1P12: Thu Apr 13 00:00:59 UTC 2023",
"generati0on": 9,
"major": 10,
"minor": 1
|,
"_links: {
"self": {
"href": "/api/cluster"
}
}
}
@covert kelp Chris, I think we have a DNS problem. It takes about 15 seconds to resolve a DNS address. This probably has nothing to do with NABox. I'll let you know later. Thank you
thanks! as the saying goes, IT'S ALWAYS DNS, even when we think it isn't DNS - it's DNS 😄
Working on an email reply about the TLS issues
or storage... 😉
It looks like prometheus is acting up. No capacity issues ?
@elder kayak Yes, big capacity issues. I added 50G to lvdata, and all restarted. Except, I replaced four heads over the weekend from 9000's to 9500's. Is there a way to link the old 9000's with the new 9500's? I presume it may be node serial?
You replaced four heads over the weekend? 😁