#How can I find the cause of snapshot growth? Is there a way to find out which qtree is causing it?

1 messages · Page 1 of 1 (latest)

fresh belfry
#

As above?

icy eagle
#

hi @fresh belfry let me check - while i look does the Volume dashboard > Per Volume Statistics help any? I guess that will show you the growth by volume, svm but does not include qtree. Does the qtree dashboard have what you need?

fresh belfry
#

Unfortunately not mate, need to find the busy qtree thats causing the snapshot growth

icy eagle
#

yeah I'm not seeing much, qtree-list-iter doesn't have anything snapshot related. We found some 7-mode commands that show snapshot copies associated with qtrees, but we haven't found anything for cmode yet - still looking. @restive belfry do you have any ideas?

#

maybe the Qtree dashboard helps? It is showing top qtrees by disk used like so which isn't quite the same as growth but might point you in the right direction?

restive belfry
#

That might work.

#

Or is there a per qtree dashboard?

#

If you have a perf archive we could pull out the qtree stats to see which one wrote the most.

fresh belfry
#

Then I would need to know the qtree to blame

restive belfry
#

No, just a volume.

icy eagle
#

you can filter that dashboard by volume

fresh belfry
#

So basically pin the blame on the most ops on the most used qtree?

icy eagle
#

perhaps something like rate(quota_disk_used{labels here}[5m])

#

would this help @fresh belfry?

fresh belfry
#

That would be perfect

icy eagle
#

clone the Top $TopResources Qtrees by Disk Used and change the query to rate(quota_disk_used{datacenter=~"$Datacenter", cluster=~"$Cluster", svm=~"$SVM", volume=~"$Volume"}[5m]) change the Standard options / Unit to kibibytes/sec

#

I'll open a PR with that change

icy eagle
fresh belfry
#

When you said try did you mean try that query?

hearty pumice
icy eagle
# fresh belfry When you said try did you mean try that query?

I shared the PR so you could import the dashboard if you wanted to try it out. The query is slightly different than what I pasted above because I tied the "qtrees by disk used growth" panel to the topN variable so you can focus on only the top ones instead of all qtrees. That's why the query in the PR looks like rate(quota_disk_used{datacenter=~"$Datacenter", cluster=~"$Cluster", svm=~"$SVM", volume=~"$Volume", qtree=~"$TopQtreeDiskUsedGrowth"}[5m]) instead of rate(quota_disk_used{datacenter=~"$Datacenter", cluster=~"$Cluster", svm=~"$SVM", volume=~"$Volume"}[5m])

if you want to import from the PR, use the link Rahul shared above. Copy that json and then in Grafana click Dashboard > Manage > Import and paste the json in the Import via panel json textarea

fresh belfry
#

Thank you, Whats the default grafana user 😂

icy eagle