Hi,
I am trying to get the total ops for a cluster
and here are two metrics:
sum(volume_total_ops{datacenter=~"$Datacenter",cluster=~"$Cluster"}) by (cluster)
sum(qos_ops{datacenter=~"$Datacenter",cluster=~"$Cluster"}) by (cluster)
I assume that these two value would be similar?
And volume_total_ops might be a little higher
than qos_ops since it include some system ops.
Is this explanation correct?
But from user site,
sum(volume_total_ops{datacenter=~"$Datacenter",cluster=~"$Cluster"}) by (cluster) has the value of 19K
sum(qos_ops{datacenter=~"$Datacenter",cluster=~"$Cluster"}) by (cluster) has the value of 29K
There is a bug gap between this two value
Is it because qos_ops should exclude all workload with {policy_group="User-Best-Effort"}?