We recently ran into this scenario:
- a node (AFF-A400) has only one production volume
- that volume is accessed via NFSv3 for reads and writes, mostly reads
- that volume is origin of 3 other flexcache volumes (load sharing)
- Node NFS read latency in Harvest and in OnTAP was about 30-100ms
- However, Volume NFS read latency was about 2-3ms in Harvest and about 30-100ms in OnTAP (volume NFS read latency was not reported correctly)
- High Node latency was due to CPU d-blade
- There was significant amount of remote retrieve traffic causing CPU bottleneck (at STRIPE - again only one volume on the node, CPU affinity)
Question to Harvest team - #5 is troublesome as Volume NFS read Latency was showing 2-3ms (within acceptable range), when in actuality it was 10x+ higher.
Harvest was reporting correct number of NFS read Ops, but significantly lower NFS read Latency !
What could cause Harvest to mis-calculate this latency?
Thank you.