Here are some updates after been quiet for a while:
By checking the nodes utilizations, the A400 appeared to have higher numbers than FAS8200, (83% on avg and 100% on 95th%) versus (67% on avg and 88% on 95th%), which seems the cause of why the A400 here cannot provide more IOPS than the FAS8200. So, the util to me is a main factor overall to determine how high the IOPS that a node can go. It could be dyanmic as the utilization went up and down. So, we cannot give out a number as the maximum IOPS any platform can provide, if I can make a statement like this here?
Why the util of the A400 with 20 CPU cores went up higher than the FAS8200 with 16 CPU cores( and also A400 has more memories than FAS8200)? I guess this was because the A400 was much more over loaded, more running activities than the FAS8200 although the fomer one has more powerful capacity(type of disks and CPU's #), depspite they are running similar types of workloads(VMware NFS Datastores and NFS volumes).
We configured total of 166TB SSD as only one aggregate onto the A400. Now, the node has been saturated (reached 100%) when only 66% of the disks capacity has been and can be used at most. Apparently, when we set it up at beginning, we configured too much space the node can handle now , but too late to recongnize the issue since It is not easy to reduce the size of the aggregate already in use.
With those being said, then my question is, how can I determine what is the appropriate amount of disks space should I configure on a particiluar platform even before I start to put data on it?