Example:
1/27/2025 02:21:08 s01-01 DEBUG disk.IO.status: deviceName="0a.02.16", ETime="45", cdb="0x8f:000000014f352800:00000400", victimRetryCount="0", retryCount="0", timeoutRetryCount="0", pathRetryCount="0", adapterStatus="0x0", targetStatus="0x2", sSenseKey="SCSI:recovered error", sSenseCode="", iSenseKey="0x1", iASC="0xb", iASCQ="0x96", pathsTried="1", basicTimeout="10", returnCode="5", disk_information="Disk 0a.02.16 Shelf 2 Drawer 2 Slot 4 Bay 16 [NETAPP X318_HARHE08TA07 NA01] S/N [XXXXXXXX] UID [5000CCA2:5480459C:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000]" 1/27/2025 02:21:08 s01-01 INFORMATIONAL disk.ioRecoveredError.retry: Recovered error on disk 0a.02.16: op 0x8f:000000038e352800:00000400 sector 10 SCSI:recovered error - Disk used internal retry algorithm to obtain data (1 b 96 96) (45) Disk 0a.02.16 Shelf 2 Drawer 2 Slot 4 Bay 16 [NETAPP X318_HARHE08TA07 NA01] S/N [XXXXXXXX] UID [5000CCA2:5480459C:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000]
We get is on a few disks... seems kinda random... the disks are not failed or anything... and if we show event log without "priv set diag", the information is not shown... I can see it has been there for awhile... question is if we need to preventively fail the disks affected? Or wait until ONTAP does it for us?