#root aggr flash pool failed 7-mode

1 messages · Page 1 of 1 (latest)

stiff veldt
#

Hello,

I have demo env which is 7-mode 8.2.5. configuration is active passive. Active node has flash pool (4 ssd) on root aggr. Yesterday ssd disks failed because of power outage(sas disk has no problem). Now root aggr failed giving error 'plex failed' .

I can not disable or remove ssd disk from aggr because aggr status failed.

I know it is looks like mission impossible 🙂 but maybe someone has an idea that i can try.

north trellis
#

There is no way to remove the SSDs from the aggr. it needs to be destroyed.

as far as the failure, i'm assuming it's a multi disk failure that cause it to go offline?

stiff veldt
#

I knew but maybe there is chance to do because it just flash pool disk not the data disk.

north trellis
#

got ya. Have you opened a case? or is this system out of support?

#

a power outage generally doesn't cause issues, unless there's something else going on internally to the system.

stiff veldt
#

Yes system out of support. I checkdd the problem is ssd firmware, there is problem with firmware i need update , i saw the bulletin. My last question is can i update disk firmware on another system (failed disk). If i can update the disk , the fail state resume or not ?

north trellis
#

Ohhhh. The ssd Uptime bug?

stiff veldt
#

Yes that one 😦

north trellis
stiff veldt
#

Yes i can open, that is the bulletin i saw . I will do that but in other controller (because original cont can not boot)then i will connect the disk as a shelf and update the disk but still wondering can i update failed the disk ?

north trellis
#

I don't believe so. could try an unfail it?
but the best chance is a support case per the kb/cb

stiff veldt
#

I think the same, i will try to do tomorrow and let you know. Thanks for the kind responses.

stiff veldt
#

Bad news... I could not update failed drives as expected. I tried to tun wafl_check but it is not supported

north trellis
#

dang.

#

anyway to get the controller re-instated to support?

#

otherwise.. can you rebuild and snapmirror back?

stiff veldt
#

I did not understand the reinstand , if i rebuild all the data will be gone

#

Because aggr failed i can not wafliron because it failed, only waflcheck will work but it removed from special menu

north trellis
#

reinstate support. if a controller is still supportable, (hasn't gone End of Life). you can get a quote to re-add netapp support.

stiff veldt
#

It is already end of life 😦

#

There is no alternative for wafl check i do not understand why

fierce root
#

If a command is unavailable it means it isn’t valid for the version/environment.