Sun StorEdge 3510 Arrays May Mark Disks as "bad" After Reporting Disk Errors |
|
| Category : | AvailabilityData Loss |
| Release Phase : | Resolved |
| Product : | Sun StorageTek 3510 FC Array
|
| Bug Id : | 6357118
|
| Date of Workaround Release : | 28-APR-2006
|
| Date of Resolved Release : | 28-Mar-2008
|
One or more disk drive(s) may become disabled and the logical drive may transition to a "Fatal Fail" status. (see below for details)
1. Impact
One or more disk drive(s) may become disabled and the logical drive may transition to a "Fatal Fail" status. It is possible that cached data may be written to the logical drive. If this occurs, pending write cache contents may be lost when the array is reset/power cycled.
If the array is running 4.15F firmware, "Cache purged" messages will be logged. For previous firmware versions, cache contents may be lost without notification.
2. Contributing Factors
This issue can occur on the following platform:
SPARC Platform
- Sun StorEdge 3510 FC array
for all current releases of controller firmware.
3. Symptoms
If the described issue occurs, one or more disks may be disabled, perhaps in quick succession, especially under conditions of heavy I/O load. If running firmware 4.15F, there may be "0B/47" SCSI parity error messages in the event log. For previous firmware versions there are no specific error messages to identify this issue.
4. Workaround
For array firmware 4.15F:
On Sun StorEdge 3510 FC arrays with firmware 4.15F, an array reset could clear this issue. Upon proper array shutdown and reset, there is a possibility that the transient error condition causing disturbances in disk drive loop may not be present. In this case the disks could participate in array operations if the disks are good and the error was transient in nature. Documented procedure can then be followed to force the logical drive to become available.
Note: Appropriate care should be taken to verify data consistency if the "cache purge" message was logged.
For additional details on recovering a logical drive from a "Fatal Fail" state, see the "Sun StorEdge 3000 Family Installation, Operation, and Service Manual" and reference section 8.5 "Recovering From Fatal Drive Failure".
***IMPORTANT NOTE***
For array firmware prior to 4.15F:
Upon array shutdown and reset, the "cache purged" warning message is only available in firmware 4.15F. Therefore, for firmware versions prior to 4.15F, the data consistency must be checked for any logical drive which has been recovered from a "fatal fail" state. Pending write cache data may have been lost without any warning message, if the cache was set in "write back" mode.
Note: Array users should regularly monitor their arrays for messages in "persistent event log" and take actions to replace any faulty components.
5. Resolution
There are no further updates planned for this Sun Alert document. If
you need additional assistance regarding this issue, please contact Sun
Services.
This Sun Alert notification is being provided to you on an "AS IS"
basis. This Sun Alert notification may contain information provided by
third parties. The issues described in this Sun Alert notification may
or may not impact your system(s). Sun makes no representations,
warranties, or guarantees as to the information contained herein. ANY
AND ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING WITHOUT LIMITATION
WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, OR
NON-INFRINGEMENT, ARE HEREBY DISCLAIMED. BY ACCESSING THIS DOCUMENT YOU
ACKNOWLEDGE THAT SUN SHALL IN NO EVENT BE LIABLE FOR ANY DIRECT,
INDIRECT, INCIDENTAL, PUNITIVE, OR CONSEQUENTIAL DAMAGES THAT ARISE OUT
OF YOUR USE OR FAILURE TO USE THE INFORMATION CONTAINED HEREIN. This
Sun Alert notification contains Sun proprietary and confidential
information. It is being provided to you pursuant to the provisions of
your agreement to purchase services from Sun, or, if you do not have
such an agreement, the Sun.com Terms of Use. This Sun Alert
notification may only be used for the purposes contemplated by these
agreements.
Copyright 2000-2008 Sun Microsystems, Inc., 4150 Network Circle, Santa Clara, CA 95054 U.S.A. All rights reserved.Modification History28-Mar-2008: Resolved
AttachmentsThis solution has no attachment