DRAM Parity Errors or SDRAM ECC Errors on Sun StorEdge 3310, Sun StorEdge 3510 or 3511 FC Arrays May Cause File System Integrity Issue |
|
| Category : | Data Loss |
| Release Phase : | Resolved |
| Product : | Sun StorageTek 3310 SCSI Array Sun StorageTek 3510 FC Array Sun StorageTek 3511 SATA Array
|
| Bug Id : | 5022022
|
| Date of Workaround Release : | 03-AUG-2004
|
| Date of Resolved Release : | 30-AUG-2005
|
Impact
In some rare cases, multi-bit DRAM parity errors or multi-bit SDRAM ECC errors on Sun StorEdge 3310, Sun StorEdge 3510 or 3511 FC arrays may cause loss of file system integrity.
Note: There is a very low probability for the occurrence of this issue.
Contributing Factors
This issue can occur on the following platforms:
- Sun StorEdge 3310 array without patch 113722-11
- Sun StorEdge 3510 array without patch 113723-10
- Sun StorEdge 3511 FC array without patch 113724-04
This issue can occur when the controller firmware fails to distinguish between single-bit ECC errors and multi-bit ECC errors. The controller seems to continue to work normally even for multi-bit errors, which leads to loss in file system integrity. A single-bit ECC error is recoverable, while a multi-bit ECC error is not.
Note: This issue may only occur when SE3310, SE3510 or SE3511 arrays encounter multi-bit DRAM parity errors or multi-bit SDRAM ECC errors. Single-bit parity or ECC errors would not encounter this issue.
Symptoms
Should the described issue occur, DRAM parity error messages similar to following are logged in SCCLI event log:
[0104] #4287: StorEdge Array SN#xxxxxxx Controller ALERT: DRAM parity error detected
SDRAM error messages similar to following are logged in "/var/adm/messages" file:
Mar 19 18:30:23 SUNWscsdMonitor[628]: [ID 298706 daemon.error]
[SUNWscsd 0x10B1D0D: Critical] <rctrl6003> Controller Event, SDRAM Error.
Likely controller error. If error persists, replace defective controller.
(Primary, Fri Mar 19 18:23:36 2004) {SN#00369b}
.
.
Mar 20 04:21:20 SUNWscsdMonitor[628]: [ID 739865 daemon.error]
[SUNWscsd 0x10B1D0D: Critical] <rctrl6003> Controller Event, SDRAM Error.
Likely controller error. If error persists, replace defective controller.
(Primary, Sat Mar 20 04:14:25 2004) {SN#00369b}
Workaround
There is no workaround for this issue. Please see the Resolution section below.
Resolution
This issue is addressed on the following platforms:
- Sun StorEdge 3310 with patch 113722-11 or later
- Sun StorEdge 3510 with patch 113723-10 or later
- Sun StorEdge 3511 (FC) with patch 113724-04 or later
Modification HistoryDate: 14-JAN-2004
-
Add point patch for SSE 3510 to Relief/Workaround section
Date: 30-AUG-2005
30-Aug-2005:
- Update Contributing Factors and Resolution sections
AttachmentsThis solution has no attachment