For Sun StorEdge 3310, Running SSCS and sccli(1M) In-Band at the Same Time May Cause SCSI Errors |
|
| Category : | Availability |
| Release Phase : | Resolved |
| Product : | Sun StorageTek 3310 SCSI Array
|
| Bug Id : | 5024472
|
| Date of Workaround Release : | 10-MAY-2004
|
| Date of Resolved Release : | 13-SEP-2005
|
Impact
On Sun StorEdge 3310, running Sun StorEdge Configuration Service (SSCS) in-band and sccli(1M) in-band at the same time may cause SCSI errors on servers. This can cause I/O issues and the server may require a reboot.
Note: SSCS is a graphical means of managing the SE3XXX family of products. sccli(1M) is the Command Line Interface (CLI) means of managing the same products.
Contributing Factors
This issue can occur on the following platform:
SPARC Platform
- Sun StorEdge 3310 SCSI Array without firmware 4.13b (as delivered in patch 113722-11)
with the following:
- Sun StorEdge Configuration Service (SSCS) and sccli(1M)
Note: SSCS, sccli(1M) and the glm(7D) driver are not Solaris version dependent.
This issue occurs when the following two conditions are met:
1. At least the following two daemons must be running on the monitoring server, verified by using the following command:
$ ps -ef | grep ss
root 370 1 0 08:08:21 ? 0:05 /usr/sbin/ssserver
root 374 1 0 08:08:21 ? 0:36 /usr/sbin/ssmon
2. In-band operation has to have been selected for the GUI which can be verified using the following command:
% grep "PRIAGENT_OVER_INBAND=" /var/opt/SUNWsscs/ssagent/ssagent.cfg
PRIAGENT_OVER_INBAND=0
Note: 0 = in-band selected (default); 1 = out-of-band selected
Symptoms
Should the described issue occur, messages similar to the following will be logged to the "var/adm/messages" file:
Oct 20 13:40:00 v4u-v240a glm: [ID 655122 kern.warning] WARNING:
ID[SUNWpd.check_intcode.6006]
Oct 20 13:40:00 v4u-v240a scsi: [ID 107833 kern.warning] WARNING:
/pci@1c,600000/scsi@2,1 (glm1):
Oct 20 13:40:00 v4u-v240a Resetting scsi bus, data overrun: got too
much data from target from (4,0)
Oct 20 13:40:00 v4u-v240a genunix: [ID 408822 kern.info] NOTICE:
glm1: fault detected in device; service still available
Oct 20 13:40:00 v4u-v240a genunix: [ID 611667 kern.info] NOTICE:
glm1:Resetting scsi bus, data overrun: got too much data from
target from (4,0)
Oct 20 13:40:00 v4u-v240a scsi: [ID 107833 kern.warning] WARNING:
/pci@1c,600000/scsi@2,1 (glm1):
Oct 20 13:40:00 v4u-v240a Target 4 reducing sync. transfer rate
Oct 20 13:40:00 v4u-v240a glm: [ID 923092 kern.warning] WARNING:
ID[SUNWpd.glm.sync_wide_backoff.6014]
Oct 20 13:40:00 v4u-v240a scsi: [ID 107833 kern.warning] WARNING:
/pci@1c,600000/scsi@2,1 (glm1):
Oct 20 13:40:00 v4u-v240a got SCSI bus reset
Oct 20 13:40:00 v4u-v240a genunix: [ID 408822 kern.info] NOTICE:
glm1: fault detected in device; service still available
Oct 20 13:40:00 v4u-v240a genunix: [ID 611667 kern.info] NOTICE:
glm1: got SCSI bus reset
Oct 20 13:40:20 v4u-v240a glm: [ID 655122 kern.warning] WARNING:
ID[SUNWpd.check_intcode.6006]
Oct 20 13:40:20 v4u-v240a scsi: [ID 107833 kern.warning] WARNING:
/pci@1c,600000/scsi@2,1 (glm1):
Oct 20 13:40:20 v4u-v240a Resetting scsi bus, data overrun: got too
much data from target from (4,0)
Oct 20 13:40:20 v4u-v240a genunix: [ID 408822 kern.info] NOTICE:
glm1: fault detected in device; service still available
Oct 20 13:40:20 v4u-v240a genunix: [ID 611667 kern.info] NOTICE:
glm1: Resetting scsi bus, data overrun: got too much data from
target from (4,0)
Workaround
To work around this issue, run either SCCS or sscli(1M) "out-of-band". The document "Sun StorEdge 3000 Family Configuration Service 1.5 User's Guide" part number 817-3337-12 page 103, chapter title: "To Use Out-of-Band Management," describes how to configure out-of-band management.
Resolution
This issue is addressed on the following platform:
SPARC Platform
- Sun StorEdge 3310 SCSI Array with firmware 4.13b (as delivered in patch 113722-11 or later)
Modification HistoryDate: 13-SEP-2005
13-Sep-2005:
- Update Contributing Factors and Resolution sections
AttachmentsThis solution has no attachment