[OmniOS-discuss] multipath problem when replacing a failed SAS drive
Kevin Swab
Kevin.Swab at colostate.edu
Wed Oct 30 17:37:46 UTC 2013
Hello,
I'm running OmniOS r151006p on the following system:
- Supermicro X8DT6 board, Xeon E5606 CPU, 48GB ram
- Supermicro SC847 chassis, 36 drive bays, SAS expanders, LSI 9211-8i
controller
- 34 x Toshiba 3T SAS drives MG03SCA300 in one pool w/ 16 mirrored sets
+ 2 hot spares
'mpathadm list lu' showed all drives as having two paths to the controller.
Yesterday, one of the drives failed and was replaced. The new drive is
only showing one path in mpathadm, and errors have started showing up
periodically in /var/adm/messages:
# mpathadm list lu /dev/rdsk/c1t5000039478CA7150d0
mpath-support: libmpscsi_vhci.so
/dev/rdsk/c1t5000039478CA7150d0s2
Total Path Count: 1
Operational Path Count: 1
Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler mptsas_handle_event_sync: IOCStatus=0x8000,
IOCLogInfo=0x31120101
Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler mptsas_handle_event: IOCStatus=0x8000,
IOCLogInfo=0x31120101
Oct 30 09:30:22 hagler scsi: [ID 365881 kern.info]
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler Log info 0x31120101 received for target 89.
Oct 30 09:30:22 hagler scsi_status=0x0, ioc_status=0x804b, scsi_state=0xc
Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler mptsas_handle_event_sync: IOCStatus=0x8000,
IOCLogInfo=0x31120101
Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler mptsas_handle_event: IOCStatus=0x8000,
IOCLogInfo=0x31120101
Oct 30 09:30:22 hagler scsi: [ID 365881 kern.info]
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler Log info 0x31120101 received for target 89.
Oct 30 09:30:22 hagler scsi_status=0x0, ioc_status=0x804b, scsi_state=0xc
Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler mptsas_handle_event_sync: IOCStatus=0x8000,
IOCLogInfo=0x31120101
Oct 30 09:30:22 hagler scsi: [ID 365881 kern.info]
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler Log info 0x31120101 received for target 89.
Oct 30 09:30:22 hagler scsi_status=0x0, ioc_status=0x804b, scsi_state=0xc
Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler mptsas_handle_event: IOCStatus=0x8000,
IOCLogInfo=0x31120101
Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler mptsas_handle_event_sync: IOCStatus=0x8000,
IOCLogInfo=0x31120101
Oct 30 09:30:22 hagler scsi: [ID 365881 kern.info]
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler Log info 0x31120101 received for target 89.
Oct 30 09:30:22 hagler scsi_status=0x0, ioc_status=0x804b, scsi_state=0xc
Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:
/pci at 0,0/pci8086,3410 at 9/pci1000,3020 at 0 (mpt_sas0):
Oct 30 09:30:22 hagler mptsas_handle_event: IOCStatus=0x8000,
IOCLogInfo=0x31120101
The error messages refer to target 89, which I can confirm corresponds
to the missing path for my replacement drive using "lsiutil":
# lsiutil -p 1 16
LSI Logic MPT Configuration Utility, Version 1.63, June 4, 2009
1 MPT Port found
Port Name Chip Vendor/Type/Rev MPT Rev Firmware Rev IOC
1. mpt_sas0 LSI Logic SAS2008 03 200 0d000100 0
SAS2008's links are 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G
B___T SASAddress PhyNum Handle Parent Type
[ ... cut ... ]
0 89 5000039478ca7152 17 0059 0032 SAS Target
0 90 5000039478ca7153 17 005a 000a SAS Target
[ ... cut ... ]
When I ask "lsiutil" to rescan the bus, I see the following error when
it gets to target 89:
# lsiutil -p 1 8
LSI Logic MPT Configuration Utility, Version 1.63, June 4, 2009
1 MPT Port found
Port Name Chip Vendor/Type/Rev MPT Rev Firmware Rev IOC
1. mpt_sas0 LSI Logic SAS2008 03 200 0d000100 0
SAS2008's links are 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G
B___T___L Type Vendor Product Rev
[ ... cut ... ]
ScsiIo to Bus 0 Target 89 failed, IOCStatus = 004b (IOC Terminated)
0 90 0 Disk TOSHIBA MG03SCA300 0108 5000039478ca7153
17
[ ... cut ... ]
This problem has happened to me once before on a similar system. At
that time, I tried reseating the drive, and tried several different
replacement drives, all had the same issue. I even tried rebooting the
system and that didn't help.
Does anyone know how I can clear this issue up? I'd be happy to provide
any additional information that might be helpful,
TIA,
Kevin
--
-------------------------------------------------------------------
Kevin Swab UNIX Systems Administrator
ACNS Colorado State University
Phone: (970)491-6572 Email: Kevin.Swab at ColoState.EDU
GPG Fingerprint: 7026 3F66 A970 67BD 6F17 8EB8 8A7D 142F 2392 791C
More information about the OmniOS-discuss
mailing list