[OmniOS-discuss] reboot hangs on 'rebooting...'
Tom Robinson
tom.robinson at motec.com.au
Tue Oct 8 03:11:14 UTC 2013
On 08/10/13 13:24, Narayan Desai wrote:
> I actually hadn't posted a full description of our systems. Here is prtconf output.
>
> The system is a:
> System Configuration: Supermicro X8DAH
> BIOS Configuration: American Megatrends Inc. 2.1 12/30/2011
> a pile of LSI 92xx controllers (mix of internal and external)
> Mellanox ConnectX 2 nic
> Intel gigabit (igb)
>
> Someone suggested that this might be some sort of ACPI problem at one point. Does that seem
> reasonable?
> -nld
>
I had some 'luck' when I disabled the following in the BIOS:
Advanced->ACPI Settings->ACPI Sleep State = Suspend Disabled
By 'luck' I mean it rebooted without hanging once. Subsequent reboots kept hanging on 'rebooting...'.
I then made the following BIOS config changes:
Advanced->CPU Power Management Configuration-> C1E Support = Disabled
CPU C3 Report = Disabled
CPU C6 Report = Disabled
CPU C7 Report = Disabled
But that didn't stop the reboot hang.
Would it have anything to do with NUMA?
t.
>
>
> On Mon, Oct 7, 2013 at 9:13 PM, Garrett D'Amore <garrett.damore at dey-sys.com
> <mailto:garrett.damore at dey-sys.com>> wrote:
>
>
> On Oct 7, 2013, at 6:08 PM, Narayan Desai <narayan.desai at gmail.com
> <mailto:narayan.desai at gmail.com>> wrote:
>
>> So, specifically wrt the mellanox cards, we have four identical systems with mellanox cards
>> running with the hermon driver. One is running OI 151a3, and reboots properly. The ones
>> running a recent omnios don't with fastboot. Just one data point.
>
>
> Oh, that's interesting. Would be interesting to see what's different. I doubt its the hermon
> driver. More suspicious of the Intel ethernet now.
>
> - Garrett
>
>> -nld
>>
>>
>> On Mon, Oct 7, 2013 at 7:10 PM, Garrett D'Amore <garrett.damore at dey-sys.com
>> <mailto:garrett.damore at dey-sys.com>> wrote:
>>
>>
>> On Oct 7, 2013, at 4:29 PM, Tom Robinson <tom.robinson at motec.com.au
>> <mailto:tom.robinson at motec.com.au>> wrote:
>>
>>> Hi Garret,
>>>
>>> Thanks for your message.
>>>
>>> The host configuration is as follows:
>>>
>>> Supermicro X9DRi-F
>>> 256GB RAM (16x Hynix 16GB ECC Reg. DDR3 1600MHz)
>>> 2 x Intel Xeon E5-2620
>>> 2 x Intel SSD 320 80GB (rpool)
>>> 4 x STEC Enterprise S842 200GB (ARC)
>>> 1 x STEC ZeusRAM 8GB 3.5" SAS SSD (ZIL)
>>> 1 x LSI SAS 9207-8i (internal drives)
>>> 1 x Intel Ethernet Server Adapter X520-DA2, Dual Port 10Gbps SFP+ Direct Attach Copper,
>>> PCI-e 2.0 5GT/s 1
>>> 2 x Mellanox ConnectX®-2 VPI
>>> 2 x LSI SAS 9207-8e (JBODS)
>>
>> I'd be suspicious of the Mellanox cards. Are these the hermon driver? It looks like
>> there is an attempt to do the right thing for those drivers, but… I don't know if I
>> believe it all works properly. The other cards should be fine.
>>
>> - Garrett
>>
>>>
>>> That is connected via external SAS to two JBODS containing 28 x 1TB disks each for
>>> mirrored zfs vdevs.
>>>
>>> The output from prtconf -vp is attached as it's very long (2042 lines).
>>>
>>> Kind regards,
>>> Tom
>>>
>>> On 08/10/13 02:06, Garrett D'Amore wrote:
>>>> If you're seeing hangs like this, I would appreciate knowing the hardware
>>>> configuration. Prtconf -vp might be helpful. Presumably this is the result of one or
>>>> more devices not doing the right thing for quiesce().
>>>>
>>>> - Garrett
>>>>
>>>> On Oct 7, 2013, at 4:34 AM, Narayan Desai <narayan.desai at gmail.com
>>>> <mailto:narayan.desai at gmail.com>> wrote:
>>>>
>>>>> This is caused by the system attempting to use fastboot on default reboot. We've
>>>>> disabled that and things seem to work properly; the following commands do the trick.
>>>>> (the first changes reboot not to use fastboot, the second causes the system to do a
>>>>> full reboot upon panic)
>>>>> -nld
>>>>>
>>>>> # svccfg -s "system/boot-config:default" setprop config/fastreboot_default=false
>>>>> # svcadm refresh svc:/system/boot-config:default
>>>>> # svccfg -s "system/boot-config:default" setprop config/fastreboot_onpanic=false
>>>>> # svcadm refresh svc:/system/boot-config:default
>>>>>
>>>>>
>>>>>
>>>>> On Sun, Oct 6, 2013 at 11:26 PM, Tom Robinson <tom.robinson at motec.com.au
>>>>> <mailto:tom.robinson at motec.com.au>> wrote:
>>>>>
>>>>> Hi,
>>>>>
>>>>> I'm running OmniOS r151006 on a SuperMicro X9 motherboard.
>>>>>
>>>>> when I do:
>>>>>
>>>>> reboot -- -r
>>>>>
>>>>> The system hangs on the 'rebooting...' message.
>>>>>
>>>>> I've disabled Suspend and C-states in the BIOS. I had one successful 'reboot' but
>>>>> now it's hanging
>>>>> again.
>>>>>
>>>>> Anyone have any clues on how to fix this?
>>>>>
>>>>> Kind regards,
>>>>>
>>>>> --
>>>>>
>>>>> Tom Robinson
>>>>> IT Manager/System Administrator
>>>>>
>>>>> MoTeC Pty Ltd
>>>>>
>>>>> 121 Merrindale Drive
>>>>> Croydon South
>>>>> 3136 Victoria
>>>>> Australia
>>>>>
>>>>> T: +61 3 9761 5050 <tel:%2B61%203%209761%205050>
>>>>> F: +61 3 9761 5051 <tel:%2B61%203%209761%205051>
>>>>> E: tom.robinson at motec.com.au <mailto:tom.robinson at motec.com.au>
>>>>>
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> OmniOS-discuss mailing list
>>>>> OmniOS-discuss at lists.omniti.com <mailto:OmniOS-discuss at lists.omniti.com>
>>>>> http://lists.omniti.com/mailman/listinfo/omnios-discuss
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> OmniOS-discuss mailing list
>>>>> OmniOS-discuss at lists.omniti.com <mailto:OmniOS-discuss at lists.omniti.com>
>>>>> http://lists.omniti.com/mailman/listinfo/omnios-discuss
>>>>
>>>
>>> <prtconf-vp.out>
>>
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://omniosce.org/ml-archive/attachments/20131008/88bfdeda/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 254 bytes
Desc: OpenPGP digital signature
URL: <https://omniosce.org/ml-archive/attachments/20131008/88bfdeda/attachment-0001.bin>
More information about the OmniOS-discuss
mailing list