[OmniOS-discuss] reboot hangs on 'rebooting...'

Tom Robinson tom.robinson at motec.com.au
Tue Oct 8 03:11:14 UTC 2013


On 08/10/13 13:24, Narayan Desai wrote:
> I actually hadn't posted a full description of our systems. Here is prtconf output.
>
> The system is a:
> System Configuration: Supermicro X8DAH
> BIOS Configuration: American Megatrends Inc. 2.1        12/30/2011
> a pile of LSI 92xx controllers (mix of internal and external)
> Mellanox ConnectX 2 nic
> Intel gigabit (igb)
>
> Someone suggested that this might be some sort of ACPI problem at one point. Does that seem
> reasonable?
>  -nld
>

I had some 'luck' when I disabled the following in the BIOS:
Advanced->ACPI Settings->ACPI Sleep State = Suspend Disabled

By 'luck' I mean it rebooted without hanging once. Subsequent reboots kept hanging on 'rebooting...'.

I then made the following BIOS config changes:

Advanced->CPU Power Management Configuration-> C1E Support = Disabled
CPU C3 Report = Disabled
CPU C6 Report = Disabled
CPU C7 Report = Disabled

But that didn't stop the reboot hang.

Would it have anything to do with NUMA?


t.

>
>
> On Mon, Oct 7, 2013 at 9:13 PM, Garrett D'Amore <garrett.damore at dey-sys.com
> <mailto:garrett.damore at dey-sys.com>> wrote:
>
>
>     On Oct 7, 2013, at 6:08 PM, Narayan Desai <narayan.desai at gmail.com
>     <mailto:narayan.desai at gmail.com>> wrote:
>
>>     So, specifically wrt the mellanox cards, we have four identical systems with mellanox cards
>>     running with the hermon driver. One is running OI 151a3, and reboots properly. The ones
>>     running a recent omnios don't with fastboot. Just one data point.
>
>
>     Oh, that's interesting.  Would be interesting to see what's different.  I doubt its the hermon
>     driver.  More suspicious of the Intel ethernet now.
>
>     - Garrett
>
>>      -nld
>>
>>
>>     On Mon, Oct 7, 2013 at 7:10 PM, Garrett D'Amore <garrett.damore at dey-sys.com
>>     <mailto:garrett.damore at dey-sys.com>> wrote:
>>
>>
>>         On Oct 7, 2013, at 4:29 PM, Tom Robinson <tom.robinson at motec.com.au
>>         <mailto:tom.robinson at motec.com.au>> wrote:
>>
>>>         Hi Garret,
>>>
>>>         Thanks for your message.
>>>
>>>         The host configuration is as follows:
>>>
>>>         Supermicro X9DRi-F
>>>         256GB RAM (16x Hynix 16GB ECC Reg. DDR3 1600MHz)
>>>         2 x Intel Xeon E5-2620
>>>         2 x Intel SSD 320 80GB (rpool)
>>>         4 x STEC Enterprise S842 200GB (ARC)
>>>         1 x STEC ZeusRAM 8GB 3.5" SAS SSD (ZIL)
>>>         1 x LSI SAS 9207-8i (internal drives)
>>>         1 x Intel Ethernet Server Adapter X520-DA2, Dual Port 10Gbps SFP+ Direct Attach Copper,
>>>         PCI-e 2.0 5GT/s 1
>>>         2 x Mellanox ConnectX®-2 VPI
>>>         2 x LSI SAS 9207-8e (JBODS)
>>
>>         I'd be suspicious of the Mellanox cards.  Are these the hermon driver?  It looks like
>>         there is an attempt to do the right thing for those drivers, but… I don't know if I
>>         believe it all works properly.  The other cards should be fine.
>>
>>         - Garrett
>>
>>>
>>>         That is connected via external SAS to two JBODS containing 28 x 1TB disks each for
>>>         mirrored zfs vdevs.
>>>
>>>         The output from prtconf -vp is attached as it's very long (2042 lines).
>>>
>>>         Kind regards,
>>>         Tom
>>>
>>>         On 08/10/13 02:06, Garrett D'Amore wrote:
>>>>         If you're seeing hangs like this, I would appreciate knowing the hardware
>>>>         configuration.  Prtconf -vp might be helpful.  Presumably this is the result of one or
>>>>         more devices not doing the right thing for quiesce(). 
>>>>
>>>>         - Garrett
>>>>
>>>>         On Oct 7, 2013, at 4:34 AM, Narayan Desai <narayan.desai at gmail.com
>>>>         <mailto:narayan.desai at gmail.com>> wrote:
>>>>
>>>>>         This is caused by the system attempting to use fastboot on default reboot. We've
>>>>>         disabled that and things seem to work properly; the following commands do the trick.
>>>>>         (the first changes reboot not to use fastboot, the second causes the system to do a
>>>>>         full reboot upon panic)
>>>>>          -nld
>>>>>
>>>>>         # svccfg -s "system/boot-config:default" setprop config/fastreboot_default=false
>>>>>         # svcadm refresh svc:/system/boot-config:default
>>>>>         # svccfg -s "system/boot-config:default" setprop config/fastreboot_onpanic=false
>>>>>         # svcadm refresh svc:/system/boot-config:default
>>>>>
>>>>>
>>>>>
>>>>>         On Sun, Oct 6, 2013 at 11:26 PM, Tom Robinson <tom.robinson at motec.com.au
>>>>>         <mailto:tom.robinson at motec.com.au>> wrote:
>>>>>
>>>>>             Hi,
>>>>>
>>>>>             I'm running OmniOS r151006 on a SuperMicro X9 motherboard.
>>>>>
>>>>>             when I do:
>>>>>
>>>>>             reboot -- -r
>>>>>
>>>>>             The system hangs on the 'rebooting...' message.
>>>>>
>>>>>             I've disabled Suspend and C-states in the BIOS. I had one successful 'reboot' but
>>>>>             now it's hanging
>>>>>             again.
>>>>>
>>>>>             Anyone have any clues on how to fix this?
>>>>>
>>>>>             Kind regards,
>>>>>
>>>>>             --
>>>>>
>>>>>             Tom Robinson
>>>>>             IT Manager/System Administrator
>>>>>
>>>>>             MoTeC Pty Ltd
>>>>>
>>>>>             121 Merrindale Drive
>>>>>             Croydon South
>>>>>             3136 Victoria
>>>>>             Australia
>>>>>
>>>>>             T: +61 3 9761 5050 <tel:%2B61%203%209761%205050>
>>>>>             F: +61 3 9761 5051 <tel:%2B61%203%209761%205051>
>>>>>             E: tom.robinson at motec.com.au <mailto:tom.robinson at motec.com.au>
>>>>>
>>>>>
>>>>>
>>>>>             _______________________________________________
>>>>>             OmniOS-discuss mailing list
>>>>>             OmniOS-discuss at lists.omniti.com <mailto:OmniOS-discuss at lists.omniti.com>
>>>>>             http://lists.omniti.com/mailman/listinfo/omnios-discuss
>>>>>
>>>>>
>>>>>         _______________________________________________
>>>>>         OmniOS-discuss mailing list
>>>>>         OmniOS-discuss at lists.omniti.com <mailto:OmniOS-discuss at lists.omniti.com>
>>>>>         http://lists.omniti.com/mailman/listinfo/omnios-discuss
>>>>
>>>
>>>         <prtconf-vp.out>
>>
>>
>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://omniosce.org/ml-archive/attachments/20131008/88bfdeda/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 254 bytes
Desc: OpenPGP digital signature
URL: <https://omniosce.org/ml-archive/attachments/20131008/88bfdeda/attachment-0001.bin>


More information about the OmniOS-discuss mailing list