[OmniOS-discuss] OmniOS crash - help needed.
Christian Flaig
christian.flaig at gmail.com
Sun Sep 22 21:18:07 UTC 2013
Hello all,
new server here, running on OmniOS Stable (OmniOS v11 r151006). Server is a Xeon E5 1620, Supermicro X9SRL-F, 64gb RAM, 3 IBM M1015 and a Intel X540 Dual 10Gbit NIC. Lots of HDDs of course, all in different ZFS pools, migrated from another server (splitting up a All-in-One basically). Used Openindiana before as a fileserver, but like the OmniOS approach.
Now the server has crashed the second time while being hit by load (serving as a VM datastore through NFS to ESXi server, one zone is MySQL database for NewzNab/NzeDB, one zone for SabNZBD).
All VMs access data on the fileserver through SMB (easiest approach), also NewzNab/NzeDB (lots of small files).
Below I tried to extract some dump from the crash. For me it looks like the CIFS server has an issue. Could someone help me how to find out any causes (and fixes!!!)? Not sure about the new hardware, but I copied between pools for 24h without any issues (ZFS send/receive). Just today started using smb for all client VMs to access files.
Thanks a lot for your help!
Chris
TIME UUID SUNW-MSG-ID
Sep 23 2013 00:52:48.433021000 9b8f4f65-46fd-cf8d-b7e2-8dc10879d615 SUNOS-8000-KL
TIME CLASS ENA
Sep 23 00:52:48.3316 ireport.os.sunos.panic.dump_available 0x0000000000000000
Sep 23 00:52:01.3988 ireport.os.sunos.panic.dump_pending_on_device 0x0000000000000000
nvlist version: 0
version = 0x0
class = list.suspect
uuid = 9b8f4f65-46fd-cf8d-b7e2-8dc10879d615
code = SUNOS-8000-KL
diag-time = 1379890368 346508
de = fmd:///module/software-diagnosis
fault-list-sz = 0x1
fault-list = (array of embedded nvlists)
(start fault-list[0])
nvlist version: 0
version = 0x0
class = defect.sunos.kernel.panic
certainty = 0x64
asru = sw:///:path=/var/crash/unknown/.9b8f4f65-46fd-cf8d-b7e2-8dc10879d615
resource = sw:///:path=/var/crash/unknown/.9b8f4f65-46fd-cf8d-b7e2-8dc10879d615
savecore-succcess = 1
dump-dir = /var/crash/unknown
dump-files = vmdump.0
os-instance-uuid = 9b8f4f65-46fd-cf8d-b7e2-8dc10879d615
panicstr = BAD TRAP: type=e (#pf Page fault) rp=ffffff007a20f450 addr=348 occurred in module "smbsrv" due to a NULL pointer dereference
panicstack = unix:die+df () | unix:trap+db3 () | unix:cmntrap+e6 () | smbsrv:smb_fsop_lookup+118 () | smbsrv:smb_common_rename+d9 () | smbsrv:smb_trans2_rename+136 () | smbsrv:smb_set_rename_info+b8 () | smbsrv:smb_set_fileinfo+ed () | smbsrv:smb_set_by_fid+b0 () | smbsrv:smb_com_trans2_set_file_information+58 () | smbsrv:smb_trans2_dispatch+313 () | smbsrv:smb_com_transaction2+1a7 () | smbsrv:smb_dispatch_request+662 () | smbsrv:smb_session_worker+a0 () | genunix:taskq_d_thread+b7 () | unix:thread_start+8 () |
crashtime = 1379887292
panic-time = Mon Sep 23 00:01:32 2013 CEST
(end fault-list[0])
fault-status = 0x1
severity = Major
__ttl = 0x1
__tod = 0x523f74c0 0x19cf6048
More information about the OmniOS-discuss
mailing list