[OmniOS-discuss] kernel panic "kernel heap corruption detected" when creating zero eager disks

wuffers moo at wuffers.net
Tue Mar 24 21:17:33 UTC 2015


I recently created a pair of 25TB LUs for use in my VMware environment to
test out Veeam (and using that space for my repo - yes, yes, backups should
not reside in the same storage, but they will be exported to tape).

So while trying to create a 16TB drive in the vSphere fat client, I got the
value out of range error (
http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2054952).
OKed the error, and task seemed to run anyways, but at some point my whole
SAN crashed during the creation of the drive.

As this was during business hours, I did not have time to wait on the dump,
but I was able to reproduce it later trying to create a 10TB drive (again
from the fat vSphere client, not web client) and capture the dump (which
takes 40 minutes.. grr).

Just an quick note on the environment: the VMware hosts are connected to
the head unit via IB and SRP. The largest LUs I had previously created for
VMware were 5TB in size, and largest drive created was 2TB.

fmdump info:

TIME                           UUID
SUNW-MSG-ID
Mar 20 2015 19:35:26.819716000 31ced65f-dca2-ee58-c882-a6daa6b94208
SUNOS-8000-KL

nvlist version: 0
        version = 0x0
        class = list.suspect
        uuid = 31ced65f-dca2-ee58-c882-a6daa6b94208
        code = SUNOS-8000-KL
        diag-time = 1426894526 787544
        de = fmd:///module/software-diagnosis
        fault-list-sz = 0x1
        fault-list = (array of embedded nvlists)
        (start fault-list[0])
        nvlist version: 0
                version = 0x0
                class = defect.sunos.kernel.panic
                certainty = 0x64
                asru =
sw:///:path=/var/crash/unknown/.31ced65f-dca2-ee58-c882-a6daa6b94208
                resource =
sw:///:path=/var/crash/unknown/.31ced65f-dca2-ee58-c882-a6daa6b94208
                savecore-succcess = 1
                dump-dir = /var/crash/unknown
                dump-files = vmdump.0
                os-instance-uuid = 31ced65f-dca2-ee58-c882-a6daa6b94208
                panicstr = kernel heap corruption detected
                panicstack = fffffffffba49114 () |
genunix:kmem_slab_free+c1 () | genunix:kmem_magazine_destroy+6e () |
genunix:kmem_cache_magazine_purge+f0 () |
genunix:kmem_cache_magazine_resize+40 () | genunix:taskq_thread+2d0 () |
unix:thread_start+8 () |
                crashtime = 1426891707
                panic-time = Fri Mar 20 18:48:27 2015 EDT
        (end fault-list[0])

        fault-status = 0x1
        severity = Major
        __ttl = 0x1
        __tod = 0x550caebe 0x30dbdfa0

Crash file:
https://drive.google.com/open?id=0B7mCJnZUzJPKOXl1S3IwYXh4NTg&authuser=0

I couldn't find any interesting comparative posts/reports. Would some kind
soul care to look at the dump and see what is happening here?

(And is this the right spot for a kernel panic report, or is it better to
go to the illumos list?)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://omniosce.org/ml-archive/attachments/20150324/0842bce9/attachment.html>


More information about the OmniOS-discuss mailing list