6 Feb 2012 09:57
kernel crash when using libnuma
Trevor Kramer <tkramer <at> hampshire.edu>
2012-02-06 08:57:52 GMT
2012-02-06 08:57:52 GMT
I have a program which can use libnuma to allocate memory using
numa_alloc_onnode() or using malloc. When running in malloc mode
everything works fine but when running under libnuma mode I get
consistent kernel panics with the following traces. This only occurs
when multiple threads are running. Has anyone seen this before or have
any recommendations on how to debug further?
crash> bt
PID: 62333 TASK: ffff883ff5698b40 CPU: 17 COMMAND: "test"
#0 [ffff883ff58378f0] machine_kexec at ffffffff810310cb
#1 [ffff883ff5837950] crash_kexec at ffffffff810b6392
#2 [ffff883ff5837a20] oops_end at ffffffff814de670
#3 [ffff883ff5837a50] die at ffffffff8100f2eb
#4 [ffff883ff5837a80] do_trap at ffffffff814ddf64
#5 [ffff883ff5837ae0] do_invalid_op at ffffffff8100ceb5
#6 [ffff883ff5837b80] invalid_op at ffffffff8100bf5b
[exception RIP: split_huge_page+2021]
RIP: ffffffff8116c605 RSP: ffff883ff5837c38 RFLAGS: 00010297
RAX: 0000000000000001 RBX: ffff880ff704bc38 RCX: 000000000000fe9e
RDX: 0000000000000000 RSI: 0000000000000046 RDI: 0000000000000246
RBP: ffff883ff5837d08 R8: 0000000000000000 R9: 0000000000000004
R10: 0000000000000001 R11: ffff880ff6fb7906 R12: ffff880ff84b7aa8
R13: fffffffffffffff2 R14: ffffea006c34c000 R15: ffffea006c34c000
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
#7 [ffff883ff5837c30] split_huge_page at ffffffff8116c5aa
#8 [ffff883ff5837d10] __split_huge_page_pmd at ffffffff8116c6d1
#9 [ffff883ff5837d40] unmap_vmas at ffffffff8113559e
#10 [ffff883ff5837e80] unmap_region at ffffffff8113cce1
#11 [ffff883ff5837ef0] do_munmap at ffffffff8113d3a6
#12 [ffff883ff5837f50] sys_munmap at ffffffff8113d4e6
(Continue reading)
RSS Feed