[Ocfs2-users] Sles10 Sp2 kernel crash
Charlie Sharkey
charlie.sharkey at bustech.com
Wed Sep 29 06:47:00 PDT 2010
I got the following crash on a Sles10 SP2 system, info below.
Is this a known problem ? It looks similar to bug# 912
http://oss.oracle.com/bugzilla/show_bug.cgi?id=912
version info
-----------------
OCFS2 Node Manager 1.4.1-1-SLES Wed Jul 23 18:33:42 UTC 2008 (build
f922955d99ef972235bd0c1fc236c5ddbb368611)
OCFS2 DLM 1.4.1-1-SLES Wed Jul 23 18:33:42 UTC 2008 (build
f922955d99ef972235bd0c1fc236c5ddbb368611)
OCFS2 DLMFS 1.4.1-1-SLES Wed Jul 23 18:33:42 UTC 2008 (build
f922955d99ef972235bd0c1fc236c5ddbb368611)
crash info
-------------
KERNEL: ./vmlinux-2.6.16.60-0.42.10
DUMPFILE: ../n2_vmcore_20100925
CPUS: 8
DATE: Sat Sep 25 12:48:00 2010
UPTIME: 10 days, 04:08:44
LOAD AVERAGE: 9.39, 9.11, 8.67
TASKS: 484
NODENAME: n2
RELEASE: 2.6.16.60-0.42.10-smp
VERSION: #1 SMP Tue Apr 27 05:11:27 UTC 2010
MACHINE: x86_64 (2926 Mhz)
MEMORY: 2.9 GB
PANIC: ""
PID: 6557
COMMAND: "dlm_thread"
TASK: ffff81012ac89860 [THREAD_INFO: ffff81010532e000]
CPU: 4
STATE: TASK_RUNNING (PANIC)
crash> bt
PID: 6557 TASK: ffff81012ac89860 CPU: 4 COMMAND: "dlm_thread"
#0 [ffff81010532fa50] machine_kexec at ffffffff8011c0b6
#1 [ffff81010532fb20] crash_kexec at ffffffff80154022
#2 [ffff81010532fbe0] __die at ffffffff802ec658
#3 [ffff81010532fc20] die at ffffffff8010c7e6
#4 [ffff81010532fc50] do_invalid_op at ffffffff8010cd97
#5 [ffff81010532fd10] error_exit at ffffffff8010bced
[exception RIP: dlm_drop_lockres_ref+480]
RIP: ffffffff88511d2a RSP: ffff81010532fdc8 RFLAGS: 00010286
RAX: ffff81006181cc08 RBX: 0000000000000000 RCX: 000000000001109c
RDX: 000000000000001f RSI: 0000000000000296 RDI: ffffffff8035ba1c
RBP: ffff81006181cbc0 R8: ffffffff8045a260 R9: 000000000000001f
R10: 0000000000000000 R11: 0000000000000000 R12: ffff810129b05c00
R13: 000000000000001f R14: ffff81004ada2320 R15: 000000000000026d
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
#6 [ffff81010532fdc0] dlm_drop_lockres_ref at ffffffff88511d2a
#7 [ffff81010532fe40] dlm_run_purge_list at ffffffff8852035c
#8 [ffff81010532fe90] dlm_thread at ffffffff88520718
#9 [ffff81010532ff10] kthread at ffffffff801480cd
#10 [ffff81010532ff50] kernel_thread at ffffffff8010bea6
crash>
text extracted from the core file:
-----------------------------------------
<3>(6345,7):dlm_deref_lockres_handler:2302 ERROR:
27870DB34A7241CC8EBDD43647ABE1FB:M0000000000000078b4305e00000000: node 0
trying to drop ref but it is already dropped!
<3>(6557,4):dlm_drop_lockres_ref:2234 ERROR: while dropping ref on
130ADCC7DE934141AF05DA025CCD14A4:O0000000000000079a3bfbc00000000
(master=0) got -22.
<1>Kernel BUG at fs/ocfs2/dlm/dlmmaster.c:2236
<4>Modules linked in: af_packet ocfs2 ocfs2_dlmfs ocfs2_dlm
ocfs2_nodemanager configfs btipbsa4 ipmi_devintf ipmi_si ipmi_msghandler
bonding ipv6 bticomp_aha363 dock smi button battery btismc ac st loop
dm_round_robin dm_multipath dm_mod usbhid usb_storage ide_core i2c_i801
igb e1000 hw_random i2c_core uhci_hcd ehci_hcd usbcore ext3 jbd qla2xxx
firmware_class qla2xxx_conf intermodule edd fan thermal processor sg
megaraid_sas ata_piix libata sd_mod scsi_mod
<4>Pid: 6557, comm: dlm_thread Tainted: P U 2.6.16.60-0.42.10-smp #1
<4>RIP: 0010:[<ffffffff88511d2a>]
<ffffffff88511d2a>{:ocfs2_dlm:dlm_drop_lockres_ref+480}
<4>Process dlm_thread (pid: 6557, threadinfo ffff81010532e000, task
ffff81012ac89860)
<4>Call Trace: <ffffffff8852035c>{:ocfs2_dlm:dlm_run_purge_list+771}
<4> <ffffffff88520718>{:ocfs2_dlm:dlm_thread+131}
<ffffffff8014820e>{autoremove_wake_function+0}
<4> <ffffffff88520695>{:ocfs2_dlm:dlm_thread+0}
<ffffffff80147e05>{keventd_create_kthread+0}
<1>RIP <ffffffff88511d2a>{:ocfs2_dlm:dlm_drop_lockres_ref+480} RSP
<ffff81010532fdc8>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20100929/8edd13e9/attachment.html
More information about the Ocfs2-users
mailing list