[Ocfs-users] Node crashed after remove a path

Roger Trang Roger.Trang at 3pardata.com
Thu May 18 18:19:58 CDT 2006


Hi,
 
I have a 2-node cluster on 2 Dell PowerEdge 2650.
When remove a device path, and both nodes crashed.
Any help would be appreciated.
Thanks!
Roger---
 
Configuration:
Oracle: 10.2.0.1.0 x86
Oracle home: on OCFS2 shared with multipath
Oracle datafiles: OCFS2 shared with multipath

cat redhat-release
Red Hat Enterprise Linux ES release 4 (Nahant Update 2)
 
 uname -a
Linux sqa-pe2650-40 2.6.9-22.ELsmp #1 SMP Mon Sep 19 18:32:14 EDT 2005 i686 
i686 i386 GNU/Linux
 
rpm -qa | grep ocfs
ocfs2console-1.2.0-1
ocfs2-tools-1.2.0-1
ocfs2-2.6.9-22.ELsmp-1.2.0-1
 
rpm -qa | grep -i device
device-mapper-1.01.04-1.0.RHEL4
device-mapper-multipath-0.4.5-6.0.RHEL4
 
Console messages:
(5104,1):dlm_send_remote_convert_request:393 ERROR: status = -107
(5104,1):dlm_wait_for_node_death:285 4E4133205E3C4AD980D6BBBE4AE4014B: waiting 
5000ms for notification of death of node 0
(6360,0):dlm_send_remote_convert_request:393 ERROR: status = -107
(6360,0):dlm_wait_for_node_death:285 EDB955CBD81B44C78CD9258B99F91E4C: waiting 
5000ms for notification of death of node 0
(5104,1):dlm_send_remote_convert_request:393 ERROR: status = -107
(5104,1):dlm_wait_for_node_death:285 4E4133205E3C4AD980D6BBBE4AE4014B: waiting 
5000ms for notification of death of node 0
(6360,0):dlm_send_remote_convert_request:393 ERROR: status = -107
(6360,0):dlm_wait_for_node_death:285 EDB955CBD81B44C78CD9258B99F91E4C: waiting 
5000ms for notification of death of node 0
(6,0):o2quo_make_decision:143 ERROR: fencing this node because it is connected 
to a half-quorum of 1 out of 2 nodes which doesn't include the lowest active 
node 0
(6,0):o2hb_stop_all_regions:1727 ERROR: stopping heartbeat on all active 
regions.
Kernel panic - not syncing: ocfs2 is very sorry to be fencing this system by 
panicing
 
------------[ cut here ]------------
kernel BUG at kernel/panic.c:74!
invalid operand: 0000 [#1]
SMP
Modules linked in: nfs lockd ocfs2(U) debugfs(U) md5 ipv6 parport_pc lp parport 
netconsole netdump autofs4 i2c_dev i2c_core ocfs2_dlmfs(U) ocfs2_dlm(U) 
ocfs2_nodemanager(U) configfs(U) sunrpc button battery ac ohci_hcd shpchp tg3 
floppy sg aic7xxx dm_round_robin dm_multipath dm_mirror dm_modEIP is at 
panic+0x47/0x147
Process events/0 (pid: 6, threadinfo=f7c7c000 task=c1e25130)f8bf5831  
default_wake_function+0x0/0xc kernel_thread_helper+0x5/0xb
a8 ff
Pid: 6, comm:             events/0
EIP: 0060:[<c0121b26>] CPU: 0
EIP is at panic+0x47/0x147
 EFLAGS: 00010286    Not tainted  (2.6.9-22.ELsmp)
EAX: 0000005a EBX: f8c00a68 ECX: f7c7cf58 EDX: c02e29cb
ESI: f8c00a6c EDI: 00000206 EBP: c1e26000 DS: 007b ES: 007b
CR0: 8005003b CR2: 0036c570 CR3: 37f87f00 CR4: 000006f0
 [<f8bee6fb>] o2quo_disk_timeout+0x0/0x2 [ocfs2_nodemanager]
 [<c013044f>] worker_thread+0x168/0x1d5
 [<f8bee6fd>] o2quo_make_decision+0x0/0x247 [ocfs2_nodemanager]
 [<c011e52f>] default_wake_function+0x0/0xc
 [<c011e52f>] default_wake_function+0x0/0xc
 [<c01302e7>] worker_thread+0x0/0x1d5
 [<c01339d9>] kthread+0x73/0x9b
 [<c0133966>] kthread+0x0/0x9b
 [<c01041f1>] kernel_thread_helper+0x5/0xb

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://oss.oracle.com/pipermail/ocfs-users/attachments/20060518/416112e0/attachment.html


More information about the Ocfs-users mailing list