<div>i'm looking to move forward on this, so sorry if I seem a little anxious, I'm excited to get it going correctly. </div>
<div>&nbsp;</div>
<div>Are there patches that I can get somewhere to apply?&nbsp; Or a path I can explore to begin getting this kernel up to speed with the correct ocfs2-dlm?</div>
<div>Or possibly are the correct ocfs2-dlm files in the kernel source tree for later kernels that I could upgrade to?</div>
<div>&nbsp;</div>
<div>Thanks for the replies so far!</div>
<div><br>&nbsp;</div>
<div><span class="gmail_quote">On 10/12/06, <b class="gmail_sendername">Sunil Mushran</b> &lt;<a href="mailto:Sunil.Mushran@oracle.com">Sunil.Mushran@oracle.com</a>&gt; wrote:</span>
<blockquote class="gmail_quote" style="PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; BORDER-LEFT: #ccc 1px solid">The ocfs2 shipping with that kernel is missing few dlm patches.<br>I'll put together some patches. There is a bugzilla logged on this.
<br><br>Bleeding Edge wrote:<br>&gt;<br>&gt; I've gotten the error below several times on different builds on<br>&gt; different hardware:<br>&gt;<br>&gt; The setup is a bit different from the norm, it's a xenofied 2.6.16<br>
&gt; kernel runing Debian Etch with block device backends for the ocfs2<br>&gt; storage. (yes I know it's an adventerous setup).&nbsp;&nbsp;I'm using the ocfs2<br>&gt; from the kernel, and the ocfs2-tools from Debian, ( 1.2.1-1).<br>
&gt;<br>&gt; 99% of the time it's great, fences well, does it's job.&nbsp;&nbsp;Only one node<br>&gt; is actually being &quot;used&quot; but both are on and mounted.&nbsp;&nbsp;I have seen<br>&gt; these errors when just leaving it overnight, and after a while it
<br>&gt; bombs, system load doesn't seem to be a factor.&nbsp;&nbsp;When I logged in this<br>&gt; morning, on the node that had very little load, I found this in the<br>&gt; console:<br>&gt;<br>&gt; (2016,0):dlm_proxy_ast_handler:321 ERROR: got ast for unknown lockres!
<br>&gt; cookie=144115188078155225, name=M0000000000000006676050a149f878,<br>&gt; namelen=31<br>&gt;<br>&gt; When I attempted to shutdown the 2nd node, it exploded with the<br>&gt; following error, and locked up both nodes, and I'm looking for
<br>&gt; clarification, or even just a starting point:<br>&gt;<br>&gt; Thanks<br>&gt;<br>&gt;<br>&gt;<br>&gt;<br>&gt; kernel BUG at &lt;bad filename&gt;:58347!<br>&gt; invalid opcode: 0000 [#1]<br>&gt; SMP<br>&gt; Modules linked in: ocfs2 ipv6 ocfs2_dlmfs ocfs2_dlm ocfs2_nodemanager
<br>&gt; configfs dm_snapshot dm_mirror dm_mod ext3 jbd<br>&gt; CPU:&nbsp;&nbsp;&nbsp;&nbsp;0<br>&gt; EIP:&nbsp;&nbsp;&nbsp;&nbsp;0061:[&lt;d113cec5&gt;]&nbsp;&nbsp;&nbsp;&nbsp;Not tainted VLI<br>&gt; EFLAGS: 00010202&nbsp;&nbsp; (2.6.16-xen-domU #1)<br>&gt; EIP is at __dlm_lockres_reserve_ast+0x35/0x40 [ocfs2_dlm]
<br>&gt; eax: 00000028&nbsp;&nbsp; ebx: cd9bfa80&nbsp;&nbsp; ecx: f578d000&nbsp;&nbsp; edx: 00000000<br>&gt; esi: 00000002&nbsp;&nbsp; edi: cb695200&nbsp;&nbsp; ebp: cd9bfa80&nbsp;&nbsp; esp: cb079d3c<br>&gt; ds: 007b&nbsp;&nbsp; es: 007b&nbsp;&nbsp; ss: 0069<br>&gt; Process umount (pid: 8046, threadinfo=cb078000 task=cf7ac030)
<br>&gt; Stack: &lt;0&gt;cd9bfa80 cd9bfa8c d1140c57 cd9bfa80 cc412ea0 0000001f<br>&gt; 00000002 00000028<br>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;00000000 c03b8203 00000400 c011e200 cb695230 cd9bfa8c d114c6cc<br>&gt; cd9bfac8<br>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;02000000 c4fe7000 c82d2e00 c4fe7000 00000000 cc412ea0 0000001f
<br>&gt; 00000000<br>&gt; Call Trace:<br>&gt;&nbsp;&nbsp;[&lt;d1140c57&gt;] dlm_migrate_lockres+0x677/0x15f0 [ocfs2_dlm]<br>&gt;&nbsp;&nbsp;[&lt;c011e200&gt;] vprintk+0x290/0x330<br>&gt;&nbsp;&nbsp;[&lt;c02f45d6&gt;] schedule+0x536/0x860<br>&gt;&nbsp;&nbsp;[&lt;d11347a5&gt;] dlm_purge_lockres+0x75/0x230 [ocfs2_dlm]
<br>&gt;&nbsp;&nbsp;[&lt;d1131868&gt;] dlm_unregister_domain+0x108/0x740 [ocfs2_dlm]<br>&gt;&nbsp;&nbsp;[&lt;c02f49cf&gt;] wait_for_completion+0xaf/0x110<br>&gt;&nbsp;&nbsp;[&lt;c0116b70&gt;] default_wake_function+0x0/0x20<br>&gt;&nbsp;&nbsp;[&lt;d126ae8d&gt;] ocfs2_remove_lockres_tracking+0xd/0x40 [ocfs2]
<br>&gt;&nbsp;&nbsp;[&lt;c0133552&gt;] kthread_stop_sem+0x82/0xb0<br>&gt;&nbsp;&nbsp;[&lt;d127038d&gt;] ocfs2_dlm_shutdown+0xed/0x360 [ocfs2]<br>&gt;&nbsp;&nbsp;[&lt;d129f295&gt;] ocfs2_unregister_net_handlers+0x25/0xc0 [ocfs2]<br>&gt;&nbsp;&nbsp;[&lt;d129a521&gt;] ocfs2_dismount_volume+0x181/0x4c0 [ocfs2]
<br>&gt;&nbsp;&nbsp;[&lt;d129aa81&gt;] ocfs2_put_super+0x31/0xe0 [ocfs2]<br>&gt;&nbsp;&nbsp;[&lt;c016c3f2&gt;] generic_shutdown_super+0x92/0x150<br>&gt;&nbsp;&nbsp;[&lt;c016c4d9&gt;] kill_block_super+0x29/0x50<br>&gt;&nbsp;&nbsp;[&lt;c016c5ea&gt;] deactivate_super+0x7a/0xa0
<br>&gt;&nbsp;&nbsp;[&lt;c018453b&gt;] sys_umount+0x4b/0x2d0<br>&gt;&nbsp;&nbsp;[&lt;c0105171&gt;] syscall_call+0x7/0xb<br>&gt; Code: 43 48 84 c0 7f 21 0f b7 43 5a a8 20 75 0b a8 20 75 19 f0 ff 43<br>&gt; 44 59 5b c3 89 1c 24 e8 15 6a ff ff 0f b7 43 5a eb e7 0f 0b eb db &lt;0f&gt;
<br>&gt; 0b eb e3 8d b4 26 00 00 00 00 53 8b 5c 24 0c 8d 43 48 e8 73<br>&gt;&nbsp;&nbsp;Badness in do_exit at kernel/exit.c:802<br>&gt;&nbsp;&nbsp;[&lt;c012134d&gt;] do_exit+0x89d/0x8b0<br>&gt;&nbsp;&nbsp;[&lt;c011007b&gt;] prepare_for_smp+0x4b/0x160<br>
&gt;&nbsp;&nbsp;[&lt;c0105c9a&gt;] die+0x23a/0x240<br>&gt;&nbsp;&nbsp;[&lt;c0106590&gt;] do_invalid_op+0x0/0xc0<br>&gt;&nbsp;&nbsp;[&lt;c010663f&gt;] do_invalid_op+0xaf/0xc0<br>&gt;&nbsp;&nbsp;[&lt;d113cec5&gt;] __dlm_lockres_reserve_ast+0x35/0x40 [ocfs2_dlm]<br>
&gt;&nbsp;&nbsp;[&lt;d113d7a5&gt;] dlm_init_mle+0x85/0x180 [ocfs2_dlm]<br>&gt;&nbsp;&nbsp;[&lt;c0105303&gt;] error_code+0x2b/0x30<br>&gt;&nbsp;&nbsp;[&lt;d113cec5&gt;] __dlm_lockres_reserve_ast+0x35/0x40 [ocfs2_dlm]<br>&gt;&nbsp;&nbsp;[&lt;d1140c57&gt;] dlm_migrate_lockres+0x677/0x15f0 [ocfs2_dlm]
<br>&gt;&nbsp;&nbsp;[&lt;c011e200&gt;] vprintk+0x290/0x330<br>&gt;&nbsp;&nbsp;[&lt;c02f45d6&gt;] schedule+0x536/0x860<br>&gt;&nbsp;&nbsp;[&lt;d11347a5&gt;] dlm_purge_lockres+0x75/0x230 [ocfs2_dlm]<br>&gt;&nbsp;&nbsp;[&lt;d1131868&gt;] dlm_unregister_domain+0x108/0x740 [ocfs2_dlm]
<br>&gt;&nbsp;&nbsp;[&lt;c02f49cf&gt;] wait_for_completion+0xaf/0x110<br>&gt;&nbsp;&nbsp;[&lt;c0116b70&gt;] default_wake_function+0x0/0x20<br>&gt;&nbsp;&nbsp;[&lt;d126ae8d&gt;] ocfs2_remove_lockres_tracking+0xd/0x40 [ocfs2]<br>&gt;&nbsp;&nbsp;[&lt;c0133552&gt;] kthread_stop_sem+0x82/0xb0
<br>&gt;&nbsp;&nbsp;[&lt;d127038d&gt;] ocfs2_dlm_shutdown+0xed/0x360 [ocfs2]<br>&gt;&nbsp;&nbsp;[&lt;d129f295&gt;] ocfs2_unregister_net_handlers+0x25/0xc0 [ocfs2]<br>&gt;&nbsp;&nbsp;[&lt;d129a521&gt;] ocfs2_dismount_volume+0x181/0x4c0 [ocfs2]<br>&gt;&nbsp;&nbsp;[&lt;d129aa81&gt;] ocfs2_put_super+0x31/0xe0 [ocfs2]
<br>&gt;&nbsp;&nbsp;[&lt;c016c3f2&gt;] generic_shutdown_super+0x92/0x150<br>&gt;&nbsp;&nbsp;[&lt;c016c4d9&gt;] kill_block_super+0x29/0x50<br>&gt;&nbsp;&nbsp;[&lt;c016c5ea&gt;] deactivate_super+0x7a/0xa0<br>&gt;&nbsp;&nbsp;[&lt;c018453b&gt;] sys_umount+0x4b/0x2d0
<br>&gt;&nbsp;&nbsp;[&lt;c0105171&gt;] syscall_call+0x7/0xb<br>&gt; (6766,0):o2net_idle_timer:1284 connection to node vserver1-3 (num 2)<br>&gt; at <a href="http://10.10.69.113:7777">10.10.69.113:7777</a> &lt;<a href="http://10.10.69.113:7777">
http://10.10.69.113:7777</a>&gt; has been idle for 10<br>&gt; seconds, shutting it down.<br>&gt; (6766,0):o2net_idle_timer:1297 here are some times that might help<br>&gt; debug the situation: (tmr 1160659302.852081 now 1160659312.846775
 dr<br>&gt; 1160659307.850562 adv 1160659302.852111:1160659302.852111 func<br>&gt; (b9bad2f8:506) 1160659302.852082:1160659302.852088)<br>&gt; BUG: soft lockup detected on CPU#0!<br>&gt;<br>&gt; Pid: 6766, comm:&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; dlm_thread
<br>&gt; EIP: 0061:[&lt;c02f5e57&gt;] CPU: 0<br>&gt; EIP is at _spin_lock+0x7/0x10<br>&gt;&nbsp;&nbsp;EFLAGS: 00000286&nbsp;&nbsp;&nbsp;&nbsp;Not tainted&nbsp;&nbsp;(2.6.16-xen-domU #1)<br>&gt; EAX: cd9bfac8 EBX: cd9bfac8 ECX: cb69520c EDX: cb695200<br>&gt; ESI: cd9bfaa4 EDI: 00000000 EBP: cb695214 DS: 007b ES: 007b
<br>&gt; CR0: 8005003b CR2: b7eec83c CR3: 00e76000 CR4: 00000660<br>&gt;&nbsp;&nbsp;[&lt;d1134e0c&gt;] dlm_thread+0x26c/0x11f0 [ocfs2_dlm]<br>&gt;&nbsp;&nbsp;[&lt;c0133720&gt;] kthread+0xc0/0x110<br>&gt;&nbsp;&nbsp;[&lt;c0133960&gt;] autoremove_wake_function+0x0/0x60
<br>&gt;&nbsp;&nbsp;[&lt;c0133734&gt;] kthread+0xd4/0x110<br>&gt;&nbsp;&nbsp;[&lt;d1134ba0&gt;] dlm_thread+0x0/0x11f0 [ocfs2_dlm]<br>&gt;&nbsp;&nbsp;[&lt;c0133660&gt;] kthread+0x0/0x110<br>&gt;&nbsp;&nbsp;[&lt;c0102bd5&gt;] kernel_thread_helper+0x5/0x10<br>&gt;
<br>&gt; ------------------------------------------------------------------------<br>&gt;<br>&gt; _______________________________________________<br>&gt; Ocfs2-users mailing list<br>&gt; <a href="mailto:Ocfs2-users@oss.oracle.com">
Ocfs2-users@oss.oracle.com</a><br>&gt; <a href="http://oss.oracle.com/mailman/listinfo/ocfs2-users">http://oss.oracle.com/mailman/listinfo/ocfs2-users</a><br>&gt;<br></blockquote></div><br>