[Ocfs2-devel] ocfs2 bug reports, any advices? thanks

Srinivas Eeda srinivas.eeda at oracle.com
Tue Feb 26 20:07:20 PST 2013


This looks similar to what the following patch is trying to address.

http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commitdiff;h=3278bb748d2437eb1464765f36429e5d6aa91c38

On 02/26/2013 07:43 PM, Guozhonghua wrote:
>
> Hi,
>
> I setup two nodes, 192.168.20.20, and 192.168.20.21,
>
> The os is Ubuntu1204 with Kernel version 3.0:
>
> root at Server21:~# uname -a
>
> Linux Server21 3.2.0-23-generic #36-Ubuntu SMP Tue Apr 10 20:39:51 UTC 
> 2012 x86_64 x86_64 x86_64 GNU/Linux
>
> Server20 reboot for the disconnection with iSCSI SAN, so Server20 
> recovery resource locks for Server21.
>
> Server20:
>
> Feb 27 09:29:31 Server20 kernel: [424826.197532] o2net: No longer 
> connected to node Server21 (num 2) at 192.168.20.21:7100
>
> Feb 27 09:29:31 Server20 kernel: [424826.197633] o2cb: o2dlm has 
> evicted node 2 from domain C5FDF4DB054B49B587DF8D4848443259
>
> Feb 27 09:29:35 Server20 kernel: [424830.079130] o2dlm: Begin recovery 
> on domain C5FDF4DB054B49B587DF8D4848443259 for node 2
>
> Feb 27 09:29:35 Server20 kernel: [424830.079156] o2dlm: Node 1 (me) is 
> the Recovery Master for the dead node 2 in domain 
> C5FDF4DB054B49B587DF8D4848443259
>
> Feb 27 09:29:35 Server20 kernel: [424830.079262] o2dlm: End recovery 
> on domain C5FDF4DB054B49B587DF8D4848443259
>
> But the Server21 can't remount the same domain disk on the storage 
> again, as syslog below:
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751256] "echo 0 > 
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751262] mount.ocfs2     D 
> ffffffff81806240     0 12194  12193 0x00000000
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751268]  ffff8807e581b908 
> 0000000000000086 ffff8807e581b8c8 ffffffffa04c056b
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751276]  ffff8807e581bfd8 
> ffff8807e581bfd8 ffff8807e581bfd8 0000000000013780
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751281]  ffff880405cbc4d0 
> ffff8807e50996f0 ffff8807e581b908 7fffffffffffffff
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751288] Call Trace:
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751303]  [<ffffffffa04c056b>] 
> ? dlm_kick_thread+0x7b/0x90 [ocfs2_dlm]
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751311]  [<ffffffff8165a55f>] 
> schedule+0x3f/0x60
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751315]  [<ffffffff8165aba5>] 
> schedule_timeout+0x2a5/0x320
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751319]  [<ffffffff8165a39f>] 
> wait_for_common+0xdf/0x180
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751327]  [<ffffffff8105f990>] 
> ? try_to_wake_up+0x200/0x200
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751331]  [<ffffffff8165a51d>] 
> wait_for_completion+0x1d/0x20
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751357]  [<ffffffffa05d7eb3>] 
> __ocfs2_cluster_lock.isra.34+0x1f3/0x810 [ocfs2]
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751364]  [<ffffffff813162a1>] 
> ? vsnprintf+0x461/0x600
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751369]  [<ffffffffa017c3bf>] 
> ? o2cb_cluster_connect+0x1af/0x2e0 [ocfs2_stack_o2cb]
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751374]  [<ffffffff813164e4>] 
> ? snprintf+0x34/0x40
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751395]  [<ffffffffa05d8d7b>] 
> ocfs2_super_lock+0xab/0x320 [ocfs2]
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751422]  [<ffffffffa0635a5b>] 
> ocfs2_fill_super+0x154b/0x2540 [ocfs2]
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751426]  [<ffffffff81316059>] 
> ? vsnprintf+0x219/0x600
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751433]  [<ffffffff8117aa46>] 
> mount_bdev+0x1c6/0x210
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751460]  [<ffffffffa0634510>] 
> ? ocfs2_initialize_super.isra.208+0x1440/0x1440 [ocfs2]
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751487]  [<ffffffffa0624615>] 
> ocfs2_mount+0x15/0x20 [ocfs2]
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751491]  [<ffffffff8117b5d3>] 
> mount_fs+0x43/0x1b0
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751497]  [<ffffffff81195e1a>] 
> vfs_kern_mount+0x6a/0xc0
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751502]  [<ffffffff81197324>] 
> do_kern_mount+0x54/0x110
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751506]  [<ffffffff81198e74>] 
> do_mount+0x1a4/0x260
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751511]  [<ffffffff81199350>] 
> sys_mount+0x90/0xe0
>
> Feb 27 09:50:59 Server21 kernel: [ 1199.751516]  [<ffffffff81664a82>] 
> system_call_fastpath+0x16/0x1b
>
> Feb 27 09:51:01 Server21 CRON[14164]: (root) CMD (   
> /opt/bin/tomcat_check.sh)
>
> Feb 27 09:51:01 Server21 CRON[14165]: (root) CMD (   
> /opt/bin/libvirtd_check.sh)
>
> Feb 27 09:51:01 Server21 CRON[14166]: (root) CMD ( 
> /opt/bin/ocfs2_iscsi_conf_chg_timer.sh)
>
> Feb 27 09:52:01 Server21 CRON[14788]: (root) CMD (   
> /opt/bin/tomcat_check.sh)
>
> Feb 27 09:52:01 Server21 CRON[14789]: (root) CMD (   
> /opt/bin/libvirtd_check.sh)
>
> Feb 27 09:52:01 Server21 CRON[14790]: (root) CMD ( 
> /opt/bin/ocfs2_iscsi_conf_chg_timer.sh)
>
> Feb 27 09:52:01 Server21 CRON[14791]: (root) CMD (   
> /opt/bin/ha_check_resource.sh)
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442926] INFO: task 
> mount.ocfs2:12194 blocked for more than 120 seconds.
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442933] "echo 0 > 
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442939] mount.ocfs2     D 
> ffffffff81806240     0 12194  12193 0x00000000
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442945]  ffff8807e581b908 
> 0000000000000086 ffff8807e581b8c8 ffffffffa04c056b
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442952]  ffff8807e581bfd8 
> ffff8807e581bfd8 ffff8807e581bfd8 0000000000013780
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442958]  ffff880405cbc4d0 
> ffff8807e50996f0 ffff8807e581b908 7fffffffffffffff
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442964] Call Trace:
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442980]  [<ffffffffa04c056b>] 
> ? dlm_kick_thread+0x7b/0x90 [ocfs2_dlm]
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442988]  [<ffffffff8165a55f>] 
> schedule+0x3f/0x60
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442992]  [<ffffffff8165aba5>] 
> schedule_timeout+0x2a5/0x320
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.442996]  [<ffffffff8165a39f>] 
> wait_for_common+0xdf/0x180
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443004]  [<ffffffff8105f990>] 
> ? try_to_wake_up+0x200/0x200
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443007]  [<ffffffff8165a51d>] 
> wait_for_completion+0x1d/0x20
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443034]  [<ffffffffa05d7eb3>] 
> __ocfs2_cluster_lock.isra.34+0x1f3/0x810 [ocfs2]
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443041]  [<ffffffff813162a1>] 
> ? vsnprintf+0x461/0x600
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443046]  [<ffffffffa017c3bf>] 
> ? o2cb_cluster_connect+0x1af/0x2e0 [ocfs2_stack_o2cb]
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443051]  [<ffffffff813164e4>] 
> ? snprintf+0x34/0x40
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443072]  [<ffffffffa05d8d7b>] 
> ocfs2_super_lock+0xab/0x320 [ocfs2]
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443099]  [<ffffffffa0635a5b>] 
> ocfs2_fill_super+0x154b/0x2540 [ocfs2]
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443103]  [<ffffffff81316059>] 
> ? vsnprintf+0x219/0x600
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443110]  [<ffffffff8117aa46>] 
> mount_bdev+0x1c6/0x210
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443137]  [<ffffffffa0634510>] 
> ? ocfs2_initialize_super.isra.208+0x1440/0x1440 [ocfs2]
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443163]  [<ffffffffa0624615>] 
> ocfs2_mount+0x15/0x20 [ocfs2]
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443168]  [<ffffffff8117b5d3>] 
> mount_fs+0x43/0x1b0
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443174]  [<ffffffff81195e1a>] 
> vfs_kern_mount+0x6a/0xc0
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443179]  [<ffffffff81197324>] 
> do_kern_mount+0x54/0x110
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443183]  [<ffffffff81198e74>] 
> do_mount+0x1a4/0x260
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443187]  [<ffffffff81199350>] 
> sys_mount+0x90/0xe0
>
> Feb 27 09:52:59 Server21 kernel: [ 1319.443193]  [<ffffffff81664a82>] 
> system_call_fastpath+0x16/0x1b
>
> Feb 27 09:53:01 Server21 CRON[15276]: (root) CMD (   
> /opt/bin/tomcat_check.sh)
>
> Feb 27 09:53:01 Server21 CRON[15277]: (root) CMD (   
> /opt/bin/libvirtd_check.sh)
>
> Feb 27 09:53:01 Server21 CRON[15278]: (root) CMD ( 
> /opt/bin/ocfs2_iscsi_conf_chg_timer.sh)
>
> Feb 27 09:53:16 Server21 kernel: [ 1335.561166] qla2xxx 
> [0000:06:00.1]-5009:2: LIP occurred (f7f7).
>
> Feb 27 09:53:21 Server21 kernel: [ 1340.535613] qla2xxx 
> [0000:06:00.1]-500c:2: LIP reset occurred (f7ef).
>
> Feb 27 09:54:01 Server21 CRON[15723]: (root) CMD (   
> /opt/bin/tomcat_check.sh)
>
> Feb 27 09:54:01 Server21 CRON[15725]: (root) CMD (   
> /opt/bin/ha_check_resource.sh)
>
> Feb 27 09:54:01 Server21 CRON[15724]: (root) CMD ( 
> /opt/bin/ocfs2_iscsi_conf_chg_timer.sh)
>
> Feb 27 09:54:01 Server21 CRON[15726]: (root) CMD (   
> /opt/bin/libvirtd_check.sh)
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134659] INFO: task 
> mount.ocfs2:12194 blocked for more than 120 seconds.
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134665] "echo 0 > 
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134673] mount.ocfs2     D 
> ffffffff81806240     0 12194  12193 0x00000000
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134679]  ffff8807e581b908 
> 0000000000000086 ffff8807e581b8c8 ffffffffa04c056b
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134686]  ffff8807e581bfd8 
> ffff8807e581bfd8 ffff8807e581bfd8 0000000000013780
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134692]  ffff880405cbc4d0 
> ffff8807e50996f0 ffff8807e581b908 7fffffffffffffff
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134698] Call Trace:
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134714]  [<ffffffffa04c056b>] 
> ? dlm_kick_thread+0x7b/0x90 [ocfs2_dlm]
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134722]  [<ffffffff8165a55f>] 
> schedule+0x3f/0x60
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134726]  [<ffffffff8165aba5>] 
> schedule_timeout+0x2a5/0x320
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134730]  [<ffffffff8165a39f>] 
> wait_for_common+0xdf/0x180
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134737]  [<ffffffff8105f990>] 
> ? try_to_wake_up+0x200/0x200
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134741]  [<ffffffff8165a51d>] 
> wait_for_completion+0x1d/0x20
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134768]  [<ffffffffa05d7eb3>] 
> __ocfs2_cluster_lock.isra.34+0x1f3/0x810 [ocfs2]
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134775]  [<ffffffff813162a1>] 
> ? vsnprintf+0x461/0x600
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134781]  [<ffffffffa017c3bf>] 
> ? o2cb_cluster_connect+0x1af/0x2e0 [ocfs2_stack_o2cb]
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134785]  [<ffffffff813164e4>] 
> ? snprintf+0x34/0x40
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134806]  [<ffffffffa05d8d7b>] 
> ocfs2_super_lock+0xab/0x320 [ocfs2]
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134833]  [<ffffffffa0635a5b>] 
> ocfs2_fill_super+0x154b/0x2540 [ocfs2]
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134837]  [<ffffffff81316059>] 
> ? vsnprintf+0x219/0x600
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134844]  [<ffffffff8117aa46>] 
> mount_bdev+0x1c6/0x210
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134871]  [<ffffffffa0634510>] 
> ? ocfs2_initialize_super.isra.208+0x1440/0x1440 [ocfs2]
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134898]  [<ffffffffa0624615>] 
> ocfs2_mount+0x15/0x20 [ocfs2]
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134902]  [<ffffffff8117b5d3>] 
> mount_fs+0x43/0x1b0
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134909]  [<ffffffff81195e1a>] 
> vfs_kern_mount+0x6a/0xc0
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134913]  [<ffffffff81197324>] 
> do_kern_mount+0x54/0x110
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134918]  [<ffffffff81198e74>] 
> do_mount+0x1a4/0x260
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134922]  [<ffffffff81199350>] 
> sys_mount+0x90/0xe0
>
> Feb 27 09:54:59 Server21 kernel: [ 1439.134927]  [<ffffffff81664a82>] 
> system_call_fastpath+0x16/0x1b
>
> -------------------------------------------------------------------------------------------------------------------------------------
> ??????????????????????????,?????????????
> ?????????????????????(??????????????????
> ???)?????????????????,??????????????????
> ??!
> This e-mail and its attachments contain confidential information from 
> H3C, which is
> intended only for the person or entity whose address is listed above. 
> Any use of the
> information contained herein in any way (including, but not limited 
> to, total or partial
> disclosure, reproduction, or dissemination) by persons other than the 
> intended
> recipient(s) is prohibited. If you receive this e-mail in error, 
> please notify the sender
> by phone or email immediately and delete it!
>
>
> _______________________________________________
> Ocfs2-devel mailing list
> Ocfs2-devel at oss.oracle.com
> https://oss.oracle.com/mailman/listinfo/ocfs2-devel

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://oss.oracle.com/pipermail/ocfs2-devel/attachments/20130226/be7e9ee4/attachment-0001.html 


More information about the Ocfs2-devel mailing list