[Ocfs2-users] Unstable Cluster

Tony Rios tony at tonyrios.com
Fri Dec 9 00:42:25 PST 2011


I managed to get ahold of the kernel panic message because it's happening on any new machines I try to introduce to the cluster:

[   66.276054] OCFS2 1.5.0
[   66.337531] o2dlm: Nodes in domain A3AA504BE42E4D3D8A15248D8FCD49BB: 3 5 
[   66.380092] ocfs2: Mounting device (8,16) on (node 5, slot 2) with ordered data mode.
[   66.401719] (ocfs2rec,1382,0):ocfs2_replay_journal:1601 Recovering node 1 from slot 1 on device (8,16)
[  118.890439] o2net: connected to node pedge33 (num 1) at 10.88.0.33:7777
[  118.911765] o2net: connected to node pedge38 (num 4) at 10.88.0.38:7777
[  240.440024] INFO: task kworker/u:3:46 blocked for more than 120 seconds.
[  240.460495] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[  240.481125] kworker/u:3     D 0000000000000001     0    46      2 0x00000000
[  240.501761]  ffff88020338dbd0 0000000000000046 ffff88020338dfd8 ffff88020338c000
[  240.522547]  0000000000013d00 ffff88020333c858 ffff88020338dfd8 0000000000013d00
[  240.543180]  ffff8802034c96e0 ffff88020333c4a0 ffff880205105b80 0000000000000001
[  240.563579] Call Trace:
[  240.583607]  [<ffffffffa03c01dd>] ocfs2_wait_for_recovery+0x7d/0xd0 [ocfs2]
[  240.604038]  [<ffffffff81087fb0>] ? autoremove_wake_function+0x0/0x40
[  240.624265]  [<ffffffffa03a8088>] ocfs2_inode_lock_full_nested+0x268/0x6a0 [ocfs2]
[  240.645091]  [<ffffffffa03b6026>] ? ocfs2_node_map_set_bit+0x46/0x60 [ocfs2]
[  240.666163]  [<ffffffffa03bc018>] ocfs2_queue_orphans+0x68/0x260 [ocfs2]
[  240.687431]  [<ffffffff81038c79>] ? default_spin_lock_flags+0x9/0x10
[  240.708689]  [<ffffffffa03bd3d4>] ocfs2_recover_orphans+0x54/0x230 [ocfs2]
[  240.729977]  [<ffffffffa03bbe1c>] ? __ocfs2_wait_on_mount+0xcc/0x140 [ocfs2]
[  240.751404]  [<ffffffff81087fb0>] ? autoremove_wake_function+0x0/0x40
[  240.772669]  [<ffffffffa03c0417>] ocfs2_complete_recovery+0x1e7/0x690 [ocfs2]
[  240.793894]  [<ffffffffa03c0230>] ? ocfs2_complete_recovery+0x0/0x690 [ocfs2]
[  240.814874]  [<ffffffff8108284d>] process_one_work+0x11d/0x420
[  240.835518]  [<ffffffff810832e9>] worker_thread+0x169/0x360
[  240.855910]  [<ffffffff81083180>] ? worker_thread+0x0/0x360
[  240.876296]  [<ffffffff81087866>] kthread+0x96/0xa0
[  240.896305]  [<ffffffff8100ce24>] kernel_thread_helper+0x4/0x10
[  240.915969]  [<ffffffff810877d0>] ? kthread+0x0/0xa0
[  240.935412]  [<ffffffff8100ce20>] ? kernel_thread_helper+0x0/0x10
[  240.954898] INFO: task ureadahead:1384 blocked for more than 120 seconds.
[  240.974688] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[  240.994902] ureadahead      D 0000000000000001     0  1384      1 0x00000000
[  241.015064]  ffff880206991d08 0000000000000086 ffff880206991fd8 ffff880206990000
[  241.035271]  0000000000013d00 ffff8802051ac858 ffff880206991fd8 0000000000013d00
[  241.055316]  ffff8802068216e0 ffff8802051ac4a0 ffff880206991ce8 0000000000000001
[  241.075007] Call Trace:
[  241.094061]  [<ffffffffa03c01dd>] ocfs2_wait_for_recovery+0x7d/0xd0 [ocfs2]
[  241.113431]  [<ffffffff81087fb0>] ? autoremove_wake_function+0x0/0x40
[  241.132382]  [<ffffffffa03a8088>] ocfs2_inode_lock_full_nested+0x268/0x6a0 [ocfs2]
[  241.151399]  [<ffffffffa03b8812>] ocfs2_inode_revalidate+0x72/0x2c0 [ocfs2]
[  241.170682]  [<ffffffffa03b0839>] ocfs2_getattr+0x59/0x1d0 [ocfs2]
[  241.189648]  [<ffffffff81169521>] vfs_getattr+0x51/0x120
[  241.208176]  [<ffffffff81169648>] vfs_fstatat+0x58/0x70
[  241.226132]  [<ffffffff8116969b>] vfs_stat+0x1b/0x20
[  241.243766]  [<ffffffff811698da>] sys_newstat+0x1a/0x40
[  241.261254]  [<ffffffff815c3955>] ? page_fault+0x25/0x30
[  241.278749]  [<ffffffff8100c002>] system_call_fastpath+0x16/0x1b




More information about the Ocfs2-users mailing list