<html><head><style type="text/css"><!-- DIV {margin:0px;} --></style></head><body><div style="font-family:times new roman, new york, times, serif;font-size:12pt"><div>Hi to all, watching the log by more attention and in the moment when a node go down, I have this imformation by the kernel about o2net :<br><br><font size="2">Jul 10 16:52:02 be1 kernel: BUG: soft lockup - CPU#0 stuck for 10s! [o2net:6814]
<br>Jul 10 16:52:02 be1 kernel: CPU 0:
<br>Jul 10 16:52:02 be1 kernel: Modules linked in: ocfs2(U) ocfs2_dlmfs(U) ocfs2_dlm
<br>parport shpchp ide_cd cdrom i2c_i801 i5000_edac i2c_core serio_raw edac_mc bnx2
<br>Jul 10 16:52:02 be1 kernel: Pid: 6814, comm: o2net Tainted: G 2.6.18-92.el5
<br>Jul 10 16:52:02 be1 kernel: RIP: 0010:[<ffffffff80064b57>] [<ffffffff80064b57>]
<br>Jul 10 16:52:02 be1 kernel: RSP: 0018:ffff81043f281d28 EFLAGS: 00000246
<br>Jul 10 16:52:02 be1 kernel: RAX: ffff810316b02828 RBX: ffff810440656018 RCX: 000
<br>Jul 10 16:52:02 be1 kernel: RDX: 0000000000000001 RSI: 0000000000000286 RDI: fff
<br>Jul 10 16:52:02 be1 kernel: RBP: ffff810367456c20 R08: ffff810316b02838 R09: fff
<br>Jul 10 16:52:02 be1 kernel: R10: ffff810316b02858 R11: 000000000000fa55 R12: fff
<br>Jul 10 16:52:02 be1 kernel: R13: 0000000000000044 R14: 000000000000001f R15: 000
<br>Jul 10 16:52:02 be1 kernel: FS: 0000000000000000(0000) GS:ffffffff8039e000(0000
<br>Jul 10 16:52:02 be1 kernel: CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
<br>Jul 10 16:52:02 be1 kernel: CR2: 000000001c1b6ec8 CR3: 0000000449592000 CR4: 000
<br>Jul 10 16:52:02 be1 kernel:
<br>Jul 10 16:52:02 be1 kernel: Call Trace:
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff884e7b0b>] :ocfs2_dlm:dlm_assert_master_h
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff884ab15e>] :ocfs2_nodemanager:o2net_proce
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff884ace20>] :ocfs2_nodemanager:o2net_rx_un
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff884ac5d2>] :ocfs2_nodemanager:o2net_rx_un
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff8004cea9>] run_workqueue+0x94/0xe4
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff800497be>] worker_thread+0x0/0x122
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff8009dbca>] keventd_create_kthread+0x0/0xc
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff800498ae>] worker_thread+0xf0/0x122
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff8008ac03>] default_wake_function+0x0/0xe
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff8009dbca>] keventd_create_kthread+0x0/0xc
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff8009dbca>] keventd_create_kthread+0x0/0xc
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff8003253d>] kthread+0xfe/0x132
<br>Jul 10 16:52:02 be1 kernel: [<ffffffff8005dfb1>] child_rip+0xa/0x11
<br>Jul 10 16:52:03 be1 kernel: [<ffffffff8009dbca>] keventd_create_kthread+0x0/0xc
<br>Jul 10 16:52:03 be1 kernel: [<ffffffff8002881b>] sync_page+0x0/0x42
<br>Jul 10 16:52:03 be1 kernel: [<ffffffff8003243f>] kthread+0x0/0x132
<br>Jul 10 16:52:03 be1 kernel: [<ffffffff8005dfa7>] child_rip+0x0/0x11</font>
<br><br>---------------------------------------------------------------------------------<br><br>Some body can help me to know what means??<br><br>Thanks<br></div><div style="font-family: times new roman,new york,times,serif; font-size: 12pt;"><br><div style="font-family: times new roman,new york,times,serif; font-size: 12pt;">----- Messaggio originale -----<br>Da: Gabriele Di Giambelardini <gabriele_d_g@yahoo.it><br>A: V Srinivas <vaungasrinu@gmail.com><br>Cc: ocfs2-users@oss.oracle.com<br>Inviato: Lunedì 30 giugno 2008, 15:56:35<br>Oggetto: Re: [Ocfs2-users] Fence abnormal and with not apparent reason<br><br><div style="font-family: times new roman,new york,times,serif; font-size: 12pt;"><div>Hi, this is my output on all the 5 servers<br><br>Module "configfs": Loaded
<br>Filesystem "configfs": Mounted
<br>Module "ocfs2_nodemanager": Loaded
<br>Module "ocfs2_dlm": Loaded
<br>Module "ocfs2_dlmfs": Loaded
<br>Filesystem "ocfs2_dlmfs": Mounted
<br>Checking O2CB cluster ocfs2: Online
<br> Heartbeat dead threshold: 61
<br> Network idle timeout: 60000
<br> Network keepalive delay: 2000
<br> Network reconnect delay: 2000
<br>Checking O2CB heartbeat: Active
<br><br>thanks<br><br><br></div><div style="font-family: times new roman,new york,times,serif; font-size: 12pt;"><br><div style="font-family: times new roman,new york,times,serif; font-size: 12pt;">----- Messaggio originale -----<br>Da: V Srinivas <vaungasrinu@gmail.com><br>A: Gabriele Di Giambelardini <gabriele_d_g@yahoo.it><br>Inviato: Lunedì 30 giugno 2008, 13:07:31<br>Oggetto: Re: [Ocfs2-users] Fence abnormal and with not apparent reason<br><br>pls send me service o2cb status output for that servers.<br><br><br><div><span class="gmail_quote">On 30/06/2008, <b class="gmail_sendername">Gabriele Di Giambelardini</b> <<a rel="nofollow" ymailto="mailto:gabriele_d_g@yahoo.it" target="_blank" href="mailto:gabriele_d_g@yahoo.it">gabriele_d_g@yahoo.it</a>> wrote:</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><div><div style="font-family: times new
roman,new york,times,serif; font-size: 12pt;"><div>I to all, I have a big and intrigued problem.<br>I explain you the situation:<br>I
have 5 servers linux and 1 SAN IBM , every server have ocfs2 and
by ocfs2-console I can watch they. Fot connect the server I use an
dedicate network,<br>The problem is that some times I have this message on one of the server:<br><span style="font-family: monospace;"><br></span>kernel: o2net: connection to node <a rel="nofollow" target="_blank" href="http://test.test.it">test.test.it</a> (num 1) at <a rel="nofollow" target="_blank" href="http://10.10.10.1:7777">10.10.10.1:7777</a> has been idle for 60.0 seconds, shutting it down.<br><br>So
my server has fenced, but when it come up, not success to start ocfs2
or mount partition. For resolve it I must fence all servers and
every thing restart to work well.<br>I have noticed the if I'm not fast to fence all servers, other nodes go in "shutting it down".<br><br><br>Some body can
help me, it's really important for me.<br><br>my server:<br><br>- Red Hat Enterprise Linux Server release 5<br>2.6.18-8.el5 #1 SMP Fri Jan 26 14:15:14 EST 2007 x86_64 x86_64 x86_64 GNU/Linux<br><br>- ocfs2-2.6.18-8.el5-1.2.8-2.el5<br>
ocfs2-tools-1.2.7-1.el5<br> ocfs2console-1.2.7-1.el5<br> ocfs2-tools-debuginfo-1.2.6-1.el5<br> ocfs2-2.6.18-92.1.1.el5-1.2.9-1.el5<br><br>- OCFS2 1.2.8 Tue Jan 22 11:58:16 PST 2008 (build 9c7ae8bb50ef6d8791df2912775adcc5)<br><br>thank in advance for any suggestions<br><span class="ad"><br><br><br><br></span></div></div><span class="ad"><br><hr size="1"><font size="2" face="Arial">Scopri il <a rel="nofollow" target="_blank" href="http://us.rd.yahoo.com/mail/it/taglines/yahoo/ymail/SIG=11djrg460/**http%3A%2F%2Fwww.ymailblogit.com%2Fblog%2F"> Blog di Yahoo! Mail</a>: trucchi, novità, consigli... e la tua opinione!</font></span></div><br>_______________________________________________<br>
Ocfs2-users mailing list<br><a rel="nofollow" ymailto="mailto:Ocfs2-users@oss.oracle.com" target="_blank" href="mailto:Ocfs2-users@oss.oracle.com">Ocfs2-users@oss.oracle.com</a><br><a rel="nofollow" target="_blank" href="http://oss.oracle.com/mailman/listinfo/ocfs2-users">http://oss.oracle.com/mailman/listinfo/ocfs2-users</a><br></blockquote></div><br></div></div></div><br>
<hr size="1"><font size="2" face="Arial">Scopri il <a rel="nofollow" target="_blank" href="http://us.rd.yahoo.com/mail/it/taglines/yahoo/ymail/SIG=11djrg460/**http%3A%2F%2Fwww.ymailblogit.com%2Fblog%2F"> Blog di Yahoo! Mail</a>: trucchi, novità, consigli... e la tua opinione!</font></div></div></div><br>
<hr size=1>
Posta, news, sport, oroscopo: tutto in una sola pagina<br> <a
href="http://us.rd.yahoo.com/mailuk/taglines/isp/control/*http://us.rd.yahoo.com/evt=52437/*http://www.yahoo.it/latuapagina" target=_blank>Crea l'home page che piace a te!</a>.</body></html>