[Ocfs2-users] kernel panic on redhat 5-7 x64
Le Duy Phuong Nam
namldp at thienphat.biz
Tue Jul 3 19:22:18 PDT 2012
Hi all,
I am using OCFS2-1.4.7 for 2 servers which is running Red hat enterprise 5.7
kernel 2.6.18-274.el5.
OCFS2 I use for drdb for replicating master-master. My 2 servers was
installed HA-Proxy.
Yesterday, server web1 was down with the log kernel panic. And today, web2
was down too. After that, I trace the log file on these server and found
that the reason from ocfs2.
The log like this:
Jul 3 10:58:37 web1 kernel: d-con r0: PingAck did not arrive in time.
Jul 3 10:58:37 web1 kernel: d-con r0: peer( Primary -> Unknown ) conn(
Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown )
Jul 3 10:58:37 web1 kernel: d-con r0: asender terminated
Jul 3 10:58:37 web1 kernel: d-con r0: Terminating asender thread
Jul 3 10:58:37 web1 kernel: d-con r0: error receiving Data, e: -5 l: 4096!
Jul 3 10:58:37 web1 kernel: block drbd0: new current UUID
A69EE0FA8CB9B85D:C9BABEF0844508EB:2F0151CEDDA9713A:2F0051CEDDA9713B
Jul 3 10:58:37 web1 kernel: d-con r0: Connection closed
Jul 3 10:58:37 web1 kernel: d-con r0: conn( NetworkFailure -> Unconnected )
Jul 3 10:58:37 web1 kernel: d-con r0: receiver terminated
Jul 3 10:58:37 web1 kernel: d-con r0: Restarting receiver thread
Jul 3 10:58:37 web1 kernel: d-con r0: receiver (re)started
Jul 3 10:58:37 web1 kernel: d-con r0: conn( Unconnected -> WFConnection )
Jul 3 10:58:53 web1 kernel: d-con r0: Handshake successful: Agreed network
protocol version 100
Jul 3 10:58:53 web1 kernel: d-con r0: Peer authenticated using 20 bytes
HMAC
Jul 3 10:58:53 web1 kernel: d-con r0: conn( WFConnection -> WFReportParams
)
Jul 3 10:58:53 web1 kernel: d-con r0: Starting asender thread (from
drbd_r_r0 [1164])
Jul 3 10:58:53 web1 kernel: block drbd0: drbd_sync_handshake:
Jul 3 10:58:53 web1 kernel: block drbd0: self
A69EE0FA8CB9B85D:C9BABEF0844508EB:2F0151CEDDA9713A:2F0051CEDDA9713B bits:466
flags:0
Jul 3 10:58:53 web1 kernel: block drbd0: peer
3ED53D15A1945AAF:C9BABEF0844508EB:2F0151CEDDA9713B:2F0051CEDDA9713B bits:49
flags:0
Jul 3 10:58:53 web1 kernel: block drbd0: uuid_compare()=100 by rule 90
Jul 3 10:58:53 web1 kernel: block drbd0: helper command: /sbin/drbdadm
initial-split-brain minor-0
Jul 3 10:58:53 web1 kernel: block drbd0: helper command: /sbin/drbdadm
initial-split-brain minor-0 exit code 0 (0x0)
Jul 3 10:58:53 web1 kernel: block drbd0: Split-Brain detected but
unresolved, dropping connection!
Jul 3 10:58:53 web1 kernel: block drbd0: helper command: /sbin/drbdadm
split-brain minor-0
Jul 3 10:58:53 web1 kernel: block drbd0: helper command: /sbin/drbdadm
split-brain minor-0 exit code 0 (0x0)
Jul 3 10:58:53 web1 kernel: d-con r0: conn( WFReportParams -> Disconnecting
)
Jul 3 10:58:53 web1 kernel: d-con r0: error receiving ReportState, e: -5 l:
0!
Jul 3 10:58:53 web1 kernel: d-con r0: asender terminated
Jul 3 10:58:53 web1 kernel: d-con r0: Terminating asender thread
Jul 3 10:58:53 web1 kernel: d-con r0: Connection closed
Jul 3 10:58:53 web1 kernel: d-con r0: conn( Disconnecting -> StandAlone )
Jul 3 10:58:53 web1 kernel: d-con r0: receiver terminated
Jul 3 10:58:53 web1 kernel: d-con r0: Terminating receiver thread
Jul 3 10:58:54 web1 kernel: (httpd,11395,3):ocfs2_truncate_file:425 ERROR:
bug expression: le64_to_cpu(fe->i_size) != i_size_read(inode)
Jul 3 10:58:54 web1 kernel: (httpd,11395,3):ocfs2_truncate_file:425 ERROR:
Inode 389752, inode i_size = 28059 != di i_size = 17004, i_flags = 0x1
Jul 3 10:58:54 web1 kernel: ----------- [cut here ] --------- [please bite
here ] ---------
Jul 3 10:58:54 web1 kernel: Kernel BUG at
...rpmbuild/xiaowei/BUILD/ocfs2-1.4.7/fs/ocfs2/file.c:425
Is there anyone meet the same situation? Please help me
Thanks and Regards,
Namldp
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20120704/b965b6a3/attachment-0001.html
More information about the Ocfs2-users
mailing list