[Ocfs2-users] O2CB heartbeat not active on 2nd node
McKinley, Reid
RMckinley at tiaa-cref.org
Wed Jun 3 09:51:04 PDT 2009
We are having trouble getting the 2nd node in our 2 node RAC
configuration to have an active O2CB heartbeat. We have our OCR and
voting disks on an OCFS2 mount point, so we cannot bring up Clusterware
on this node.
I'm at a loss as to what the issue is. It was running fine for a few
weeks, then we had a reboot and we cannot get the heartbeat active and
we cannot mount any OCFS2 filesystems on the 2nd node.
Any ideas are greatly appreciated.
Dmesg errors are at the bottom.
Here are the rpm and status details:
[root at nyclx1 ~]# rpm -qa | grep ocfs2
ocfs2-tools-1.4.1-1.el5
ocfs2console-1.4.1-1.el5
ocfs2-2.6.18-92.el5-1.4.1-1.el5
ocfs2-2.6.18-92.el5debug-1.2.8-2.el5
ocfs2-2.6.18-92.el5-debuginfo-1.4.1-1.el5
ocfs2-tools-debuginfo-1.4.1-1.el5
[root at nyclx1 ~]# /etc/init.d/o2cb status
Driver for "configfs": Loaded
Filesystem "configfs": Mounted
Driver for "ocfs2_dlmfs": Loaded
Filesystem "ocfs2_dlmfs": Mounted
Checking O2CB cluster tiaa: Online
Heartbeat dead threshold = 31
Network idle timeout: 30000
Network keepalive delay: 2000
Network reconnect delay: 2000
Checking O2CB heartbeat: Active
[root at nyclx2 ~]# rpm -qa | grep ocfs2
ocfs2-tools-1.4.1-1.el5
ocfs2-2.6.18-92.el5-1.4.1-1.el5
ocfs2-tools-debuginfo-1.4.1-1.el5
ocfs2console-1.4.1-1.el5
ocfs2-2.6.18-92.el5debug-1.2.8-2.el5
ocfs2-2.6.18-92.el5-debuginfo-1.4.1-1.el5
[root at nyclx2 ~]# /etc/init.d/o2cb status
Driver for "configfs": Loaded
Filesystem "configfs": Mounted
Driver for "ocfs2_dlmfs": Loaded
Filesystem "ocfs2_dlmfs": Mounted
Checking O2CB cluster tiaa: Online
Heartbeat dead threshold = 31
Network idle timeout: 30000
Network keepalive delay: 2000
Network reconnect delay: 2000
Checking O2CB heartbeat: Not active
OCFS2 1.4.1 Wed Jul 23 12:05:34 PDT 2008 (build
3fc82af4b5669945497b322b6aabd031)
(11296,1):o2net_connect_expired:1637 ERROR: no connection established
with node 0 after 30.0 seconds, giving up and returning errors.
(14212,1):dlm_request_join:1033 ERROR: status = -107
(14212,1):dlm_try_to_join_domain:1207 ERROR: status = -107
(14212,1):dlm_join_domain:1485 ERROR: status = -107
(14212,1):dlm_register_domain:1732 ERROR: status = -107
(14212,1):ocfs2_dlm_init:2662 ERROR: status = -107
(14212,1):ocfs2_mount_volume:1251 ERROR: status = -107
ocfs2: Unmounting device (253,2) on (node 1)
(11296,1):o2net_connect_expired:1637 ERROR: no connection established
with node 0 after 30.0 seconds, giving up and returning errors.
(14350,1):dlm_request_join:1033 ERROR: status = -107
(14350,1):dlm_try_to_join_domain:1207 ERROR: status = -107
(14350,1):dlm_join_domain:1485 ERROR: status = -107
(14350,1):dlm_register_domain:1732 ERROR: status = -107
(14350,1):ocfs2_dlm_init:2662 ERROR: status = -107
(14350,1):ocfs2_mount_volume:1251 ERROR: status = -107
ocfs2: Unmounting device (253,3) on (node 1)
(11296,1):o2net_connect_expired:1637 ERROR: no connection established
with node 0 after 30.0 seconds, giving up and returning errors.
(4347,1):dlm_request_join:1033 ERROR: status = -107
(4347,1):dlm_try_to_join_domain:1207 ERROR: status = -107
(4347,1):dlm_join_domain:1485 ERROR: status = -107
(4347,1):dlm_register_domain:1732 ERROR: status = -107
(4347,1):ocfs2_dlm_init:2662 ERROR: status = -107
(4347,1):ocfs2_mount_volume:1251 ERROR: status = -107
ocfs2: Unmounting device (253,3) on (node 1)
(11296,1):o2net_connect_expired:1637 ERROR: no connection established
with node 0 after 30.0 seconds, giving up and returning errors.
(4948,1):dlm_request_join:1033 ERROR: status = -107
(4948,1):dlm_try_to_join_domain:1207 ERROR: status = -107
(4948,1):dlm_join_domain:1485 ERROR: status = -107
(4948,1):dlm_register_domain:1732 ERROR: status = -107
(4948,1):ocfs2_dlm_init:2662 ERROR: status = -107
(4948,1):ocfs2_mount_volume:1251 ERROR: status = -107
ocfs2: Unmounting device (253,3) on (node 1)
OCFS2 Node Manager 1.4.1 Wed Jul 23 12:05:37 PDT 2008 (build
0f78045c75c0174e50e4cf0934bf9eae)
OCFS2 DLM 1.4.1 Wed Jul 23 12:05:37 PDT 2008 (build
4ce8fae327880c466761f40fb7619490)
OCFS2 DLMFS 1.4.1 Wed Jul 23 12:05:37 PDT 2008 (build
4ce8fae327880c466761f40fb7619490)
OCFS2 User DLM kernel interface loaded
[root at nyclx2 ~]#
Reid McKinley
********************************************************************************************
This message, including any attachments, contains confidential information intended
for a specific individual and purpose, and is protected by law. If you are not the intended
recipient, please contact the sender immediately by reply e-mail and destroy all copies.
You are hereby notified that any disclosure, copying, or distribution of this message, or
the taking of any action based on it, is strictly prohibited.
TIAA-CREF
********************************************************************************************
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20090603/ff17f145/attachment-0001.html
More information about the Ocfs2-users
mailing list