<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
<META NAME="Generator" CONTENT="MS Exchange Server version 5.5.2658.34">
<TITLE>RE: [Ocfs2-users] Getting eI am using RHLError when mounting        shar ed OCFS2 device.</TITLE>
</HEAD>
<BODY>
<P><FONT SIZE=2>Removed VIPs from hosts and restarted the cluster. But nothing changed. Still cannot mount /dev/md0 on both nodes.</FONT>
<BR><FONT SIZE=2>Do I need to reboot servers after changing the /etc/hosts ? Any other suggestions ?</FONT>
<BR><FONT SIZE=2>Thanks,</FONT>
</P>
<P><FONT SIZE=2>Sachin Vaidya </FONT>
<BR><FONT SIZE=2>Infrastructure Management Senior Analyst </FONT>
<BR><FONT SIZE=2>Affiliated Computer Services </FONT>
</P>
<BR>
<P><FONT SIZE=2> -----Original Message-----</FONT>
<BR><FONT SIZE=2>From: Sunil Mushran [<A HREF="mailto:Sunil.Mushran@oracle.com">mailto:Sunil.Mushran@oracle.com</A>] </FONT>
<BR><FONT SIZE=2>Sent: Thursday, March 30, 2006 5:34 PM</FONT>
<BR><FONT SIZE=2>To: Vaidya, Sachin</FONT>
<BR><FONT SIZE=2>Cc: ''ocfs2-users@oss.oracle.com' '</FONT>
<BR><FONT SIZE=2>Subject: Re: [Ocfs2-users] Getting eI am using RHLError when mounting shar ed OCFS2 device.</FONT>
</P>
<P><FONT SIZE=2>Remove vip and mount on both. See if that helps.</FONT>
</P>
<P><FONT SIZE=2>Vaidya, Sachin wrote:</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> Hi,</FONT>
<BR><FONT SIZE=2>> Tried both public and private ip addreses but still not able to mount </FONT>
<BR><FONT SIZE=2>> device on both nodes.</FONT>
<BR><FONT SIZE=2>> Here are my configuration details.</FONT>
<BR><FONT SIZE=2>> hosts file : same on both nodes.</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> 127.0.0.1 localhost.localdomain localhost</FONT>
<BR><FONT SIZE=2>> 172.18.11.12 acspittdw001 acspittdw001.servicemetrics.net</FONT>
<BR><FONT SIZE=2>> 172.18.22.1 priv-acspittdw001</FONT>
<BR><FONT SIZE=2>> 172.18.11.24 vip-acspittdw001</FONT>
<BR><FONT SIZE=2>> 172.18.11.13 acspittdw002 acspittdw002.servicemetrics.net</FONT>
<BR><FONT SIZE=2>> 172.18.22.2 priv-acspittdw002</FONT>
<BR><FONT SIZE=2>> 172.18.11.25 vip-acspittdw002</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> The cluster.conf on both nodes looks same as</FONT>
<BR><FONT SIZE=2>> node:</FONT>
<BR><FONT SIZE=2>> ip_port = 7777</FONT>
<BR><FONT SIZE=2>> ip_address = 172.18.11.12</FONT>
<BR><FONT SIZE=2>> number = 0</FONT>
<BR><FONT SIZE=2>> name = acspittdw001</FONT>
<BR><FONT SIZE=2>> cluster = ocfs2</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> node:</FONT>
<BR><FONT SIZE=2>> ip_port = 7777</FONT>
<BR><FONT SIZE=2>> ip_address = 172.18.11.13</FONT>
<BR><FONT SIZE=2>> number = 1</FONT>
<BR><FONT SIZE=2>> name = acspittdw002</FONT>
<BR><FONT SIZE=2>> cluster = ocfs2</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> cluster:</FONT>
<BR><FONT SIZE=2>> node_count = 2</FONT>
<BR><FONT SIZE=2>> name = ocfs2</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> Both nodes can ping each other on public and private ips.</FONT>
<BR><FONT SIZE=2>> The mount command produces following error on node 2 when device is </FONT>
<BR><FONT SIZE=2>> already mounted on node 1.</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> [root@acspittdw002 ~]# mount -t ocfs2 /dev/md0 /crs1</FONT>
<BR><FONT SIZE=2>> mount.ocfs2: Transport endpoint is not connected while mounting </FONT>
<BR><FONT SIZE=2>> /dev/md0 on /crs1</FONT>
<BR><FONT SIZE=2>> [root@acspittdw002 ~]#</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> dmesg show following messages</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> SELinux: initialized (dev debugfs, type debugfs), uses genfs_contexts</FONT>
<BR><FONT SIZE=2>> (5027,2):ocfs2_initialize_super:1354 max_slots for this device: 8</FONT>
<BR><FONT SIZE=2>> (5027,2):ocfs2_fill_local_node_info:1031 I am node 1</FONT>
<BR><FONT SIZE=2>> (4986,2):o2net_connect_expired:1446 ERROR: no connection established </FONT>
<BR><FONT SIZE=2>> with node 0 after 10 seconds, giving up and returning errors.</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> (5027,2):dlm_request_join:771 ERROR: status = -107</FONT>
<BR><FONT SIZE=2>> (5027,2):dlm_try_to_join_domain:919 ERROR: status = -107</FONT>
<BR><FONT SIZE=2>> (5027,2):dlm_join_domain:1164 ERROR: status = -107</FONT>
<BR><FONT SIZE=2>> (5027,2):dlm_register_domain:1354 ERROR: status = -107</FONT>
<BR><FONT SIZE=2>> (5027,2):ocfs2_dlm_init:1996 ERROR: status = -107</FONT>
<BR><FONT SIZE=2>> (5027,2):ocfs2_mount_volume:1063 ERROR: status = -107</FONT>
<BR><FONT SIZE=2>> ocfs2: Unmounting device (9,0) on (node 1)</FONT>
<BR><FONT SIZE=2>> [root@acspittdw002 ~]#</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> Any idea why this is happening ?</FONT>
<BR><FONT SIZE=2>> I can provide more details if needed.</FONT>
<BR><FONT SIZE=2>> Any help will be greatly appreciated.</FONT>
<BR><FONT SIZE=2>> Thanks in advance.</FONT>
<BR><FONT SIZE=2>> - Sachin Vaidya.</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> -----Original Message-----</FONT>
<BR><FONT SIZE=2>> From: Sunil Mushran</FONT>
<BR><FONT SIZE=2>> To: Vaidya, Sachin</FONT>
<BR><FONT SIZE=2>> Cc: 'ocfs2-users@oss.oracle.com'</FONT>
<BR><FONT SIZE=2>> Sent: 3/29/2006 7:16 PM</FONT>
<BR><FONT SIZE=2>> Subject: Re: [Ocfs2-users] Getting eI am using RHLError when mounting </FONT>
<BR><FONT SIZE=2>> shared OCFS2 device.</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> Connection failiure. Check dmesg.</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> Mount triggers the heartbeat thread which triggers the o2net</FONT>
<BR><FONT SIZE=2>> to make a connection to all heartbeating nodes. If this connection</FONT>
<BR><FONT SIZE=2>> fails,</FONT>
<BR><FONT SIZE=2>> the mount fails. (The larger node number initiates the connection</FONT>
<BR><FONT SIZE=2>> to the lower node number.)</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> Obvious error would be incorrect ipaddr specified in cluster.conf.</FONT>
<BR><FONT SIZE=2>> Error messages in /var/log/messsages on both nodes will</FONT>
<BR><FONT SIZE=2>> provide more clues.</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> Vaidya, Sachin wrote:</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > Hi,</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > I am using RHLE4 2.6.9-34.Elsmp with OCFS2 1.2.</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > The h/w for this 2 node cluster is connected correctly.</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > After loading ocfs2 on both nodes, the shared device could only be</FONT>
<BR><FONT SIZE=2>> > mounted on one node. When I try to mount same shared device on second</FONT>
<BR><FONT SIZE=2>> > node then I get following error.</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > Mount.ocfs2: Transport endpoint is not connected while mounting</FONT>
<BR><FONT SIZE=2>> > /dev/md0 on /crs1</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > Any idea, why this is happening ?</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > Any help will be highly appreciated.</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > Thanks,</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > Sachin Vaidya</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > </FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> ------------------------------------------------------------------------</FONT>
<BR><FONT SIZE=2>> ></FONT>
<BR><FONT SIZE=2>> > _______________________________________________</FONT>
<BR><FONT SIZE=2>> > Ocfs2-users mailing list</FONT>
<BR><FONT SIZE=2>> > Ocfs2-users@oss.oracle.com</FONT>
<BR><FONT SIZE=2>> > <A HREF="http://oss.oracle.com/mailman/listinfo/ocfs2-users" TARGET="_blank">http://oss.oracle.com/mailman/listinfo/ocfs2-users</A></FONT>
<BR><FONT SIZE=2>> > </FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> ------------------------------------------------------------------------</FONT>
<BR><FONT SIZE=2>></FONT>
<BR><FONT SIZE=2>> _______________________________________________</FONT>
<BR><FONT SIZE=2>> Ocfs2-users mailing list</FONT>
<BR><FONT SIZE=2>> Ocfs2-users@oss.oracle.com</FONT>
<BR><FONT SIZE=2>> <A HREF="http://oss.oracle.com/mailman/listinfo/ocfs2-users" TARGET="_blank">http://oss.oracle.com/mailman/listinfo/ocfs2-users</A></FONT>
<BR><FONT SIZE=2>> </FONT>
</P>
</BODY>
</HTML>