[Ocfs2-users] Getting eI am using RHLError when mounting shar ed OCFS2 device.

Sunil Mushran Sunil.Mushran at oracle.com
Thu Mar 30 20:14:59 CST 2006


/etc/hosts is not the problem.
Do:
/sbin/ifconfig
Do you see the vip bound on the same interface as the
one used in cluster.conf?

Also, what does the dmesg indicate on both nodes. The lower
node number will list the ip which is trying to connect to it.

Vaidya, Sachin wrote:
>
> Removed VIPs from hosts and restarted the cluster. But nothing 
> changed. Still cannot mount /dev/md0 on both nodes.
> Do I need to reboot servers after changing the /etc/hosts ? Any other 
> suggestions ?
> Thanks,
>
> Sachin Vaidya
> Infrastructure Management Senior Analyst
> Affiliated Computer Services
>
>
>  -----Original Message-----
> From:   Sunil Mushran [mailto:Sunil.Mushran at oracle.com]
> Sent:   Thursday, March 30, 2006 5:34 PM
> To:     Vaidya, Sachin
> Cc:     ''ocfs2-users at oss.oracle.com' '
> Subject:        Re: [Ocfs2-users] Getting eI am using RHLError when 
> mounting    shar ed OCFS2 device.
>
> Remove vip and mount on both. See if that helps.
>
> Vaidya, Sachin wrote:
> >
> > Hi,
> > Tried both public and private ip addreses but still not able to mount
> > device on both nodes.
> > Here are my configuration details.
> > hosts file : same on both nodes.
> >
> >  127.0.0.1       localhost.localdomain   localhost
> > 172.18.11.12    acspittdw001    acspittdw001.servicemetrics.net
> > 172.18.22.1     priv-acspittdw001
> > 172.18.11.24    vip-acspittdw001
> > 172.18.11.13    acspittdw002    acspittdw002.servicemetrics.net
> > 172.18.22.2     priv-acspittdw002
> > 172.18.11.25    vip-acspittdw002
> >
> > The cluster.conf on both nodes looks same as
> > node:
> >         ip_port = 7777
> >         ip_address = 172.18.11.12
> >         number = 0
> >         name = acspittdw001
> >         cluster = ocfs2
> >
> > node:
> >         ip_port = 7777
> >         ip_address = 172.18.11.13
> >         number = 1
> >         name = acspittdw002
> >         cluster = ocfs2
> >
> > cluster:
> >         node_count = 2
> >         name = ocfs2
> >
> > Both nodes can ping each other on public and private ips.
> > The mount command produces following error on node 2 when device is
> > already mounted on node 1.
> >
> > [root at acspittdw002 ~]#  mount -t ocfs2 /dev/md0 /crs1
> > mount.ocfs2: Transport endpoint is not connected while mounting
> > /dev/md0 on /crs1
> > [root at acspittdw002 ~]#
> >
> > dmesg show following messages
> >
> > SELinux: initialized (dev debugfs, type debugfs), uses genfs_contexts
> > (5027,2):ocfs2_initialize_super:1354 max_slots for this device: 8
> > (5027,2):ocfs2_fill_local_node_info:1031 I am node 1
> > (4986,2):o2net_connect_expired:1446 ERROR: no connection established
> > with node 0 after 10 seconds, giving up and returning errors.
> >
> > (5027,2):dlm_request_join:771 ERROR: status = -107
> > (5027,2):dlm_try_to_join_domain:919 ERROR: status = -107
> > (5027,2):dlm_join_domain:1164 ERROR: status = -107
> > (5027,2):dlm_register_domain:1354 ERROR: status = -107
> > (5027,2):ocfs2_dlm_init:1996 ERROR: status = -107
> > (5027,2):ocfs2_mount_volume:1063 ERROR: status = -107
> > ocfs2: Unmounting device (9,0) on (node 1)
> > [root at acspittdw002 ~]#
> >
> > Any idea why this is happening ?
> > I can provide more details if needed.
> > Any help will be greatly appreciated.
> > Thanks in advance.
> > - Sachin Vaidya.
> >
> >
> >
> > -----Original Message-----
> > From: Sunil Mushran
> > To: Vaidya, Sachin
> > Cc: 'ocfs2-users at oss.oracle.com'
> > Sent: 3/29/2006 7:16 PM
> > Subject: Re: [Ocfs2-users] Getting eI am using RHLError when mounting
> > shared OCFS2 device.
> >
> > Connection failiure. Check dmesg.
> >
> > Mount triggers the heartbeat thread which triggers the o2net
> > to make a connection to all heartbeating nodes. If this connection
> > fails,
> > the mount fails. (The larger node number initiates the connection
> > to the lower node number.)
> >
> > Obvious error would be incorrect ipaddr specified in cluster.conf.
> > Error messages in /var/log/messsages on both nodes will
> > provide more clues.
> >
> > Vaidya, Sachin wrote:
> > >
> > > Hi,
> > >
> > > I am using RHLE4 2.6.9-34.Elsmp with OCFS2 1.2.
> > >
> > > The h/w for this 2 node cluster is connected correctly.
> > >
> > > After loading ocfs2 on both nodes, the shared device could only be
> > > mounted on one node. When I try to mount same shared device on second
> > > node then I get following error.
> > >
> > > Mount.ocfs2: Transport endpoint is not connected while mounting
> > > /dev/md0 on /crs1
> > >
> > > Any idea, why this is happening ?
> > >
> > > Any help will be highly appreciated.
> > >
> > > Thanks,
> > >
> > > Sachin Vaidya
> > >
> > >
> > >
> > >
> > 
> ------------------------------------------------------------------------
> > >
> > > _______________________________________________
> > > Ocfs2-users mailing list
> > > Ocfs2-users at oss.oracle.com
> > > http://oss.oracle.com/mailman/listinfo/ocfs2-users
> > > 
> >
> > 
> ------------------------------------------------------------------------
> >
> > _______________________________________________
> > Ocfs2-users mailing list
> > Ocfs2-users at oss.oracle.com
> > http://oss.oracle.com/mailman/listinfo/ocfs2-users
> >  
>



More information about the Ocfs2-users mailing list