[Ocfs2-users] Getting eI am using RHLError when mounting shar ed OCFS2 device.

Vaidya, Sachin Sachin.Vaidya at acs-inc.com
Thu Mar 30 17:27:43 CST 2006


Hi,
Tried both public and private ip addreses but still not able to mount device
on both nodes.
Here are my configuration details.
hosts file : same on both nodes.

 127.0.0.1       localhost.localdomain   localhost
172.18.11.12    acspittdw001    acspittdw001.servicemetrics.net
172.18.22.1     priv-acspittdw001
172.18.11.24    vip-acspittdw001
172.18.11.13    acspittdw002    acspittdw002.servicemetrics.net
172.18.22.2     priv-acspittdw002
172.18.11.25    vip-acspittdw002

The cluster.conf on both nodes looks same as
node:
        ip_port = 7777
        ip_address = 172.18.11.12
        number = 0
        name = acspittdw001
        cluster = ocfs2

node:
        ip_port = 7777
        ip_address = 172.18.11.13
        number = 1
        name = acspittdw002
        cluster = ocfs2

cluster:
        node_count = 2
        name = ocfs2

Both nodes can ping each other on public and private ips.
The mount command produces following error on node 2 when device is already
mounted on node 1.

[root at acspittdw002 ~]#  mount -t ocfs2 /dev/md0 /crs1
mount.ocfs2: Transport endpoint is not connected while mounting /dev/md0 on
/crs1
[root at acspittdw002 ~]#

dmesg show following messages

SELinux: initialized (dev debugfs, type debugfs), uses genfs_contexts
(5027,2):ocfs2_initialize_super:1354 max_slots for this device: 8
(5027,2):ocfs2_fill_local_node_info:1031 I am node 1
(4986,2):o2net_connect_expired:1446 ERROR: no connection established with
node 0 after 10 seconds, giving up and returning errors.
(5027,2):dlm_request_join:771 ERROR: status = -107
(5027,2):dlm_try_to_join_domain:919 ERROR: status = -107
(5027,2):dlm_join_domain:1164 ERROR: status = -107
(5027,2):dlm_register_domain:1354 ERROR: status = -107
(5027,2):ocfs2_dlm_init:1996 ERROR: status = -107
(5027,2):ocfs2_mount_volume:1063 ERROR: status = -107
ocfs2: Unmounting device (9,0) on (node 1)
[root at acspittdw002 ~]#

Any idea why this is happening ?
I can provide more details if needed.
Any help will be greatly appreciated.
Thanks in advance.
- Sachin Vaidya.



-----Original Message-----
From: Sunil Mushran
To: Vaidya, Sachin
Cc: 'ocfs2-users at oss.oracle.com'
Sent: 3/29/2006 7:16 PM
Subject: Re: [Ocfs2-users] Getting eI am using RHLError when mounting shared
OCFS2 device.

Connection failiure. Check dmesg.

Mount triggers the heartbeat thread which triggers the o2net
to make a connection to all heartbeating nodes. If this connection
fails,
the mount fails. (The larger node number initiates the connection
to the lower node number.)

Obvious error would be incorrect ipaddr specified in cluster.conf.
Error messages in /var/log/messsages on both nodes will
provide more clues.

Vaidya, Sachin wrote:
>
> Hi,
>
> I am using RHLE4 2.6.9-34.Elsmp with OCFS2 1.2.
>
> The h/w for this 2 node cluster is connected correctly.
>
> After loading ocfs2 on both nodes, the shared device could only be 
> mounted on one node. When I try to mount same shared device on second 
> node then I get following error.
>
> Mount.ocfs2: Transport endpoint is not connected while mounting 
> /dev/md0 on /crs1
>
> Any idea, why this is happening ?
>
> Any help will be highly appreciated.
>
> Thanks,
>
> Sachin Vaidya
>
>  
>
>
------------------------------------------------------------------------
>
> _______________________________________________
> Ocfs2-users mailing list
> Ocfs2-users at oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-users
>   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20060330/8de16bd9/attachment.html


More information about the Ocfs2-users mailing list