[Ocfs2-users] Problems mounting shared filesystem

Sunil Mushran sunil.mushran at oracle.com
Mon Mar 22 13:52:31 PDT 2010


The network connect is failing. Could be because of a firewall,
or bad ip address, some switch issue.

Mount the volume on node 2. Then enable tracing and
tail messages file.
# debugfs.ocfs2 -l TCP allow
# tail -f /var/log/messages

Then from node 4, ping node 2 using netcat.
# nc -z 192.168.1.2 7777

If it succeeds, then you should see:
Connection to 192.168.1.2 7777 port [tcp/cbt] succeeded!

Additionally, you will see a message on node 2 "attempt to connect
from node...".

If not, then look at your network setup.

Remember to disable tracing on node 2.
#debugfs.ocfs2 -l TCP off

Sunil

Chris Clonch wrote:
> We are testing clustering and I am having issues getting all of my 
> nodes to mount.  I have 4 nodes.  I am using iSCSI to share 1 target 
> with 2 luns.  All 4 nodes can are accessing the target; I can run 
> fdisk -l against the block devices.  Initially I had all 4 nodes 
> mounting the share but brought the cluster down to add an additional 
> NIC.  Presently nodes 2 and 3 can mount the shares, 1 and 4 can not.  
> Previously I had node 1 mounted and nodes 2, 3 and 4 could not.
>
> Any help is appreciated!
>
> Nodes 2 & 3:
>
> # service o2cb status
> Driver for "configfs": Loaded
> Filesystem "configfs": Mounted
> Driver for "ocfs2_dlmfs": Loaded
> Filesystem "ocfs2_dlmfs": Mounted
> Checking O2CB cluster ocfs2: Online
> Heartbeat dead threshold = 31
>   Network idle timeout: 30000
>   Network keepalive delay: 2000
>   Network reconnect delay: 2000
> Checking O2CB heartbeat: Active
>
>
> Nodes 1 & 4:
>
> # service o2cb status
> Driver for "configfs": Loaded
> Filesystem "configfs": Mounted
> Driver for "ocfs2_dlmfs": Loaded
> Filesystem "ocfs2_dlmfs": Mounted
> Checking O2CB cluster ocfs2: Online
> Heartbeat dead threshold = 31
>   Network idle timeout: 30000
>   Network keepalive delay: 2000
>   Network reconnect delay: 2000
> Checking O2CB heartbeat: Not active
>
>
> All nodes:
>
> # mounted.ocfs2 -d
> Device                FS     UUID                                  Label
> /dev/sda1             ocfs2  fea0a398-a696-414f-bd9f-d7aa84bd6b77  ocu01
> /dev/sdb1             ocfs2  26e82fa7-ec91-4a81-a965-571ed4223ab0  
> oracluster
>
> # mounted.ocfs2 -f
> Device                FS     Nodes
> /dev/sda1             ocfs2  ocnode2, ocnode3
> /dev/sdb1             ocfs2  ocnode2, ocnode3
>
>
> dmesg snippet from node 4:
>
> o2net: connected to node ocnode2 (num 2) at 192.168.1.2:7777 
> <http://192.168.1.2:7777>
> (4145,0):o2net_connect_expired:1664 ERROR: no connection established 
> with node 3 after 30.0 seconds, giving up and returning errors.
> (4176,0):dlm_request_join:1036 ERROR: status = -107
> (4176,0):dlm_try_to_join_domain:1210 ERROR: status = -107
> (4176,0):dlm_join_domain:1488 ERROR: status = -107
> (4176,0):dlm_register_domain:1754 ERROR: status = -107
> (4176,0):ocfs2_dlm_init:2723 ERROR: status = -107
> (4176,0):ocfs2_mount_volume:1437 ERROR: status = -107
> ocfs2: Unmounting device (8,17) on (node 4)
> o2net: no longer connected to node ocnode2 (num 2) at 192.168.1.2:7777 
> <http://192.168.1.2:7777>




More information about the Ocfs2-users mailing list