[Ocfs2-users] SuSe Hangs when /etc/init.d/o2cb online

Miguel Santos N. msantos at celnet.com.mx
Tue Apr 1 16:48:45 PDT 2008


I tried to do what you told me but it hangs and it doesn't even log to 
the output file the result of strace.

However I ran it without -o switch and here is it:

==========
execve("/sbin/o2cb_ctl", ["o2cb_ctl", "-H", "-n", "ocfs2", "-t", 
"cluster", "-a", "online=yes"], [/* 61 vars */]) = 0
uname({sys="Linux", node="drwagner", ...}) = 0
brk(0)                                  = 0x6cf000
brk(0x6cff70)                           = 0x6cff70
arch_prctl(ARCH_SET_FS, 0x6cf8b0)       = 0
brk(0x6f0f70)                           = 0x6f0f70
brk(0x6f1000)                           = 0x6f1000
open("/sys/o2cb/interface_revision", O_RDONLY) = 3
read(3, "5\n", 15)                      = 2
read(3, "", 13)                         = 0
close(3)                                = 0
stat("/sys/kernel/config", {st_mode=S_IFDIR|0755, st_size=0, ...}) = 0
statfs("/sys/kernel/config", {f_type=0x62656570, f_bsize=4096, 
f_blocks=0, f_bfree=0, f_bavail=0, f_files=0, f_ffree=0, f_fsid={0, 0}, 
f_namelen=255, f_frsize=4096}) = 0
stat("/etc/ocfs2/cluster.conf", {st_mode=S_IFREG|0644, st_size=331, 
...}) = 0
open("/etc/ocfs2/cluster.conf", O_RDONLY) = 3
read(3, "node:\n\tip_port = 7777\n\tip_addres"..., 4000) = 331
read(3, "", 4000)                       = 0
close(3)                                = 0
mkdir("/sys/kernel/config/cluster/ocfs2", 0755
=========

As you told me, the last command is not executed at all and the computer 
hangs. I tried to execute:

mkdir -p -m 755 /sys/kernel/config/cluster/ocfs2

And guess what? it hangs too =P

So I guess OCFS2 is not the problem but something with the kernel space.

What do you think?

Sunil Mushran escribió:
> So that looks ok.
>
> Next step would be to do some strac-ing.
>
> Do:
> /etc/init.d/o2cb load
>
> then do:
> strace -ff -o /tmp/out o2cb_ctl -H -n ocfs2 -t cluster -a online=yes
>
> The last command is what "onlines" the cluster.
>
> /tmp/out should have some information.
>
> Miguel Santos N. wrote:
>> Thanks..
>>
>> I am attaching the file /etc/ocfs/cluster.conf, it is the same file 
>> on all the servers.
>>
>>
>>
>> Sunil Mushran escribió:
>>> Check /etc/ocfs2/cluster.conf on the opensuse cluster. Starting cluster
>>> includes reading cluster.conf and populating /sys/kernel/config with
>>> the info. Make that file available as-is... as in, not a cut-paste.
>>>
>>> Miguel Santos wrote:
>>>> Hello,
>>>>
>>>> I have a DELL MD3000i, and a couple of servers that I want to 
>>>> connect to
>>>> the array.
>>>>
>>>> I have setup the main server with CentOS 5.
>>>> - 2.6.18-53.el5 x86_64
>>>> - ocfs2-tools-1.2.7-1.el5
>>>> - ocfs2console-1.2.7-1.el5
>>>> - ocfs2-2.6.18-53.el5-1.2.8-2.el5 # Kernel Module
>>>>
>>>> And two OpenSuSe 10.3 Servers
>>>> - 2.6.22.5-31-default x86_64
>>>> - ocfs2-tools-1.2.6-18
>>>> - ocfs2console-1.2.6-18
>>>> The module has been already shipped with the kernel (thanks to novell)
>>>>
>>>> I had no problems mounting the FS on the CentOS 5 server.
>>>> I can read/write in the FS without problems.
>>>>
>>>> ============
>>>> [root at arenamexico ~]# mounted.ocfs2 -d
>>>> Device                FS     UUID                                  
>>>> Label
>>>> /dev/sdb1             ocfs2  cffd81fa-7395-433e-912a-d9c9d7235df3  
>>>> Coliseo
>>>> [root at arenamexico ~]# mounted.ocfs2 -f
>>>> Device                FS     Nodes
>>>> /dev/sdb1             ocfs2  arenamexico
>>>> [root at arenamexico ~]# mount|grep ocfs2
>>>> ocfs2_dlmfs on /dlm type ocfs2_dlmfs (rw)
>>>> /dev/sdb1 on /mnt/projects type ocfs2 
>>>> (rw,_netdev,noatime,heartbeat=local)
>>>> ============
>>>>
>>>> But when I try to start the o2cb service on the Opensuse servers, the
>>>> servers just hang.
>>>>
>>>> ============
>>>> drwagner:~ # /etc/init.d/o2cb status
>>>> Module "configfs": Loaded
>>>> Filesystem "configfs": Mounted
>>>> Module "ocfs2_nodemanager": Loaded
>>>> Module "ocfs2_dlm": Loaded
>>>> Module "ocfs2_dlmfs": Loaded
>>>> Filesystem "ocfs2_dlmfs": Mounted
>>>>
>>>> drwagner:~ # /etc/init.d/o2cb online ocfs2
>>>> Loading module "configfs": OK
>>>> Mounting configfs filesystem at /sys/kernel/config: OK
>>>> Loading module "ocfs2_nodemanager": OK
>>>> Loading module "ocfs2_dlm": OK
>>>> Loading module "ocfs2_dlmfs": OK
>>>> Mounting ocfs2_dlmfs filesystem at /dlm: OK
>>>> Starting O2CB cluster ocfs2:
>>>>
>>>> [And it hangs]
>>>> =============
>>>>
>>>> Does anyone has a clue of what is happening?
>>>>
>>>> Thanks.
>>>>
>>>>
>>>> _______________________________________________
>>>> Ocfs2-users mailing list
>>>> Ocfs2-users at oss.oracle.com
>>>> http://oss.oracle.com/mailman/listinfo/ocfs2-users
>>>>   
>>>
>>
>> ------------------------------------------------------------------------
>>
>> node:
>>     ip_port = 7777
>>     ip_address = 192.168.130.1
>>     number = 0
>>     name = arenamexico
>>     cluster = ocfs2
>>
>> node:
>>     ip_port = 7777
>>     ip_address = 192.168.130.2
>>     number = 1
>>     name = suse1
>>     cluster = ocfs2
>>
>> node:
>>     ip_port = 7777
>>     ip_address = 192.168.130.3
>>     number = 2
>>     name = drwagner
>>     cluster = ocfs2
>>
>> cluster:
>>     node_count = 3
>>     name = ocfs2
>>
>>   
>




More information about the Ocfs2-users mailing list