[Ocfs2-users] SuSe Hangs when /etc/init.d/o2cb online
Miguel Santos N.
msantos at celnet.com.mx
Tue Apr 1 16:48:45 PDT 2008
I tried to do what you told me but it hangs and it doesn't even log to
the output file the result of strace.
However I ran it without -o switch and here is it:
==========
execve("/sbin/o2cb_ctl", ["o2cb_ctl", "-H", "-n", "ocfs2", "-t",
"cluster", "-a", "online=yes"], [/* 61 vars */]) = 0
uname({sys="Linux", node="drwagner", ...}) = 0
brk(0) = 0x6cf000
brk(0x6cff70) = 0x6cff70
arch_prctl(ARCH_SET_FS, 0x6cf8b0) = 0
brk(0x6f0f70) = 0x6f0f70
brk(0x6f1000) = 0x6f1000
open("/sys/o2cb/interface_revision", O_RDONLY) = 3
read(3, "5\n", 15) = 2
read(3, "", 13) = 0
close(3) = 0
stat("/sys/kernel/config", {st_mode=S_IFDIR|0755, st_size=0, ...}) = 0
statfs("/sys/kernel/config", {f_type=0x62656570, f_bsize=4096,
f_blocks=0, f_bfree=0, f_bavail=0, f_files=0, f_ffree=0, f_fsid={0, 0},
f_namelen=255, f_frsize=4096}) = 0
stat("/etc/ocfs2/cluster.conf", {st_mode=S_IFREG|0644, st_size=331,
...}) = 0
open("/etc/ocfs2/cluster.conf", O_RDONLY) = 3
read(3, "node:\n\tip_port = 7777\n\tip_addres"..., 4000) = 331
read(3, "", 4000) = 0
close(3) = 0
mkdir("/sys/kernel/config/cluster/ocfs2", 0755
=========
As you told me, the last command is not executed at all and the computer
hangs. I tried to execute:
mkdir -p -m 755 /sys/kernel/config/cluster/ocfs2
And guess what? it hangs too =P
So I guess OCFS2 is not the problem but something with the kernel space.
What do you think?
Sunil Mushran escribió:
> So that looks ok.
>
> Next step would be to do some strac-ing.
>
> Do:
> /etc/init.d/o2cb load
>
> then do:
> strace -ff -o /tmp/out o2cb_ctl -H -n ocfs2 -t cluster -a online=yes
>
> The last command is what "onlines" the cluster.
>
> /tmp/out should have some information.
>
> Miguel Santos N. wrote:
>> Thanks..
>>
>> I am attaching the file /etc/ocfs/cluster.conf, it is the same file
>> on all the servers.
>>
>>
>>
>> Sunil Mushran escribió:
>>> Check /etc/ocfs2/cluster.conf on the opensuse cluster. Starting cluster
>>> includes reading cluster.conf and populating /sys/kernel/config with
>>> the info. Make that file available as-is... as in, not a cut-paste.
>>>
>>> Miguel Santos wrote:
>>>> Hello,
>>>>
>>>> I have a DELL MD3000i, and a couple of servers that I want to
>>>> connect to
>>>> the array.
>>>>
>>>> I have setup the main server with CentOS 5.
>>>> - 2.6.18-53.el5 x86_64
>>>> - ocfs2-tools-1.2.7-1.el5
>>>> - ocfs2console-1.2.7-1.el5
>>>> - ocfs2-2.6.18-53.el5-1.2.8-2.el5 # Kernel Module
>>>>
>>>> And two OpenSuSe 10.3 Servers
>>>> - 2.6.22.5-31-default x86_64
>>>> - ocfs2-tools-1.2.6-18
>>>> - ocfs2console-1.2.6-18
>>>> The module has been already shipped with the kernel (thanks to novell)
>>>>
>>>> I had no problems mounting the FS on the CentOS 5 server.
>>>> I can read/write in the FS without problems.
>>>>
>>>> ============
>>>> [root at arenamexico ~]# mounted.ocfs2 -d
>>>> Device FS UUID
>>>> Label
>>>> /dev/sdb1 ocfs2 cffd81fa-7395-433e-912a-d9c9d7235df3
>>>> Coliseo
>>>> [root at arenamexico ~]# mounted.ocfs2 -f
>>>> Device FS Nodes
>>>> /dev/sdb1 ocfs2 arenamexico
>>>> [root at arenamexico ~]# mount|grep ocfs2
>>>> ocfs2_dlmfs on /dlm type ocfs2_dlmfs (rw)
>>>> /dev/sdb1 on /mnt/projects type ocfs2
>>>> (rw,_netdev,noatime,heartbeat=local)
>>>> ============
>>>>
>>>> But when I try to start the o2cb service on the Opensuse servers, the
>>>> servers just hang.
>>>>
>>>> ============
>>>> drwagner:~ # /etc/init.d/o2cb status
>>>> Module "configfs": Loaded
>>>> Filesystem "configfs": Mounted
>>>> Module "ocfs2_nodemanager": Loaded
>>>> Module "ocfs2_dlm": Loaded
>>>> Module "ocfs2_dlmfs": Loaded
>>>> Filesystem "ocfs2_dlmfs": Mounted
>>>>
>>>> drwagner:~ # /etc/init.d/o2cb online ocfs2
>>>> Loading module "configfs": OK
>>>> Mounting configfs filesystem at /sys/kernel/config: OK
>>>> Loading module "ocfs2_nodemanager": OK
>>>> Loading module "ocfs2_dlm": OK
>>>> Loading module "ocfs2_dlmfs": OK
>>>> Mounting ocfs2_dlmfs filesystem at /dlm: OK
>>>> Starting O2CB cluster ocfs2:
>>>>
>>>> [And it hangs]
>>>> =============
>>>>
>>>> Does anyone has a clue of what is happening?
>>>>
>>>> Thanks.
>>>>
>>>>
>>>> _______________________________________________
>>>> Ocfs2-users mailing list
>>>> Ocfs2-users at oss.oracle.com
>>>> http://oss.oracle.com/mailman/listinfo/ocfs2-users
>>>>
>>>
>>
>> ------------------------------------------------------------------------
>>
>> node:
>> ip_port = 7777
>> ip_address = 192.168.130.1
>> number = 0
>> name = arenamexico
>> cluster = ocfs2
>>
>> node:
>> ip_port = 7777
>> ip_address = 192.168.130.2
>> number = 1
>> name = suse1
>> cluster = ocfs2
>>
>> node:
>> ip_port = 7777
>> ip_address = 192.168.130.3
>> number = 2
>> name = drwagner
>> cluster = ocfs2
>>
>> cluster:
>> node_count = 3
>> name = ocfs2
>>
>>
>
More information about the Ocfs2-users
mailing list