[Ocfs2-users] SuSe Hangs when /etc/init.d/o2cb online

Yuri Kirsanov flash at preferance.ru
Tue Apr 1 17:12:18 PDT 2008


I would suggest to upgrade kernel to more recent version or to compile 
default clean kernel by yourself.
2.6.18.53 is pretty old already, as we have 2.6.24.xx. I use 2.6.23.13 and 
have no problems. If your computer just hangs - it's kernel or hardware 
problem, in any other case you would see kernel Oops page.
By the way, you have different versions of ocfs2 tools and kernel module.

----- Original Message ----- 
From: "Sunil Mushran" <Sunil.Mushran at oracle.com>
To: "Miguel Santos N." <msantos at celnet.com.mx>
Cc: <ocfs2-users at oss.oracle.com>; "Joel Becker" <Joel.Becker at oracle.com>
Sent: Wednesday, April 02, 2008 4:05 AM
Subject: Re: [Ocfs2-users] SuSe Hangs when /etc/init.d/o2cb online


Do a mount and you should see configfs mounted:
configfs on /sys/kernel/config type configfs (rw)

If so, the hang is puzzling. Email opensuse help list if anyone
else has encountered the same.

Joel, Any ideas?


Miguel Santos N. wrote:
> I tried to do what you told me but it hangs and it doesn't even log to
> the output file the result of strace.
>
> However I ran it without -o switch and here is it:
>
> ==========
> execve("/sbin/o2cb_ctl", ["o2cb_ctl", "-H", "-n", "ocfs2", "-t",
> "cluster", "-a", "online=yes"], [/* 61 vars */]) = 0
> uname({sys="Linux", node="drwagner", ...}) = 0
> brk(0)                                  = 0x6cf000
> brk(0x6cff70)                           = 0x6cff70
> arch_prctl(ARCH_SET_FS, 0x6cf8b0)       = 0
> brk(0x6f0f70)                           = 0x6f0f70
> brk(0x6f1000)                           = 0x6f1000
> open("/sys/o2cb/interface_revision", O_RDONLY) = 3
> read(3, "5\n", 15)                      = 2
> read(3, "", 13)                         = 0
> close(3)                                = 0
> stat("/sys/kernel/config", {st_mode=S_IFDIR|0755, st_size=0, ...}) = 0
> statfs("/sys/kernel/config", {f_type=0x62656570, f_bsize=4096,
> f_blocks=0, f_bfree=0, f_bavail=0, f_files=0, f_ffree=0, f_fsid={0,
> 0}, f_namelen=255, f_frsize=4096}) = 0
> stat("/etc/ocfs2/cluster.conf", {st_mode=S_IFREG|0644, st_size=331,
> ...}) = 0
> open("/etc/ocfs2/cluster.conf", O_RDONLY) = 3
> read(3, "node:\n\tip_port = 7777\n\tip_addres"..., 4000) = 331
> read(3, "", 4000)                       = 0
> close(3)                                = 0
> mkdir("/sys/kernel/config/cluster/ocfs2", 0755
> =========
>
> As you told me, the last command is not executed at all and the
> computer hangs. I tried to execute:
>
> mkdir -p -m 755 /sys/kernel/config/cluster/ocfs2
>
> And guess what? it hangs too =P
>
> So I guess OCFS2 is not the problem but something with the kernel space.
>
> What do you think?
>
> Sunil Mushran escribi?:
>> So that looks ok.
>>
>> Next step would be to do some strac-ing.
>>
>> Do:
>> /etc/init.d/o2cb load
>>
>> then do:
>> strace -ff -o /tmp/out o2cb_ctl -H -n ocfs2 -t cluster -a online=yes
>>
>> The last command is what "onlines" the cluster.
>>
>> /tmp/out should have some information.
>>
>> Miguel Santos N. wrote:
>>> Thanks..
>>>
>>> I am attaching the file /etc/ocfs/cluster.conf, it is the same file
>>> on all the servers.
>>>
>>>
>>>
>>> Sunil Mushran escribi?:
>>>> Check /etc/ocfs2/cluster.conf on the opensuse cluster. Starting
>>>> cluster
>>>> includes reading cluster.conf and populating /sys/kernel/config with
>>>> the info. Make that file available as-is... as in, not a cut-paste.
>>>>
>>>> Miguel Santos wrote:
>>>>> Hello,
>>>>>
>>>>> I have a DELL MD3000i, and a couple of servers that I want to
>>>>> connect to
>>>>> the array.
>>>>>
>>>>> I have setup the main server with CentOS 5.
>>>>> - 2.6.18-53.el5 x86_64
>>>>> - ocfs2-tools-1.2.7-1.el5
>>>>> - ocfs2console-1.2.7-1.el5
>>>>> - ocfs2-2.6.18-53.el5-1.2.8-2.el5 # Kernel Module
>>>>>
>>>>> And two OpenSuSe 10.3 Servers
>>>>> - 2.6.22.5-31-default x86_64
>>>>> - ocfs2-tools-1.2.6-18
>>>>> - ocfs2console-1.2.6-18
>>>>> The module has been already shipped with the kernel (thanks to
>>>>> novell)
>>>>>
>>>>> I had no problems mounting the FS on the CentOS 5 server.
>>>>> I can read/write in the FS without problems.
>>>>>
>>>>> ============
>>>>> [root at arenamexico ~]# mounted.ocfs2 -d
>>>>> Device                FS     UUID
>>>>> Label
>>>>> /dev/sdb1             ocfs2  cffd81fa-7395-433e-912a-d9c9d7235df3
>>>>> Coliseo
>>>>> [root at arenamexico ~]# mounted.ocfs2 -f
>>>>> Device                FS     Nodes
>>>>> /dev/sdb1             ocfs2  arenamexico
>>>>> [root at arenamexico ~]# mount|grep ocfs2
>>>>> ocfs2_dlmfs on /dlm type ocfs2_dlmfs (rw)
>>>>> /dev/sdb1 on /mnt/projects type ocfs2
>>>>> (rw,_netdev,noatime,heartbeat=local)
>>>>> ============
>>>>>
>>>>> But when I try to start the o2cb service on the Opensuse servers, the
>>>>> servers just hang.
>>>>>
>>>>> ============
>>>>> drwagner:~ # /etc/init.d/o2cb status
>>>>> Module "configfs": Loaded
>>>>> Filesystem "configfs": Mounted
>>>>> Module "ocfs2_nodemanager": Loaded
>>>>> Module "ocfs2_dlm": Loaded
>>>>> Module "ocfs2_dlmfs": Loaded
>>>>> Filesystem "ocfs2_dlmfs": Mounted
>>>>>
>>>>> drwagner:~ # /etc/init.d/o2cb online ocfs2
>>>>> Loading module "configfs": OK
>>>>> Mounting configfs filesystem at /sys/kernel/config: OK
>>>>> Loading module "ocfs2_nodemanager": OK
>>>>> Loading module "ocfs2_dlm": OK
>>>>> Loading module "ocfs2_dlmfs": OK
>>>>> Mounting ocfs2_dlmfs filesystem at /dlm: OK
>>>>> Starting O2CB cluster ocfs2:
>>>>>
>>>>> [And it hangs]
>>>>> =============
>>>>>
>>>>> Does anyone has a clue of what is happening?
>>>>>
>>>>> Thanks.
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> Ocfs2-users mailing list
>>>>> Ocfs2-users at oss.oracle.com
>>>>> http://oss.oracle.com/mailman/listinfo/ocfs2-users
>>>>>
>>>>
>>>
>>> ------------------------------------------------------------------------ 
>>>
>>>
>>> node:
>>>     ip_port = 7777
>>>     ip_address = 192.168.130.1
>>>     number = 0
>>>     name = arenamexico
>>>     cluster = ocfs2
>>>
>>> node:
>>>     ip_port = 7777
>>>     ip_address = 192.168.130.2
>>>     number = 1
>>>     name = suse1
>>>     cluster = ocfs2
>>>
>>> node:
>>>     ip_port = 7777
>>>     ip_address = 192.168.130.3
>>>     number = 2
>>>     name = drwagner
>>>     cluster = ocfs2
>>>
>>> cluster:
>>>     node_count = 3
>>>     name = ocfs2
>>>
>>>
>>
>


_______________________________________________
Ocfs2-users mailing list
Ocfs2-users at oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users




More information about the Ocfs2-users mailing list