[Ocfs2-users] crash during big file transfers

Sunil Mushran Sunil.Mushran at oracle.com
Mon Jun 23 11:51:14 PDT 2008


1.2.9 is only meant for sles9/10 and rhel4/5. It will not build on
other kernels. Instead use the fs that is bundled with the kernel.

Not only will it work without requiring any src changes, it also
has newer features.
http://oss.oracle.com/projects/ocfs2/dist/documentation/ocfs2-new-features.html

While we try to push bug fixes for older kernels to the stable team,
a full list of relevant patches is maintained here.
http://www.kernel.org/pub/linux/kernel/people/mfasheh/ocfs2/backports/

Carlos Xavier wrote:
> Tank you very much Sunil for the tip on the timeouts.
>
> Before I update them I decided give a try on changing the scheduller from 
> CFQ to DEADLINE and the results were amazing. Changing it I could run many 
> parallel file copy with no crash.
>
> As I'm planning to make available the OCFS2 packages for the Slackware 
> distribution I'm asking for help on compiling the kernel module, so as we 
> can have a updated one for the kernel 2.6.21.5 and 2.6.24.5 distributed with 
> Slackware 12.0 and 12.1.
>
> Tanks,
> Carlos Xavier.
>
> ----- Original Message ----- 
> From: "Sunil Mushran" <Sunil.Mushran at oracle.com>
> To: "Carlos Xavier" <cbastos at connection.com.br>
> Cc: "ocfs2" <ocfs2-users at oss.oracle.com>
> Sent: Monday, June 23, 2008 12:10 PM
> Subject: Re: [Ocfs2-users] crash during big file transfers
>
>
>   
>> The fs in 2.6.21 still uses the old very short cluster timeouts.
>>
>> In mainline, the defaults were updated in 2.6.25-ish.
>>
>> The faq has the details on setting them manually.
>> http://oss.oracle.com/projects/ocfs2/dist/documentation/ocfs2_faq.html#TIMEOUT
>>
>> Carlos Xavier wrote:
>>     
>>> Dear Srs.
>>>  I have been running a system over the OCFS2 filesystem since 12/2007,
>>> Lately we start to experience some node crash and the reported couse was 
>>> the OCFS2, but nothing was wrote on the system log.
>>> So we decided to move the system to another pair of machine. I did a 
>>> fresh install of Slackware 12.0 and installed the OCFS2 1.2.7 tools.
>>>  When I started to copy the files from the old system to the new one, 
>>> after some time of file transfer I get this error:
>>>  root at diana:~# Message from syslogd at diana at Sat Jun 21 03:39:00 2008 ...
>>> diana kernel: Kernel panic - not syncing: ocfs2 is very sorry to be 
>>> fencing this system by panicing
>>>  After this the only way to get the computer back is only powering it of 
>>> and on again.
>>>  I took a look at the FAQ and saw is issue repported on the Heartbeat 
>>> session Question 07, but i noticed it was related to an old kernel 
>>> version. Since we are running a kernel version 2.6.21.5 I would like to 
>>> know if this solution still apply.
>>>  Some items on the FAQ are not uptodate, How can I do to know what module 
>>> version is on the kernel I'm running?
>>>  What is the relation of the kernel version and the kernel module?
>>>  Is there a way to generate a patch to apply to the kernel im running and 
>>> conpile it?
>>>  I would like to try the latest version of the kernel module. I 
>>> downloaded it and tried to compile it, but it was not sucessfull.
>>> This is the way i configured it:
>>> apolo:# 
>>> ./configure --prefix=/usr --sysconfdir=/etc --build=i486-slackware-linux  
>>> --host=i486-slackware-linux --target=i486-slackware-linux --with-kernel-source=/usr/src/linux
>>>  When I tryed to compile it I got the following error:
>>> apolo# make
>>> make -C fs
>>> make[1]: Entering directory `/usr/local/src/ocfs2-1.2.9/fs'
>>> make -C ocfs2 stamp-md5
>>> make[2]: Entering directory `/usr/local/src/ocfs2-1.2.9/fs/ocfs2'
>>> make[2]: `stamp-md5' is up to date.
>>> make[2]: Leaving directory `/usr/local/src/ocfs2-1.2.9/fs/ocfs2'
>>> make -C ocfs2/cluster stamp-md5
>>> make[2]: Entering directory `/usr/local/src/ocfs2-1.2.9/fs/ocfs2/cluster'
>>> make[2]: `stamp-md5' is up to date.
>>> make[2]: Leaving directory `/usr/local/src/ocfs2-1.2.9/fs/ocfs2/cluster'
>>> make -C ocfs2/dlm stamp-md5
>>> make[2]: Entering directory `/usr/local/src/ocfs2-1.2.9/fs/ocfs2/dlm'
>>> make[2]: `stamp-md5' is up to date.
>>> make[2]: Leaving directory `/usr/local/src/ocfs2-1.2.9/fs/ocfs2/dlm'
>>> make -C /lib/modules/2.6.21.5-smp/build  M=/usr/local/src/ocfs2-1.2.9/fs 
>>> modules
>>> make[2]: Entering directory `/usr/src/linux-2.6.21.5'
>>>  WARNING: Symbol version dump /usr/src/linux-2.6.21.5/Module.symvers
>>>            is missing; modules will have no dependencies and modversions.
>>>  CC [M]  /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.o
>>> In file included from /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c:39:
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/inode.h:117: warning: 'kmem_cache_t' 
>>> is deprecated
>>> In file included from /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c:40:
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/journal.h:136: warning: 
>>> 'kmem_cache_t' is deprecated
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c: In function 
>>> 'ocfs2_schedule_truncate_log_flush':
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c:1230: warning: passing 
>>> argument 1 of 'cancel_delayed_work' from incompatible pointer type
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c:1233: warning: passing 
>>> argument 2 of 'queue_delayed_work' from incompatible pointer type
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c: In function 
>>> 'ocfs2_truncate_log_shutdown':
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c:1412: warning: passing 
>>> argument 1 of 'cancel_delayed_work' from incompatible pointer type
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c:1444:69: error: macro 
>>> "INIT_WORK" passed 3 arguments, but takes just 2
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c: In function 
>>> 'ocfs2_truncate_log_init':
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c:1444: error: 'INIT_WORK' 
>>> undeclared (first use in this function)
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c:1444: error: (Each undeclared 
>>> identifier is reported only once
>>> /usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.c:1444: error: for each 
>>> function it appears in.)
>>> make[4]: *** [/usr/local/src/ocfs2-1.2.9/fs/ocfs2/alloc.o] Error 1
>>> make[3]: *** [/usr/local/src/ocfs2-1.2.9/fs/ocfs2] Error 2
>>> make[2]: *** [_module_/usr/local/src/ocfs2-1.2.9/fs] Error 2
>>> make[2]: Leaving directory `/usr/src/linux-2.6.21.5'
>>> make[1]: *** [build-modules] Error 2
>>> make[1]: Leaving directory `/usr/local/src/ocfs2-1.2.9/fs'
>>> make: *** [fs] Error 2
>>>  Tanks in advance for your attention, I hope you can help me to get over 
>>> this issue.
>>>  Regards,
>>> Carlos Xavier.
>>> ------------------------------------------------------------------------
>>>
>>> _______________________________________________
>>> Ocfs2-users mailing list
>>> Ocfs2-users at oss.oracle.com
>>> http://oss.oracle.com/mailman/listinfo/ocfs2-users
>>>       
>
>
> _______________________________________________
> Ocfs2-users mailing list
> Ocfs2-users at oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-users
>   




More information about the Ocfs2-users mailing list