[Ocfs2-users] ocfs2 - Kernel panic on many write/read from both

Srinivas Eeda srinivas.eeda at oracle.com
Tue Dec 20 10:58:40 PST 2011


Marek Królikowski wrote:
> Sorry i don`t copy everything:
> TEST-MAIL1# echo "ls //orphan_dir:0000"|debugfs.ocfs2 /dev/dm-0|wc
> debugfs.ocfs2 1.6.4
> 5239722 26198604 246266859
^^^^^ those numbers (5239722, 6074335) are the problem. What they are 
telling is the orphan directory is filled with flood of files. This is 
because of the change of unlink behavior introduced by patch 
"ea455f8ab68338ba69f5d3362b342c115bea8e13".

If you are interested in details, ... in normal unlink case an entry for 
the deleting file is created in orphan directory as an intermediate step 
and the entry is cleared towards the end of the unlink process. But 
because of that patch, entry doesn't get cleared and sticks around.

OCFS2 has a function called orphan scan which is executed as part of a 
thread which gets a ex lock on orphan scan lock and it then scans to 
clear all entries but it can't because the open lock is still around. 
Since this can takes longer because of the huge number of entries 
getting created, *new deletes will get delayed* as they need the ex lock.

So what can be done? for now if you are not using quotas feature you 
should get a new kernel by backing out the following patches

5fd131893793567c361ae64cbeb28a2a753bbe35
f7b1aa69be138ad9d7d3f31fa56f4c9407f56b6a
ea455f8ab68338ba69f5d3362b342c115bea8e13

or periodically umount the file system on all nodes and remount whenever 
the problem becomes severe.

Thanks,
--Srini

> TEST-MAIL1# echo "ls //orphan_dir:0001"|debugfs.ocfs2 /dev/dm-0|wc
> debugfs.ocfs2 1.6.4
> 6074335 30371669 285493670
>  
> TEST-MAIL2 ~ # echo "ls //orphan_dir:0000"|debugfs.ocfs2 /dev/dm-0|wc
> debugfs.ocfs2 1.6.4
> 5239722 26198604 246266859
> TEST-MAIL2 ~ # echo "ls //orphan_dir:0001"|debugfs.ocfs2 /dev/dm-0|wc
> debugfs.ocfs2 1.6.4
> 6074335 30371669 285493670
>  
> Thanks for Your help.
>  
>  
> *From:* Marek Królikowski <mailto:admin at wset.edu.pl>
> *Sent:* Tuesday, December 20, 2011 6:39 PM
> *To:* ocfs2-users at oss.oracle.com <mailto:ocfs2-users at oss.oracle.com>
> *Subject:* Re: [Ocfs2-users] ocfs2 - Kernel panic on many write/read 
> from both
>  
> > I think you are running into a known issue. Are there lot of orphan
> > files in orphan directory? I am not sure if the problem is still there,
> > if not please run the same test and once you see the same symptoms,
> > please run the following and provide me the output
> >
> > echo "ls //orphan_dir:0000"|debugfs.ocfs2 <device>|wc
> > echo "ls //orphan_dir:0001"|debugfs.ocfs2 <device>|wc
> Hello
> Thank You for answer - strange i don`t get email with Your answer.
> This is what You want:
> TEST-MAIL1# echo "ls //orphan_dir:0000"|debugfs.ocfs2 /dev/dm-0|wc
> debugfs.ocfs2 1.6.4
> 5239722 26198604 246266859
>  
>  
> TEST-MAIL2 ~ # echo "ls //orphan_dir:0000"|debugfs.ocfs2 /dev/dm-0|wc
> debugfs.ocfs2 1.6.4
> 5239722 26198604 246266859
>  
>  
> This is my testing cluster so if u need do more tests please tell me i 
> do for You.
> Thanks
> ------------------------------------------------------------------------
>
> _______________________________________________
> Ocfs2-users mailing list
> Ocfs2-users at oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-users




More information about the Ocfs2-users mailing list