[Ocfs2-users] Weird lock

Sunil Mushran Sunil.Mushran at oracle.com
Fri Apr 4 10:40:42 PDT 2008


Don't send the raw locking_state info. Instead send the human readable
as output-ed by debugfs.ocfs2.

$ debugfs.ocfs2 -R "fs_locks" /dev/sdX >/tmp/out

Nuno Fernandes wrote:
> Hi,
>
> We are having a problem with apache+perl being hang.
>
>  1626 ?        Ss     0:00 sendmail: rejecting connections on daemon MTA: load 
> average: 152
>  1634 ?        Ss     0:00 sendmail: Queue runner at 01:00:00 
> for /var/spool/clientmqueue
>  1741 ?        Ss     0:00 /usr/sbin/httpd
>  1744 ?        S      0:00  
> \_ /usr/local/sbin/cronolog /site/logssite/access_log.%Y%m%d
> 21377 ?        S      0:00  \_ /usr/sbin/httpd
> 23942 ?        D      0:00  |   
> \_ /usr/bin/perl -w /storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
> 21518 ?        S      0:00  \_ /usr/sbin/httpd
> 23987 ?        D      0:00  |   
> \_ /usr/bin/perl -w /storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
> 21552 ?        S      0:00  \_ /usr/sbin/httpd
> 23873 ?        D      0:00  |   
> \_ /usr/bin/perl -w /storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
> 21563 ?        S      0:00  \_ /usr/sbin/httpd
> 23948 ?        D      0:00  |   
> \_ /usr/bin/perl -w /storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
> 21590 ?        S      0:00  \_ /usr/sbin/httpd
> 23866 ?        R     39:21  |   
> \_ /usr/bin/perl -w /storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
> 21596 ?        S      0:00  \_ /usr/sbin/httpd
> 23929 ?        D      0:00  |   
> \_ /usr/bin/perl -w /storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
>
> Process 23866  keeps on running and all the others freeze. Strace also blocks.
> Attached i'm sending locking_state data. Dmesg:
>
> -----
> OCFS2 Node Manager 1.2.5 Tue Apr 10 12:29:33 EDT 2007 (build 
> 9e5f332181e8ebfad464946bcc4888af)
> OCFS2 DLM 1.2.5 Tue Apr 10 12:29:33 EDT 2007 (build 
> e2556a71429f31033b275dff4b5594aa)
> OCFS2 DLMFS 1.2.5 Tue Apr 10 12:29:33 EDT 2007 (build 
> e2556a71429f31033b275dff4b5594aa)
> OCFS2 User DLM kernel interface loaded
> o2net: accepted connection from node ws3 (num 19) at 172.16.42.3:7777
> o2net: connected to node ws1 (num 0) at 172.16.42.1:7777
> o2net: connected to node ws2 (num 1) at 172.16.42.2:7777
> OCFS2 1.2.5 Tue Apr 10 12:29:28 EDT 2007 (build 
> 0f745576f5282c9408787369d99ba880)
> ocfs2_dlm: Nodes in domain ("C1B50B9082BC4B74A13FF6F34D35B68B"): 0 1 12 19
> kjournald starting.  Commit interval 5 seconds
> ocfs2: Mounting device (3,3) on (node 12, slot 2)
> -----
>
> These lockouts keep on happening from time to time (about 3 times a week). 
> Today it happended 2 times already.
>
> Thanks for any info
> Nuno Fernandes
>   
> ------------------------------------------------------------------------
>
> _______________________________________________
> Ocfs2-users mailing list
> Ocfs2-users at oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-users




More information about the Ocfs2-users mailing list