To add further information:<br><br>1) Note A:<br># cat /sys/kernel/debug/o2dlm/6D419D86AE8A4DB1940788EDDA27027B/dlm_state<br>Domain: 6D419D86AE8A4DB1940788EDDA27027B Key: 0xc955c1d5<br>Thread Pid: 3869 Node: 1 State: JOINED<br>
Number of Joins: 1 Joining Node: 255<br>Domain Map: 1 2<br>Live Map: 1 2<br>Lock Resources: 70731 (442210)<br>MLEs: 0 (1048380)<br> Blocking: 0 (647669)<br> Mastery: 0 (400711)<br> Migration: 0 (0)<br>Lists: Dirty=Empty Purge=Empty PendingASTs=Empty PendingBASTs=Empty<br>
Purge Count: 0 Refs: 70732<br>Dead Node: 255<br>Recovery Pid: 3870 Master: 255 State: INACTIVE<br>Recovery Map:<br>Recovery Node State:<br><br>Node B:<br># cat /sys/kernel/debug/o2dlm/6D419D86AE8A4DB1940788EDDA27027B/dlm_state<br>
Domain: 6D419D86AE8A4DB1940788EDDA27027B Key: 0xc955c1d5<br>Thread Pid: 3757 Node: 2 State: JOINED<br>Number of Joins: 1 Joining Node: 255<br>Domain Map: 1 2<br>Live Map: 1 2<br>Lock Resources: 48113 (50521)<br>MLEs: 0 (85510)<br>
Blocking: 0 (35121)<br> Mastery: 0 (50389)<br> Migration: 0 (0)<br>Lists: Dirty=Empty Purge=Empty PendingASTs=Empty PendingBASTs=Empty<br>Purge Count: 0 Refs: 48114<br>Dead Node: 255<br>Recovery Pid: 3758 Master: 255 State: INACTIVE<br>
Recovery Map:<br>Recovery Node State:<br><br>There are no busy locks apparently, as shown by<br><br># debugfs.ocfs2 -R "fs_locks -B" /dev/sda1<br>#<br><br>I am unable to kill any of these processes, even with kill -9.<br>
<br># cat /etc/ocfs2/cluster.conf<br>cluster:<br> node_count = 2<br> name = ocfs2ftpcluster<br><br>node:<br> ip_port = 7777<br> ip_address = 192.168.0.1<br> number = 1<br> name = prtftp01<br>
cluster = ocfs2ftpcluster<br><br>node:<br> ip_port = 7777<br> ip_address = 192.168.0.2<br> number = 2<br> name = prtftp02<br> cluster = ocfs2ftpcluster<br><br>If you'd like the output of :<br>
<br># debugfs.ocfs2 -R "fs_locks" /dev/sda1 | wc -l<br>768681<br><br>I can give it, but it's a lot output.<br><br>--Jason<br><br><div class="gmail_quote">On Fri, Apr 2, 2010 at 11:38 AM, Jason Price <span dir="ltr"><<a href="mailto:japrice@gmail.com">japrice@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">I'm setting up an HA ftp server (amongst other services).<br><br>When
two connections happen simultaneously, and (more specifically) the same
user from two IP's attempt to access the same file (one for reading, and
one for writing), the processes both hang. And all subsequent attempts
to either read or write the file fail.<br>
<br>The two processes that seem to have caused the lock:<br>user 24139
1657 Thu Apr 1 18:25:01 2010 proftpd: cbs - ::ffff:xxx.yyy.0.253: RETR
prim_wo_img_dom.obs<br>user 24142 1657 Thu Apr 1 18:25:01 2010
proftpd: cbs - ::ffff:xxx.yyy.103.208: STOR prim_wo_img_dom.obs<br>
<br>(there are 49 other process trying to do the same things, but these
are the first ones.)<br><br>I'm more than happy to provide any
information needed on this issue:<br><br>OSL<br>CentOS release 5.4
(Final)<br><br>
uname -a:<br>Linux prtftp01<omitted> 2.6.18-164.11.1.el5 #1 SMP
Wed Jan 20 07:32:21 EST 2010 x86_64 x86_64 x86_64 GNU/Linux<br><br>ocfs2
version 1.4.4<br><br>At the moment, only one host is actively serving
FTP at any time. I can fail the services back and forth as needed.<br><font color="#888888"><font color="#888888">
<br>--Jason</font>
</font></blockquote></div><br>