<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">On 2012-08-22 18:23, srinivas eeda
wrote:<br>
</div>
<blockquote cite="mid:5035076B.8000101@oracle.com" type="cite">
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
crash looks similar to what patch <a moz-do-not-send="true"
class="moz-txt-link-freetext"
href="https://oss.oracle.com/pipermail/ocfs2-devel/2012-January/008469.html">https://oss.oracle.com/pipermail/ocfs2-devel/2012-January/008469.html</a>
trying to address. The fix is not yet accepted because as
explained in the patch description we need to fix the master node
to skip sending BAST after receiving unlock message.<br>
<br>
regarding ERROR: status = -17 what storage do you use? could be
due to stale data.<br>
</blockquote>
Size of storage is 400G<br>
OCFS2 works over aoe <br>
<br>
<br>
<blockquote cite="mid:5035076B.8000101@oracle.com" type="cite"> <br>
On 8/22/2012 2:25 AM, Pawel wrote:
<blockquote cite="mid:5034A59D.7060205@mp.pl" type="cite">
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
<div class="moz-cite-prefix">It was done multiple times,<br>
even more: system was recreated by mkfs.<br>
Still the same behavior...<br>
<br>
<br>
Pawel<br>
<br>
On 2012-08-22 04:21, Sunil Mushran wrote:<br>
</div>
<blockquote
cite="mid:CAEeiSHU6rPsrtfZLemJzUPSeNgq5q=0=SCgiQnCmMBPN0Bpnbg@mail.gmail.com"
type="cite">You may want to run a full fsck on the fs.
<div><br>
</div>
<div>fsck.ocfs2 -fy /dev/xxxx<br>
<br>
<div class="gmail_quote">On Tue, Aug 21, 2012 at 12:49 AM,
Pawel <span dir="ltr"><<a moz-do-not-send="true"
href="mailto:pzlist@mp.pl" target="_blank">pzlist@mp.pl</a>></span>
wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex">Hi,<br>
After upgrading ocfs2 my cluster is instable.<br>
<br>
At least ones per week I can see:<br>
kernel panic: Null pointer dereference at 00048<br>
o2dlm_blocking_ast_wrapper + 0x8/0x20 [ocfs2_stack_o2cb]<br>
stack:<br>
dlm_do_local_bast [ocfs2_dlm]<br>
dlm_lookup_lockers [ocfs2_dlm]<br>
dlm_proxy_ast_handler<br>
add_timer<br>
..<br>
<br>
After that sometimes deadlock happens on another nodes.
Entire cluster<br>
restart solve the issue.<br>
I see in log:<br>
(dlm_thread,7227,3):dlm_send_proxy_ast_msg:484 ERROR:<br>
ECB9442E19A94EAC896641BFADD55E4B: res
M0000000000000001f411c900000000,<br>
error -107 send AST to node 4<br>
(dlm_thread,7227,3):dlm_flush_asts:605 ERROR: status =
-107<br>
o2net: No connection established with node 4 after 10.0
seconds, giving up.<br>
o2net: No connection established with node 4 after 10.0
seconds, giving up.<br>
o2net: No connection established with node 4 after 10.0
seconds, giving up.<br>
(dlm_thread,7227,4):dlm_send_proxy_ast_msg:484 ERROR:<br>
ECB9442E19A94EAC896641BFADD55E4B: res
M0000000000000001f411c900000000,<br>
error -107 send AST to node 4<br>
(dlm_thread,7227,4):dlm_flush_asts:605 ERROR: status =
-107<br>
o2cb: o2dlm has evicted node 4 from domain
ECB9442E19A94EAC896641BFADD55E4B<br>
o2cb: o2dlm has evicted node 4 from domain
ECB9442E19A94EAC896641BFADD55E4B<br>
o2dlm: Begin recovery on domain
ECB9442E19A94EAC896641BFADD55E4B for node 4<br>
o2dlm: Node 5 (he) is the Recovery Master for the dead
node 4 in domain<br>
ECB9442E19A94EAC896641BFADD55E4B<br>
o2dlm: End recovery on domain
ECB9442E19A94EAC896641BFADD55E4B<br>
<br>
<br>
Additionaly ~4 times per day I see:<br>
<br>
ocfs2_check_dir_for_entry:2119 ERROR: status = -17<br>
ocfs2_mknod:459 ERROR: status = -17<br>
ocfs2_create:629 ERROR: status = -17<br>
<br>
<br>
I currently use kernel 3.4.2<br>
my filesystem has been created with:<br>
-N 8-b 4096 -C 32768 --fs-features<br>
backup-super,strict-journal-super,sparse,extended-slotmap,inline-data,metaecc,xattr,indexed-dirs,refcount,discontig-bg,unwritten,usrquota,grpquota<br>
<br>
Could you tell me what could make my system instable?
Which feature ?<br>
<br>
Thanks for any help<br>
<br>
Pawel<br>
<br>
<br>
_______________________________________________<br>
Ocfs2-users mailing list<br>
<a moz-do-not-send="true"
href="mailto:Ocfs2-users@oss.oracle.com">Ocfs2-users@oss.oracle.com</a><br>
<a moz-do-not-send="true"
href="https://oss.oracle.com/mailman/listinfo/ocfs2-users"
target="_blank">https://oss.oracle.com/mailman/listinfo/ocfs2-users</a><br>
</blockquote>
</div>
<br>
</div>
</blockquote>
<br>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
Ocfs2-users mailing list
<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:Ocfs2-users@oss.oracle.com">Ocfs2-users@oss.oracle.com</a>
<a moz-do-not-send="true" class="moz-txt-link-freetext" href="https://oss.oracle.com/mailman/listinfo/ocfs2-users">https://oss.oracle.com/mailman/listinfo/ocfs2-users</a></pre>
</blockquote>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
Ocfs2-users mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Ocfs2-users@oss.oracle.com">Ocfs2-users@oss.oracle.com</a>
<a class="moz-txt-link-freetext" href="https://oss.oracle.com/mailman/listinfo/ocfs2-users">https://oss.oracle.com/mailman/listinfo/ocfs2-users</a></pre>
</blockquote>
<br>
</body>
</html>