<html>
<head>
<meta content="text/html; charset=UTF-8" http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
Hello:<br>
<br>
On 05/09/2016 09:20 PM, 서정우 wrote:<br>
<blockquote
cite="mid:CAAcuexBFPZ=dTgORi_nHXTgRsEHk2ZTrTHLpzN+eFTneOtiiSw@mail.gmail.com"
type="cite">
<div dir="ltr"><span style="font-size:14px">Hi all.</span>
<div style="font-size:14px"><br>
</div>
<div style="font-size:14px"> I built up ocfs2 on drbd dual
primary. </div>
<div style="font-size:14px">Each node has 12 disks of Raid 10
with mdadm chuck size 4096k.</div>
<div style="font-size:14px">Cluster size of filesystem is
1048576 bytes.</div>
<div style="font-size:14px"><br>
</div>
<div style="font-size:14px">Main purpose of use is reflink files
on drbd.</div>
<div style="font-size:14px"><br>
</div>
<div style="font-size:14px">I reflinked files from 1TB file and
exported them to LIO iscsi.</div>
<div style="font-size:14px"><br>
</div>
<div style="font-size:14px">After few days tests, i got kernel
error.</div>
<div style="font-size:14px"><br>
</div>
<div style="font-size:14px"><br clear="all">
<div><br>
</div>
<div>
<div dir="ltr">
<div>May 4 19:29:38 master kernel: [1283940.130689]
(reflink,30902,0):ocfs2_check_dir_for_entry:2048 ERROR:
status = -17</div>
<div>May 4 19:29:38 master kernel: [1283940.131122]
(reflink,30902,0):ocfs2_mv_orphaned_inode_to_new:2917
ERROR: status = -17</div>
<div>May 4 19:29:38 master kernel: [1283940.131533]
(reflink,30902,0):ocfs2_reflink:4317 ERROR: status = -17</div>
<div>
<div>May 4 21:15:29 master kernel: [1290290.387752]
INFO: task reflink:5954 blocked for more than 120
seconds.</div>
<div>May 4 21:15:29 master kernel: [1290290.388093]
Not tainted 4.4.7-040407-generic #201604121331</div>
<div>May 4 21:15:29 master kernel: [1290290.388417]
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs"
disables this message.</div>
<div>May 4 21:15:29 master kernel: [1290290.388784]
reflink D ffff880037e83cf8 0 5954 25468
0x00000000</div>
<div>May 4 21:15:29 master kernel: [1290290.388788]
ffff880037e83cf8 ffff8800b80e6000 ffff8802156ae040
ffff88000195d280</div>
<div>May 4 21:15:29 master kernel: [1290290.388790]
ffff880037e84000 ffff8801af84f1dc ffff88000195d280
00000000ffffffff</div>
<div>May 4 21:15:29 master kernel: [1290290.388792]
ffff8801af84f1e0 ffff880037e83d10 ffffffff817fdf35
ffff8801af84f1d8</div>
<div>May 4 21:15:29 master kernel: [1290290.388793]
Call Trace:</div>
<div>May 4 21:15:29 master kernel: [1290290.388798]
[<ffffffff817fdf35>] schedule+0x35/0x80</div>
<div>May 4 21:15:29 master kernel: [1290290.388800]
[<ffffffff817fe1de>]
schedule_preempt_disabled+0xe/0x10</div>
<div>May 4 21:15:29 master kernel: [1290290.388802]
[<ffffffff817ffe19>]
__mutex_lock_slowpath+0xb9/0x130</div>
<div>May 4 21:15:29 master kernel: [1290290.388803]
[<ffffffff817ffeaf>] mutex_lock+0x1f/0x30</div>
<div>May 4 21:15:29 master kernel: [1290290.388832]
[<ffffffffc0a02668>]
ocfs2_reflink_ioctl+0x218/0x360 [ocfs2]</div>
<div>May 4 21:15:29 master kernel: [1290290.388848]
[<ffffffffc09e634e>] ocfs2_ioctl+0x26e/0x660
[ocfs2]</div>
<div>May 4 21:15:29 master kernel: [1290290.388851]
[<ffffffff8121aa68>] do_vfs_ioctl+0x298/0x480</div>
<div>May 4 21:15:29 master kernel: [1290290.388853]
[<ffffffff812167a4>] ? putname+0x54/0x60</div>
<div>May 4 21:15:29 master kernel: [1290290.388854]
[<ffffffff812066bf>] ? do_sys_open+0x1af/0x230</div>
<div>May 4 21:15:29 master kernel: [1290290.388856]
[<ffffffff8121acc9>] SyS_ioctl+0x79/0x90</div>
<div>May 4 21:15:29 master kernel: [1290290.388858]
[<ffffffff81802036>]
entry_SYSCALL_64_fastpath+0x16/0x75</div>
<div>May 4 21:15:29 master kernel: [1290290.388860]
INFO: task reflink:6466 blocked for more than 120
seconds.</div>
<div>May 4 21:15:29 master kernel: [1290290.389236]
Not tainted 4.4.7-040407-generic #201604121331</div>
<div>May 4 21:15:29 master kernel: [1290290.389611]
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs"
disables this message.</div>
<div>May 4 21:15:29 master kernel: [1290290.389998]
reflink D ffff880038f87cf8 0 6466 32643
0x00000000</div>
<div>May 4 21:15:29 master kernel: [1290290.390000]
ffff880038f87cf8 ffff8800b80e6000 ffff880215542940
ffff880002508dc0</div>
<div>May 4 21:15:29 master kernel: [1290290.390002]
ffff880038f88000 ffff8801af84f1dc ffff880002508dc0
00000000ffffffff</div>
<div>May 4 21:15:29 master kernel: [1290290.390004]
ffff8801af84f1e0 ffff880038f87d10 ffffffff817fdf35
ffff8801af84f1d8</div>
<div>May 4 21:15:29 master kernel: [1290290.390005]
Call Trace:</div>
<div>May 4 21:15:29 master kernel: [1290290.390008]
[<ffffffff817fdf35>] schedule+0x35/0x80</div>
<div>May 4 21:15:29 master kernel: [1290290.390009]
[<ffffffff817fe1de>]
schedule_preempt_disabled+0xe/0x10</div>
<div>May 4 21:15:29 master kernel: [1290290.390010]
[<ffffffff817ffe19>]
__mutex_lock_slowpath+0xb9/0x130</div>
<div>May 4 21:15:29 master kernel: [1290290.390012]
[<ffffffff817ffeaf>] mutex_lock+0x1f/0x30</div>
<div>May 4 21:15:29 master kernel: [1290290.390031]
[<ffffffffc0a02668>]
ocfs2_reflink_ioctl+0x218/0x360 [ocfs2]</div>
<div>May 4 21:15:29 master kernel: [1290290.390045]
[<ffffffffc09e634e>] ocfs2_ioctl+0x26e/0x660
[ocfs2]</div>
<div>May 4 21:15:29 master kernel: [1290290.390048]
[<ffffffff8121aa68>] do_vfs_ioctl+0x298/0x480</div>
<div>May 4 21:15:29 master kernel: [1290290.390049]
[<ffffffff812167a4>] ? putname+0x54/0x60</div>
<div>May 4 21:15:29 master kernel: [1290290.390051]
[<ffffffff812066bf>] ? do_sys_open+0x1af/0x230</div>
<div>May 4 21:15:29 master kernel: [1290290.390052]
[<ffffffff8121acc9>] SyS_ioctl+0x79/0x90</div>
<div>May 4 21:15:29 master kernel: [1290290.390054]
[<ffffffff81802036>]
entry_SYSCALL_64_fastpath+0x16/0x75</div>
</div>
<div><br>
</div>
<div><br>
</div>
<div> I saw same report with kernel 4.3 but there was no
answer.</div>
<div>Any ideas?</div>
</div>
</div>
</div>
<div><br>
</div>
<div class="gmail_signature">
<div dir="ltr">
</div>
</div>
</div>
</blockquote>
A similiar issue was first reported by Junxiao:<br>
[1]
<a class="moz-txt-link-freetext" href="https://oss.oracle.com/pipermail/ocfs2-devel/2015-December/011408.html">https://oss.oracle.com/pipermail/ocfs2-devel/2015-December/011408.html</a><br>
<br>
I also reproduced that issue. But it's very tricky to handle in a
way maintainer can accept. There is a proposed patch in this thread,
but I fail to find the patches in archive: <br>
[2]
<a class="moz-txt-link-freetext" href="https://oss.oracle.com/pipermail/ocfs2-devel/2016-January/011503.html">https://oss.oracle.com/pipermail/ocfs2-devel/2016-January/011503.html</a><br>
<br>
IIRC, it just hung there, but without "-17" error which means "File
exists" :<br>
(reflink,30902,0):ocfs2_reflink:4317 ERROR: status = -17<br>
<br>
Eric <br>
<blockquote
cite="mid:CAAcuexBFPZ=dTgORi_nHXTgRsEHk2ZTrTHLpzN+eFTneOtiiSw@mail.gmail.com"
type="cite">
<div dir="ltr">
<div class="gmail_signature">
<div dir="ltr">
<div> </div>
</div>
</div>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
Ocfs2-devel mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Ocfs2-devel@oss.oracle.com">Ocfs2-devel@oss.oracle.com</a>
<a class="moz-txt-link-freetext" href="https://oss.oracle.com/mailman/listinfo/ocfs2-devel">https://oss.oracle.com/mailman/listinfo/ocfs2-devel</a></pre>
</blockquote>
<br>
</body>
</html>