<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=Content-Type content="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 12 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
        {font-family:Wingdings;
        panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        margin-bottom:.0001pt;
        font-size:11.0pt;
        font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
        {mso-style-priority:34;
        margin-top:0in;
        margin-right:0in;
        margin-bottom:0in;
        margin-left:.5in;
        margin-bottom:.0001pt;
        font-size:11.0pt;
        font-family:"Calibri","sans-serif";}
span.EmailStyle17
        {mso-style-type:personal-compose;
        font-family:"Calibri","sans-serif";
        color:windowtext;}
.MsoChpDefault
        {mso-style-type:export-only;}
@page Section1
        {size:8.5in 11.0in;
        margin:1.0in 1.0in 1.0in 1.0in;}
div.Section1
        {page:Section1;}
/* List Definitions */
@list l0
        {mso-list-id:503477716;
        mso-list-type:hybrid;
        mso-list-template-ids:-831113944 1817844718 67698713 67698715 67698703 67698713 67698715 67698703 67698713 67698715;}
@list l0:level1
        {mso-level-text:%1;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        margin-left:63.75pt;
        text-indent:-21.0pt;}
@list l1
        {mso-list-id:1697776430;
        mso-list-type:hybrid;
        mso-list-template-ids:-148206960 -438119442 67698691 67698693 67698689 67698691 67698693 67698689 67698691 67698693;}
@list l1:level1
        {mso-level-start-at:0;
        mso-level-number-format:bullet;
        mso-level-text:-;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:"Calibri","sans-serif";
        mso-fareast-font-family:Calibri;
        mso-bidi-font-family:"Times New Roman";}
ol
        {margin-bottom:0in;}
ul
        {margin-bottom:0in;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=EN-US link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal>Hi All.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Currently running ocfs2 in a dual node setup over dual
primary DRBD with a gigabit backend for a webserver environment.<o:p></o:p></p>
<p class=MsoNormal>Read performance is as expected, write performance is absolutely
terrible (ie: 22 file modifications per second).<o:p></o:p></p>
<p class=MsoNormal>The gigabit crossover achieves its full capacity easily, but
has an avg 77ms latency.<o:p></o:p></p>
<p class=MsoNormal>So Im looking to change to infiniband with some hardware
from ebay and hopefully thatll solve the slow problem. Do you think it will
solve my bad write performance issues?<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>My next problem is where if too many applications are queued
to write to the partition ocfs goes and restarts the system (obviously cause it
hasn’t communicated with the other node in quite a while, currently
configured for 60 seconds).<o:p></o:p></p>
<p class=MsoNormal>And cause im only running two node, the other one goes and
kills itself too. (Im in the process of setting up a third node via iscsi, but haven’t
got there yet)<o:p></o:p></p>
<p class=MsoNormal>When the two come back up, and drbd is finished syncing I go
to manually re-mount one of the servers.<o:p></o:p></p>
<p class=MsoNormal>But when I do it restarts itself again, and again, and again
etc.<o:p></o:p></p>
<p class=MsoNormal>All I see in messages/dmesg is something like this, then the
server goes and resets itself.<o:p></o:p></p>
<p class=MsoNormal style='text-indent:.5in'> (3756,3):ocfs2_find_slot:502
slot 1 is already allocated to this node!<o:p></o:p></p>
<p class=MsoNormal style='text-indent:.5in'> (3756,3):ocfs2_check_volume:1753
File system was not unmounted cleanly, recovering volume.<o:p></o:p></p>
<p class=MsoNormal>The slotmap has both nodes in it, even tho they aren’t
mounted.<o:p></o:p></p>
<p class=MsoNormal style='text-indent:.5in'># echo "slotmap" |
debugfs.ocfs2 -n /dev/drbd0<o:p></o:p></p>
<p class=MsoNormal> Slot#
Node#<o:p></o:p></p>
<p class=MsoNormal> 0
1<o:p></o:p></p>
<p class=MsoListParagraph style='margin-left:63.75pt;text-indent:-21.0pt;
mso-list:l0 level1 lfo2'><![if !supportLists]><span style='mso-list:Ignore'>1<span
style='font:7.0pt "Times New Roman"'>
</span></span><![endif]>0<o:p></o:p></p>
<p class=MsoNormal>Currently im fsck’ing the partition, which replayed the
journals of both nodes (contrary to the error message you see above).<o:p></o:p></p>
<p class=MsoNormal>Then after a couple of failures (each time resetting one of
the servers) I end up trying to mount with localflocks.<o:p></o:p></p>
<p class=MsoNormal>It seems that half the time localflocks works, it mounts the
partition. I can then unmount and remount normally, and happy sailing.<o:p></o:p></p>
<p class=MsoNormal>But the other half the time the system resets itself again.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Im not sure how im supposed to remount the partition properly
in this scenario, can someone help me?<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Btw:<o:p></o:p></p>
<p class=MsoNormal>Linux- 2.6.28<o:p></o:p></p>
<p class=MsoNormal>drbd 8.2.7<o:p></o:p></p>
<p class=MsoNormal>elevator=deadline<o:p></o:p></p>
<p class=MsoNormal>I hope I included enough relevant information.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Andrew.<o:p></o:p></p>
</div>
</body>
</html>