[Ocfs-users] Hard system restart when DRBD connection fails while in use

Henri Cook ocfs at theplayboymansion.net
Sun Sep 7 03:47:16 PDT 2008


Answering my own questions...

OCFS2 is definitely the cause, number 77 in the 1.2 FAQ allows me to
cause a reboot, OR a kernel panic when one of the nodes in my two-node
cluster abruptly quits (without unmounting the in-use drive) - what's
the reasoning behind this kind of fencing? It cripples the entire pair
for >=30 seconds while the one that's presumably still working reboots -
I don't see the benefit?

Henri


Henri Cook wrote:
> Please, this is quite urgent for me - sorry to be a pain
>
> It appears that OCFS2 is a very likely suspect in causing these reboots,
> they only occur when the shared drbd device is mounted on both nodes
> (which is the default behaviour) - if I unmount on Node B before
> rebooting it then the reboot does not occur. There are no error messages
> from OCFS2 to speak of, where can i see/configure these heartbeat options?
>
> I'm trying to get a faux-serial console attached as i've read there's a
> historic issue where it doesn't even write to log files but only to screen
>
> Thanks,
>
> Henri
>
> Henri Cook wrote:
>   
>> Hi all,
>>
>> I have two nodes (A+B) running a DRBD file system (using OCFS2) on /shared.
>>
>> If I start say, an FTP file transfer to my drbd /shared directory on node A, then reboot node B which is the other machine in a Primary-Primary DRBD configuration while the transfer is in progress - node A stops at a similar time that DRBD notices the connection with Node B has been lost (hence crippling both machines for the time it takes to reboot). If the drive is inactive (i.e. nothing is being written to it) then this does not occur.
>>
>> My question then is, could OCFS2 tools be the source of these reboots, is there any such default action configured? If so, how would I go about investigating/altering it?  There are no log entries about the reboot to speak of.
>>
>> OS is Ubuntu Hardy (Server) 8.04 and ocfs2-tools 1.3.9-0ubuntu1
>>
>> Thanks in advance,
>>
>> Henri
>>
>>
>>   
>>     
>
>   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://oss.oracle.com/pipermail/ocfs-users/attachments/20080907/bb74373e/attachment.html 


More information about the Ocfs-users mailing list