[Ocfs2-users] Catatonic nodes under SLES10
David Miller
syslog at d.sparks.net
Mon Apr 9 14:32:33 PDT 2007
Alexei_Roudnev wrote:
> Did you checked
>
> /proc/sys/kernel/panic /proc/sys/kernel/panic_on_oops
>
> system variables?
>
No. Maybe I'm missing something here.
Are you saying that a panic/freeze/reboot is the expected/desirable
behavior? That nothing more graceful could be done, like to just
dismount the ocfs2 file systems, or force them to a read-only mount or
something like that? We have to reload the kernel?
Thanks,
--- David
> ----- Original Message -----
> From: "David Miller" <syslog at d.sparks.net>
> To: <ocfs2-users at oss.oracle.com>
> Sent: Monday, April 02, 2007 9:01 AM
> Subject: [Ocfs2-users] Catatonic nodes under SLES10
>
[snip]
> Both servers will be connected to a dual-host external RAID system.
> I've setup ocfs2 on a couple of test systems and everything appears to
> work fine.
>
> Until, that is, one of the systems loses network connectivity.
>
> When the systems can't talk to each other anymore, but the disk
> heartbeat is still alive, the high numbered node goes catatonic. Under
> SLES 9 it fenced itself off with a kernel panic; under 10 it simply
> stops responding to network or console. A power cycling is required to
> bring it back up.
>
> The desired behavior would be for the higher numbered node to lose
> access to the ocfs2 file system(s). I don't really care whether it
> would simply timeout ala stale NFS mounts, or immediately error like
> access to non-existent files.
>
>
More information about the Ocfs2-users
mailing list