[Ocfs-users] copy error + control file corruption in ocfs 1.1 0

Bryce philip.copeland at oracle.com
Tue Mar 16 16:35:28 CST 2004


On Fri, 2004-03-12 at 08:22, Robert Blok wrote:
<TRIMMED>

> The second problem is still there:
> - Make the first control file corrupt.
> The database has crashed now.
> 
> - Copy the second controlfile over the first controlfile:
> cp --o_direct ../a2/control02.ctl ./control01.ctl
> 
> - Restart the database:
> [oracle at prac01 a1]$ srvctl start database -d test
> PRKP-1005 : Failed to start up cluster database test
> ORA-00227: corrupt block detected in controlfile: (block 315, # blocks 1)
> ORA-00202: controlfile: '/oradata/test/a1/control01.ctl'
> 
> Any ideas?


For this second part of the problem, the only thing that springs to mind
is that something else is poking the control file with writes while the
cp is happening.


an "lsof ./control01.ctl" should show up what other processes have the
file open
in the 'FD' field look for processes that have a 'w' or 'u' attibute as
these will be the ones that have write access to ./control01.ctl but
please note that a gulity process may not be open/writing at the time
you issue the lsof

eg
[root at emerald root]# lsof  /var/log/messages
COMMAND  PID USER   FD   TYPE DEVICE   SIZE   NODE NAME
syslogd 1997 root    1w   REG    3,7 241697 131655 /var/log/messages
(file descriptor 1 with write access)

[root at emerald root]# lsof  /var/log/maillog
COMMAND  PID USER   FD   TYPE DEVICE  SIZE   NODE NAME
syslogd 1997 root    3w   REG    3,7 11024 131657 /var/log/maillog
(file descriptor 3 with write access)

Phil
=--=



More information about the Ocfs-users mailing list