Test activation of standby database fails in Oracle 9iR2

From: Finn Jorgensen <finn.oracledba_at_gmail.com>
Date: Wed, 26 Dec 2007 11:15:58 -0500
Message-ID: <74f79c6b0712260815x244ebd6ao1ea746308a81e8aa@mail.gmail.com>

Hello all.

The client I work for has asked me to put together a script to activate their standby databases in case of a datacenter disaster/powerfailure. This is due to the high number of standby's that need activation all at the same time and the short window in which they need to be activated. Part of the task is that the script should be able to run in a "TEST" mode in which the standby's are activated, but the primaries are not shutdown/inaccessible (client just wants to know IT can manage to bring all standby's up in the allotted window, but without disrupting actual production).

All standby's are set up indentically using the LGWR process and standby logfiles and RTA for 10g.

I pretty much have everything together and the actual failover (where the primaries are not available) is working fine and the "TEST" mode works fine too for Oracle 10g, because the "alter database recover standby database finish FORCE" command gracefully kills the RFS processes. However, in 9i the FORCE keyword doesn't exist yet and according to my research my only option is to query the v$managed_standby view and find the "RFS" processes and kill them at the OS level. After that bring the standby online.

As such, for 9i databases the order of events for a test are : Set fal_server='' and fal_client='' on standby, set log_archive_dest_state_2=DEFER on primary, kill -9 RFS processes on standby, alter database recover standby database finish, alter database commit to switchover to primary

However, when I do that I get the following errors in the alert log :

Fri Dec 21 13:51:54 2007
alter database recover managed standby database finish Fri Dec 21 13:51:54 2007
Terminal Recovery: request posted
Fri Dec 21 13:52:15 2007
Warning: log 4 of thread 1 is being archived or modified MRP0: Background Media Recovery terminated with error 261 Fri Dec 21 13:52:15 2007
Errors in file /oracle/admin/DGTEST9/bdump/dgtest9_mrp0_20366.trc: ORA-00261: log 4 of thread 1 is being archived or modified ORA-00312: online log 4 thread 1: '/DGTEST01/DGTEST9/standby01.log' Recovery interrupted.
MRP0: Background Media Recovery process shutdown Fri Dec 21 13:52:16 2007
Terminal Recovery: completion detected
Completed: alter database recover managed standby database Fri Dec 21 13:52:16 2007
alter database commit to switchover to primary ALTER DATABASE COMMIT TO SWITCHOVER TO PRIMARY Database not recovered through End-Of-REDO Database not recovered through End-Of-REDO Switchover: Media recovery required - standby not in limbo ORA-16139 signalled during: alter database commit to switchover to primary...
I then try activating the standby using "skip standby logfile" (which would mean data loss), but still I get an error.

Fri Dec 21 13:52:16 2007

alter database activate standby database skip standby logfile ALTER DATABASE ACTIVATE [PHYSICAL] STANDBY DATABASE Fri Dec 21 13:52:31 2007
Warning: log 4 of thread 1 is being archived or modified Activate standby database received error 261 If I then (by hand) turn the MRP back on and then run through the sequence by hand it works fine :

Fri Dec 21 13:56:04 2007
alter database recover managed standby database disconnect from session Attempt to start background Managed Standby Recovery process MRP0 started with pid=13
MRP0: Background Managed Standby Recovery process started Media Recovery Waiting for thread 1 seq# 228 Fri Dec 21 13:56:10 2007
Completed: alter database recover managed standby database di Fri Dec 21 13:57:19 2007
alter database recover managed standby database finish Terminal Recovery: request posted
Fri Dec 21 13:57:24 2007
TERMINAL RECOVERY changing datafile format version from 8.0.0.0.0 to 9.0.0.0.0
Switching logfile format version from 8.0.0.0.0 to 9.0.0.0.0

Terminal Recovery: applying standby redo logs.
Terminal Recovery: thread 1 seq# 228 redo required
Terminal Recovery:  /DGTEST01/DGTEST9/standby01.log

Identified end-of-REDO for thread 1 sequence 228 Incomplete recovery applied all redo ever generated. Recovery completed through change 8295382144444 MRP0: Media Recovery Complete
Switching logfile format version from 9.0.0.0.0 to 8.0.0.0.0 Terminal Recovery: successful completion Begin: Wait for standby logfiles to be archived Fri Dec 21 13:57:25 2007
ARC0: Evaluating archive log 4 thread 1 sequence 228 Fri Dec 21 13:57:25 2007
ARC1: Evaluating archive log 4 thread 1 sequence 228 ARC1: Unable to archive log 4 thread 1 sequence 228

Log actively being archived by another process Fri Dec 21 13:57:25 2007
ARC0: Beginning to archive log 4 thread 1 sequence 228 Creating archive destination LOG_ARCHIVE_DEST_1: '/oracle/admin/DGTEST9/arch/DGTEST9_1_228.arc' ARC0: Completed archiving log 4 thread 1 sequence 228

Fri Dec 21 13:57:40 2007
End: All standby logfiles have been archived Resetting standby activation ID 799518267 (0x2fa7ae3b) MRP0: Background Media Recovery process shutdown Fri Dec 21 13:57:41 2007
Terminal Recovery: completion detected
Completed: alter database recover managed standby database fi Fri Dec 21 13:57:57 2007
alter database commit to switchover to primary Fri Dec 21 13:57:57 2007
ALTER DATABASE COMMIT TO SWITCHOVER TO PRIMARY NORESETLOGS after complete recovery through change 8295382144444 Resetting resetlogs activation ID 0 (0x0) Online log 1 of thread 1 was previously cleared Online log 2 of thread 1 was previously cleared Changing control file format version from 8.0.0.0.0 to 9.0.0.0.0 RESETLOGS changing datafile format version from 9.0.0.0.0 to 8.0.0.0.0 Switchover: Complete - Database shutdown required Completed: alter database commit to switchover to primary

As such, it seems to be a timing issue with the RFS processes somehow. I built in a sleep for 10 seconds after killing the RFS processes and tried with 30 seconds as well, but still the same error. As stated above, if the primary is unavailable (shutdown abort) I do not have this error.

Does anybody have any experience with this type of problem? Any ideas as to what my problem may be? My testing has been conducted in Oracle 9.2.0.6 on Solaris 9, but I would think it's reproducable in any version of 9iR2.

Thanks,

Finn

--
http://www.freelists.org/webpage/oracle-l

Received on Wed Dec 26 2007 - 10:15:58 CST