ok tried another test from Prod->DR

tiny database..

I can tnsping LIVE_DR, LIVE_PROD service names from both servers. Listeners up on both servers.


log_archive_dest_2 on primary (service=live_dr optional reopen=60 lgwr async noaffirm)



Switch logs on the primary(which is taking ages -> 2-3 mins) and ...


hu Feb 4 11:30:49 2010
LGWR: Completed archiving log 1 thread 1 sequence 14
LGWR received timedout error from KSR
No action necessary. ARCH will attempt to transmit log file
LGWR: Error 2 closing archivelog file 'live_dr'
Thu Feb 4 11:33:49 2010
Errors in file /u1/app/oracle/product/9.2.0.8/admin/LIVE/bdump/live_lgwr_9764.trc:
ORA-00002: Message 2 not found; product=RDBMS; facility=ORA
LGWR: Error 2 disconnecting from destination LOG_ARCHIVE_DEST_2 standby host 'live_dr'
Thread 1 advanced to log sequence 15
Current log# 2 seq# 15 mem# 0: /u4/oradata/LIVE/redo02.log
Thu Feb 4 11:33:53 2010
ARC1: Evaluating archive log 1 thread 1 sequence 14
ARC1: Beginning to archive log 1 thread 1 sequence 14
Creating archive destination LOG_ARCHIVE_DEST_1: '/u4/oradata/LIVE/arch/1_14.dbf'
ARC1: Completed archiving log 1 thread 1 sequence 14



live_lgwr_9764.trc:

*** SESSION ID:(4.1) 2010-02-04 11:29:22.051
*** 2010-02-04 11:29:22.051 43650 kcrr.c
Initializing NetServer for dest=live_dr
Initializing PGA storage for Netserver communication
Allocating a brand new NetServer
Allocated NetServer 0
Starting NetServer
NetServer 0 has been started.Subscribing to KSR Channel 0
success!
Indicating recv buffer for KSR Channel 0
success
Waiting for Netserver 0 to initialize itself
*** 2010-02-04 11:29:25.106 43976 kcrr.c
Netserver 0 has been initialized
LGWR performing a channel reset to ignore previous responses
LGWR connecting as publisher to KSR Channel 0
LGWR-NS 0 initialized for destination=live_dr
*** 2010-02-04 11:29:25.106 44421 kcrr.c
Making upiahm request to NetServer 0
Waiting for NetServer to respond to upiahm
*** 2010-02-04 11:29:28.537 44607 kcrr.c
upiahm connect done status is 0
Receiving message from NetServer 0
Destination LOG_ARCHIVE_DEST_2 is in CLUSTER CONSISTENT mode
Destination LOG_ARCHIVE_DEST_2 is in MAXIMUM PERFORMANCE mode
Receiving message from NetServer 0
*** 2010-02-04 11:29:28.579 45903 kcrr.c
Making upinbls request to NetServer 0
NetServer pid:9786
*** 2010-02-04 11:30:49.340 45733 kcrr.c
Making upinblc request to NetServer 0
NetServer pid:9786
*** 2010-02-04 11:30:59.333
LGWR found NetServer 0 alive.. waiting for msg
LGWR found NetServer 0 alive.. waiting for msg
LGWR found NetServer 0 alive.. waiting for msg
LGWR found NetServer 0 alive.. waiting for msg
LGWR found NetServer 0 alive.. waiting for msg
LGWR found NetServer 0 alive.. waiting for msg
LGWR found NetServer 0 alive.. waiting for msg
*** 2010-02-04 11:32:09.287
LGWR found NetServer 0 alive.. waiting for msg
*** 2010-02-04 11:32:19.298
LGWR found NetServer 0 alive.. waiting for msg
*** 2010-02-04 11:32:29.299
LGWR found NetServer 0 alive.. waiting for msg
LGWR found NetServer 0 alive.. waiting for msg
LGWR found NetServer 0 alive.. waiting for msg
LGWR found NetServer 0 alive.. waiting for msg
*** 2010-02-04 11:33:29.242
LGWR found NetServer 0 alive.. waiting for msg
LGWR found NetServer 0 alive.. waiting for msg
Current time 710076829 exceeds start time 710076649 plus 180 seconds: return timedout error
LGWR received timedout error from KSR
No action necessary. ARCH will attempt to transmit log file
ERROR:kcrrnupinblc 2 on received message 4
LGWR produced a total of 25 RFSWRITE messages
ERROR:kcrrnsupirfs.1 LGWR had received a timeout previously. return timeout again
Error 2 closing standby archive log file at host 'live_dr'
Archive destination LOG_ARCHIVE_DEST_2 made inactive: File close error
*** 2010-02-04 11:33:49.003 51426 kcrr.c
LGWR: Error 2 closing archivelog file 'live_dr'
*** 2010-02-04 11:33:49.003
kcrrfail: dest:2 err:2 force:0
ORA-00002: Message 2 not found; product=RDBMS; facility=ORA
LGWR produced a total of 25 RFSWRITE messages
ERROR:kcrrnsupirfs.1 LGWR had received a timeout previously. return timeout again
Error 2 detaching RFS from standby instance at host 'live_dr'
*** 2010-02-04 11:33:49.018 45506 kcrr.c
Making upidhs request to NetServer 0 (hst 0x103edbe58)
NetServer pid:9786
*** 2010-02-04 11:33:53.018 45650 kcrr.c
Cleaninup up LNS0 [pid 9786] after network detach
*** 2010-02-04 11:33:53.018 43513 kcrr.c
LNS0 [pid 9786] receiving termination signal..
.... killed succesfully
ERROR:kcrrnsupidhs LGWR had received a timeout previously. return timeout again
*** 2010-02-04 11:33:53.019 51426 kcrr.c
LGWR: Error 2 disconnecting from destination LOG_ARCHIVE_DEST_2 standby host 'live_dr'
Ignoring kcrrlcmp() detach error 2
*** 2010-02-04 11:33:53.019
kcrrfail: dest:2 err:2 force:0