Return-Path: <root@fatcity.cts.com>
Received: from ensim.rackshack.net (root@localhost)
 by orafaq.net (8.11.6/8.11.6) with ESMTP id gBMKZd513814
 for <oracle-l@orafaq.net>; Sun, 22 Dec 2002 14:35:39 -0600
X-ClientAddr: 209.68.248.164
Received: from newsfeed.cts.com (newsfeed.cts.com [209.68.248.164])
 by ensim.rackshack.net (8.11.6/8.11.6) with ESMTP id gBMKZdc13809
 for <oracle-l@orafaq.net>; Sun, 22 Dec 2002 14:35:39 -0600
Received: from fatcity.UUCP (uucp@localhost)
 by newsfeed.cts.com (8.9.3/8.9.3) with UUCP id JAA79993;
 Sun, 22 Dec 2002 09:13:49 -0800 (PST)
Received: by fatcity.com (26-Feb-2001/v1.0g-b72/bab) via UUCP id 00520EF1; Sun, 22 Dec 2002 08:38:36 -0800
Message-ID: <F001.00520EF1.20021222083836@fatcity.com>
Date: Sun, 22 Dec 2002 08:38:36 -0800
To: Multiple recipients of list ORACLE-L <ORACLE-L@fatcity.com>
X-Comment: Oracle RDBMS Community Forum
X-Sender: "chao_ping" <chao_ping@vip.163.com>
Sender: root@fatcity.com
Reply-To: ORACLE-L@fatcity.com
Errors-To: ML-ERRORS@fatcity.com
From: "chao_ping" <chao_ping@vip.163.com>
Subject: =?gb2312?B?UmU6IFJFOiBoZWxwIG1lIGZpbmQgb3V0IHdoeSByYWMgaW5zdGFuY2UgZGllZA==?=
Organization: Fat City Network Services, San Diego, California
X-ListServer: v1.0g, build 72; ListGuru (c) 1996-2001 Bruce A. Bergman
Precedence: bulk
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit

Thanks for your suggestion, i checked that article, but still unable to solve the problem.
The same time the next day, another instance in the cluster died, with the same reason. ora-29740, still with reason 2.
The cluster runs quite stable in the past month(since the patchset is installed, it is just about 30 days).
When i check the linux /var/log/messages, i found at the exact same time, syslogd restarted in both node in the two days , when rac instance died. Whould there be some relations between them?Unix did not rebooted ,I checked uptime value.
>From the trace file, i found it said the dead instance failed to transfer heart beat:
first day, from the alive instace rac1:






*** 2002-12-21 04:01:54.227
kjxgrnbrisalive: (1, 2) not beating, HB: 479418910, 479418910
*** 2002-12-21 04:01:54.239
kjxgrnbrdead: Detected death of 1, initiating reconfig
kjxgrrcfgchk: Initiating reconfig, reason 2
*** 2002-12-21 04:01:59.256
kjxgmrcfg: Reconfiguration started, reason 2
kjxgmcs: Setting state to 6 0.
*** 2002-12-21 04:01:59.258
Name Service frozen
kjxgmcs: Setting state to 6 1.






from the trace file of the second day, from the alive instance rac2:




*** 2002-12-22 04:01:56.457
kjxgrnbrisalive: (0, 1) not beating, HB: 479438832, 479438832
*** 2002-12-22 04:01:56.457
kjxgrnbrdead: Detected death of 0, initiating reconfig
kjxgrrcfgchk: Initiating reconfig, reason 2
*** 2002-12-22 04:02:01.486
kjxgmrcfg: Reconfiguration started, reason 2
kjxgmcs: Setting state to 9 0.
*** 2002-12-22 04:02:01.495
Name Service frozen





I wonder if anyone here have the experience of dealing with rac system. What shall i check to verify why rac instance failed to update the controlfile. I already enabled event:
event="29740 trace name errorstack level 3" 
in one instance.
shall i enable the undocumented parameter
_imr_active=false in the system? 

> Hi Chao:
> 
> THe Instance 2 in your Cluster (rac2) was dead during
> the fast reconfiguration (Check the reason in the alert log
> file.. which says reason 2).  You generally do a reconfig
> (or fast reconfig) when you add/remove instances from the
> Cluster setup, which is not (I hope) in your case.
> 
> THere are some kernel events to trace the reconfigurations,
> and an underscore parameter (I think it is _imr_active !)
> to disable the 29740 usually not recommended.
> 
> For investigation , review the check point, LMON trace files
> and check the OS log files.
> 
> 
> 
> 
> Best Regards,
> K Gopalakrishnan
> 
> 
> 
-- 
Please see the official ORACLE-L FAQ: http://www.orafaq.net
-- 
Author: chao_ping
  INET: chao_ping@vip.163.com

Fat City Network Services    -- 858-538-5051 http://www.fatcity.com
San Diego, California        -- Mailing list and web hosting services
---------------------------------------------------------------------
To REMOVE yourself from this mailing list, send an E-Mail message
to: ListGuru@fatcity.com (note EXACT spelling of 'ListGuru') and in
the message BODY, include a line containing: UNSUB ORACLE-L
(or the name of mailing list you want to be removed from).  You may
also send the HELP command for other information (like subscribing).

