Re: 11gR2 RAC Network Failure - Timeout

From: Riyaj Shamsudeen <riyaj.shamsudeen_at_gmail.com>
Date: Mon, 6 Feb 2012 13:08:51 -0600
Message-ID: <CAA2DszyrjC=48np_mK491eSdMKb-YXnRYVq73uJSdnOtaFQejA_at_mail.gmail.com>



Purav
 You may want to use -p flag to see those attributes of a resource. Check_interval is set to 1 second and that's the interval between two checks for that VIP resource. VIP resource also has a check_timeout attribute set to 30 seconds. So, it might be approximately few seconds after 30 seconds for the VIP failures to be detected. It is possible that clusterware is trying to tear down the dependent resource causing 10 more seconds of delay.

For network resource, script_timeout is set to 60 seconds, if the thread executing the script does not complete in 60 seconds for a resource, that resource is considered not responding.

Of course, you shouldn't modify these attributes as per the documentation.

$ crsctl stat res ora.some.vip -p
..
TYPE=ora.cluster_vip_net2.type
ACL=owner:root:rwx,pgrp:root:r-x,other::r--,group:oinstall:r-x,user:grid:r-x ...
CHECK_INTERVAL=1
CHECK_TIMEOUT=30
DESCRIPTION=Oracle VIP resource
ENABLED=1
FAILOVER_DELAY=0
FAILURE_INTERVAL=0
FAILURE_THRESHOLD=0
...
OFFLINE_CHECK_INTERVAL=0
PLACEMENT=favored
PROFILE_CHANGE_TEMPLATE=
RESTART_ATTEMPTS=0
SCRIPT_TIMEOUT=60
SERVER_POOLS=*

START_DEPENDENCIES=hard(ora.net2.network) pullup(ora.net2.network)
START_TIMEOUT=0
STATE_CHANGE_TEMPLATE=

STOP_DEPENDENCIES=hard(ora.net2.network) STOP_TIMEOUT=0

$ crsctl stat res ora.net1.network -p
NAME=ora.net1.network
TYPE=ora.network.type
..
CHECK_INTERVAL=1
...
OFFLINE_CHECK_INTERVAL=60
...
RESTART_ATTEMPTS=5
SCRIPT_TIMEOUT=60
...

Cheers

Riyaj Shamsudeen
Principal DBA,
Ora!nternals - http://www.orainternals.com - Specialists in Performance, RAC and EBS
Blog: http://orainternals.wordpress.com
OakTable member http://www.oaktable.com and Oracle ACE Director

Co-author of the books: Expert Oracle
Practices<http://tinyurl.com/book-expert-oracle-practices/>, Pro Oracle SQL, Expert PL/SQL
Practices<http://tinyurl.com/book-expert-plsql-practices>

On Mon, Feb 6, 2012 at 12:28 PM, D'Hooge Freek <Freek.DHooge_at_uptime.be>wrote:

> Hi,
>
> I think that sets only the interconnect timeout, not the vip timeout.
> On linux the vip connectivity is checked by using ethtool (or mii-tool on
> older linux versions) to verify the status of the network port. If ethtool
> reports that the link is down a ping is done to the public gateway. If the
> ping test fails two times, the failover is initiated.
> Looking at the script, I think the timeout of the ping is set to 1 second
> (-w 1 -c 1).
>
> The script responsible for the vip test is $ORACLE_HOME/bin/racgvip (where
> $ORACLE_HOME is the GI home, not the db home).
> On linux this is just a shell script, so you can modify it to change the
> number of retries.
> But I can't directly find where the sleep time between two vip checks is
> defined.
>
>
> Regards,
>
>
> Freek D'Hooge
> Uptime
> Oracle Database Administrator
> email: freek.dhooge_at_uptime.be
> tel +32(0)3 451 23 82
> http://www.uptime.be
> disclaimer: www.uptime.be/disclaimer
>
> -----Original Message-----
> From: oracle-l-bounce_at_freelists.org [mailto:oracle-l-bounce_at_freelists.org]
> On Behalf Of Phillip Jones
> Sent: maandag 6 februari 2012 16:13
> To: puravc_at_gmail.com
> Cc: oracle-l_at_freelists.org
> Subject: Re: 11gR2 RAC Network Failure - Timeout
>
> Hi,
> To view the timeout:
>
> crsctl get css misscount
>
> To set it:
>
> crsctl set css misscount 45
>
> "crsctl get css" will give you other possible settings that are
> viewable/settable.
>
> Note that Oracle advise to not set misscount to a value lower than 30
> seconds.
>
> Hope that helps,
>
> Phil
>
> On Mon, Feb 6, 2012 at 2:36 PM, Purav Chovatia <puravc_at_gmail.com> wrote:
>
> >
> >
> > Is there any crsctl (crsctl status resource ora.net1.network -f does not
> > show) or srvctl or any other command to display this timeout?
> > Is this timeout modifiable? i.e. can the VIP be made to failover sooner
> to
> > the surviving node?
> >
>
>
> --
> http://www.freelists.org/webpage/oracle-l
>
>
> --
> http://www.freelists.org/webpage/oracle-l
>
>
>

--
http://www.freelists.org/webpage/oracle-l
Received on Mon Feb 06 2012 - 13:08:51 CST

Original text of this message