Re: Long connect time when one node in RAC goes down

From: Martin Klier <usn_at_usn-it.de>
Date: Thu, 04 Sep 2008 16:50:39 +0200
Message-ID: <48BFF5BF.3080907@usn-it.de>


Hi Yechiel,

Yechiel Adar schrieb:
> After that it try to convert the name to tcp/ip address.
> When sqlnet try to convert server2-vip to tcp/ip address he is stuck.
>
> It seems that somewhere in the network something is not updated
> when the vip is moved to the other node and it takes about 6 (or 6*2)
> seconds
> until sqlnet gets error from the network and then it try to connect with
> the second entry, server-vip1, and this works.
>
> Have you heard anything about this problem?

It sounds a bit like an ARP refreshing issue. On large networks, the ARP caches of the switches might take their time to refresh their tables, and the more intermediate switch/routing stations/paths are different from "client" to the DB, the longer it might take to renew them.

Can you or your network admin trace the managed switches that are immediately involved on client and server side?

Regards
Martin

-- 
Usn's IT Blog for Linux, Oracle, Asterisk
http://www.usn-it.de

--
http://www.freelists.org/webpage/oracle-l
Received on Thu Sep 04 2008 - 09:50:39 CDT

Original text of this message