Failover testing with 10g RAC
Date: Fri, 30 May 2008 11:21:39 -0400
Solaris 10, RAC 10.2.0.3. Using IPMP groups for NIC redundancy.
We've been conducting failover testing -- disabling a HBA port, power
off a switch,
yank an IC link, etc.
In every single case, CRS rebooted the server where the dire deed was performed,
and when the server came back up, the repair was successful, e.g. failed over to
the secondary HBA port, or the physical IP for the IPMP group floated
to the standby
NIC and so forth.
The other server stayed up and all Oracle components remained
the switch power off test, the physical IP for the IC actually floated over to the
standby NIC with no outage on this server.
Is this what is to be expected? CRS will always reboot a server to repair itself when an underlying hardware failure is detected?