RE: CRS-1615:voting device hang at 50% fatal, termination in 99620 ms

From: D'Hooge Freek <Freek.DHooge_at_uptime.be>
Date: Thu, 25 Aug 2011 11:08:01 +0200
Message-ID: <4814386347E41145AAE79139EAA39898150E4F481D_at_ws03-exch07.iconos.be>



Marco,

I don't know the error timings for the other node, but I think the heartbeat fatal messages are coming after the first node has terminated due to the missing voting disk.

This would indicate that there is no general problem with the voting disk itself, but that the problem is specific to the first node. Either the connection itself or the load or an ocfs2 bug would then be the cause of the error.

Do you know if at the time of the failure the other OCFS2 volumes where still accessible? Are your voting disks placed on the same luns as your database files or are they on a separate ocfs2 volume?

Regards,

Freek D'Hooge
Uptime
Oracle Database Administrator
email: freek.dhooge_at_uptime.be
tel +32(0)3 451 23 82
http://www.uptime.be
disclaimer: www.uptime.be/disclaimer

---
From: Marko Sutic [mailto:marko.sutic_at_gmail.com] 
Sent: donderdag 25 augustus 2011 10:51
To: D'Hooge Freek
Cc: oracle-l_at_freelists.org
Subject: Re: CRS-1615:voting device hang at 50% fatal, termination in 99620 ms

Errors messages from another node:

2011-08-25 10:38:33.563

[cssd(18117)]CRS-1612:node l01ora3 (1) at 50% heartbeat fatal, eviction in 14.000 seconds
2011-08-25 10:38:40.558
[cssd(18117)]CRS-1611:node l01ora3 (1) at 75% heartbeat fatal, eviction in 7.010 seconds
2011-08-25 10:38:41.560
[cssd(18117)]CRS-1611:node l01ora3 (1) at 75% heartbeat fatal, eviction in 6.010 seconds
2011-08-25 10:38:45.558
[cssd(18117)]CRS-1610:node l01ora3 (1) at 90% heartbeat fatal, eviction in 2.010 seconds
2011-08-25 10:38:46.560
[cssd(18117)]CRS-1610:node l01ora3 (1) at 90% heartbeat fatal, eviction in 1.010 seconds
2011-08-25 10:38:47.562
[cssd(18117)]CRS-1610:node l01ora3 (1) at 90% heartbeat fatal, eviction in 0.010 seconds
2011-08-25 10:38:47.574
[cssd(18117)]CRS-1607:CSSD evicting node l01ora3. Details in /u01/app/crs/log/l01ora4/cssd/ocssd.log.
2011-08-25 10:39:01.579
[cssd(18117)]CRS-1601:CSSD Reconfiguration complete. Active nodes are l01ora4 .
Regards, Marko -- http://www.freelists.org/webpage/oracle-l
Received on Thu Aug 25 2011 - 04:08:01 CDT

Original text of this message