[Pacemaker] Backup ring is marked faulty

Sebastian Kaps sebastian.kaps at imail.de
Thu Aug 4 18:43:32 UTC 2011


Hi Steven,

On 04.08.2011, at 18:27, Steven Dake wrote:

> redundant ring is only supported upstream in corosync 1.4.1 or later.

What does "supported" mean in this context, exactly? 

I'm asking, because we're having serious issues with these systems since 
they went into production (the testing phase did not show any problems, 
but we also couldn't use real workloads then).

Since the cluster went productive, we're having issues with seemingly random 
STONITH events that seem to be related to a high I/O load on a DRBD-mirrored
OCFS2 volume - but I don't see any pattern yet. We've had these machines 
running for nearly two weeks without major problems and suddenly they went 
back to killing each other :-(

> The retransmit list message issues you are having is fixed in corosync
> 1.3.3. and later  This is what is triggering the redundant ring faulty
> error.

Could it also cause the instability problems we're seeing?
Thanks again, for helping!

-- 
Sebastian




More information about the Pacemaker mailing list