[ClusterLabs] Antw: clearing failed actions
lists at alteeve.ca
Wed May 31 18:03:54 EDT 2017
On 31/05/17 11:02 PM, Attila Megyeri wrote:
>>>> What type of failure do you have, and what is the status after that? Do
>>>> have fencing enabled?
>>> Typically a failed start, or a failed monitor.
>>> Fencing is disabled as we have multiple nodes / quorum.
>> Stonith and quorum solve different problems. Stonith is required, quorum
>> is optional.
> I see your point, but does it relate to the failcount issue? By turning stonith off, the fail counters will not be removed even if the service recovers immediately after restart?
I don't know, but according to Ken's last email, what you're seeing is
expected. I replied because of the miss understanding of the rolls
quorum and fencing play. Running a cluster without fencing is dangerous.
Papers and Projects: https://alteeve.com/w/
"I am, somehow, less interested in the weight and convolutions of
Einstein’s brain than in the near certainty that people of equal talent
have lived and died in cotton fields and sweatshops." - Stephen Jay Gould
More information about the Users