[ClusterLabs] Increasing fence timeout

Tue Aug 13 15:50:51 EDT 2019

Thank you, I reached the same conclusion after reaching through the script.

Another question - I am no longer seeing the error quoted below as I've increased shell_timeout to 30 seconds, but failovers are still happening.  From the logs, it appears that the cluster simply loses communication with one of the nodes.  Is there a way to increase a timeout such that it waits a while to see if it can re-establish the connection before performing a failover?

Thank you,
-- 
Casey

> On Aug 12, 2019, at 1:28 AM, Oyvind Albrigtsen <oalbrigt at redhat.com> wrote:
> 
> You should be able to increase this timeout by running:
> pcs stonith update <device> shell_timeout=10
> 
> Oyvind
> 
> On 08/08/19 12:13 -0600, Casey & Gina wrote:
>> Hi, I'm currently running into periodic premature killing of nodes due to the fence monitor timeout being set to 5 seconds.  Here is an example message from the logs:
>> 
>> fence_vmware_rest[22334] stderr: [ Exception: Operation timed out after 5001 milliseconds with 0 bytes received ]
>> 
>> How can I increase this timeout using PCS?
>> 
>> Thank you,
>> -- 
>> Casey
>> _______________________________________________
>> Manage your subscription:
>> https://lists.clusterlabs.org/mailman/listinfo/users
>> 
>> ClusterLabs home: https://www.clusterlabs.org/