[ClusterLabs] Increasing fence timeout
Casey & Gina
caseyandgina at icloud.com
Tue Aug 13 15:50:51 EDT 2019
Thank you, I reached the same conclusion after reaching through the script.
Another question - I am no longer seeing the error quoted below as I've increased shell_timeout to 30 seconds, but failovers are still happening. From the logs, it appears that the cluster simply loses communication with one of the nodes. Is there a way to increase a timeout such that it waits a while to see if it can re-establish the connection before performing a failover?
> On Aug 12, 2019, at 1:28 AM, Oyvind Albrigtsen <oalbrigt at redhat.com> wrote:
> You should be able to increase this timeout by running:
> pcs stonith update <device> shell_timeout=10
> On 08/08/19 12:13 -0600, Casey & Gina wrote:
>> Hi, I'm currently running into periodic premature killing of nodes due to the fence monitor timeout being set to 5 seconds. Here is an example message from the logs:
>> fence_vmware_rest stderr: [ Exception: Operation timed out after 5001 milliseconds with 0 bytes received ]
>> How can I increase this timeout using PCS?
>> Thank you,
>> Manage your subscription:
>> ClusterLabs home: https://www.clusterlabs.org/
More information about the Users