[ClusterLabs] [External] : Re: Fence Agent tests

Valentin Vidić vvidic at valentin-vidic.from.hr
Sat Nov 5 14:06:46 EDT 2022


On Sat, Nov 05, 2022 at 05:20:47PM +0000, Robert Hayden wrote:
> The OCI compute instances don't have a hardware watchdog, only the software watchdog.
> So, when the network goes completely hung (e.g. firewall-cmd panic-on), all network 
> traffic stops which implies that IO to the SBD device also stops.  I do not see the software
> watchdog take any action in response to the network hang.

It seems like the watchdog is not working or is not configured with a
correct timeout here. sbd will not refresh the watchdog if it fails to
read from the disk, so the watchdog should eventually expire and reset
the node.

-- 
Valentin


More information about the Users mailing list