<div dir="ltr">Hi,<div><br></div><div>I'm using Redhat Cluster Suite 7with watchdog timer based fence agent. I understand this is a really bad setup but this is what the end-user wants.</div><div><br></div><div>ATB => auto_tie_breaker</div><div><br></div><div>"When the auto_tie_breaker is used in even-number member clusters, then the failure of the partition containing the auto_tie_breaker_node (by default the node with lowest ID) will cause other partition to become inquorate and it will self-fence. In 2-node clusters with auto_tie_breaker this means that failure of node favoured by auto_tie_breaker_node (typically nodeid 1) will result in reboot of other node (typically nodeid 2) that detects the inquorate state. If this is undesirable then corosync-qdevice can be used instead of the auto_tie_breaker to provide additional vote to quorum making behaviour closer to odd-number member clusters."</div><div><br></div><div>Thanks</div><div><br></div></div><br><div class="gmail_quote"><div dir="ltr">On Sun, 29 Apr 2018 at 02:15, Digimer <<a href="mailto:lists@alteeve.ca">lists@alteeve.ca</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On 2018-04-28 09:06 PM, Wei Shan wrote:<br>
> Hi all,<br>
> <br>
> If I have a 2 node cluster with ATB enabled and the lowest node ID node<br>
> has failed. What will happen? My assumption is that the higher node ID<br>
> node will self fence and be rebooted. What happens after that?<br>
> <br>
> Thanks!<br>
> <br>
> -- <br>
> Regards,<br>
> Ang Wei Shan<br>
<br>
Which cluster stack is this? I am not familiar with the term "ATB".<br>
<br>
If it's a standard pacemaker or cman/rgmanager cluster, then on node<br>
failure, the good node should block and request a fence (a lost node is<br>
not allowed to be assumed gone via self fence, except when using a<br>
watchdog timer based fence agent). If the fence doesn't work, the<br>
survivor should remain blocked (better to hang than risk corruption). If<br>
the fence succeeds, then the survivor node will recover any lost<br>
services based on the configuration of those services (usually a simple<br>
(re)start on the good node).<br>
<br>
-- <br>
Digimer<br>
Papers and Projects: <a href="https://alteeve.com/w/" rel="noreferrer" target="_blank">https://alteeve.com/w/</a><br>
"I am, somehow, less interested in the weight and convolutions of<br>
Einstein’s brain than in the near certainty that people of equal talent<br>
have lived and died in cotton fields and sweatshops." - Stephen Jay Gould<br>
</blockquote></div><br clear="all"><div><br></div>-- <br><div dir="ltr" class="gmail_signature" data-smartmail="gmail_signature">Regards,<br>Ang Wei Shan</div>