<html><header></header><body><div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;">Thanks, I will experiment this.</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;"> </div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;">Now, I have a last issue about stonith.</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;">I tried to reproduce a stonith situation, by disabling the network interface used for HA on node 1.</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;">Stonith is configured with ipmi poweroff.</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;">What happens, is that once the interface is down, both nodes tries to stonith the other node, causing both to poweroff...</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;">I would like the node running all resources (zpool and nfs ip) to be the first trying to stonith the other node.</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;">Or is there anything else better?</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;"> </div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;">Here is the current crm config show:</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;"> </div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;">
<pre class="language-markup"><code>node 1: xstha1 \
attributes standby=off maintenance=off
node 2: xstha2 \
attributes standby=off maintenance=off
primitive xstha1-stonith stonith:external/ipmi \
params hostname=xstha1 ipaddr=192.168.221.18 userid=ADMIN passwd="******" interface=lanplus \
op monitor interval=25 timeout=25 start-delay=25 \
meta target-role=Started
primitive xstha1_san0_IP IPaddr \
params ip=10.10.10.1 cidr_netmask=255.255.255.0 nic=san0
primitive xstha2-stonith stonith:external/ipmi \
params hostname=xstha2 ipaddr=192.168.221.19 userid=ADMIN passwd="******" interface=lanplus \
op monitor interval=25 timeout=25 start-delay=25 \
meta target-role=Started
primitive xstha2_san0_IP IPaddr \
params ip=10.10.10.2 cidr_netmask=255.255.255.0 nic=san0
primitive zpool_data ZFS \
params pool=test \
op start timeout=90 interval=0 \
op stop timeout=90 interval=0 \
meta target-role=Started
location xstha1-stonith-pref xstha1-stonith -inf: xstha1
location xstha1_san0_IP_pref xstha1_san0_IP 100: xstha1
location xstha2-stonith-pref xstha2-stonith -inf: xstha2
location xstha2_san0_IP_pref xstha2_san0_IP 100: xstha2
order zpool_data_order inf: zpool_data ( xstha1_san0_IP )
location zpool_data_pref zpool_data 100: xstha1
colocation zpool_data_with_IPs inf: zpool_data xstha1_san0_IP
property cib-bootstrap-options: \
have-watchdog=false \
dc-version=1.1.15-e174ec8 \
cluster-infrastructure=corosync \
stonith-action=poweroff \
no-quorum-policy=stop
</code></pre>
</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;"> </div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;">Thanks!<br />Gabriele</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;"> </div>
<div id="wt-mailcard">
<div> </div>
<div><span style="font-size: 14px; font-family: Helvetica;"><strong>Sonicle S.r.l. </strong>: <a href="http://www.sonicle.com/" target="_new">http://www.sonicle.com</a></span></div>
<div><span style="font-size: 14px; font-family: Helvetica;"><strong>Music: </strong><a href="http://www.gabrielebulfon.com/" target="_new">http://www.gabrielebulfon.com</a></span></div>
<div><span style="font-size: 14px; font-family: Helvetica;"><strong>eXoplanets : </strong><a href="https://gabrielebulfon.bandcamp.com/album/exoplanets">https://gabrielebulfon.bandcamp.com/album/exoplanets</a></span></div>
<div> </div>
</div>
<div style="font-family: tahoma,arial,helvetica,sans-serif; font-size: 14px;"><tt><br /><br /><br />----------------------------------------------------------------------------------<br /><br />Da: Andrei Borzenkov <arvidjaar@gmail.com><br />A: users@clusterlabs.org <br />Data: 11 dicembre 2020 18.30.29 CET<br />Oggetto: Re: [ClusterLabs] Antw: [EXT] Recoveing from node failure<br /><br /></tt></div>
<blockquote style="border-left: #000080 2px solid; margin-left: 5px; padding-left: 5px;"><tt>11.12.2020 18:37, Gabriele Bulfon пишет:<br />> I found I can do this temporarily:<br />> <br />> crm config property cib-bootstrap-options: no-quorum-policy=ignore<br />> <br /><br />All two node clusters I remember run with setting forever :)<br /><br />> then once node 2 is up again:<br />> <br />> crm config property cib-bootstrap-options: no-quorum-policy=stop<br />> <br />> so that I make sure nodes will not mount in another strange situation.<br />> <br />> Is there any better way? <br /><br />"better" us subjective, but ...<br /><br />> (such as ignore until everything is back to normal then conisder top again)<br />> <br /><br />That is what stonith does. Because quorum is pretty much useless in two<br />node cluster, as I already said all clusters I have seem used<br />no-quorum-policy=ignore and stonith-enabled=true. It means when node<br />boots and other node is not available stonith is attempted; if stonith<br />succeeds pacemaker continues with starting resources; if stonith fails,<br />node is stuck.<br /><br />_______________________________________________<br />Manage your subscription:<br />https://lists.clusterlabs.org/mailman/listinfo/users<br /><br />ClusterLabs home: https://www.clusterlabs.org/<br /><br /><br /></tt></blockquote></body></html>