<div dir="ltr"><div dir="ltr"><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, Aug 14, 2020 at 6:10 AM Gabriele Bulfon <<a href="mailto:gbulfon@sonicle.com">gbulfon@sonicle.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div><div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">Thanks to all your suggestions, I now have the systems with stonith configured on ipmi.</div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)"> </div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">Two questions:</div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">- how can I simulate a stonith situation to check that everything is ok?<br></div></div></blockquote><div><br></div><div>You can run `stonith_admin -B <node>` to tell Pacemaker to reboot the node using the configured stonith devices. If you want to test a network failure, you can have iptables block inbound and outbound traffic on the heartbeat IP address on one node.<br> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div><div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">- considering that I have both nodes with stonith against the other node, once the two nodes can communicate, how can I be sure the two nodes will not try to stonith each other?</div></div></blockquote><div><br></div><div>The simplest option is to add a delay attribute (e.g., delay=10) to one of the stonith devices. That way, if both nodes want to fence each other, the node whose stonith device has a delay configured will wait for the delay to expire before executing the reboot action.</div><div><br></div><div>Alternatively, you can set up corosync-qdevice, using a separate system running qnetd server as a quorum arbitrator.<br></div><div> <br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)"> </div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">:)<br>Thanks!<br>Gabriele<br><br>
<div id="gmail-m_-3141077643952048964wt-mailcard">
<div> </div>
<div> </div>
<div><span style="font-size:14px;font-family:Helvetica"><b>Sonicle S.r.l. </b>: <a href="http://www.sonicle.com/" target="_blank">http://www.sonicle.com</a></span></div>
<div><span style="font-size:14px;font-family:Helvetica"><b>Music: </b><a href="http://www.gabrielebulfon.com/" target="_blank">http://www.gabrielebulfon.com</a></span></div>
<div><span style="font-size:14px;font-family:Helvetica"><b>Quantum Mechanics : </b><a href="http://www.cdbaby.com/cd/gabrielebulfon" target="_blank">http://www.cdbaby.com/cd/gabrielebulfon</a></span></div>
</div>
<br><hr><br><br><span style="font-family:Arial,Helvetica,sans-serif;font-size:small"><b>Da:</b> Gabriele Bulfon <<a href="mailto:gbulfon@sonicle.com" target="_blank">gbulfon@sonicle.com</a>><br><b>A:</b> Cluster Labs - All topics related to open-source clustering welcomed <<a href="mailto:users@clusterlabs.org" target="_blank">users@clusterlabs.org</a>><br><b>Data:</b> 29 luglio 2020 14.22.42 CEST<br><b>Oggetto:</b> Re: [ClusterLabs] Antw: [EXT] Stonith failing<br></span><br><br>
<blockquote style="border-left:2px solid rgb(0,0,128);margin-left:5px;padding-left:5px">
<div> </div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">It is a ZFS based illumos system.</div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">I don't think SBD is an option.</div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">Is there a reliable ZFS based stonith?</div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)"> </div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">Gabriele<br><br>
<div id="gmail-m_-3141077643952048964wt-mailcard">
<div> </div>
<div> </div>
<div><span style="font-size:14px;font-family:Helvetica"><b>Sonicle S.r.l. </b>: <a href="http://www.sonicle.com/" target="_blank">http://www.sonicle.com</a></span></div>
<div><span style="font-size:14px;font-family:Helvetica"><b>Music: </b><a href="http://www.gabrielebulfon.com/" target="_blank">http://www.gabrielebulfon.com</a></span></div>
<div><span style="font-size:14px;font-family:Helvetica"><b>Quantum Mechanics : </b><a href="http://www.cdbaby.com/cd/gabrielebulfon" target="_blank">http://www.cdbaby.com/cd/gabrielebulfon</a></span></div>
</div>
<br><hr><br><br><span style="font-family:Arial,Helvetica,sans-serif;font-size:small"><b>Da:</b> Andrei Borzenkov <<a href="mailto:arvidjaar@gmail.com" target="_blank">arvidjaar@gmail.com</a>><br><b>A:</b> Cluster Labs - All topics related to open-source clustering welcomed <<a href="mailto:users@clusterlabs.org" target="_blank">users@clusterlabs.org</a>><br><b>Data:</b> 29 luglio 2020 9.46.09 CEST<br><b>Oggetto:</b> Re: [ClusterLabs] Antw: [EXT] Stonith failing<br></span><br><br>
<blockquote style="border-left:2px solid rgb(0,0,128);margin-left:5px;padding-left:5px">
<div dir="ltr">
<div dir="ltr">
<div style="font-family:arial,sans-serif"> </div>
</div>
<br>
<div class="gmail_quote">
<div class="gmail_attr" dir="ltr">On Wed, Jul 29, 2020 at 9:01 AM Gabriele Bulfon <<a href="mailto:gbulfon@sonicle.com" target="_blank">gbulfon@sonicle.com</a>> wrote:</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">That one was taken from a specific implementation on Solaris 11.</div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">The situation is a dual node server with shared storage controller: both nodes see the same disks concurrently.</div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">Here we must be sure that the two nodes are not going to import/mount the same zpool at the same time, or we will encounter data corruption:</div>
</div>
</blockquote>
<div> </div>
<div>
<div style="font-family:arial,sans-serif">ssh based "stonith" cannot guarantee it.</div>
</div>
<div> </div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">node 1 will be perferred for pool 1, node 2 for pool 2, only in case one of the node goes down or is taken offline the resources should be first free by the leaving node and taken by the other node.</div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)"> </div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)">Would you suggest one of the available stonith in this case?</div>
<div style="font-family:Tahoma;font-size:14px;color:rgb(0,0,0)"> </div>
</div>
</blockquote>
<div> </div>
<div>
<div style="font-family:arial,sans-serif">IPMI, managed PDU, SBD ...</div>
</div>
<div style="font-family:arial,sans-serif">In practice, the only stonith method that works in case of complete node outage including any power supply is SBD.</div>
</div>
</div>
<pre>_______________________________________________
Manage your subscription:
<a href="https://lists.clusterlabs.org/mailman/listinfo/users" target="_blank">https://lists.clusterlabs.org/mailman/listinfo/users</a>
ClusterLabs home: <a href="https://www.clusterlabs.org/" target="_blank">https://www.clusterlabs.org/</a>
</pre>
</blockquote>
</div>
<pre>_______________________________________________
Manage your subscription:
<a href="https://lists.clusterlabs.org/mailman/listinfo/users" target="_blank">https://lists.clusterlabs.org/mailman/listinfo/users</a>
ClusterLabs home: <a href="https://www.clusterlabs.org/" target="_blank">https://www.clusterlabs.org/</a>
</pre>
</blockquote>
</div></div>_______________________________________________<br>
Manage your subscription:<br>
<a href="https://lists.clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">https://lists.clusterlabs.org/mailman/listinfo/users</a><br>
<br>
ClusterLabs home: <a href="https://www.clusterlabs.org/" rel="noreferrer" target="_blank">https://www.clusterlabs.org/</a><br>
</blockquote></div><br clear="all"><br>-- <br><div dir="ltr" class="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div>Regards,<br><br></div>Reid Wahl, RHCA<br></div><div>Software Maintenance Engineer, Red Hat<br></div>CEE - Platform Support Delivery - ClusterHA</div></div></div></div></div></div></div></div></div></div></div></div></div></div></div>