<html><body><div style="color:#000; background-color:#fff; font-family:verdana, helvetica, sans-serif;font-size:12pt"><div><span>Its part of the requirement given to me to support this solution on servers without stonith devices. So I cannot enable the stonith.<br></span></div><div><br></div><div style="font-family: verdana,helvetica,sans-serif; font-size: 12pt;"><div style="font-family: times new roman,new york,times,serif; font-size: 12pt;"><font face="Arial" size="2"><hr size="1"><b><span style="font-weight: bold;">From:</span></b> Alan Robertson <alanr@unix.sh><br><b><span style="font-weight: bold;">To:</span></b> ihjaz Mohamed <ihjazmohamed@yahoo.co.in>; The Pacemaker cluster resource manager <pacemaker@oss.clusterlabs.org><br><b><span style="font-weight: bold;">Sent:</span></b> Monday, 24 October 2011 8:22 PM<br><b><span style="font-weight: bold;">Subject:</span></b> Re: [Pacemaker] Cluster goes to (unmanaged) Failed state when
 both nodes are rebooted together<br></font><br><div id="yiv1561605340">

  

    
    <title></title>
  <div>
    Setting no-quorum-policy to ignore and disabling stonith is not a
    good idea.  You're sort of inviting the cluster to do screwed up
    things.<br>
    <br>
    <br>
    On 10/24/2011 08:23 AM, ihjaz Mohamed wrote:
    <blockquote type="cite">
      <div style="color: rgb(0, 0, 0); background-color: rgb(255, 255, 255); font-family: verdana,helvetica,sans-serif; font-size: 14pt;">
        <div><font size="3">Hi All,</font></div>
        <div><font size="2"><br>
          </font></div>
        <div><font size="3">I 've pacemaker running with corosync.
            Following is my </font><font size="3">CRM configuration.</font></div>
        <div><br>
        </div>
        <div> <font size="2">node soalaba56<br>
            node soalaba63<br>
            primitive FloatingIP ocf:heartbeat:IPaddr2 \<br>
                    params ip="<floating_ip>" nic="eth0:0"<br>
            primitive acestatus lsb:acestatus \<br>
            primitive pingd ocf:pacemaker:ping \<br>
                    params host_list="<gateway_ip>"
            multiplier="100" \<br>
                    op monitor interval="15s" timeout="5s"<br>
            group HAService FloatingIP acestatus \<br>
                    meta target-role="Started"<br>
            clone pingdclone pingd \<br>
                    meta globally-unique="false"<br>
            location ip1_location FloatingIP \<br>
                    rule $id="ip1_location-rule" pingd: defined pingd<br>
            property $id="cib-bootstrap-options" \<br>
                   
            dc-version="1.1.5-5.el6-01e86afaaa6d4a8c4836f68df80ababd6ca3902f"
            \<br>
                    cluster-infrastructure="openais" \<br>
                    expected-quorum-votes="2" \<br>
                    stonith-enabled="false" \<br>
                    no-quorum-policy="ignore" \<br>
                    last-lrm-refresh="1305736421"</font></div>
        <div><font size="2">----------------------------------------------------------------------</font></div>
        <div><br>
        </div>
        <div><font size="3">When I reboot both the nodes together,
            cluster goes into an (unmanaged) Failed state as shown
            below.</font></div>
        <div><font size="2"><br>
          </font></div>
        <div><br>
        </div>
        <div><font size="2">============<br>
            Last updated: Mon Oct 24 08:10:42 2011<br>
            Stack: openais<br>
            Current DC: soalaba63 - partition with quorum<br>
            Version:
            1.1.5-5.el6-01e86afaaa6d4a8c4836f68df80ababd6ca3902f<br>
            2 Nodes configured, 2 expected votes<br>
            2 Resources configured.<br>
            ============<br>
            <br>
            Online: [ soalaba56 soalaba63 ]<br>
            <br>
             Resource Group: HAService<br>
                 FloatingIP (ocf::heartbeat:IPaddr2) Started 
            (unmanaged) FAILED[   soalaba63       soalaba56 ]<br>
                 acestatus  (lsb:acestatus):        Stopped<br>
             Clone Set: pingdclone [pingd]<br>
                 Started: [ soalaba56 soalaba63 ]<br>
            <br>
            Failed actions:<br>
                FloatingIP_stop_0 (node=soalaba63, call=7, rc=1,
            status=complete): unknown error<br>
                FloatingIP_stop_0 (node=soalaba56, call=7, rc=1,
            status=complete): unknown error<br>
          </font></div>
        <div><font size="2">------------------------------------------------------------------------------<br>
          </font></div>
        <div><br>
        </div>
        <div><font size="3">This happens only when the reboot is done
            simultaneously on both the nodes. If reboot is done with
            some interval in between this is not seen. Looking into the
            logs I see that  when the nodes come up resources are
            started on both the nodes and then it tries to stop the
            started resources and fails there. </font></div>
        <div><font size="3"><br>
          </font></div>
        <div><font size="3">I've attached the logs.</font><br>
        </div>
        <div><font size="2"><br>
          </font></div>
        <div><font size="2"><br>
          </font></div>
      </div>
      <pre><fieldset class="yiv1561605340mimeAttachmentHeader"></fieldset>
_______________________________________________
Pacemaker mailing list: <a rel="nofollow" class="yiv1561605340moz-txt-link-abbreviated" ymailto="mailto:Pacemaker@oss.clusterlabs.org" target="_blank" href="mailto:Pacemaker@oss.clusterlabs.org">Pacemaker@oss.clusterlabs.org</a>
<a rel="nofollow" class="yiv1561605340moz-txt-link-freetext" target="_blank" href="http://oss.clusterlabs.org/mailman/listinfo/pacemaker">http://oss.clusterlabs.org/mailman/listinfo/pacemaker</a>

Project Home: <a rel="nofollow" class="yiv1561605340moz-txt-link-freetext" target="_blank" href="http://www.clusterlabs.org">http://www.clusterlabs.org</a>
Getting started: <a rel="nofollow" class="yiv1561605340moz-txt-link-freetext" target="_blank" href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a>
Bugs: <a rel="nofollow" class="yiv1561605340moz-txt-link-freetext" target="_blank" href="http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker">http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker</a>
</pre>
    </blockquote>
    <br>
    <br>
    <pre class="yiv1561605340moz-signature">-- 
    Alan Robertson <a rel="nofollow" class="yiv1561605340moz-txt-link-rfc2396E" ymailto="mailto:alanr@unix.sh" target="_blank" href="mailto:alanr@unix.sh"><alanr@unix.sh></a>

"Openness is the foundation and preservative of friendship...  Let me claim from you at all times your undisguised opinions." - William Wilberforce
</pre>
  </div>
</div><br><br></div></div></div></body></html>