<div dir="ltr">This is my messages log.<br><div><br>Jul 27 08:02:46 vmx-occ-005 apache(WebSite)[32477]: INFO: apache not running<br>Jul 27 08:02:46 vmx-occ-005 crmd[31424]:   notice: process_lrm_event: Operation WebSite_monitor_60000: not running (node=node1, call=11, rc=7, cib-update=15, confirmed=false)<br>Jul 27 08:02:46 vmx-occ-005 attrd[31422]:   notice: attrd_cs_dispatch: Update relayed from node2<br>Jul 27 08:02:46 vmx-occ-005 attrd[31422]:   notice: attrd_trigger_update: Sending flush op to all hosts for: fail-count-WebSite (1)<br>Jul 27 08:02:46 vmx-occ-005 attrd[31422]:   notice: attrd_perform_update: Sent update 12: fail-count-WebSite=1<br>Jul 27 08:02:46 vmx-occ-005 attrd[31422]:   notice: attrd_cs_dispatch: Update relayed from node2<br>Jul 27 08:02:46 vmx-occ-005 attrd[31422]:   notice: attrd_trigger_update: Sending flush op to all hosts for: last-failure-WebSite (1437976962)<br>Jul 27 08:02:46 vmx-occ-005 attrd[31422]:   notice: attrd_perform_update: Sent update 14: last-failure-WebSite=1437976962<br>Jul 27 08:02:46 vmx-occ-005 apache(WebSite)[32511]: INFO: apache is not running.<br>Jul 27 08:02:46 vmx-occ-005 crmd[31424]:   notice: process_lrm_event: Operation WebSite_stop_0: ok (node=node1, call=14, rc=0, cib-update=16, confirmed=true)<br><br></div><div>this is my corosync log:<br><br><br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_process_request:  Forwarding cib_modify operation for section status to master (origin=local/crmd/15)<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       Diff: --- 0.38.65 2<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       Diff: +++ 0.38.66 (null)<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       +  /cib:  @num_updates=66<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       +  /cib/status/node_state[@id=&#39;node1&#39;]/lrm[@id=&#39;node1&#39;]/lrm_resources/lrm_resource[@id=&#39;WebSite&#39;]/lrm_rsc_op[@id=&#39;WebSite_last_failure_0&#39;]:  @operation_key=WebSite_monitor_60000, @transition-key=9:119038:0:a5b747ee-4fbc-4f65-a690-29276791fd19, @transition-magic=0:7;9:119038:0:a5b747ee-4fbc-4f65-a690-29276791fd19, @call-id=11, @rc-code=7, @interval=60000, @last-rc-change=1437976966, @exec-time=0, @op-digest=eddc33bef3f1592ad847638ee4<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_process_request:  Completed cib_modify operation for section status: OK (rc=0, origin=node1/crmd/15, version=0.38.66)<br>Jul 27 08:02:46 [31422] vmx-occ-005      attrd:   notice: attrd_cs_dispatch:    Update relayed from node2<br>Jul 27 08:02:46 [31422] vmx-occ-005      attrd:   notice: attrd_trigger_update:         Sending flush op to all hosts for: fail-count-WebSite (1)<br>Jul 27 08:02:46 [31422] vmx-occ-005      attrd:   notice: attrd_perform_update:         Sent update 12: fail-count-WebSite=1<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_process_request:  Forwarding cib_modify operation for section status to master (origin=local/attrd/12)<br>Jul 27 08:02:46 [31422] vmx-occ-005      attrd:   notice: attrd_cs_dispatch:    Update relayed from node2<br>Jul 27 08:02:46 [31422] vmx-occ-005      attrd:   notice: attrd_trigger_update:         Sending flush op to all hosts for: last-failure-WebSite (1437976962)<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       Diff: --- 0.38.66 2<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       Diff: +++ 0.38.67 (null)<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       +  /cib:  @num_updates=67<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       ++ /cib/status/node_state[@id=&#39;node1&#39;]/transient_attributes[@id=&#39;node1&#39;]/instance_attributes[@id=&#39;status-node1&#39;]:  &lt;nvpair id=&quot;status-node1-fail-count-WebSite&quot; name=&quot;fail-count-WebSite&quot; value=&quot;1&quot;/&gt;<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_process_request:  Completed cib_modify operation for section status: OK (rc=0, origin=node1/attrd/12, version=0.38.67)<br>Jul 27 08:02:46 [31422] vmx-occ-005      attrd:   notice: attrd_perform_update:         Sent update 14: last-failure-WebSite=1437976962<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_process_request:  Forwarding cib_modify operation for section status to master (origin=local/attrd/14)<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       Diff: --- 0.38.67 2<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       Diff: +++ 0.38.68 (null)<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       +  /cib:  @num_updates=68<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       ++ /cib/status/node_state[@id=&#39;node1&#39;]/transient_attributes[@id=&#39;node1&#39;]/instance_attributes[@id=&#39;status-node1&#39;]:  &lt;nvpair id=&quot;status-node1-last-failure-WebSite&quot; name=&quot;last-failure-WebSite&quot; value=&quot;1437976962&quot;/&gt;<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_process_request:  Completed cib_modify operation for section status: OK (rc=0, origin=node1/attrd/14, version=0.38.68)<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_process_request:  Completed cib_modify operation for section status: OK (rc=0, origin=node2/attrd/404, version=0.38.68)<br>Jul 27 08:02:46 [31421] vmx-occ-005       lrmd:     info: cancel_recurring_action:      Cancelling operation WebSite_monitor_60000<br>Jul 27 08:02:46 [31424] vmx-occ-005       crmd:     info: do_lrm_rsc_op:        Performing key=3:119728:0:a5b747ee-4fbc-4f65-a690-29276791fd19 op=WebSite_stop_0<br>Jul 27 08:02:46 [31421] vmx-occ-005       lrmd:     info: log_execute:  executing - rsc:WebSite action:stop call_id:14<br>Jul 27 08:02:46 [31424] vmx-occ-005       crmd:     info: process_lrm_event:    Operation WebSite_monitor_60000: Cancelled (node=node1, call=11, confirmed=true)<br>apache(WebSite)[32511]: 2015/07/27_08:02:46 INFO: apache is not running.<br>Jul 27 08:02:46 [31421] vmx-occ-005       lrmd:     info: log_finished:         finished - rsc:WebSite action:stop call_id:14 pid:32511 exit-code:0 exec-time:167ms queue-time:0ms<br>Jul 27 08:02:46 [31424] vmx-occ-005       crmd:   notice: process_lrm_event:    Operation WebSite_stop_0: ok (node=node1, call=14, rc=0, cib-update=16, confirmed=true)<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_process_request:  Forwarding cib_modify operation for section status to master (origin=local/crmd/16)<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       Diff: --- 0.38.68 2<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       Diff: +++ 0.38.69 (null)<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       +  /cib:  @num_updates=69<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_perform_op:       +  /cib/status/node_state[@id=&#39;node1&#39;]/lrm[@id=&#39;node1&#39;]/lrm_resources/lrm_resource[@id=&#39;WebSite&#39;]/lrm_rsc_op[@id=&#39;WebSite_last_0&#39;]:  @operation_key=WebSite_stop_0, @operation=stop, @transition-key=3:119728:0:a5b747ee-4fbc-4f65-a690-29276791fd19, @transition-magic=0:0;3:119728:0:a5b747ee-4fbc-4f65-a690-29276791fd19, @call-id=14, @last-run=1437976966, @last-rc-change=1437976966, @exec-time=167<br>Jul 27 08:02:46 [31419] vmx-occ-005        cib:     info: cib_process_request:  Completed cib_modify operation for section status: OK (rc=0, origin=node1/crmd/16, version=0.38.69)<br>Jul 27 08:02:51 [31419] vmx-occ-005        cib:     info: cib_process_ping:     Reporting our current digest to node2: 608e7e54d63c1f66c39c9b4162a189d3 for 0.38.69 (0x846320 0)<br><br></div><div>These are the logs after i have triggered the failure. Pacemaker doesnt restarts the service automatically, even if i start the httpd service , the status i get is stopped on node 1. If i restart the cluster it works fine.<br><br></div><div><br></div><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Jul 27, 2015 at 11:30 AM, Vijay Partha <span dir="ltr">&lt;<a href="mailto:vijaysarathy94@gmail.com" target="_blank">vijaysarathy94@gmail.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">Could you help me out in configuring stonith properly. I am new to pacemaker and I have been working for a few days. What all logs do you require?<br></div><div class="gmail_extra"><br><div class="gmail_quote"><div><div class="h5">On Mon, Jul 27, 2015 at 11:22 AM, Digimer <span dir="ltr">&lt;<a href="mailto:lists@alteeve.ca" target="_blank">lists@alteeve.ca</a>&gt;</span> wrote:<br></div></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><div class="h5"><div><div>On 27/07/15 01:35 AM, Vijay Partha wrote:<br>
&gt; HI .<br>
&gt;<br>
&gt; My configuration file looks like this:<br>
&gt;<br>
&gt; &lt;cib crm_feature_set=&quot;3.0.9&quot; validate-with=&quot;pacemaker-2.0&quot; epoch=&quot;38&quot;<br>
&gt; num_updates=&quot;0&quot; admin_epoch=&quot;0&quot; cib-last-written=&quot;Fri Jul 24 15:57:06<br>
&gt; 2015&quot; have-quorum=&quot;1&quot; dc-uuid=&quot;node2&quot;&gt;<br>
&gt;   &lt;configuration&gt;<br>
&gt;     &lt;crm_config&gt;<br>
&gt;       &lt;cluster_property_set id=&quot;cib-bootstrap-options&quot;&gt;<br>
&gt;         &lt;nvpair id=&quot;cib-bootstrap-options-dc-version&quot; name=&quot;dc-version&quot;<br>
&gt; value=&quot;1.1.11-97629de&quot;/&gt;<br>
&gt;         &lt;nvpair id=&quot;cib-bootstrap-options-cluster-infrastructure&quot;<br>
&gt; name=&quot;cluster-infrastructure&quot; value=&quot;cman&quot;/&gt;<br>
&gt;         &lt;nvpair id=&quot;cib-bootstrap-options-stonith-enabled&quot;<br>
&gt; name=&quot;stonith-enabled&quot; value=&quot;false&quot;/&gt;<br>
&gt;         &lt;nvpair id=&quot;cib-bootstrap-options-no-quorum-policy&quot;<br>
&gt; name=&quot;no-quorum-policy&quot; value=&quot;ignore&quot;/&gt;<br>
&gt;         &lt;nvpair id=&quot;cib-bootstrap-options-cluster-recheck-interval&quot;<br>
&gt; name=&quot;cluster-recheck-interval&quot; value=&quot;2s&quot;/&gt;<br>
&gt;       &lt;/cluster_property_set&gt;<br>
&gt;     &lt;/crm_config&gt;<br>
&gt;     &lt;nodes&gt;<br>
&gt;       &lt;node id=&quot;node1&quot; uname=&quot;node1&quot;/&gt;<br>
&gt;       &lt;node id=&quot;node2&quot; uname=&quot;node2&quot;/&gt;<br>
&gt;     &lt;/nodes&gt;<br>
&gt;     &lt;resources&gt;<br>
&gt;       &lt;primitive class=&quot;ocf&quot; id=&quot;my_first_svc&quot; provider=&quot;heartbeat&quot;<br>
&gt; type=&quot;Dummy&quot;&gt;<br>
&gt;         &lt;instance_attributes id=&quot;my_first_svc-instance_attributes&quot;/&gt;<br>
&gt;         &lt;operations&gt;<br>
&gt;           &lt;op id=&quot;my_first_svc-start-timeout-20&quot; interval=&quot;0s&quot;<br>
&gt; name=&quot;start&quot; timeout=&quot;20&quot;/&gt;<br>
&gt;           &lt;op id=&quot;my_first_svc-stop-timeout-20&quot; interval=&quot;0s&quot;<br>
&gt; name=&quot;stop&quot; timeout=&quot;20&quot;/&gt;<br>
&gt;           &lt;op id=&quot;my_first_svc-monitor-interval-120s&quot; interval=&quot;120s&quot;<br>
&gt; name=&quot;monitor&quot;/&gt;<br>
&gt;         &lt;/operations&gt;<br>
&gt;       &lt;/primitive&gt;<br>
&gt;       &lt;primitive class=&quot;ocf&quot; id=&quot;WebSite&quot; provider=&quot;heartbeat&quot;<br>
&gt; type=&quot;apache&quot;&gt;<br>
&gt;         &lt;instance_attributes id=&quot;WebSite-instance_attributes&quot;&gt;<br>
&gt;           &lt;nvpair id=&quot;WebSite-instance_attributes-configfile&quot;<br>
&gt; name=&quot;configfile&quot; value=&quot;/etc/httpd/conf/httpd.conf&quot;/&gt;<br>
&gt;           &lt;nvpair id=&quot;WebSite-instance_attributes-statusurl&quot;<br>
</div></div></div></div>&gt; name=&quot;statusurl&quot; value=&quot;<a href="http://localhost/server-status" rel="noreferrer" target="_blank">http://localhost/server-status</a>&quot;/&gt;<div><div class="h5"><br>
<div><div>&gt;         &lt;/instance_attributes&gt;<br>
&gt;         &lt;operations&gt;<br>
&gt;           &lt;op id=&quot;WebSite-start-timeout-40s&quot; interval=&quot;0s&quot; name=&quot;start&quot;<br>
&gt; timeout=&quot;40s&quot; on-fail=&quot;restart&quot;/&gt;<br>
&gt;           &lt;op id=&quot;WebSite-stop-timeout-60s&quot; interval=&quot;0s&quot; name=&quot;stop&quot;<br>
&gt; timeout=&quot;60s&quot; on-fail=&quot;restart&quot;/&gt;<br>
&gt;           &lt;op id=&quot;WebSite-monitor-interval-1min&quot; interval=&quot;1min&quot;<br>
&gt; name=&quot;monitor&quot; on-fail=&quot;restart&quot;/&gt;<br>
&gt;         &lt;/operations&gt;<br>
&gt;         &lt;meta_attributes id=&quot;WebSite-meta_attributes&quot;/&gt;<br>
&gt;       &lt;/primitive&gt;<br>
&gt;     &lt;/resources&gt;<br>
&gt;     &lt;constraints&gt;<br>
&gt;       &lt;rsc_location id=&quot;location-WebSite-node2-50&quot; node=&quot;node2&quot;<br>
&gt; rsc=&quot;WebSite&quot; score=&quot;50&quot;/&gt;<br>
&gt;   &lt;/constraints&gt;<br>
&gt;     &lt;rsc_defaults&gt;<br>
&gt;       &lt;meta_attributes id=&quot;rsc_defaults-options&quot;&gt;<br>
&gt;         &lt;nvpair id=&quot;rsc_defaults-options-migration-threshold&quot;<br>
&gt; name=&quot;migration-threshold&quot; value=&quot;1&quot;/&gt;<br>
&gt;       &lt;/meta_attributes&gt;<br>
&gt;     &lt;/rsc_defaults&gt;<br>
&gt;     &lt;op_defaults&gt;<br>
&gt;       &lt;meta_attributes id=&quot;op_defaults-options&quot;&gt;<br>
&gt;         &lt;nvpair id=&quot;op_defaults-options-timeout&quot; name=&quot;timeout&quot;<br>
&gt; value=&quot;240s&quot;/&gt;<br>
&gt;       &lt;/meta_attributes&gt;<br>
&gt;     &lt;/op_defaults&gt;<br>
&gt;   &lt;/configuration&gt;<br>
&gt; &lt;/cib&gt;<br>
&gt;<br>
&gt; Once i stop the httpd service the pacemaker does not restarts it<br>
&gt; automatically.<br>
<br>
</div></div>As mentioned, logs help a lot. The logs from all nodes starting before<br>
you trigger the failure until after the logs stop printing please.<br>
<br>
Also, you must use stonith. Please configure and test it. Often problems<br>
go away when stonith is configured and working properly.<br>
</div></div><span><br>
--<br>
Digimer<br>
Papers and Projects: <a href="https://alteeve.ca/w/" rel="noreferrer" target="_blank">https://alteeve.ca/w/</a><br>
</span><span class=""><span>What if the cure for cancer is trapped in the mind of a person without<br>
access to education?<br>
<br>
_______________________________________________<br>
</span></span>Users mailing list: <a href="mailto:Users@clusterlabs.org" target="_blank">Users@clusterlabs.org</a><span class=""><br>
<span><a href="http://clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://clusterlabs.org/mailman/listinfo/users</a><br>
<br>
Project Home: <a href="http://www.clusterlabs.org" rel="noreferrer" target="_blank">http://www.clusterlabs.org</a><br>
</span>Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" rel="noreferrer" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>
Bugs: <a href="http://bugs.clusterlabs.org" rel="noreferrer" target="_blank">http://bugs.clusterlabs.org</a><br>
</span></blockquote></div><span class="HOEnZb"><font color="#888888"><br><br clear="all"><br>-- <br><div><div dir="ltr"><div>With Regards<br></div>P.Vijay<br></div></div>
</font></span></div>
</blockquote></div><br><br clear="all"><br>-- <br><div class="gmail_signature"><div dir="ltr"><div>With Regards<br></div>P.Vijay<br></div></div>
</div>