<div dir="ltr"><div>Hi Klaus</div><div><br></div>service mycustomprog status returns fine that is  no errors.   It does not hang.<div><br></div><div>Suresh</div><div><br><div class="gmail_extra"><br><div class="gmail_quote">On Sun, Jan 1, 2017 at 9:43 PM, Klaus Wenninger <span dir="ltr">&lt;<a href="mailto:kwenning@redhat.com" target="_blank">kwenning@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Suresh!<br>
<br>
Have you tried lsb-status in a shell?<br>
Does it show anything interesting or is it hanging?<br>
<br>
Regards,<br>
Klaus<br>
<br>
On 12/30/2016 08:45 AM, Suresh Rajagopalan wrote:<br>
&gt; Cluster running centos 6.8 with pacemaker/corosync.    This config was<br>
&gt; running well for quite sometime. All of a sudden we see regular<br>
&gt; restarts of the monitored process where corosync thinks it has<br>
&gt; failed(even though it really has not failed).  I am showing the<br>
&gt; relevant logs and config below.  Any pointers appreciated as it is not<br>
&gt; clear why this would occur.<br>
&gt;<br>
&gt; Thanks<br>
&gt; Suresh<br>
&gt;<br>
&gt; Dec 28 13:18:20 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:     info:<br>
&gt; LogActions:  Leave   mycustomprog       (Started <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;)<br>
&gt; Dec 28 13:22:03 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; process_lrm_event:   Operation mycustomprog_monitor_10000: not running<br>
&gt; (node=<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;, call=29, rc=7, cib-update=1427,<br>
&gt; confirmed=false)<br>
&gt; Dec 28 13:22:03 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:   notice:<br>
&gt; process_lrm_event:   a.b.com-mycustomprog_monitor_<wbr>10000:29 [<br>
&gt; mycustomprogram (pid  15657) is running...\n ]<br>
&gt; Dec 28 13:22:03 [2194] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;        cib:     info:<br>
&gt; cib_perform_op:      ++ /cib/status/node_state[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.<wbr>b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/lrm[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.<wbr>b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/lrm_resourc<wbr>es/lrm_resource[@id=&#39;mycustomp<wbr>rog&#39;]:<br>
&gt;  &lt;lrm_rsc_op id=&quot;mycustomprog_last_failure_<wbr>0&quot;<br>
&gt; operation_key=&quot;mycustomprog_mo<wbr>nitor_10000&quot; operation=&quot;monitor&quot;<br>
&gt; crm-debug-origin=&quot;do_update_re<wbr>source&quot; crm_feature_set=&quot;3.0.10&quot;<br>
&gt; transition-key=&quot;7:462:0:a9dbbd<wbr>47-975b-4aee-8b4a-de56e0a8e7a7<wbr>&quot;<br>
&gt; transition-magic=&quot;0<br>
&gt; Dec 28 13:22:03 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; abort_transition_graph:      Transition aborted by<br>
&gt; mycustomprog_monitor_10000 &#39;create&#39; on <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;: Old<br>
&gt; event (magic=0:7;7:462:0:a9dbbd47-97<wbr>5b-4aee-8b4a-de56e0a8e7a7,<br>
&gt; cib=0.48.2038786, source=process_graph_event:605<wbr>, 1)<br>
&gt; Dec 28 13:22:03 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; update_failcount:    Updating failcount for mycustomprog on <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt; after failed monitor: rc=7 (update=value++,<br>
&gt; time=1482931323)<br>
&gt; Dec 28 13:22:03 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; process_graph_event: Detected action (462.7)<br>
&gt; mycustomprog_monitor_10000.29=<wbr>not running: failed<br>
&gt; Dec 28 13:22:03 [2197] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;      attrd:   notice:<br>
&gt; attrd_trigger_update:        Sending flush op to all hosts for:<br>
&gt; fail-count-mycustomprog (1)<br>
&gt; Dec 28 13:22:03 [2197] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;      attrd:   notice:<br>
&gt; attrd_perform_update:        Sent update 18: fail-count-mycustomprog=1<br>
&gt; Dec 28 13:22:03 [2194] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;        cib:     info:<br>
&gt; cib_perform_op:      ++ /cib/status/node_state[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.<wbr>b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/transient_a<wbr>ttributes[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/instance_at<wbr>tributes[@id=&#39;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">status-a.b.com</a><br>
&gt; &lt;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">http://status-a.b.com</a>&gt;&#39;]:  &lt;nvpair<br>
&gt; id=&quot;status-a.b.com-fail-count-<wbr>mycustomprog&quot;<br>
&gt; name=&quot;fail-count-mycustomprog&quot; value=&quot;1&quot;/&gt;<br>
&gt; Dec 28 13:22:03 [2197] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;      attrd:   notice:<br>
&gt; attrd_trigger_update:        Sending flush op to all hosts for:<br>
&gt; last-failure-mycustomprog (1482931323)<br>
&gt; Dec 28 13:22:03 [2197] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;      attrd:   notice:<br>
&gt; attrd_perform_update:        Sent update 20:<br>
&gt; last-failure-mycustomprog=1482<wbr>931323<br>
&gt; Dec 28 13:22:03 [2197] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;      attrd:   notice:<br>
&gt; attrd_perform_update:        Sent update 20:<br>
&gt; last-failure-mycustomprog=1482<wbr>931323<br>
&gt; Dec 28 13:22:03 [2194] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;        cib:     info:<br>
&gt; cib_perform_op:      ++ /cib/status/node_state[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.<wbr>b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/transient_a<wbr>ttributes[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/instance_at<wbr>tributes[@id=&#39;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">status-a.b.com</a><br>
&gt; &lt;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">http://status-a.b.com</a>&gt;&#39;]:  &lt;nvpair<br>
&gt; id=&quot;status-a.b.com-last-failur<wbr>e-mycustomprog&quot;<br>
&gt; name=&quot;last-failure-mycustompro<wbr>g&quot; value=&quot;1482931323&quot;/&gt;<br>
&gt; Dec 28 13:22:04 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; abort_transition_graph:      Transition aborted by<br>
&gt; status-a.b.com-fail-count-mycu<wbr>stomprog, fail-count-mycustomprog=1:<br>
&gt; Transient attribute change (create cib=0.48.2038787,<br>
&gt; source=abort_unless_down:329, path=/cib/status/node_state[@i<wbr>d=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/transient_a<wbr>ttributes[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/instance_at<wbr>tributes[@id=&#39;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">status-a.b.com</a><br>
&gt; &lt;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">http://status-a.b.com</a>&gt;<br>
&gt; Dec 28 13:22:04 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; abort_transition_graph:      Transition aborted by<br>
&gt; status-a.b.com-last-failure-my<wbr>customprog,<br>
&gt; last-failure-mycustomprog=1482<wbr>931323: Transient attribute change<br>
&gt; (create cib=0.48.2038788, source=abort_unless_down:329,<br>
&gt; path=/cib/status/node_state[@i<wbr>d=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/transient_a<wbr>ttributes[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/instance_at<wbr>tributes[@id=&#39;status-macshii00<wbr>002-hva.gs.r11.<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:  warning:<br>
&gt; unpack_rsc_op_failure:       Processing failed op monitor for<br>
&gt; mycustomprog on <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;: not running (7)<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:     info:<br>
&gt; native_print:        mycustomprog       (lsb:mycustomprog):<br>
&gt;  FAILED <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:     info:<br>
&gt; get_failcount_full:  mycustomprog has failed 1 times on <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:     info:<br>
&gt; common_apply_stickiness:     mycustomprog can fail 999999 more times<br>
&gt; on <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt; before being forced off<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:     info:<br>
&gt; RecurringOp:  Start recurring monitor (10s) for mycustomprog on<br>
&gt; <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:   notice:<br>
&gt; LogActions:  Recover mycustomprog       (Started <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;)<br>
&gt; Dec 28 13:22:04 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:   notice:<br>
&gt; te_rsc_command:      Initiating action 5: stop mycustomprog_stop_0 on<br>
&gt; <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt; (local)<br>
&gt; Dec 28 13:22:04 [2196] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       lrmd:     info:<br>
&gt; cancel_recurring_action:     Cancelling lsb operation<br>
&gt; mycustomprog_status_10000Dec 28 13:18:20 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:     info: LogActions:  Leave<br>
&gt; mycustomprog       (Started <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;)<br>
&gt; Dec 28 13:22:03 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; process_lrm_event:   Operation mycustomprog_monitor_10000: not running<br>
&gt; (node=<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;, call=29, rc=7, cib-update=1427,<br>
&gt; confirmed=false)<br>
&gt; Dec 28 13:22:03 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:   notice:<br>
&gt; process_lrm_event:   a.b.com-mycustomprog_monitor_<wbr>10000:29 [<br>
&gt; mycustomprogram (pid  15657) is running...\n ]<br>
&gt; Dec 28 13:22:03 [2194] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;        cib:     info:<br>
&gt; cib_perform_op:      ++ /cib/status/node_state[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.<wbr>b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/lrm[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.<wbr>b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/lrm_resourc<wbr>es/lrm_resource[@id=&#39;mycustomp<wbr>rog&#39;]:<br>
&gt;  &lt;lrm_rsc_op id=&quot;mycustomprog_last_failure_<wbr>0&quot;<br>
&gt; operation_key=&quot;mycustomprog_mo<wbr>nitor_10000&quot; operation=&quot;monitor&quot;<br>
&gt; crm-debug-origin=&quot;do_update_re<wbr>source&quot; crm_feature_set=&quot;3.0.10&quot;<br>
&gt; transition-key=&quot;7:462:0:a9dbbd<wbr>47-975b-4aee-8b4a-de56e0a8e7a7<wbr>&quot;<br>
&gt; transition-magic=&quot;0<br>
&gt; Dec 28 13:22:03 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; abort_transition_graph:      Transition aborted by<br>
&gt; mycustomprog_monitor_10000 &#39;create&#39; on <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;: Old<br>
&gt; event (magic=0:7;7:462:0:a9dbbd47-97<wbr>5b-4aee-8b4a-de56e0a8e7a7,<br>
&gt; cib=0.48.2038786, source=process_graph_event:605<wbr>, 1)<br>
&gt; Dec 28 13:22:03 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; update_failcount:    Updating failcount for mycustomprog on <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt; after failed monitor: rc=7 (update=value++,<br>
&gt; time=1482931323)<br>
&gt; Dec 28 13:22:03 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; process_graph_event: Detected action (462.7)<br>
&gt; mycustomprog_monitor_10000.29=<wbr>not running: failed<br>
&gt; Dec 28 13:22:03 [2197] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;      attrd:   notice:<br>
&gt; attrd_trigger_update:        Sending flush op to all hosts for:<br>
&gt; fail-count-mycustomprog (1)<br>
&gt; Dec 28 13:22:03 [2197] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;      attrd:   notice:<br>
&gt; attrd_perform_update:        Sent update 18: fail-count-mycustomprog=1<br>
&gt; Dec 28 13:22:03 [2194] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;        cib:     info:<br>
&gt; cib_perform_op:      ++ /cib/status/node_state[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.<wbr>b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/transient_a<wbr>ttributes[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/instance_at<wbr>tributes[@id=&#39;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">status-a.b.com</a><br>
&gt; &lt;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">http://status-a.b.com</a>&gt;&#39;]:  &lt;nvpair<br>
&gt; id=&quot;status-a.b.com-fail-count-<wbr>mycustomprog&quot;<br>
&gt; name=&quot;fail-count-mycustomprog&quot; value=&quot;1&quot;/&gt;<br>
&gt; Dec 28 13:22:03 [2197] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;      attrd:   notice:<br>
&gt; attrd_trigger_update:        Sending flush op to all hosts for:<br>
&gt; last-failure-mycustomprog (1482931323)<br>
&gt; Dec 28 13:22:03 [2197] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;      attrd:   notice:<br>
&gt; attrd_perform_update:        Sent update 20:<br>
&gt; last-failure-mycustomprog=1482<wbr>931323<br>
&gt; Dec 28 13:22:03 [2197] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;      attrd:   notice:<br>
&gt; attrd_perform_update:        Sent update 20:<br>
&gt; last-failure-mycustomprog=1482<wbr>931323<br>
&gt; Dec 28 13:22:03 [2194] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;        cib:     info:<br>
&gt; cib_perform_op:      ++ /cib/status/node_state[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.<wbr>b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/transient_a<wbr>ttributes[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/instance_at<wbr>tributes[@id=&#39;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">status-a.b.com</a><br>
&gt; &lt;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">http://status-a.b.com</a>&gt;&#39;]:  &lt;nvpair<br>
&gt; id=&quot;status-a.b.com-last-failur<wbr>e-mycustomprog&quot;<br>
&gt; name=&quot;last-failure-mycustompro<wbr>g&quot; value=&quot;1482931323&quot;/&gt;<br>
&gt; Dec 28 13:22:04 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; abort_transition_graph:      Transition aborted by<br>
&gt; status-a.b.com-fail-count-mycu<wbr>stomprog, fail-count-mycustomprog=1:<br>
&gt; Transient attribute change (create cib=0.48.2038787,<br>
&gt; source=abort_unless_down:329, path=/cib/status/node_state[@i<wbr>d=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/transient_a<wbr>ttributes[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/instance_at<wbr>tributes[@id=&#39;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">status-a.b.com</a><br>
&gt; &lt;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">http://status-a.b.com</a>&gt;<br>
&gt; Dec 28 13:22:04 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:     info:<br>
&gt; abort_transition_graph:      Transition aborted by<br>
&gt; status-a.b.com-last-failure-my<wbr>customprog,<br>
&gt; last-failure-mycustomprog=1482<wbr>931323: Transient attribute change<br>
&gt; (create cib=0.48.2038788, source=abort_unless_down:329,<br>
&gt; path=/cib/status/node_state[@i<wbr>d=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/transient_a<wbr>ttributes[@id=&#39;<a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;&#39;]/instance_at<wbr>tributes[@id=&#39;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">status-a.b.com</a><br>
&gt; &lt;<a href="http://status-a.b.com" rel="noreferrer" target="_blank">http://status-a.b.com</a>&gt;<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:  warning:<br>
&gt; unpack_rsc_op_failure:       Processing failed op monitor for<br>
&gt; mycustomprog on <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;: not running (7)<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:     info:<br>
&gt; native_print:        mycustomprog       (lsb:mycustomprog):<br>
&gt;  FAILED <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:     info:<br>
&gt; get_failcount_full:  mycustomprog has failed 1 times on <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a><br>
&gt; &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:     info:<br>
&gt; common_apply_stickiness:     mycustomprog can fail 999999 more times<br>
&gt; on <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt; before being forced off<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:     info:<br>
&gt; RecurringOp:  Start recurring monitor (10s) for mycustomprog on<br>
&gt; <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;<br>
&gt; Dec 28 13:22:04 [2198] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;    pengine:   notice:<br>
&gt; LogActions:  Recover mycustomprog       (Started <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;)<br>
&gt; Dec 28 13:22:04 [2199] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       crmd:   notice:<br>
&gt; te_rsc_command:      Initiating action 5: stop mycustomprog_stop_0 on<br>
&gt; <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt; (local)<br>
&gt; Dec 28 13:22:04 [2196] <a href="http://a.b.com" rel="noreferrer" target="_blank">a.b.com</a> &lt;<a href="http://a.b.com" rel="noreferrer" target="_blank">http://a.b.com</a>&gt;       lrmd:     info:<br>
&gt; cancel_recurring_action:     Cancelling lsb operation<br>
&gt; mycustomprog_status_10000<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; pcs config:<br>
&gt;<br>
&gt; source settings.rc<br>
&gt;<br>
&gt; pcs property set stonith-enabled=false<br>
&gt; pcs property set no-quorum-policy=ignore<br>
&gt; pcs resource create ClusterIP2 IPaddr2 ip=$MYVIP cidr_netmask=$NETMASKVIP1<br>
&gt; pcs resource create ClusterIP3 IPaddr2 ip=$MYVIP2<br>
&gt; cidr_netmask=$NETMASKVIP2<br>
&gt; pcs resource create mycustomprog lsb:mycustomprog op monitor<br>
&gt; interval=&quot;10s&quot;<br>
&gt; pcs constraint colocation add ClusterIP3 with ClusterIP2 INFINITY<br>
&gt; pcs constraint colocation add mycustomprog with ClusterIP2 INFINITY<br>
&gt; pcs property set start-failure-is-fatal=false<br>
&gt; pcs resource defaults resource-stickiness=100<br>
&gt; pcs constraint colocation add chkhealth with ClusterIP2 INFINITY<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; ______________________________<wbr>_________________<br>
&gt; Users mailing list: <a href="mailto:Users@clusterlabs.org" target="_blank">Users@clusterlabs.org</a><br>
&gt; <a href="http://lists.clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.clusterlabs.org/m<wbr>ailman/listinfo/users</a><br>
&gt;<br>
&gt; Project Home: <a href="http://www.clusterlabs.org" rel="noreferrer" target="_blank">http://www.clusterlabs.org</a><br>
&gt; Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" rel="noreferrer" target="_blank">http://www.clusterlabs.org/doc<wbr>/Cluster_from_Scratch.pdf</a><br>
&gt; Bugs: <a href="http://bugs.clusterlabs.org" rel="noreferrer" target="_blank">http://bugs.clusterlabs.org</a><br>
<br>
<br>
<br>
______________________________<wbr>_________________<br>
Users mailing list: <a href="mailto:Users@clusterlabs.org" target="_blank">Users@clusterlabs.org</a><br>
<a href="http://lists.clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.clusterlabs.org/m<wbr>ailman/listinfo/users</a><br>
<br>
Project Home: <a href="http://www.clusterlabs.org" rel="noreferrer" target="_blank">http://www.clusterlabs.org</a><br>
Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" rel="noreferrer" target="_blank">http://www.clusterlabs.org/doc<wbr>/Cluster_from_Scratch.pdf</a><br>
Bugs: <a href="http://bugs.clusterlabs.org" rel="noreferrer" target="_blank">http://bugs.clusterlabs.org</a><br>
</blockquote></div><br></div></div></div>