<div dir="ltr">Thanks, <br></div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, May 18, 2017 at 4:37 PM, Ken Gaillot <span dir="ltr">&lt;<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On 05/17/2017 06:10 PM, Radoslaw Garbacz wrote:<br>

&gt; Hi,<br>

&gt;<br>

&gt; I have a question regarding &#39;&lt;lrm_resource&gt;&lt;lrm_rsc_op&gt; &#39;op-status<br>

&gt; attribute getting value 4.<br>

&gt;<br>

&gt; In my case I have a strange behavior, when resources get those &quot;monitor&quot;<br>

&gt; operation entries in the CIB with op-status=4, and they do not seem to<br>

&gt; be called (exec-time=0).<br>

&gt;<br>

&gt; What does &#39;op-status&#39; = 4 mean?<br>

<br>

</span>The action had an error status<br>

<span class=""><br>

&gt;<br>

&gt; I would appreciate some elaboration regarding this, since this is<br>

&gt; interpreted by pacemaker as an error, which causes logs:<br>

&gt; crm_mon:    error: unpack_rsc_op:    Preventing dbx_head_head from<br>

&gt; re-starting anywhere: operation monitor failed &#39;not configured&#39; (6)<br>

<br>

</span>The rc-code=&quot;6&quot; is the more interesting number; it&#39;s the result returned<br>

by the resource agent. As you can see above, it means &quot;not configured&quot;.<br>

What that means exactly is up to the resource agent&#39;s interpretation.<br>

<span class=""><br>

&gt; and I am pretty sure the resource agent was not called (no logs,<br>

&gt; exec-time=0)<br>

<br>

</span>Normally this could only come from the resource agent.<br>

<br>

However there are two cases where pacemaker generates this error itself:<br>

if the resource definition in the CIB is invalid; and if your version of<br>

pacemaker was compiled with support for reading sensitive parameter<br>

values from a file, but that file could not be read.<br>

<br>

It doesn&#39;t sound like your case is either one of those though, since<br>

they would prevent the resource from even starting. Most likely it&#39;s<br>

coming from the resource agent. I&#39;d look at the resource agent source<br>

code and see where it can return OCF_ERR_CONFIGURED.<br>

<span class=""><br>

&gt; There are two aspects of this:<br>

&gt;<br>

&gt; 1) harmless (pacemaker seems to not bother about it), which I guess<br>

&gt; indicates cancelled monitoring operations:<br>

&gt; op-status=4, rc-code=189<br>

<br>

</span>This error means the connection between the crmd and lrmd daemons was<br>

lost -- most commonly, that shows up for operations that were pending at<br>

shutdown.<br>

<div><div class="h5"><br>

&gt;<br>

&gt; * Example:<br>

&gt; &lt;lrm_rsc_op id=&quot;dbx_first_datas_last_<wbr>failure_0&quot;<br>

&gt; operation_key=&quot;dbx_first_<wbr>datas_monitor_0&quot; operation=&quot;monitor&quot;<br>

&gt; crm-debug-origin=&quot;do_update_<wbr>resource&quot; crm_feature_set=&quot;3.0.12&quot;<br>

&gt; transition-key=&quot;38:0:7:<wbr>c8b63d9d-9c70-4f99-aa1b-<wbr>e993de6e4739&quot;<br>

&gt; transition-magic=&quot;4:189;38:0:<wbr>7:c8b63d9d-9c70-4f99-aa1b-<wbr>e993de6e4739&quot;<br>

&gt; on_node=&quot;olegdbx61-vm000001&quot; call-id=&quot;10&quot; rc-code=&quot;189&quot; op-status=&quot;4&quot;<br>

&gt; interval=&quot;0&quot; last-run=&quot;1495057378&quot; last-rc-change=&quot;1495057378&quot;<br>

&gt; exec-time=&quot;0&quot; queue-time=&quot;0&quot; op-digest=&quot;<wbr>f6bd1386a336e8e6ee25ecb651a9ef<wbr>b6&quot;/&gt;<br>

&gt;<br>

&gt;<br>

&gt; 2) error level one (op-status=4, rc-code=6), which generates logs:<br>

&gt; crm_mon:    error: unpack_rsc_op:    Preventing dbx_head_head from<br>

&gt; re-starting anywhere: operation monitor failed &#39;not configured&#39; (6)<br>

&gt;<br>

&gt; * Example:<br>

&gt; &lt;lrm_rsc_op id=&quot;dbx_head_head_last_<wbr>failure_0&quot;<br>

&gt; operation_key=&quot;dbx_head_head_<wbr>monitor_0&quot; operation=&quot;monitor&quot;<br>

&gt; crm-debug-origin=&quot;do_update_<wbr>resource&quot; crm_feature_set=&quot;3.0.12&quot;<br>

&gt; transition-key=&quot;39:0:7:<wbr>c8b63d9d-9c70-4f99-aa1b-<wbr>e993de6e4739&quot;<br>

&gt; transition-magic=&quot;4:6;39:0:7:<wbr>c8b63d9d-9c70-4f99-aa1b-<wbr>e993de6e4739&quot;<br>

&gt; on_node=&quot;olegdbx61-vm000001&quot; call-id=&quot;999999999&quot; rc-code=&quot;6&quot;<br>

&gt; op-status=&quot;4&quot; interval=&quot;0&quot; last-run=&quot;1495057389&quot;<br>

&gt; last-rc-change=&quot;1495057389&quot; exec-time=&quot;0&quot; queue-time=&quot;0&quot;<br>

&gt; op-digest=&quot;<wbr>60cdc9db1c5b77e8dba698d3d0c8cd<wbr>a8&quot;/&gt;<br>

&gt;<br>

&gt;<br>

&gt; Could it be some hardware (VM hyperviser) issue?<br>

&gt;<br>

&gt;<br>

&gt; Thanks in advance,<br>

&gt;<br>

&gt; --<br>

&gt; Best Regards,<br>

&gt;<br>

&gt; Radoslaw Garbacz<br>

&gt; XtremeData Incorporated<br>

<br>

</div></div>______________________________<wbr>_________________<br>

Users mailing list: <a href="mailto:Users@clusterlabs.org">Users@clusterlabs.org</a><br>

<a href="http://lists.clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.clusterlabs.org/<wbr>mailman/listinfo/users</a><br>

<br>

Project Home: <a href="http://www.clusterlabs.org" rel="noreferrer" target="_blank">http://www.clusterlabs.org</a><br>

Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" rel="noreferrer" target="_blank">http://www.clusterlabs.org/<wbr>doc/Cluster_from_Scratch.pdf</a><br>

Bugs: <a href="http://bugs.clusterlabs.org" rel="noreferrer" target="_blank">http://bugs.clusterlabs.org</a><br>

</blockquote></div><br><br clear="all"><br>-- <br><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div>Best Regards,<br><br>Radoslaw Garbacz<br></div>XtremeData Incorporated<br></div></div></div></div>

</div>