<div dir="ltr"><div><br></div>Hi All,<div><br></div><div>I have 2 node HA setup. I have added migration_threshold=5 and failure-timeout=120s for my resources. When migration threshold is reached to 5 resources are migrated to other node. But once observed fail-count is not reset back to zero after 2 mins. The setup was in the same state almost for 3 hours but still fail-count did not reset to zero.</div><div><br></div><div>Then I tried the same test again but could not reproduce this.When compared the logs of success scenario with failed scenario found that pengine did not take action to clear failcount.</div><div><br></div><div><br></div><div><br></div><div>Success logs</div><div><b><span style="font-size:13px">Nov 19 15:27:08 [16409] sc-node-1 pengine: notice: unpack_rsc_op: Clearing expired failcount for oc-service-mana</span><span style="font-size:13px">ger on sc-node-1</span></b><br style="font-size:13px"><span style="font-size:13px">Nov 19 15:27:08 [16409] sc-node-1 pengine: info: get_failcount_f</span><span style="font-size:13px">ull: oc-service-mana</span><span style="font-size:13px">ger has failed 5 times on sc-node-1</span><br style="font-size:13px"><span style="font-size:13px">Nov 19 15:27:08 [16409] sc-node-1 pengine: notice: unpack_rsc_op: Clearing expired failcount for oc-service-mana</span><span style="font-size:13px">ger on sc-node-1</span><br style="font-size:13px"><span style="font-size:13px">Nov 19 15:27:08 [16409] sc-node-1 pengine: notice: unpack_rsc_op: Re-initiated expired calculated failure oc-service-mana</span><span style="font-size:13px">ger_last_failur</span><span style="font-size:13px">e_0 (rc=7, magic=0:7;3:145</span><span style="font-size:13px">:0:258ae879-832</span><span style="font-size:13px">f-4126-a7d7-e57</span><span style="font-size:13px">bd3fdcdb1) on sc-node-1</span><br style="font-size:13px"><span style="font-size:13px">4:58 PM</span><br></div><div><span style="font-size:13px"><br></span></div><div><span style="font-size:13px"><br></span></div><div><span style="font-size:13px">Failure logs</span></div><div><span style="font-size:13px">Nov 04 22:23:39 [6831] sc-HA2 pengine: warning: unpack_rsc_op: Processing failed op monitor for oc-service-mana</span><span style="font-size:13px">ger on sc-HA1: not running (7)</span><br style="font-size:13px"><span style="font-size:13px">Nov 04 22:23:39 [6831] sc-HA2 pengine: info: native_print: oc-service-mana</span><span style="font-size:13px">ger (upstart:oc-ser</span><span style="font-size:13px">vice-manager): Started sc-HA2</span><br style="font-size:13px"><b><span style="font-size:13px">Nov 04 22:23:39 [6831] sc-HA2 pengine: info: get_failcount_f</span><span style="font-size:13px">ull: oc-service-mana</span><span style="font-size:13px">ger has failed 5 times on sc-HA1</span></b><br style="font-size:13px"><span style="font-size:13px">Nov 04 22:23:39 [6831] sc-HA2 pengine: warning: common_apply_st</span><span style="font-size:13px">ickiness: Forcing oc-service-mana</span><span style="font-size:13px">ger away from sc-HA1 after 5 failures (max=5)</span><br style="font-size:13px"><span style="font-size:13px">Nov 04 22:23:39 [6831] sc-HA2 pengine: info: rsc_merge_weigh</span><span style="font-size:13px">ts: oc-service-mana</span><span style="font-size:13px">ger: Rolling back scores from oc-fw-agent</span><br style="font-size:13px"><span style="font-size:13px">Nov 04 22:23:39 [6831] sc-HA2 pengine: info: LogActions: Leave oc-service-mana</span><span style="font-size:13px">ger (Started sc-HA2)</span><span style="font-size:13px"><br></span></div><div><div><br></div><div><br></div><div>What might be the reason of - in failure case this action did not take place ?</div><div><b><span style="font-size:13px">notice: unpack_rsc_op: Clearing expired failcount for oc-service-mana</span><span style="font-size:13px">ger </span></b><br></div><div><b><span style="font-size:13px"><br></span></b></div><div><b><span style="font-size:13px"><br></span></b></div>-- <br><div class="gmail_signature">Thanks and Regards,<br></div><div class="gmail_signature">Pritam Kharat.<br></div>
</div></div>