<div dir="ltr"><div dir="ltr"><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, Aug 12, 2019 at 4:12 PM Michael Powell <<a href="mailto:Michael.Powell@harmonicinc.com">Michael.Powell@harmonicinc.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div lang="EN-US">
<div class="gmail-m_-1033704192409098613WordSection1">
<p class="MsoNormal">At 07:44:49, the ss agent discovers that the master instance has failed on node
<b><i>mgraid…-0</i></b> as a result of a failed <b><span style="font-family:"Courier New"">ssadm</span></b> request in response to an
<b><span style="font-family:"Courier New"">ss_monitor()</span></b> operation. It issues a
<b><span style="font-family:"Courier New"">crm_master -Q -D</span></b> command with the intent of demoting the master and promoting the slave, on the other node, to master. The
<b><span style="font-family:"Courier New"">ss_demote()</span></b> function finds that the application is no longer running and returns
<b><span style="font-family:"Courier New"">OCF_NOT_RUNNING</span></b> (7). In the older product, this was sufficient to promote the other instance to master, but in the current product, that does not happen. Currently, the failed application is restarted,
as expected, and is promoted to master, but this takes 10’s of seconds.<u></u><u></u></p>
<p class="MsoNormal"><u></u> </p></div></div></blockquote><div><br></div><div>Did you try to disable resource stickiness for this ms?<br></div><div> </div></div></div>