<div style="line-height:1.7;color:#000000;font-size:14px;font-family:Arial"><div><br>I have two pacemaker resources. We call them A and B. Because of environmental reasons, their start methods and monitor methods always return failure <br><br>(OCF_ERR_GENERIC). The following are their configurations:(The cluster property of start-failure-is-fatal is false)<br><br>primitive A A \<br>        op monitor interval=20 timeout=120 \<br>        op stop interval=0 timeout=120 on-fail=restart \<br>        op start interval=0 timeout=240 on-fail=restart \<br>        meta failure-timeout=60s<br>primitive B B \<br>        op monitor interval=20 timeout=120 \<br>        op stop interval=0 timeout=120 on-fail=restart \<br>        op start interval=0 timeout=240 on-fail=restart \<br>        meta failure-timeout=60s<br>clone A_cl A<br>clone B_cl B<br><br>The time consuming of their methods is different:<br>A:<br>start = 60s       monitor < 1s        stop = 80s<br>B:<br>start < 1s        monitor < 1s        stop < 1s     <br><br>Resource of A is scheduled normally, always start and stop. But for resource B, there is only circular monitor fails, without start and stop.<br>. And there is no fail-count showing of B in "crm status -f". <br><br>Two operations can solve the problem of B not being scheduled:<br>1，Set failure-timeout of B from 60s to 600s<br>2，Modify ocf of A，make the stop method return as soon as possible<br><br>I tested it several times, and the results were the same. Why does the resource not be scheduled when failure-timeout setting too short? And what does <br><br>it have to do with the time consuming stop of another resource?  Is this a bug?<br><br>My pacemaker version is 1.1.16. Any suggestion is welcome. Thank you!<br><br><br>James<br>2018-05-20<br></div></div><br><br><span title="neteasefooter"><p> </p></span>