<div dir="ltr"><div>Was trying to simulate the "Corosync main process was not scheduled for xx ms" issue.</div><div>Have a 2 node cluster with with a clone resource(Master-Slave).</div><div><br></div><div><b>TEST #1</b></div><div>  Initially state of cluster resource: </div><div>    NodeA - Master</div><div>    NodeB - Slave</div><div>    Current DC - NodeA</div><div><br></div><div>  Now I freeze the corosync main process on the Master using command : </div><div>  # kill -STOP $(pidof corosync)</div><div><br></div><div>  State of cluster resource:</div><div>    NodeA - Master</div><div>    NodeB - Master</div><div>    Current DC - NodeA NodeB</div><div><br></div><div>  Now continue the corosync process on the Master :</div><div>  # kill -CONT $(pidof corosync)</div><div><br></div><div>  State of cluster resource:</div><div>    NodeA - Master</div><div>    NodeB - Slave</div><div>    Current DC - NodeA</div><div><br></div><div><b>TEST #2</b></div><div>  Did the same test with the below initial state of cluster resource:</div><div>    NodeA - Slave</div><div>    NodeB - Master</div><div>    Current DC - NodeA</div><div><br></div><div>  The final state in this case: </div><div>    NodeA - Master</div><div>    NodeB - Slave</div><div>    Current DC - NodeA</div><div><br></div><div>In 1st test NodeA started as Master and ended as Master.</div><div>In 2nd test NodeB started as Master and ended as Slave.</div><div>NodeA is the current DC in both the tests.</div><div>The behavior is not well defined in these tests.</div><div>This is always reproducible.</div><div><br></div><div>Cluster Properties:</div><div> cluster-infrastructure: corosync</div><div> cluster-name: APPHA</div><div> cluster-recheck-interval: 2s</div><div> dc-deadtime: 5</div><div> dc-version: 1.1.16-12.el7-94ff4df</div><div> have-watchdog: false</div><div> load-threshold: 1000%</div><div> no-quorum-policy: ignore</div><div> start-failure-is-fatal: false</div><div> stonith-enabled: false</div><div><br></div><div>Resource properties: </div><div>  Operations: demote interval=0s timeout=30 (APPHA-demote-interval-0s)</div><div>              monitor interval=1 role=Master timeout=40 (APPHA-monitor-interval-1)</div><div>              monitor interval=2 role=Slave timeout=40 (APPHA-monitor-interval-2)</div><div>              promote interval=0s timeout=30 (APPHA-promote-interval-0s)</div><div>              start interval=0s timeout=180 (APPHA-start-interval-0s)</div><div>              stop interval=0s timeout=180 (APPHA-stop-interval-0s)</div><div><br></div><div><br></div><div>Versions of the rpms installed:</div><div>pacemaker-cli-1.1.16-12.el7.x86_64</div><div>resource-agents-3.9.5-105.el7.x86_64</div><div>pacemaker-libs-1.1.16-12.el7.x86_64</div><div>pacemaker-cluster-libs-1.1.16-12.el7.x86_64</div><div>pcs-0.9.158-6.el7.centos.x86_64</div><div>corosync-2.4.0-9.el7.x86_64</div><div>pacemaker-1.1.16-12.el7.x86_64</div><div>corosynclib-2.4.0-9.el7.x86_64</div><div><br></div><div>Linux ditro : </div><div>CentOS Linux release 7.4.1708 (Core)</div><div><br></div><div>Regards,</div><div>Abhay</div><div><br></div></div>