<div dir="ltr">Hi All, <div>We are seeing an issue where we replaced no-quorum-policy=ignore with other options in corosync.conf order to simulate the same behaviour :</div><div><br> <b> wait_for_all: 0<br></b></div><b> last_man_standing: 1<br> last_man_standing_window: 20000</b><div><b><br></b></div><div>There was another property (auto-tie-breaker) tried but couldn't configure it as crm did not recognise this property.<br><div><br></div><div>But even after using these options, we are seeing that system is not quorate if at least half of the nodes are not up. </div><div><br></div><div>Some properties from crm config are as follows: </div><div><br></div><div><b>primitive stonith-sbd stonith:external/sbd \<br> params pcmk_delay_base=5s<br>.</b></div><div><b>.<br>property cib-bootstrap-options: \<br> have-watchdog=true \<br> dc-version="2.1.2+20211124.ada5c3b36-150400.2.43-2.1.2+20211124.ada5c3b36" \<br> cluster-infrastructure=corosync \<br> cluster-name=FILE \<br> stonith-enabled=true \<br> stonith-timeout=172 \<br> stonith-action=reboot \<br> stop-all-resources=false \<br> no-quorum-policy=ignore<br>rsc_defaults build-resource-defaults: \<br> resource-stickiness=1<br>rsc_defaults rsc-options: \<br> resource-stickiness=100 \<br> migration-threshold=3 \<br> failure-timeout=1m \<br> cluster-recheck-interval=10min<br>op_defaults op-options: \<br> timeout=600 \<br> record-pending=true</b><br></div><div><b><br></b></div><div>On a 4-node setup when the whole cluster is brought up together we see error logs like: </div><div><br></div><div><p style="margin:0in;font-family:Calibri;font-size:11pt"><b>2023-06-26T11:35:17.231104+00:00
FILE-1 pacemaker-schedulerd[26359]:
warning: <font color="#000000">Fencing and resource management disabled due to lack of quorum</font></b></p>
<p style="margin:0in;font-family:Calibri;font-size:11pt"><b>2023-06-26T11:35:17.231338+00:00
FILE-1 pacemaker-schedulerd[26359]:
warning: Ignoring malformed node_state entry without uname</b></p>
<p style="margin:0in;font-family:Calibri;font-size:11pt"><b>2023-06-26T11:35:17.233771+00:00
FILE-1 pacemaker-schedulerd[26359]:
warning: Node FILE-2 is unclean!</b></p>
<p style="margin:0in;font-family:Calibri;font-size:11pt"><b>2023-06-26T11:35:17.233857+00:00
FILE-1 pacemaker-schedulerd[26359]:
warning: Node FILE-3 is unclean!</b></p>
<p style="margin:0in;font-family:Calibri;font-size:11pt"><b>2023-06-26T11:35:17.233957+00:00
FILE-1 pacemaker-schedulerd[26359]:
warning: Node FILE-4 is unclean!</b></p><p style="margin:0in;font-family:Calibri;font-size:11pt"><b><br></b></p></div><div>Kindly help correct the configuration to make the system function normally with all resources up, even if there is just one node up. </div><div><br></div><div>Please let me know if any more info is needed. </div><div><br></div><div>Thanks</div><div>Priyanka</div><div><br></div><div><br></div></div></div>