<div dir="ltr">Thanks for the clarifications guys!</div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">Il giorno mar 27 apr 2021 alle ore 18:24 Jehan-Guillaume de Rorthais <<a href="mailto:jgdr@dalibo.com">jgdr@dalibo.com</a>> ha scritto:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">On Mon, 26 Apr 2021 18:04:41 +0000 (UTC)<br>

Strahil Nikolov <<a href="mailto:hunter86_bg@yahoo.com" target="_blank">hunter86_bg@yahoo.com</a>> wrote:<br>

<br>

> I prefer that the stack is auto enabled. Imagine that you got a DB that is<br>

> replicated and primary DB node is fenced. You would like that node to join<br>

> the cluster and if possible to sync with the new primary instead of staying<br>

> down.<br>

<br>

In the case of PostgreSQL, the failing primary may not be able to failback<br>

automatically with the new primary. Worse, if it actually enters in<br>

replication, it might just silently become a corrupted standby, giving a wrong<br>

feeling of safety, until a new failover occurs.<br>

<br>

PAF doesn't handle auto-failback (eg. pg_rewind) per design, to avoid code<br>

complexity. We don't want to give a wrong feeling of perfect<br>

full-availability/failback/fully-automated-admin'ed PgSQL cluster. If something<br>

went wrong with your DB, you better need to check and fix it. You need both<br>

system and DBA guy on board to take care of the availability and safety of your<br>

cluster.<br>

<br>

Note that auto-failback of secondary nodes is safe, as far as they are able<br>

to actually follow up with the production. Maybe we can imaginer some safety<br>

belts in PAF's code to allow Pacemaker auto-start on boot, but refuse<br>

to start a badly shaped PostgreSQL.<br>

<br>

> One such example is the SAP HANA DB. Imagine that the current primary<br>

> node looses storage and it failed to commit all transactions to disk. Without<br>

> replication you will endure data loss for the last 1-2 minutes (depends on<br>

> your monitoring interval) unless you got a replication.<br>

<br>

PAF is a shared-nothing approach, it requires replication between nodes.<br>

<br>

<br>

_______________________________________________<br>

Manage your subscription:<br>

<a href="https://lists.clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">https://lists.clusterlabs.org/mailman/listinfo/users</a><br>

<br>

ClusterLabs home: <a href="https://www.clusterlabs.org/" rel="noreferrer" target="_blank">https://www.clusterlabs.org/</a><br>

</blockquote></div>