[ClusterLabs] PAF with postgresql 13?
CHAMPAGNE Julie
julie.champagne at pm.gouv.fr
Mon Mar 7 09:32:46 EST 2022
Hi,
Thanks for your quick answer. Here are my answers:
I tried 2 different tests to create a problem in the cluster:
- killing postgres processes on node1: pkill postgres
- putting node1 in standby in the cluster with the following command: pcs node standby node1
Both gave me the same result regarding the error message with lsn location. In the pacemaker logs:
pgsqlms(pgsqld)[5180]: Mar 07 15:29:48 ERROR: Can not get current node LSN location
Mar 07 15:29:48 node2 pacemaker-execd [692] (log_op_output) notice: pgsqld_promote_0[5180] error output [ Could not query value of cancel_switchover-pgsqld: attribute does not exist ]
Mar 07 15:29:48 node2 pacemaker-execd [692] (log_op_output) notice: pgsqld_promote_0[5180] error output [ Could not query value of recover_master-pgsqld: attribute does not exist ]
Mar 07 15:29:48 node2 pacemaker-execd [692] (log_op_output) notice: pgsqld_promote_0[5180] error output [ Could not query value of nodes-pgsqld: attribute does not exist ]
Mar 07 15:29:48 node2 pacemaker-execd [692] (log_op_output) notice: pgsqld_promote_0[5180] error output [ Could not query value of lsn_location-pgsqld: attribute does not exist ]
Mar 07 15:29:48 node2 pacemaker-execd [692] (log_op_output) notice: pgsqld_promote_0[5180] error output [ ocf-exit-reason:Can not get current node LSN location ]
Mar 07 15:29:48 node2 pacemaker-execd [692] (log_finished) info: pgsqld promote (call 233, PID 5180) exited with status 1 (execution time 183ms, queue time 0ms)
Mar 07 15:29:48 node2 pacemaker-controld [695] (process_lrm_event) notice: Result of promote operation for pgsqld on node2: error | rc=1 call=233 key=pgsqld_promote_0 confirmed=true cib-update=382
Mar 07 15:29:48 node2 pacemaker-controld [695] (process_lrm_event) notice: node2-pgsqld_promote_0:233 [ Could not query value of cancel_switchover-pgsqld: attribute does not exist\nCould not query value of recover_master-pgsqld: attribute does not exist\nCould not query value of nodes-pgsqld: attribute does not exist\nCould not query value of lsn_location-pgsqld: attribute does not exist\nocf-exit-reason:Can not get current node LSN location\n ]
Our versions are Debian 11.2 and PAF 2.3.0 (installed from debian repository).
The attrd_updater command returns:
root at node1 ~ > attrd_updater --private --lifetime reboot --name "lsn_location-pgsqld" --query
Could not query value of lsn_location-pgsqld: attribute does not exist
Thanks !
-----Message d'origine-----
De : Jehan-Guillaume de Rorthais <jgdr at dalibo.com>
Envoyé : lundi 7 mars 2022 15:15
À : CHAMPAGNE Julie <julie.champagne at pm.gouv.fr>
Cc : Cluster Labs - All topics related to open-source clustering welcomed <users at clusterlabs.org>
Objet : Re: [ClusterLabs] PAF with postgresql 13?
Hi,
Caution, this is an english spoken mailing list :)
Bellow my answer.
On Mon, 7 Mar 2022 12:31:07 +0000
CHAMPAGNE Julie <julie.champagne at pm.gouv.fr> wrote:
> Lorsque je crée un problème sur le noeud1,
What's the issue you are testing precisely?
> * pgsqld_promote_0 on node2 'error' (1): call=24, status='complete',
> exitreason='Can not get current node LSN location',
It seems the agent had some trouble getting some private attributes from the cluster. Could you give exact:
* Debian version
* PAF version
Do you find any error in logs about setting/getting lsn_location attribute ?
What is the result of the following command:
attrd_updater --private --lifetime reboot --name "lsn_location-pgsqld" --query
Thanks,
More information about the Users
mailing list