<!DOCTYPE html>
<html>
<head>
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
</head>
<body>
<font size="2" face="Courier 10 Pitch">hi guys<br>
<br>
Having 3-node pgSQL cluster with PAF - when all three systems are
shutdown at virtually the same time then PAF fails to start when
HA cluster is operational again.<br>
<br>
from status:<br>
...<br>
Migration Summary:<br>
* Node: ubusrv2 (2):<br>
* PGSQL-PAF-5433: migration-threshold=1000000
fail-count=1000000 last-failure='Tue Nov 7 17:52:38 2023'<br>
* Node: ubusrv3 (3):<br>
* PGSQL-PAF-5433: migration-threshold=1000000
fail-count=1000000 last-failure='Tue Nov 7 17:52:38 2023'<br>
* Node: ubusrv1 (1):<br>
* PGSQL-PAF-5433: migration-threshold=1000000
fail-count=1000000 last-failure='Tue Nov 7 17:52:38 2023'<br>
<br>
Failed Resource Actions:<br>
* PGSQL-PAF-5433_stop_0 on ubusrv2 'error' (1): call=90,
status='complete', exitreason='Unexpected state for instance
"PGSQL-PAF-5433" (returned 1)', last-rc-change='Tue Nov 7
17:52:38 2023', queued=0ms, exec=84ms<br>
* PGSQL-PAF-5433_stop_0 on ubusrv3 'error' (1): call=82,
status='complete', exitreason='Unexpected state for instance
"PGSQL-PAF-5433" (returned 1)', last-rc-change='Tue Nov 7
17:52:38 2023', queued=0ms, exec=82ms<br>
* PGSQL-PAF-5433_stop_0 on ubusrv1 'error' (1): call=86,
status='complete', exitreason='Unexpected state for instance
"PGSQL-PAF-5433" (returned 1)', last-rc-change='Tue Nov 7
17:52:38 2023', queued=0ms, exec=108ms<br>
<br>
and all three pgSQLs show virtually identical logs:<br>
...<br>
2023-11-07 16:54:45.532 UTC [24936] LOG: starting PostgreSQL 14.9
(Ubuntu 14.9-0ubuntu0.22.04.1) on x86_64-pc-linux-gnu, compiled by
gcc (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0, 64-bit<br>
2023-11-07 16:54:45.532 UTC [24936] LOG: listening on IPv4
address "0.0.0.0", port 5433<br>
2023-11-07 16:54:45.532 UTC [24936] LOG: listening on IPv6
address "::", port 5433<br>
2023-11-07 16:54:45.535 UTC [24936] LOG: listening on Unix socket
"/var/run/postgresql/.s.PGSQL.5433"<br>
2023-11-07 16:54:45.547 UTC [24938] LOG: database system was
interrupted while in recovery at log time 2023-11-07 15:30:56 UTC<br>
2023-11-07 16:54:45.547 UTC [24938] HINT: If this has occurred
more than once some data might be corrupted and you might need to
choose an earlier recovery target.<br>
2023-11-07 16:54:45.819 UTC [24938] LOG: entering standby mode<br>
2023-11-07 16:54:45.824 UTC [24938] FATAL: could not open
directory "/var/run/postgresql/14-paf.pg_stat_tmp": No such file
or directory<br>
2023-11-07 16:54:45.825 UTC [24936] LOG: startup process (PID
24938) exited with exit code 1<br>
2023-11-07 16:54:45.825 UTC [24936] LOG: aborting startup due to
startup process failure<br>
2023-11-07 16:54:45.826 UTC [24936] LOG: database system is shut
down<br>
<br>
Is this "test" case's result, as I showed above, expected? It
reproduces every time.<br>
If not - what might it be I'm missing?<br>
<br>
many thanks, L.<br>
</font>
</body>
</html>