[ClusterLabs] Error observed while starting cluster

Roshni Chatterjee roshni.chatterjee at india.nec.com
Tue Mar 20 01:59:12 EDT 2018


Hi ,

Error observed in pacemaker and pcs status
Error: cluster is not currently running on this node

!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

I have built the source  code of corosync (2.4.2) and pacemaker (1.1.16)  and  have followed the below steps for building a 2 node cluster .


1.       Download source  code of corosync and pacemaker (versions as mentioned above ) and compile .

2.       Install pcsd using “yum install pcs”

3.       Allow cluster services through firewall using #firewall-cmd --permanent --add-service=high-availability

4.       Start and enable pcsd #systemctl start pcsd and #systemctl enable pcsd

5.       Change password for user hacluster

6.       pcs cluster auth pcmk3 node2

7.       pcs cluster setup --name mycluster pcmk3 node2

8.       pcs cluster start -all

9.       pcs status


It is observed that the no error is received till step 8 . At step 9 when pcs status is checked error is received (highlighted below)
[root at node2 ~]# pacemakerd --features
Pacemaker 1.1.16 (Build: 94ff4df51a)
Supporting v3.0.11:  agent-manpages libqb-logging libqb-ipc nagios  corosync-native atomic-attrd acls
[root at node2 ~]# pcs cluster start --all
pcmk3: Starting Cluster...
node2: Starting Cluster...
[root at node2 ~]# pcs status
Error: cluster is not currently running on this node

On checking pacemaker status the following issue is found -
[root at pcmk3 ~]# systemctl pacemaker status -l
Unknown operation 'pacemaker'.
[root at pcmk3 ~]# systemctl status pacemaker -l
● pacemaker.service - Pacemaker High Availability Cluster Manager
   Loaded: loaded (/usr/lib/systemd/system/pacemaker.service; disabled; vendor preset: disabled)
   Active: active (running) since Tue 2018-03-20 10:55:44 IST; 13min ago
     Docs: man:pacemakerd
           http://clusterlabs.org/doc/en-US/Pacemaker/1.1-pcs/html/Pacemaker_Explained/index.html
Main PID: 26932 (pacemakerd)
   CGroup: /system.slice/pacemaker.service
           ├─26932 /usr/sbin/pacemakerd -f
           ├─26933 /usr/libexec/pacemaker/cib
           ├─26934 /usr/libexec/pacemaker/stonithd
           ├─26935 /usr/libexec/pacemaker/lrmd
           ├─26936 /usr/libexec/pacemaker/attrd
           └─26937 /usr/libexec/pacemaker/pengine

Mar 20 10:55:45 pcmk3 pacemakerd[26932]:   notice: Respawning failed child process: crmd
Mar 20 10:55:45 pcmk3 pacemakerd[26932]:    error: The crmd process (27035) exited: Key has expired (127)
Mar 20 10:55:45 pcmk3 pacemakerd[26932]:   notice: Respawning failed child process: crmd
Mar 20 10:55:45 pcmk3 pacemakerd[26932]:    error: The crmd process (27036) exited: Key has expired (127)
Mar 20 10:55:45 pcmk3 pacemakerd[26932]:   notice: Respawning failed child process: crmd
Mar 20 10:55:45 pcmk3 pacemakerd[26932]:    error: The crmd process (27037) exited: Key has expired (127)
Mar 20 10:55:45 pcmk3 pacemakerd[26932]:   notice: Respawning failed child process: crmd
Mar 20 10:55:45 pcmk3 pacemakerd[26932]:    error: The crmd process (27038) exited: Key has expired (127)
Mar 20 10:55:45 pcmk3 pacemakerd[26932]:    error: Child respawn count exceeded by crmd
Mar 20 10:56:21 pcmk3 cib[26933]:    error: Operation ignored, cluster configuration is invalid. Please repair and restart: Update does not conform to the configured schema
[root at pcmk3 ~]#

Corosync.log
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: start_child:        Forked child 27035 for process crmd
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: mcp_cpg_deliver:    Ignoring process list sent by peer for local node
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: mcp_cpg_deliver:    Ignoring process list sent by peer for local node
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:    error: pcmk_child_exit:    The crmd process (27035) exited: Key has expired (127)
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:   notice: pcmk_process_exit:  Respawning failed child process: crmd
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: start_child:        Using uid=189 and group=189 for process crmd
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: start_child:        Forked child 27036 for process crmd
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: mcp_cpg_deliver:    Ignoring process list sent by peer for local node
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: mcp_cpg_deliver:    Ignoring process list sent by peer for local node
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:    error: pcmk_child_exit:    The crmd process (27036) exited: Key has expired (127)
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:   notice: pcmk_process_exit:  Respawning failed child process: crmd
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: start_child:        Using uid=189 and group=189 for process crmd
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: start_child:        Forked child 27037 for process crmd
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: mcp_cpg_deliver:    Ignoring process list sent by peer for local node
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:     info: mcp_cpg_deliver:    Ignoring process list sent by peer for local node
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:    error: pcmk_child_exit:    The crmd process (27037) exited: Key has expired (127)
Mar 20 10:55:45 [26932] pcmk3 pacemakerd:   notice: pcmk_process_exit:  Respawning failed child process: crmd


Regards,
Roshni

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20180320/983cf2bc/attachment-0001.html>


More information about the Users mailing list