[ClusterLabs] Pacemaker newbie needs advice for adding a new cluster node

thl at it-hluchnik.de thl at it-hluchnik.de
Thu Aug 11 07:20:24 EDT 2016


Hello all,

maybe someone can help me adding a node in my test cluster.

I have a working three-node test Cluster in my VirtualBox, using OEL7.2 and now I try to add another node but I have no plan and I did some kind of try & error which does not work.

This is my three-node cluster:

# pcs status
Cluster name: zweiter
Last updated: Thu Aug 11 12:53:55 2016          Last change: Thu Aug 11 12:53:47 2016 by root via cibadmin on knoten01-hb
Stack: corosync
Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum
3 nodes and 12 resources configured

Online: [ knoten01-hb knoten02-hb knoten03-hb ]

Full list of resources:

 Clone Set: ResO2CB-clone [ResO2CB]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ResOCFS2_00-clone [ResOCFS2_00]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ClusterIP-clone [ClusterIP] (unique)
     ClusterIP:0        (ocf::heartbeat:IPaddr2):       Started knoten01-hb
     ClusterIP:1        (ocf::heartbeat:IPaddr2):       Started knoten02-hb
     ClusterIP:2        (ocf::heartbeat:IPaddr2):       Started knoten03-hb
 Clone Set: WebSite-clone [WebSite]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]

PCSD Status:
  knoten01-hb: Online
  knoten02-hb: Online
  knoten03-hb: Online

Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled




I setup another host with name knoten04-hb, pcsd is started by systemd. My first idea was modifying /etc/corosync/corosync.conf on all four hosts, adding the new host. When done, without any stop/start action, pcs shows me this:

# pcs status
Cluster name: zweiter
Last updated: Thu Aug 11 13:02:22 2016          Last change: Thu Aug 11 12:53:47 2016 by root via cibadmin on knoten01-hb
Stack: corosync
Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum
3 nodes and 12 resources configured

Online: [ knoten01-hb knoten02-hb knoten03-hb ]

Full list of resources:

 Clone Set: ResO2CB-clone [ResO2CB]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ResOCFS2_00-clone [ResOCFS2_00]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ClusterIP-clone [ClusterIP] (unique)
     ClusterIP:0        (ocf::heartbeat:IPaddr2):       Started knoten01-hb
     ClusterIP:1        (ocf::heartbeat:IPaddr2):       Started knoten02-hb
     ClusterIP:2        (ocf::heartbeat:IPaddr2):       Started knoten03-hb
 Clone Set: WebSite-clone [WebSite]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]

PCSD Status:
  knoten01-hb: Online
  knoten02-hb: Online
  knoten03-hb: Online
  knoten04-hb: Online

Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled


PCSD Status shows the new node, everything else is unchanged. I guess the corosync part is OK.




Next step is getting pacemaker configured. I tried:

# cibadmin --query > add_knoten04-hb_cfg
# vi add_knoten04-hb_cfg
# cibadmin --replace --xml-file add_knoten04-hb_cfg

The only entry I changed was this:

    <nodes>
      <node id="1" uname="knoten01-hb"/>
      <node id="2" uname="knoten02-hb"/>
      <node id="3" uname="knoten03-hb"/>
      <node id="4" uname="knoten04-hb"/>        <==============
    </nodes>




Again, this looks good:

# pcs status
Cluster name: zweiter
Last updated: Thu Aug 11 13:08:19 2016          Last change: Thu Aug 11 13:08:11 2016 by root via cibadmin on knoten01-hb
Stack: corosync
Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum
4 nodes and 13 resources configured

Online: [ knoten01-hb knoten02-hb knoten03-hb ]
OFFLINE: [ knoten04-hb ]

Full list of resources:

 Clone Set: ResO2CB-clone [ResO2CB]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ResOCFS2_00-clone [ResOCFS2_00]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ClusterIP-clone [ClusterIP] (unique)
     ClusterIP:0        (ocf::heartbeat:IPaddr2):       Started knoten01-hb
     ClusterIP:1        (ocf::heartbeat:IPaddr2):       Started knoten02-hb
     ClusterIP:2        (ocf::heartbeat:IPaddr2):       Started knoten03-hb
 Clone Set: WebSite-clone [WebSite]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
     Stopped: [ knoten04-hb ]

PCSD Status:
  knoten01-hb: Online
  knoten02-hb: Online
  knoten03-hb: Online
  knoten04-hb: Online

Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled


knoten04-hb is known but offline, for sure. OK, let's start it:




# pcs cluster start knoten04-hb
knoten04-hb: Starting Cluster...

The funny thing: pcs status on the old nodes tell me:

[root at knoten01 ~]# pcs status
Cluster name: zweiter
Last updated: Thu Aug 11 13:10:45 2016          Last change: Thu Aug 11 13:08:11 2016 by root via cibadmin on knoten01-hb
Stack: corosync
Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum
4 nodes and 13 resources configured

Online: [ knoten01-hb knoten02-hb knoten03-hb ]
OFFLINE: [ knoten04-hb ]

Full list of resources:

 Clone Set: ResO2CB-clone [ResO2CB]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ResOCFS2_00-clone [ResOCFS2_00]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ClusterIP-clone [ClusterIP] (unique)
     ClusterIP:0        (ocf::heartbeat:IPaddr2):       Started knoten01-hb
     ClusterIP:1        (ocf::heartbeat:IPaddr2):       Started knoten02-hb
     ClusterIP:2        (ocf::heartbeat:IPaddr2):       Started knoten03-hb
 Clone Set: WebSite-clone [WebSite]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
     Stopped: [ knoten04-hb ]

PCSD Status:
  knoten01-hb: Online
  knoten02-hb: Online
  knoten03-hb: Online
  knoten04-hb: Online

Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled


And pcs on the new node tells me that:

[root at knoten04 ~]# pcs status
Cluster name: zweiter
Last updated: Thu Aug 11 13:10:25 2016          Last change: Thu Aug 11 12:40:38 2016 by hacluster via crmd on knoten04-hb
Stack: corosync
Current DC: knoten04-hb (version 1.1.13-10.el7-44eb2dd) - partition WITHOUT quorum
4 nodes and 13 resources configured

Online: [ knoten04-hb ]
OFFLINE: [ knoten01-hb knoten02-hb knoten03-hb ]

Full list of resources:

 Clone Set: ResO2CB-clone [ResO2CB]
     Stopped: [ knoten01-hb knoten02-hb knoten03-hb knoten04-hb ]
 Clone Set: ResOCFS2_00-clone [ResOCFS2_00]
     Stopped: [ knoten01-hb knoten02-hb knoten03-hb knoten04-hb ]
 Clone Set: ClusterIP-clone [ClusterIP] (unique)
     ClusterIP:0        (ocf::heartbeat:IPaddr2):       Stopped
     ClusterIP:1        (ocf::heartbeat:IPaddr2):       Stopped
     ClusterIP:2        (ocf::heartbeat:IPaddr2):       Stopped
 Clone Set: WebSite-clone [WebSite]
     Stopped: [ knoten01-hb knoten02-hb knoten03-hb knoten04-hb ]

PCSD Status:
  knoten01-hb: Online
  knoten02-hb: Online
  knoten03-hb: Online
  knoten04-hb: Online

Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled


This is obviously no valid cluster. So what am I doing wrong? How to add the node, getting a working four-node cluster?

Thanks in advance for any help.

Thomas
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: This is a digitally signed message part.
URL: <http://lists.clusterlabs.org/pipermail/users/attachments/20160811/150ddcee/attachment-0002.sig>


More information about the Users mailing list