[ClusterLabs] Pacemaker newbie needs advice for adding a new	cluster node
    thl at it-hluchnik.de 
    thl at it-hluchnik.de
       
    Thu Aug 11 11:20:24 UTC 2016
    
    
  
Hello all,
maybe someone can help me adding a node in my test cluster.
I have a working three-node test Cluster in my VirtualBox, using OEL7.2 and now I try to add another node but I have no plan and I did some kind of try & error which does not work.
This is my three-node cluster:
# pcs status
Cluster name: zweiter
Last updated: Thu Aug 11 12:53:55 2016          Last change: Thu Aug 11 12:53:47 2016 by root via cibadmin on knoten01-hb
Stack: corosync
Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum
3 nodes and 12 resources configured
Online: [ knoten01-hb knoten02-hb knoten03-hb ]
Full list of resources:
 Clone Set: ResO2CB-clone [ResO2CB]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ResOCFS2_00-clone [ResOCFS2_00]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ClusterIP-clone [ClusterIP] (unique)
     ClusterIP:0        (ocf::heartbeat:IPaddr2):       Started knoten01-hb
     ClusterIP:1        (ocf::heartbeat:IPaddr2):       Started knoten02-hb
     ClusterIP:2        (ocf::heartbeat:IPaddr2):       Started knoten03-hb
 Clone Set: WebSite-clone [WebSite]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
PCSD Status:
  knoten01-hb: Online
  knoten02-hb: Online
  knoten03-hb: Online
Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled
I setup another host with name knoten04-hb, pcsd is started by systemd. My first idea was modifying /etc/corosync/corosync.conf on all four hosts, adding the new host. When done, without any stop/start action, pcs shows me this:
# pcs status
Cluster name: zweiter
Last updated: Thu Aug 11 13:02:22 2016          Last change: Thu Aug 11 12:53:47 2016 by root via cibadmin on knoten01-hb
Stack: corosync
Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum
3 nodes and 12 resources configured
Online: [ knoten01-hb knoten02-hb knoten03-hb ]
Full list of resources:
 Clone Set: ResO2CB-clone [ResO2CB]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ResOCFS2_00-clone [ResOCFS2_00]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ClusterIP-clone [ClusterIP] (unique)
     ClusterIP:0        (ocf::heartbeat:IPaddr2):       Started knoten01-hb
     ClusterIP:1        (ocf::heartbeat:IPaddr2):       Started knoten02-hb
     ClusterIP:2        (ocf::heartbeat:IPaddr2):       Started knoten03-hb
 Clone Set: WebSite-clone [WebSite]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
PCSD Status:
  knoten01-hb: Online
  knoten02-hb: Online
  knoten03-hb: Online
  knoten04-hb: Online
Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled
PCSD Status shows the new node, everything else is unchanged. I guess the corosync part is OK.
Next step is getting pacemaker configured. I tried:
# cibadmin --query > add_knoten04-hb_cfg
# vi add_knoten04-hb_cfg
# cibadmin --replace --xml-file add_knoten04-hb_cfg
The only entry I changed was this:
    <nodes>
      <node id="1" uname="knoten01-hb"/>
      <node id="2" uname="knoten02-hb"/>
      <node id="3" uname="knoten03-hb"/>
      <node id="4" uname="knoten04-hb"/>        <==============
    </nodes>
Again, this looks good:
# pcs status
Cluster name: zweiter
Last updated: Thu Aug 11 13:08:19 2016          Last change: Thu Aug 11 13:08:11 2016 by root via cibadmin on knoten01-hb
Stack: corosync
Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum
4 nodes and 13 resources configured
Online: [ knoten01-hb knoten02-hb knoten03-hb ]
OFFLINE: [ knoten04-hb ]
Full list of resources:
 Clone Set: ResO2CB-clone [ResO2CB]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ResOCFS2_00-clone [ResOCFS2_00]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ClusterIP-clone [ClusterIP] (unique)
     ClusterIP:0        (ocf::heartbeat:IPaddr2):       Started knoten01-hb
     ClusterIP:1        (ocf::heartbeat:IPaddr2):       Started knoten02-hb
     ClusterIP:2        (ocf::heartbeat:IPaddr2):       Started knoten03-hb
 Clone Set: WebSite-clone [WebSite]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
     Stopped: [ knoten04-hb ]
PCSD Status:
  knoten01-hb: Online
  knoten02-hb: Online
  knoten03-hb: Online
  knoten04-hb: Online
Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled
knoten04-hb is known but offline, for sure. OK, let's start it:
# pcs cluster start knoten04-hb
knoten04-hb: Starting Cluster...
The funny thing: pcs status on the old nodes tell me:
[root at knoten01 ~]# pcs status
Cluster name: zweiter
Last updated: Thu Aug 11 13:10:45 2016          Last change: Thu Aug 11 13:08:11 2016 by root via cibadmin on knoten01-hb
Stack: corosync
Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum
4 nodes and 13 resources configured
Online: [ knoten01-hb knoten02-hb knoten03-hb ]
OFFLINE: [ knoten04-hb ]
Full list of resources:
 Clone Set: ResO2CB-clone [ResO2CB]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ResOCFS2_00-clone [ResOCFS2_00]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
 Clone Set: ClusterIP-clone [ClusterIP] (unique)
     ClusterIP:0        (ocf::heartbeat:IPaddr2):       Started knoten01-hb
     ClusterIP:1        (ocf::heartbeat:IPaddr2):       Started knoten02-hb
     ClusterIP:2        (ocf::heartbeat:IPaddr2):       Started knoten03-hb
 Clone Set: WebSite-clone [WebSite]
     Started: [ knoten01-hb knoten02-hb knoten03-hb ]
     Stopped: [ knoten04-hb ]
PCSD Status:
  knoten01-hb: Online
  knoten02-hb: Online
  knoten03-hb: Online
  knoten04-hb: Online
Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled
And pcs on the new node tells me that:
[root at knoten04 ~]# pcs status
Cluster name: zweiter
Last updated: Thu Aug 11 13:10:25 2016          Last change: Thu Aug 11 12:40:38 2016 by hacluster via crmd on knoten04-hb
Stack: corosync
Current DC: knoten04-hb (version 1.1.13-10.el7-44eb2dd) - partition WITHOUT quorum
4 nodes and 13 resources configured
Online: [ knoten04-hb ]
OFFLINE: [ knoten01-hb knoten02-hb knoten03-hb ]
Full list of resources:
 Clone Set: ResO2CB-clone [ResO2CB]
     Stopped: [ knoten01-hb knoten02-hb knoten03-hb knoten04-hb ]
 Clone Set: ResOCFS2_00-clone [ResOCFS2_00]
     Stopped: [ knoten01-hb knoten02-hb knoten03-hb knoten04-hb ]
 Clone Set: ClusterIP-clone [ClusterIP] (unique)
     ClusterIP:0        (ocf::heartbeat:IPaddr2):       Stopped
     ClusterIP:1        (ocf::heartbeat:IPaddr2):       Stopped
     ClusterIP:2        (ocf::heartbeat:IPaddr2):       Stopped
 Clone Set: WebSite-clone [WebSite]
     Stopped: [ knoten01-hb knoten02-hb knoten03-hb knoten04-hb ]
PCSD Status:
  knoten01-hb: Online
  knoten02-hb: Online
  knoten03-hb: Online
  knoten04-hb: Online
Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled
This is obviously no valid cluster. So what am I doing wrong? How to add the node, getting a working four-node cluster?
Thanks in advance for any help.
Thomas
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: This is a digitally signed message part.
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20160811/150ddcee/attachment-0003.sig>
    
    
More information about the Users
mailing list