[Pacemaker] Cluster with fc18 and fc17 nodes.

Andrew Beekhof andrew at beekhof.net
Tue Aug 27 23:38:03 UTC 2013


On 27/08/2013, at 6:11 PM, Francis SOUYRI <francis.souyri at apec.fr> wrote:

> Hello Andrew,
> 
> 	I do a new test, sorry you are right cibadmin is clever enough.
> 
> But why il have this:

Because they are left over from previous incarnations.
Shut down the cluster and run:

  CIB_file=/path/to/cib.xml cibadmin --replace --xml-text '<nodes/>'

That should clear out the nodes section

> 
> on FC18:
>    <nodes>
>       <node id="-1062731267" uname="noeud1.apec.fr" type="normal"/>
>       <node id="-33445696" uname="noeud2.apec.fr" type="normal"/>
>       <node id="3232236029" uname="noeud1.apec.fr"/>
>       <node id="4261521600" uname="noeud2.apec.fr"/>
>       <node id="1" uname="noeud1.apec.fr"/>
>       <node id="2" uname="noeud2.apec.fr"/>
>     </nodes>
> 
> on FC17:
>     <nodes>
>       <node id="-1062731267" uname="noeud1.apec.fr" type="normal"/>
>       <node id="-33445696" uname="noeud2.apec.fr" type="normal"/>
>       <node id="2" uname="noeud2.apec.fr" type="normal"/>
>       <node id="1" uname="noeud1.apec.fr" type="normal"/>
>     </nodes>
> 
> My corosync conf is like this:
> 
> totem {
> version: 2
> secauth: off
> cluster_name: cluster
>  interface {
>        ringnumber: 0
>        bindnetaddr: 192.168.1.0
>        ttl: 1
>  }
> transport: udpu
> }
> 
> nodelist {
>  node {
>        ring0_addr: noeud1.apec.fr
> 	nodeid: 1
>       }
>  node {
>        ring0_addr: noeud2.apec.fr
> 	nodeid: 2
>       }
> }
> 
> 
> Best regards.
> 
> Francis
> 
> On 08/27/2013 03:11 AM, Andrew Beekhof wrote:
>> 
>> On 26/08/2013, at 7:48 PM, Francis SOUYRI <francis.souyri at apec.fr> wrote:
>> 
>>> Hello Andrew,
>>> 
>>> About your document "http://blog.clusterlabs.org/blog/2013/mixing-pacemaker-versions/"
>>> 
>>> 
>>>    1. stop the cluster on both nodes
>>>    2. on both nodes, run:
>>> 
>>>    CIB_file=/path/to/cib.xml cibadmin -M -X '<cib crm_feature_set="3.0.7"/>'
>>> 
>>>    3. start node1 and wait until it is elected as the DC
>>>    4. start node2
>>> 
>>> How can I execute the command "cibadmin -M -X '<cib crm_feature_set="3.0.6"/>'" when the cluster is stopped ?
>> 
>> cibadmin is clever enough to talk directly to the contents of CIB_file
>> 
>>> 
>>> I started fc18 node, execute "cibadmin -M -X '<cib crm_feature_set="3.0.6"/>'", started fc17 node, now the nodes talk but I have this.
>>> 
>>> on FC18:
>>>   <nodes>
>>>      <node id="-1062731267" uname="noeud1.apec.fr" type="normal"/>
>>>      <node id="-33445696" uname="noeud2.apec.fr" type="normal"/>
>>>      <node id="3232236029" uname="noeud1.apec.fr"/>
>>>      <node id="4261521600" uname="noeud2.apec.fr"/>
>>>      <node id="1" uname="noeud1.apec.fr"/>
>>>      <node id="2" uname="noeud2.apec.fr"/>
>>>    </nodes>
>>> 
>>> on FC17:
>>>    <nodes>
>>>      <node id="-1062731267" uname="noeud1.apec.fr" type="normal"/>
>>>      <node id="-33445696" uname="noeud2.apec.fr" type="normal"/>
>>>      <node id="2" uname="noeud2.apec.fr" type="normal"/>
>>>      <node id="1" uname="noeud1.apec.fr" type="normal"/>
>>>    </nodes>
>>> 
>>> The nodes have two networks, 192.168.1.0/24 for external communication and 10.1.1.0/24 with bonding for drbd. Corosync used 192.168.1.0 with udpu.
>>> 
>>> Best regards.
>>> 
>>> Francis
>>> 
>>> On 08/26/2013 01:42 AM, Andrew Beekhof wrote:
>>>> 
>>>> On 23/08/2013, at 7:18 PM, Francis SOUYRI <francis.souyri at apec.fr> wrote:
>>>> 
>>>>> Hello,
>>>>> 
>>>>> For a long time I used heartbeat/drbd for 2 nodes clusters with Fedora, I used the internal crm of heartbeat not pacemaker.
>>>>> 
>>>>> I planned to upgrade from the fc17 to the fc18, but on fc18 heartbeat is obsolete and I have to change to corosync/pacemaker.
>>>>> For information the heartbeat fc17 package work fine on fc18 and the cluster with a node fc17 and the other fc18 (without the firewall activated by default !!!) work perfectly (The final configuration is to have the both node in fc18).
>>>>> 
>>>>> But the corosync/pacemaker do not work with a fc17 node and a fc18 node.
>>>>> 
>>>>> I have these packages.
>>>>> 
>>>>> drbd-pacemaker-8.4.2-1.fc17.i686
>>>>> pacemaker-libs-1.1.7-2.fc17.i686
>>>>> pacemaker-1.1.7-2.fc17.i686
>>>>> corosync-2.3.0-1.fc17.i686
>>>>> corosynclib-2.3.0-1.fc17.i686
>>>>> pacemaker-cli-1.1.7-2.fc17.i686
>>>>> pacemaker-cluster-libs-1.1.7-2.fc17.i686
>>>>> 
>>>>> pacemaker-libs-1.1.9-0.1.70ad9fa.git.fc18.i686
>>>>> pacemaker-1.1.9-0.1.70ad9fa.git.fc18.i686
>>>>> drbd-pacemaker-8.4.2-1.fc18.i686
>>>>> pacemaker-cluster-libs-1.1.9-0.1.70ad9fa.git.fc18.i686
>>>>> pacemaker-cli-1.1.9-0.1.70ad9fa.git.fc18.i686
>>>>> corosynclib-2.3.1-1.fc18.i686
>>>>> corosync-2.3.1-1.fc18.i686
>>>>> 
>>>>> The corosync config :
>>>>> 
>>>>> totem {
>>>>> version: 2
>>>>> secauth: off
>>>>> cluster_name: cluster
>>>>>  interface {
>>>>>        ringnumber: 0
>>>>>        bindnetaddr: 192.168.1.0
>>>>>        ttl: 1
>>>>>  }
>>>>> transport: udpu
>>>>> }
>>>>> 
>>>>> nodelist {
>>>>>  node {
>>>>>        ring0_addr: noeud1.xxxx.fr
>>>>>       }
>>>>>  node {
>>>>>        ring0_addr: noeud2.xxxx.fr
>>>>>       }
>>>>> }
>>>>> 
>>>>> quorum {
>>>>> provider: corosync_votequorum
>>>>> }
>>>>> 
>>>>> logging {
>>>>> to_syslog: yes
>>>>> debug: off
>>>>> }
>>>>> 
>>>>> A short time after starting pacemaker I have this.
>>>>> 
>>>>> FC18 node:
>>>>> 
>>>>> Corosync Nodes:
>>>>> noeud1.xxxx.fr noeud2.xxxx.fr
>>>>> Pacemaker Nodes:
>>>>> noeud1.xxxx.fr noeud1.xxxx.fr noeud2.xxxx.fr noeud2.xxxx.fr
>>>>> 
>>>>> <node id="-33445696" uname="noeud2.xxxx.fr" type="normal"/>
>>>>> <node id="-1062731267" uname="noeud1.xxxx.fr" type="normal"/>
>>>>> <node id="3232236029" uname="noeud1.xxxx.fr"/>
>>>>> <node id="4261521600" uname="noeud2.xxxx.fr"/>
>>>>> 
>>>>> Why four nodes ?!? What are the nodes 3232236029 and 4261521600 ?
>>>> 
>>>> The same as the other two but stored as %u (unsigned int) instead of %d (signed int).
>>>> This was a bug in older versions, you can work around it by specifying a (small) nodeid in corosync.conf
>>>> 
>>>>> 
>>>>> FC17 node:
>>>>> 
>>>>> Corosync Nodes:
>>>>> noeud1.xxxx.fr noeud2.xxxx.fr
>>>>> Pacemaker Nodes:
>>>>> noeud1.xxxx.fr noeud2.xxxx.fr
>>>>> 
>>>>> <node id="-33445696" uname="noeud2.xxxx.fr" type="normal"/>
>>>>> <node id="-1062731267" uname="noeud1.xxxx.fr" type="normal"/>
>>>>> 
>>>>> On the FC17 I have some messages like this "error: cib_perform_op: Discarding update with feature set '3.0.7' greater than our own '3.0.6'".
>>>>> On the FC18 "warning: cib_process_replace: Replacement 0.5.4 from noeud2.xxxx.fr not applied to 0.9.0: current epoch is greater than the replacement"
>>>>> 
>>>>> Pacemaker 1.1.7 and 1.1.9 are not compatible ?
>>>> 
>>>> This should provide some more information:
>>>>    http://blog.clusterlabs.org/blog/2013/mixing-pacemaker-versions/
>>>> 
>>>> 
>>> 
>>> 
>>> _______________________________________________
>>> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
>>> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>>> 
>>> Project Home: http://www.clusterlabs.org
>>> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
>>> Bugs: http://bugs.clusterlabs.org
>> 
> 
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 841 bytes
Desc: Message signed with OpenPGP using GPGMail
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20130828/ad5c65dd/attachment-0004.sig>


More information about the Pacemaker mailing list