[Pacemaker] Pacemaker with corosync/openais 1.0.0
    Andrew Beekhof 
    andrew at beekhof.net
       
    Wed Jul 15 09:21:22 UTC 2009
    
    
  
Are you using the very latest version from mercurial?
2009/7/15 Ante Karamatić <ivoks at ubuntu.com>:
> Hi
>
> I've been trying to get pacemaker to work with corosync 1.0.0. Pacemaker
> i used was snapshot of stable tree from yesterday. I've changed the
> source of corosync; renaming CRM_SERVICE into PCMK_SERVICE in
> include/corosync/corodefs.h.
>
> Whole thing compiles nicely and for most part it works. When started, I
> can see all pacemaker services running except cib; stonithd, lrmd,
> attrd, pengine and crmd.
>
> Corosync logs are looking all right, except for the 'cib' part:
>
> Jul 15 10:04:11 corosync [pcmk  ] notice: pcmk_peer_update: Transitional
> membership event on ring 8: memb=0, new=0, lost=0
> Jul 15 10:04:11 corosync [pcmk  ] notice: pcmk_peer_update: Stable
> membership event on ring 8: memb=1, new=1, lost=0
> Jul 15 10:04:11 corosync [pcmk  ] info: pcmk_peer_update: NEW:  pace-1
> 1984866496
> Jul 15 10:04:11 corosync [pcmk  ] info: pcmk_peer_update: MEMB: pace-1
> 1984866496
> Jul 15 10:04:11 corosync [pcmk  ] info: update_member: Node pace-1 now
> has process list: 00000000000000000000000000013312 (78610)
> Jul 15 10:04:11 corosync [TOTEM ] A processor joined or left the
> membership and a new membership was formed.
> Jul 15 10:04:11 corosync [MAIN  ] Completed service synchronization,
> ready to provide service.
> Jul 15 10:04:11 corosync [pcmk  ] ERROR: pcmk_wait_dispatch: Child
> process cib exited (pid=2280, rc=100)
> Jul 15 10:04:11 corosync [pcmk  ] notice: pcmk_wait_dispatch: Child
> process cib no longer wishes to be respawned
> Jul 15 10:04:11 corosync [pcmk  ] info: update_member: Node pace-1 now
> has process list: 00000000000000000000000000013212 (78354)
>
> Looking at the syslog messages from pacemaker I find this:
>
> Jul 15 10:04:11 pace-1 cib: [2280]: WARN: retrieveCib: Cluster
> configuration not found: /var/lib/heartbeat/crm/cib.xml
> Jul 15 10:04:11 pace-1 cib: [2280]: WARN: readCibXmlFile: Primary
> configuration corrupt or unusable, trying backup...
> Jul 15 10:04:11 pace-1 cib: [2280]: WARN: readCibXmlFile: Continuing
> with an empty configuration.
> ...
> Jul 15 10:04:11 pace-1 cib: [2280]: ERROR: init_ais_connection: No
> context created, but connection reported 'ok'
> Jul 15 10:04:11 pace-1 lrmd: [2281]: info: G_main_add_SignalHandler:
> Added signal handler for signal 17
> Jul 15 10:04:11 pace-1 attrd: [2282]: info: main: Starting up
> Jul 15 10:04:11 pace-1 pengine: [2283]: info: main: Starting pengine
> Jul 15 10:04:11 pace-1 crmd: [2284]: info: main: CRM Hg Version:
> a37d901f0276113d88667cd6b00257e96dbc267e
> Jul 15 10:04:11 pace-1 cib: [2280]: info: init_ais_connection:
> Connection to our AIS plugin (9) failed: Library error (2)
> Jul 15 10:04:11 pace-1 lrmd: [2281]: info: G_main_add_SignalHandler:
> Added signal handler for signal 10
> Jul 15 10:04:11 pace-1 attrd: [2282]: info: crm_cluster_connect:
> Connecting to OpenAIS
> Jul 15 10:04:11 pace-1 crmd: [2284]: info: crmd_init: Starting crmd
> Jul 15 10:04:11 pace-1 cib: [2280]: CRIT: cib_init: Cannot sign in to
> the cluster... terminating
> ...
> Jul 15 10:04:48 pace-1 crmd: [2284]: info: do_cib_control: Could not
> connect to the CIB service: connection failed
> Jul 15 10:04:48 pace-1 crmd: [2284]: WARN: do_cib_control: Couldn't
> complete CIB registration 13 times... pause and retry
>
> corosync.conf is copy of corosync.conf.example width additional
> pacemaker service:
>
> service {
>        # Load the Pacemaker Cluster Resource Manager
>        ver:       0
>        name:      pacemaker
> }
>
> Any ideas what is going on?
>
> _______________________________________________
> Pacemaker mailing list
> Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
    
    
More information about the Pacemaker
mailing list