[Pacemaker] Pacemaker with corosync/openais 1.0.0

Ante Karamatić ivoks at ubuntu.com
Wed Jul 15 04:23:47 EDT 2009


Hi

I've been trying to get pacemaker to work with corosync 1.0.0. Pacemaker
i used was snapshot of stable tree from yesterday. I've changed the
source of corosync; renaming CRM_SERVICE into PCMK_SERVICE in
include/corosync/corodefs.h.

Whole thing compiles nicely and for most part it works. When started, I
can see all pacemaker services running except cib; stonithd, lrmd,
attrd, pengine and crmd.

Corosync logs are looking all right, except for the 'cib' part:

Jul 15 10:04:11 corosync [pcmk  ] notice: pcmk_peer_update: Transitional
membership event on ring 8: memb=0, new=0, lost=0
Jul 15 10:04:11 corosync [pcmk  ] notice: pcmk_peer_update: Stable
membership event on ring 8: memb=1, new=1, lost=0
Jul 15 10:04:11 corosync [pcmk  ] info: pcmk_peer_update: NEW:  pace-1
1984866496
Jul 15 10:04:11 corosync [pcmk  ] info: pcmk_peer_update: MEMB: pace-1
1984866496
Jul 15 10:04:11 corosync [pcmk  ] info: update_member: Node pace-1 now
has process list: 00000000000000000000000000013312 (78610)
Jul 15 10:04:11 corosync [TOTEM ] A processor joined or left the
membership and a new membership was formed.
Jul 15 10:04:11 corosync [MAIN  ] Completed service synchronization,
ready to provide service.
Jul 15 10:04:11 corosync [pcmk  ] ERROR: pcmk_wait_dispatch: Child
process cib exited (pid=2280, rc=100)
Jul 15 10:04:11 corosync [pcmk  ] notice: pcmk_wait_dispatch: Child
process cib no longer wishes to be respawned
Jul 15 10:04:11 corosync [pcmk  ] info: update_member: Node pace-1 now
has process list: 00000000000000000000000000013212 (78354)

Looking at the syslog messages from pacemaker I find this:

Jul 15 10:04:11 pace-1 cib: [2280]: WARN: retrieveCib: Cluster
configuration not found: /var/lib/heartbeat/crm/cib.xml
Jul 15 10:04:11 pace-1 cib: [2280]: WARN: readCibXmlFile: Primary
configuration corrupt or unusable, trying backup...
Jul 15 10:04:11 pace-1 cib: [2280]: WARN: readCibXmlFile: Continuing
with an empty configuration.
...
Jul 15 10:04:11 pace-1 cib: [2280]: ERROR: init_ais_connection: No
context created, but connection reported 'ok'
Jul 15 10:04:11 pace-1 lrmd: [2281]: info: G_main_add_SignalHandler:
Added signal handler for signal 17
Jul 15 10:04:11 pace-1 attrd: [2282]: info: main: Starting up
Jul 15 10:04:11 pace-1 pengine: [2283]: info: main: Starting pengine
Jul 15 10:04:11 pace-1 crmd: [2284]: info: main: CRM Hg Version:
a37d901f0276113d88667cd6b00257e96dbc267e
Jul 15 10:04:11 pace-1 cib: [2280]: info: init_ais_connection:
Connection to our AIS plugin (9) failed: Library error (2)
Jul 15 10:04:11 pace-1 lrmd: [2281]: info: G_main_add_SignalHandler:
Added signal handler for signal 10
Jul 15 10:04:11 pace-1 attrd: [2282]: info: crm_cluster_connect:
Connecting to OpenAIS
Jul 15 10:04:11 pace-1 crmd: [2284]: info: crmd_init: Starting crmd
Jul 15 10:04:11 pace-1 cib: [2280]: CRIT: cib_init: Cannot sign in to
the cluster... terminating
...
Jul 15 10:04:48 pace-1 crmd: [2284]: info: do_cib_control: Could not
connect to the CIB service: connection failed
Jul 15 10:04:48 pace-1 crmd: [2284]: WARN: do_cib_control: Couldn't
complete CIB registration 13 times... pause and retry

corosync.conf is copy of corosync.conf.example width additional
pacemaker service:

service {
        # Load the Pacemaker Cluster Resource Manager
        ver:       0
        name:      pacemaker
}

Any ideas what is going on?




More information about the Pacemaker mailing list