[Pacemaker] problem starting new instance of pacemaker (via corosync)

John White jwhite at lbl.gov
Fri Sep 7 11:28:58 EDT 2012


An odd update to this.  We run in a stateless environment (nodes are pxe booted and have NFS roots, etc).  Trying the same install on a VM works just fine.  I wonder if anyone has experience with pacemaker and stateless nodes.
----------------
John White
HPC Systems Engineer
(510) 486-7307
One Cyclotron Rd, MS: 50C-3209C
Lawrence Berkeley National Lab
Berkeley, CA 94720

On Sep 6, 2012, at 2:49 PM, John White <jwhite at lbl.gov> wrote:

> Hello Folks,
> 	I'm having a very hard time getting a basic pacemaker setup going.  I've gotten corosync up and running just fine from what i can tell, but once I start with pacemaker commands, I get CIB errors everywhere:
> 
> -bash-4.1# crm configure
> Signon to CIB failed: connection failed
> Init failed, could not perform requested operations
> ERROR: cannot parse xml: no element found: line 1, column 0
> crm(live)configure#
> 
> Digging deeper, I see both attrd and cib failing to connect to the AIS plugin:
> 
> Sep 06 14:42:52 n0014.lustre attrd: [13225]: notice: crm_cluster_connect: Connecting to cluster infrastructure: classic openais (with plugin)
> Sep 06 14:42:52 n0014.lustre attrd: [13225]: ERROR: main: HA Signon failed
> Sep 06 14:42:52 n0014.lustre attrd: [13225]: ERROR: main: Aborting startup
> -snip-
> Sep 06 14:42:52 n0014.lustre cib: [13223]: info: get_cluster_type: Cluster type is: 'openais'
> Sep 06 14:42:52 n0014.lustre cib: [13223]: notice: crm_cluster_connect: Connecting to cluster infrastructure: classic openais (with plugin)
> Sep 06 14:42:52 n0014.lustre cib: [13223]: info: init_ais_connection_classic: Creating connection to our Corosync plugin
> Sep 06 14:42:52 n0014.lustre cib: [13223]: info: init_ais_connection_classic: Connection to our AIS plugin (10) failed: Library error (2)
> Sep 06 14:42:52 n0014.lustre cib: [13223]: CRIT: cib_init: Cannot sign in to the cluster… terminating
> 
> 
> I'm really at a loss here after 3 days, any ideas or hints as to where I might find a solution?  More logging available upon request.
> 
> 
> 
> ----------------
> John White
> HPC Systems Engineer
> (510) 486-7307
> One Cyclotron Rd, MS: 50C-3209C
> Lawrence Berkeley National Lab
> Berkeley, CA 94720
> 





More information about the Pacemaker mailing list