[Pacemaker] pacemaker-1.1.10 results in Failed to sign on to the LRM 7

David Vossel dvossel at redhat.com
Wed May 15 11:18:41 EDT 2013


----- Original Message -----
> From: "Andrew Widdersheim" <awiddersheim at hotmail.com>
> To: pacemaker at oss.clusterlabs.org
> Sent: Wednesday, May 15, 2013 7:53:56 AM
> Subject: [Pacemaker] pacemaker-1.1.10 results in Failed to sign on to the	LRM 7
> 
> I am running the following versions:
> 
> pacemaker-1.1.10-rc2
> cluster-glue-1.0.11
> heartbeat-3.0.5

what libqb version do you have?

> 
> I was running pacemaker-1.1.6 and things were working fine but after updating
> to the latest I could not get pacemaker to start with the following message
> repeated in the logs:
> 
> crmd[8456]:  warning: do_lrm_control: Failed to sign on to the LRM 7 (30 max)
> times
> 
> Here is strace output from the crmd process:
>
> 0.000023 recvfrom(5, 0xc513f9, 2487, 64, 0, 0) = -1 EAGAIN (Resource
> temporarily unavailable)
> 0.000021 poll([{fd=5, events=0}], 1, 0) = 0 (Timeout)
> 0.000574 socket(PF_FILE, SOCK_STREAM, 0) = 6
> 0.000042 fcntl(6, F_GETFD)         = 0
> 0.000025 fcntl(6, F_SETFD, FD_CLOEXEC) = 0
> 0.000021 fcntl(6, F_SETFL, O_RDONLY|O_NONBLOCK) = 0
> 0.000055 connect(6, {sa_family=AF_FILE, path=@"lrmd"}, 110) = -1 ECONNREFUSED
> (Connection refused)
> 0.000050 close(6)                  = 0
> 0.000031 shutdown(4294967295, 2 /* send and receive */) = -1 EBADF (Bad file
> descriptor)
> 0.000024 close(4294967295)         = -1 EBADF (Bad file descriptor)
> 0.000039 write(2, "Could not establish lrmd connect"..., 62) = 62
> 0.000058 sendto(3, "<28>May 14 18:54:51 crmd[8456]: "..., 104, MSG_NOSIGNAL,
> NULL, 0) = 104
> 0.000327 times({tms_utime=0, tms_stime=1, tms_cutime=0, tms_cstime=0}) =
> 430616237
> 0.000028 recvfrom(5, 0xc513f9, 2487, 64, 0, 0) = -1 EAGAIN (Resource
> temporarily unavailable)
> 0.000025 poll([{fd=5, events=0}], 1, 0) = 0 (Timeout)
> 0.000026 recvfrom(5, 0xc513f9, 2487, 64, 0, 0) = -1 EAGAIN (Resource
> temporarily unavailable)
> 0.000023 poll([{fd=5, events=0}], 1, 0) = 0 (Timeout)
> 0.000023 recvfrom(5, 0xc513f9, 2487, 64, 0, 0) = -1 EAGAIN (Resource
> temporarily unavailable)
> 0.000023 poll([{fd=5, events=0}], 1, 0) = 0 (Timeout)
> 
> I'm not quite sure what the issue is. At first I thought it might have been
> some type of permissions issues but I'm not quite sure that is the case
> anymore. Any help would be appreciated. I can forward a long any more
> details to help in troubleshooting.

Are there anything in the logs that indicate a problem with the lrmd component? Do you see lrmd listed in 'ps -axf' output? 

-- Vossel

> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
>




More information about the Pacemaker mailing list