[ClusterLabs] [Problem] The crmd fails to connect with pengine.

renayama19661014 at ybb.ne.jp renayama19661014 at ybb.ne.jp
Thu Dec 27 15:51:37 EST 2018


Hi All,

This problem occurred with our users.

The following problem occurred in a two-node cluster that does not set STONITH.

The problem seems to have occurred in the following procedure.

Step 1) Configure the cluster with 2 nodes. The DC node is the second node.
Step 2) Several resources are running on the first node.
Step 3) It stops almost at the same time in order of 2nd node and 1st node.
Step 4) After the second node stops, the first node tries to calculate the state transition for the resource stop.

However, crmd fails to connect with pengine and does not calculate state transitions.

-----
Dec 27 08:36:00 rh74-01 crmd[12997]: warning: Setup of client connection failed, not adding channel to mainloop
-----

As a result, Pacemaker will stop without stopping the resource.

The problem seems to have occurred in the following environment.

 - libqb 1.0
 - corosync 2.4.1
 - Pacemaker 1.1.15

I tried to reproduce this problem, but for now it can not be reproduced.

Do you know the cause of this problem?

Best Regards,
Hideo Yamacuhi.


More information about the Users mailing list