[ClusterLabs] Resources not monitored in SLES11 SP4 (1.1.12-f47ea56)

Ulrich Windl Ulrich.Windl at rz.uni-regensburg.de
Tue Jun 26 02:14:11 EDT 2018


Hi!

We just observed some strange effect we cannot explain in SLES 11 SP4 (pacemaker 1.1.12-f47ea56):
We run about a dozen of Xen PVMs on a three-node cluster (plus some infrastructure and monitoring stuff). It worked all well so far, and there was no significant change recently.
However when a colleague stopped on VM for maintenance via cluster command, the cluster did not notice when the PVM actually was running again (it had been started not using the cluster (a bad idea, I know)).
Examining the logs, it seems that the recheck timer popped periodically, but no monitor action was run for the VM (the action is configured to run every 10 minutes).

Actually the only monitor operations found were:
May 23 08:04:13
Jun 13 08:13:03
Jun 25 09:29:04
Then a manual "reprobe" was done, and several monitor operations were run.
Then again I see no more monitor actions in syslog.

What could be the reasons for this? Too many operations defined?

The other message I don't understand is like "<other-resource>: Rolling back scores from <vm-resource>"

Could it be a new bug introduced in pacemaker, or could it be some configuration problem (The status is completely clean however)?

According to the packet changelog, there was no change since Nov 2016...

Regards,
Ulrich




More information about the Users mailing list