[ClusterLabs] two virtual domains start and stop every 15 minutes

Ken Gaillot kgaillot at redhat.com
Fri Jun 14 15:20:12 EDT 2019


On Fri, 2019-06-14 at 18:27 +0200, Lentes, Bernd wrote:
> Hi,
> 
> i had that problem already once but still it's not clear for me what
> really happens.
> I had this problem some days ago:
> I have a 2-node cluster with several virtual domains as resources. I
> put one node (ha-idg-2) into standby, and two running virtual domains
> were migrated to the other node (ha-idg-1). The other virtual domains
> were already running on ha-idg-1.
> Since then the two virtual domains which migrated (vm_idcc_devel and
> vm_severin) start or stop every 15 minutes on ha-idg-1.
> ha-idg-2 resides in standby.
> I know that the 15 minutes interval is related to the "cluster-
> recheck-interval".
> But why are these two domains started and stopped ?
> I looked around much in the logs, checked the pe-input files, watched
> some graphs created by crm_simulate with dotty ...
> I always see that the domains are started and 15 minutes later
> stopped and 15 minutes later started ...
> but i don't see WHY. I would really like to know that.
> And why are the domains not started from the monitor resource
> operation ? It should recognize that the domain is stopped and starts
> it again. My monitor interval is 30 seconds.
> I had two errors pending concerning these domains, a failed migrate
> from ha-idg-1 to ha-idg-2, form some time before.
> Could that be the culprit ?
> 
> I still have all the logs from that time, if you need information
> just let me know.

Yes the logs and pe-input files would be helpful. It sounds like a bug
in the scheduler. What version of pacemaker are you running?

> 
> Thanks.
> 
> 
> Bernd
-- 
Ken Gaillot <kgaillot at redhat.com>



More information about the Users mailing list