[ClusterLabs] Xen Migration/resource cleanup problem in SLES11 SP3

Dejan Muhamedagic dejanmm at fastmail.fm
Thu Oct 8 10:13:57 EDT 2015


Hi,

On Thu, Oct 08, 2015 at 02:29:08PM +0200, Ulrich Windl wrote:
> Hi!
> 
> I'd like to report an "interesting problem" with SLES11 SP3+HAE (latest updates):
> 
> When doing "rcopenais stop" on node "h10" with three Xen-VMs running, the cluster tried to migrate those VMs to other nodes (OK).
> 
> However migration failed on the remote nodes, but the cluster thought migration was successfully. Later the cluster restarted the VMs (BAD).
> 
> Oct  8 13:19:17 h10 Xen(prm_xen_v07)[16537]: INFO: v07: xm migrate to h01 succeeded.
> Oct  8 13:20:38 h01 Xen(prm_xen_v07)[9027]: ERROR: v07: Not active locally, migration failed!

xm did report success in migrate_to, but the overall migration
should've been considered failed, because migrate_from failed. Do
you have a too low timeout? The failure msg is logged 81 second
later, provided the clocks are in sync.

> Oct  8 13:44:53 h01 pengine[18985]:  warning: unpack_rsc_op_failure: Processing failed op migrate_from for prm_xen_v07 on h01: unknown error (1)
> 
> Things are really bad after h10 was rebooted eventually: The cluster restarted the three VMs again, because it thought those VMs were still running on h10! (VERY BAD)
> During startup, the cluster did nor probe the three VMs.

If a node restarted, how could anything think that there was
anything there still running. Strange.

But anyway, the if the migrate_from fails, then the resource
should still be running at the origin host, right?

Thanks,

Dejan

> Oct  8 14:14:20 h01 pengine[18985]:  warning: unpack_rsc_op_failure: Processing failed op migrate_from for prm_xen_v07 on h01: unknown error (1)
> 
> Oct  8 14:14:20 h01 pengine[18985]:   notice: LogActions: Restart prm_xen_v07 (Started h10)
> 
> Oct  8 14:14:20 h01 crmd[18986]:   notice: te_rsc_command: Initiating action 89: stop prm_xen_v07_stop_0 on h01 (local)
> 
> ...
> 
> Regards,
> Ulrich
> 
> 
> 
> _______________________________________________
> Users mailing list: Users at clusterlabs.org
> http://clusterlabs.org/mailman/listinfo/users
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org




More information about the Users mailing list