[ClusterLabs] Xen Migration/resource cleanup problem in SLES11 SP3

Cleber Paiva de Souza cleberps at gmail.com
Thu Oct 8 13:20:54 EDT 2015


Are both machines identical hardware/version/model? We found that machines
with different CPU features crash while migrating from the machine with
more features to one with few features.
Also are your STONITH ok? STONITH protects from that muti-running behavior.


On Thu, Oct 8, 2015 at 9:29 AM, Ulrich Windl <
Ulrich.Windl at rz.uni-regensburg.de> wrote:

> Hi!
>
> I'd like to report an "interesting problem" with SLES11 SP3+HAE (latest
> updates):
>
> When doing "rcopenais stop" on node "h10" with three Xen-VMs running, the
> cluster tried to migrate those VMs to other nodes (OK).
>
> However migration failed on the remote nodes, but the cluster thought
> migration was successfully. Later the cluster restarted the VMs (BAD).
>
> Oct  8 13:19:17 h10 Xen(prm_xen_v07)[16537]: INFO: v07: xm migrate to h01
> succeeded.
> Oct  8 13:20:38 h01 Xen(prm_xen_v07)[9027]: ERROR: v07: Not active
> locally, migration failed!
>
> Oct  8 13:44:53 h01 pengine[18985]:  warning: unpack_rsc_op_failure:
> Processing failed op migrate_from for prm_xen_v07 on h01: unknown error (1)
>
> Things are really bad after h10 was rebooted eventually: The cluster
> restarted the three VMs again, because it thought those VMs were still
> running on h10! (VERY BAD)
> During startup, the cluster did nor probe the three VMs.
>
> Oct  8 14:14:20 h01 pengine[18985]:  warning: unpack_rsc_op_failure:
> Processing failed op migrate_from for prm_xen_v07 on h01: unknown error (1)
>
> Oct  8 14:14:20 h01 pengine[18985]:   notice: LogActions: Restart
> prm_xen_v07 (Started h10)
>
> Oct  8 14:14:20 h01 crmd[18986]:   notice: te_rsc_command: Initiating
> action 89: stop prm_xen_v07_stop_0 on h01 (local)
>
> ...
>
> Regards,
> Ulrich
>
>
>
> _______________________________________________
> Users mailing list: Users at clusterlabs.org
> http://clusterlabs.org/mailman/listinfo/users
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
>



-- 
Cleber Paiva de Souza
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20151008/0b2d2f01/attachment-0003.html>


More information about the Users mailing list