[ClusterLabs] Xen Migration/resource cleanup problem in SLES11 SP3
Ulrich.Windl at rz.uni-regensburg.de
Thu Oct 8 08:29:08 EDT 2015
I'd like to report an "interesting problem" with SLES11 SP3+HAE (latest updates):
When doing "rcopenais stop" on node "h10" with three Xen-VMs running, the cluster tried to migrate those VMs to other nodes (OK).
However migration failed on the remote nodes, but the cluster thought migration was successfully. Later the cluster restarted the VMs (BAD).
Oct 8 13:19:17 h10 Xen(prm_xen_v07): INFO: v07: xm migrate to h01 succeeded.
Oct 8 13:20:38 h01 Xen(prm_xen_v07): ERROR: v07: Not active locally, migration failed!
Oct 8 13:44:53 h01 pengine: warning: unpack_rsc_op_failure: Processing failed op migrate_from for prm_xen_v07 on h01: unknown error (1)
Things are really bad after h10 was rebooted eventually: The cluster restarted the three VMs again, because it thought those VMs were still running on h10! (VERY BAD)
During startup, the cluster did nor probe the three VMs.
Oct 8 14:14:20 h01 pengine: warning: unpack_rsc_op_failure: Processing failed op migrate_from for prm_xen_v07 on h01: unknown error (1)
Oct 8 14:14:20 h01 pengine: notice: LogActions: Restart prm_xen_v07 (Started h10)
Oct 8 14:14:20 h01 crmd: notice: te_rsc_command: Initiating action 89: stop prm_xen_v07_stop_0 on h01 (local)
More information about the Users