[Pacemaker] clone resource doesn't stop during node standby

Andrew Beekhof andrew at beekhof.net
Thu Mar 18 06:35:26 EDT 2010


2010/3/18 Junko IKEDA <ikedaj at intellilink.co.jp>:
> Hi,
>
> I run the following resources on two node.
>
> # crm_mon -1
>
> ============
> Last updated: Thu Mar 18 11:26:54 2010
> Stack: openais
> Current DC: cspm01 - partition with quorum
> Version: 1.0.8-2a76c6ac04bc stable-1.0 tip
> 2 Nodes configured, 2 expected votes
> 2 Resources configured.
> ============
>
> Online: [ cspm01 cspm02 ]
>
>     Resource Group: UMgroup01
>         UmDummy03  (ocf::heartbeat:Dummy): Started cspm01
>         UmDummy04  (ocf::heartbeat:Dummy01):       Started cspm01
>     Clone Set: clnUMgroup01
>         Started: [ cspm01 cspm02 ]
>
>
>
>
> edit Dummy01 RA, which is running as UmDummy04 in UMgroup01 to create stop
> NG situation.
> aftre stop NG of UmDummy04, clone resource keeps running on both nodes.
>
>
>
> # crm_mon -1
>
> ============
> Last updated: Thu Mar 18 11:27:25 2010
> Stack: openais
> Current DC: cspm01 - partition with quorum
> Version: 1.0.8-2a76c6ac04bc stable-1.0 tip
> 2 Nodes configured, 2 expected votes
> 2 Resources configured.
> ============
>
> Online: [ cspm01 cspm02 ]
>
>     Resource Group: UMgroup01
>         UmDummy03  (ocf::heartbeat:Dummy): Started cspm01
>         UmDummy04  (ocf::heartbeat:Dummy01):       Started cspm01
> (unmanaged)
> FAILED
>     Clone Set: clnUMgroup01
>         Started: [ cspm01 cspm02 ]
>
> Failed actions:
>        UmDummy04_monitor_10000 (node=cspm01, call=13, rc=7,
> status=complete):
> not running
>        UmDummy04_stop_0 (node=cspm01, call=14, rc=1, status=complete):
> unknown error
>
>
>
> I tried to change node(cspm02) status to "standby".
>
> standby action succeeded,
> but clone resource on standby node was not stopped.
>
> # crm_mon -1
>
> ============
> Stack: openais
> Current DC: cspm01 - partition with quorum
> Version: 1.0.8-2a76c6ac04bc stable-1.0 tip
> 2 Nodes configured, 2 expected votes
> 2 Resources configured.
> ============
>
> Node cspm02: standby
> Online: [ cspm01 ]
>
>     Resource Group: UMgroup01
>         UmDummy03      (ocf::heartbeat:Dummy): Started cspm01
>         UmDummy04      (ocf::heartbeat:Dummy01):       Started cspm01
> (unmanaged)
> FAILED
>     Clone Set: clnUMgroup01
>         Started: [ cspm01 cspm02 ]
>
> Failed actions:
>        UmDummy04_monitor_10000 (node=cspm01, call=13, rc=7,
> status=complete):
> not running
>        UmDummy04_stop_0 (node=cspm01, call=14, rc=1, status=complete):
> unknown error
>
>
> It seems that constraints setting prevents to stop action because there is
> the unmanaged resource.

No. The UmDummy04 is unmanaged because it failed to stop when we asked it to.
This therefore prevents UmDummy03 from being stopped (as required by
the semantics of group resources).

> Is it expected?
>
> I think that the resources on cspm02 is not "unmanaged",
> so it should be stopped during standby.
>
> I attached hb_report.
>
>
> Thanks,
> Junko
> _______________________________________________
> Pacemaker mailing list
> Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
>




More information about the Pacemaker mailing list