[Pacemaker] Tomcat and "unmanaged failed"

Andreas Kurz andreas at hastexo.com
Wed May 30 12:55:44 UTC 2012


Hi Andreas,

On 05/29/2012 04:14 PM, Stallmann, Andreas wrote:
> Hi there,
> 
>  
> 
> we have here a corosync/pacemaker cluster running tomcat. Sometimes our
> application running inside tomcat fails and tomcat dies.
> 
>  
> 
> This – for some reason I don’t understand – leads to an “unmanaged
> failed” state for tomcat diplayed in crm_mon. This would not been to
> bad, but at this point the cluster “decides” not to failover the
> resource to the second node.
> 
>  
> 
> My questions:
> 
>  
> 
> 1.       Is this a standard behaviour? Should a failover stop (or not
> take place at all), if a resource runs into an unmanaged failed state?

Yes, that is default behaviour ... Pacemaker tries to stop, that fails
so it must assume (worst case) it is still running, now STONITH would
trigger to make sure the node including the resource is definitely down
... without STONITH it stays unmanaged until cleared.

> 
> 2.       What conditions have to apply, before a resource is called
> “unmanaged failed”?

e.g. stop failures ;-)

> 
> 3.       Is there any way of an “automatic recover” of a resource that
> ran into an “unmanaged failed” state?

First attempt should be to fix your application. There is also the
"failure-timeout" resource meta-attribute ... in combination with the
cluster-recheck-interval cluster property, this clears resource failures
on a regular base.

Regards,
Andreas

-- 
Need help with Pacemaker?
http://www.hastexo.com/now


> 
>  
> 
> Cheers,
> 
>  
> 
> Andreas
> 
> --
> CONET Solutions GmbH
> Andreas Stallmann,
> Theodor-Heuss-Allee 19, 53773 Hennef
> Tel.: +49 2242 939-677, Fax: +49 2242 939-393
> Mobil: +49 172 2455051
> Internet: http://www.conet.de, mailto: AStallmann at CONET.DE
> <mailto:AStallmann at CONET.DE>
> 
>  
> 
> ----------------------------
> CONET Solutions GmbH, Theodor-Heuss-Allee 19, 53773 Hennef.
> Registergericht/Registration Court: Amtsgericht Siegburg (HRB Nr. 9136)
> Geschäftsführer/Managing Director: Anke Höfer 
> 
>  ----------------------------
> 
>  
> 
> 
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org



-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 222 bytes
Desc: OpenPGP digital signature
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20120530/c0888604/attachment-0004.sig>


More information about the Pacemaker mailing list