[ClusterLabs] Antw: Re: Problem with stonith and starting services
Ulrich.Windl at rz.uni-regensburg.de
Thu Jul 6 01:54:14 EDT 2017
>>> Cesar Hernandez <c.hernandez at medlabmg.com> schrieb am 05.07.2017 um 08:50 in
Nachricht <AEF93306-C409-43C2-8222-CD457CC153C4 at medlabmg.com>:
>> Might be kind of a strange race as well ... but without knowing what the
>> script actually does ...
> The script first try to reboot the node using ssh, something like ssh $NODE
> reboot -f, then runs a remote reboot using AWS api
AFAIK that's not proper fencing. SunOS once had a "fasthalt" command. In Linux "halt -nf" might do a similar thing, or maybe trigger a reboot via sysrq (echo b > /proc/sysrq-trigger).
Fencing is everything but a clean shutdown. The specific problem is that shutdown may be performed asynchronously, so the command finishes, but the system is still in progress of shutting down (which may hang indefinitely).
> Users mailing list: Users at clusterlabs.org
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
More information about the Users