Would you be able to direct me to some documentation for configuring STONITH based on my environment? The Clusters from Scratch document talks about a fence_ipmilan driver, which I seem to not have on my centos 6 install. I only show the fence_pcmk driver.<br>
<br>Reading the metadata for fence_pcmk, I don't quite understand how it should be configured.<br><br>Thanks!<br><br>--Cal<br><br><div class="gmail_quote">On Fri, Jul 27, 2012 at 10:56 AM, Phil Frost <span dir="ltr"><<a href="mailto:phil@macprofessionals.com" target="_blank">phil@macprofessionals.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="im">On 07/27/2012 11:48 AM, Cal Heldenbrand wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Why wouldn't my mem3 failover happen if it timed out stopping the cluster IP?<br>
</blockquote>
<br></div>
If a stop action fails, pacemaker can't know if the resource is running, not running, or in some other broken state. The cluster is in an unknown state, and there's no reasonable thing pacemaker can do. Since pacemaker thinks a node is broken (it failed to stop a resource, as requested) but isn't sure, the solution is to transition to a known state by powering the node off, resetting it, or otherwise fencing it. Configure a STONITH resource to do this. Without STONITH, your only option is to manually address the cause of the failure (high load, in this case), then issue "crm resource cleanup ..." on any failed resources to instruct pacemaker that it is safe to try again.<br>
<br>
</blockquote></div><br>