[Pacemaker] Query regarding configuring STONITH device

sachin garg sachingarg2k1 at gmail.com
Mon Jul 2 12:19:38 UTC 2012


Hi,

I am using IPMI plugin for configuring STONITH with heartbeat cluster.
If a resource fails on one node then the other node STONITHs that node. But
when the failed node comes back after the reboot, the STONITH device itself
fails on the node which has started again. Logs indicate that IPMI start
operation returned 1 (i.e. unknown error). I suspect that this may be due
to some initialization delays at network level. But I am not sure about
this. What could be the best way to overcome this issue? I consider adding
a start delay to stonith device but can't say if that is the right
approach.

Moreover, how should one configure start/monitor operation failure for a
STONITH device? I have currently configured pacemaker to fence the node if
start/monitor operation fails for STONITH device. Is this the right
configuration?

And what should be the monitoring frequency for STONITH device?

Regards
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20120702/615f333a/attachment-0003.html>


More information about the Pacemaker mailing list