[Pacemaker] Howto recover from node state UNCLEAN (online)

Andreas Mock andreas.mock at web.de
Thu Sep 5 10:23:23 UTC 2013


Hi all,

is there a way to recover from node state UNCLEAN (online) without
rebooting?

Background: 
- RHEL6.4
- cman-cluster with pacemaker
- stonith enabled and working

- resource monitoring failed on node 1
  => stop of resource on node 1 failed 
  => stonith off node 1 worked
- more or less parallel as resource is clone resource
  resource monitoring failed on node 2
  => stop of resource on node 2 failed
  => stonith of node 2 failed as stonith resource agent on
     node 1 is unreachable caused by stonithing of node1

- Error message stating, giving up stonithing.
=> node 2 in the state above

Interestingly: a "service stop pacemaker" doesn't work
as pacemaker seems to be blocked by this node state.

The questions:
1) How to recover from this state without rebooting?
2) Is self-stonithing allowed meanwhile, so that
a self-stonithing device could be added in a fencing
topology?

Best regards
Andreas Mock

   





More information about the Pacemaker mailing list