[Pacemaker] Pacemaker hang with hardware reset

Damiano Scaramuzza cesello at daimonlab.it
Wed Jul 4 11:20:08 UTC 2012


Hi Emmanuel,
yes I use drbd level fence as in linbit user guide

disk {
    fencing resource-only;
    ...
  }
  handlers {
    fence-peer "/usr/lib/drbd/crm-fence-peer.sh";
    after-resync-target "/usr/lib/drbd/crm-unfence-peer.sh";
    ...
  }

and
the drbd driver correctly marks the peer as Outdated as soon as I reset
the node and I suppose calls the crm-fence-peer.sh script
to notify crm the fencing phase.
The problem is: I see the drbd peer driver Uknown/Outdated but in
crm_mon I still seeing both resources in Master state.
This happens only with hard reset. If I stop and start the node I see
teh correct phase Outdated-->UpToDate after a re-sink.
My guess is that drbd notify, the crm try to notify but the message does
not arrives to other components (dlm,clvm,ocfs2 )




On 04/07/2012 12:22, emmanuel segura wrote:
> Hello Damiano
>
> Do you use drbd fence + pacemaker fence?
>
> 2012/7/4 Damiano Scaramuzza <cesello at daimonlab.it
> <mailto:cesello at daimonlab.it>>
>
>     Hi all, my first post in this ML.
>     I've used in 2008 heartbeat for a big project and now I'm back with
>     pacemaker for a smaller one.
>     ....
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20120704/6b297c18/attachment.htm>


More information about the Pacemaker mailing list