[ClusterLabs] epic fail

Digimer lists at alteeve.ca
Sun Jul 23 14:19:15 EDT 2017

On 2017-07-23 08:27 AM, Dmitri Maziuk wrote:
> So yesterday I ran yum update that puled in the new pacemaker and tried
> to restart it. The node went into its usual "can't unmount drbd because
> kernel is using it" and got stonith'ed in the middle of yum transaction.
> The end result: DRBD reports split brain, HA daemons don't start on
> boot, RPM database is FUBAR. I've had enough. I'm rebuilding this
> cluster as centos 6 + heartbeat R1.
> Centos 7 + DRBD 8.4 + pacemaker + NFS server: FAIL. You have been warned.
> Dima

Is DRBD set to 'fencing resource-and-stonith'? If so, then the only way
to get a split-brain is if something is configured wrong in pacemaker or
if something caused crm-fence-peer.sh to report success when it didn't
actually succeed...

Papers and Projects: https://alteeve.com/w/
"I am, somehow, less interested in the weight and convolutions of
Einstein’s brain than in the near certainty that people of equal talent
have lived and died in cotton fields and sweatshops." - Stephen Jay Gould

More information about the Users mailing list