[Pacemaker] Getting split brain after all reboot of a cluster node

Anne Nicolas ennael1 at gmail.com
Wed Mar 5 05:54:55 EST 2014


Le 05/03/2014 11:26, emmanuel segura a écrit :
> because you don't have fencing configured

As said I've added it in /etc/drbd.d/global_common.conf which is now

     $ cat global_common.conf
     global {
             usage-count no;
             disable-ip-verification;
      }
     common {
     handlers {
         fence-peer "/usr/lib/drbd/crm-fence-peer.sh";
         after-resync-target "/usr/lib/drbd/crm-unfence-peer.sh";
       }
	     syncer { rate 500M; }
}

but no improvment.

> 
> 
> 2014-03-05 9:28 GMT+01:00 Anne Nicolas <ennael1 at gmail.com
> <mailto:ennael1 at gmail.com>>:
> 
>     Hi
> 
>     I'm having trouble setting a very simple cluster with 2 nodes. After all
>     reboot I'm getting split brain that I have to solve by hand then.
>     Looking for a solution for that one...
> 
>     Both nodes have 4 network interfaces. We use 3 of them: one for an IP
>     cluster, one for a bridge for a vm and the last one for the private
>     network of the cluster
> 
>     I'm using
>     drbd : 8.3.9
>     drbd-utils: 8.3.9
> 
>     DRBD configuration:
>     ============
>     $ cat global_common.conf
>     global {
>             usage-count no;
>             disable-ip-verification;
>      }
>     common { syncer { rate 500M; } }
> 
>     cat server.res
>     resource server {
>             protocol C;
>             net {
>                      cram-hmac-alg sha1;
>                      shared-secret "eafcupps";
>                 }
>      on dzacupsvr {
>         device     /dev/drbd0;
>         disk       /dev/vg0/server;
>         address    172.16.1.1:7788 <http://172.16.1.1:7788>;
>         flexible-meta-disk  internal;
>       }
>       on dzacupsvr2 {
>         device     /dev/drbd0;
>         disk       /dev/vg0/server;
>         address    172.16.1.2:7788 <http://172.16.1.2:7788>;
>         flexible-meta-disk  internal;
>       }
>     }
> 
>     Pacemaker configuration
>     ===============
>     node $id="16847020 <tel:16847020>" dzacupsvr
>     node $id="33624236" dzacupsvr2
>     primitive apache ocf:heartbeat:apache \
>             params configfile="/etc/httpd/conf/httpd.conf" \
>             op start interval="0" timeout="40s" \
>             op stop interval="0" timeout="60s"
>     primitive clusterip ocf:heartbeat:IPaddr2 \
>             params ip="192.168.1.5" cidr_netmask="24" nic="eth0"
>     primitive drbdserv ocf:linbit:drbd \
>             params drbd_resource="server" \
>             op monitor interval="60s"
>     primitive fsserv ocf:heartbeat:Filesystem \
>             params device="/dev/drbd/by-res/server" directory="/EdgeServer"
>     fstype="ext4"
>     primitive libvirt-guests lsb:libvirt-guests
>     primitive libvirtd lsb:libvirtd
>     primitive mysql ocf:heartbeat:mysql \
>             params binary="/usr/bin/mysqld_safe" config="/etc/my.cnf"
>     datadir="/EdgeServer/mysql" \
>             op start interval="0" timeout="40s" \
>             op stop interval="0" timeout="60s" \
>             meta target-role="Started"
>     primitive named lsb:named
>     primitive samba lsb:smb
>     group services fsserv clusterip libvirtd samba apache mysql
>     ms drbdservClone drbdserv \
>             meta master-max="1" master-node-max="1" clone-max="2"
>     clone-node-max="1" notify="true"
>     colocation fs_on_drbd inf: fsserv drbdservClone:Master
>     order fsserv-after-drbdserv inf: drbdservClone:promote fsserv:start
>     property $id="cib-bootstrap-options" \
>            
>     dc-version="1.1.7-2.mga1-ee0730e13d124c3d58f00016c3376a1de5323cff" \
>             cluster-infrastructure="corosync" \
>             stonith-enabled="false" \
>             no-quorum-policy="ignore"
> 
>     and here are the logs
>     http://pastebin.com/DcV2WM3M
> 
>     After looking for more information, I've added fences in drbd
>     configuration
> 
>     handlers {
>         fence-peer "/usr/lib/drbd/crm-fence-peer.sh";
>         after-resync-target "/usr/lib/drbd/crm-unfence-peer.sh";
>       }
>     but still without any success...
> 
>     Any help appreciated
> 
>     Cheers
> 
>     --
>     Anne
> 
> 
>     --
>     Anne
>     http://mageia.org
> 
>     _______________________________________________
>     Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
>     <mailto:Pacemaker at oss.clusterlabs.org>
>     http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
>     Project Home: http://www.clusterlabs.org
>     Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
>     Bugs: http://bugs.clusterlabs.org
> 
> 
> 
> 
> -- 
> esta es mi vida e me la vivo hasta que dios quiera
> 
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
> 


-- 
Anne
http://mageia.org




More information about the Pacemaker mailing list