<div dir="ltr"><div>Hello<br><br></div>How do you configure your cluster network? are you using a private network for the cluster and one public for the services?<br></div><div class="gmail_extra"><br><br><div class="gmail_quote">
2013/5/15 Andrew Widdersheim <span dir="ltr"><<a href="mailto:awiddersheim@hotmail.com" target="_blank">awiddersheim@hotmail.com</a>></span><br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Sorry to bring up old issues but I am having the exact same problem as the original poster. A simultaneous disconnect on my two node cluster causes the resources to start to transition to the other node but mid flight the transition is aborted and resources are started again on the original node when the cluster realizes connectivity is same between the two nodes.<br>
<br>
I have tried various dampen settings without having any luck. Seems like the nodes report the outages at slightly different times which results in a partial transition of resources instead of waiting to know the connectivity of all of the nodes in the cluster before taking action which is what I would have thought dampen would help solve.<br>
<br>
Ideally the cluster wouldn't start the transition if another cluster node is having a connectivity issue as well and connectivity status is shared between all cluster nodes. Find my configuration below. Let me know there is something I can change to fix or if this behavior is expected.<br>
<br>
primitive p_drbd ocf:linbit:drbd \<br>
params drbd_resource="r1" \<br>
op monitor interval="30s" role="Slave" \<br>
op monitor interval="10s" role="Master"<br>
primitive p_fs ocf:heartbeat:Filesystem \<br>
params device="/dev/drbd/by-res/r1" directory="/drbd/r1" fstype="ext4" options="noatime" \<br>
op start interval="0" timeout="60s" \<br>
op stop interval="0" timeout="180s" \<br>
op monitor interval="30s" timeout="40s"<br>
primitive p_mysql ocf:heartbeat:mysql \<br>
params binary="/usr/libexec/mysqld" config="/drbd/r1/mysql/my.cnf" datadir="/drbd/r1/mysql" \<br>
op start interval="0" timeout="120s" \<br>
op stop interval="0" timeout="120s" \<br>
op monitor interval="30s" \<br>
meta target-role="Started"<br>
primitive p_ping ocf:pacemaker:ping \<br>
params host_list="192.168.5.1" dampen="30s" multiplier="1000" debug="true" \<br>
op start interval="0" timeout="60s" \<br>
op stop interval="0" timeout="60s" \<br>
op monitor interval="5s" timeout="10s"<br>
group g_mysql_group p_fs p_mysql \<br>
meta target-role="Started"<br>
ms ms_drbd p_drbd \<br>
meta notify="true" master-max="1" clone-max="2" target-role="Started"<br>
clone cl_ping p_ping<br>
location l_connected g_mysql \<br>
rule $id="l_connected-rule" pingd: defined pingd<br>
colocation c_mysql_on_drbd inf: g_mysql ms_drbd:Master<br>
order o_drbd_before_mysql inf: ms_drbd:promote g_mysql:start<br>
property $id="cib-bootstrap-options" \<br>
dc-version="1.1.6-1.el6-8b6c6b9b6dc2627713f870850d20163fad4cc2a2" \<br>
cluster-infrastructure="Heartbeat" \<br>
no-quorum-policy="ignore" \<br>
stonith-enabled="false" \<br>
cluster-recheck-interval="5m" \<br>
last-lrm-refresh="1368632470"<br>
rsc_defaults $id="rsc-options" \<br>
migration-threshold="5" \<br>
resource-stickiness="200"<br>
_______________________________________________<br>
Pacemaker mailing list: <a href="mailto:Pacemaker@oss.clusterlabs.org">Pacemaker@oss.clusterlabs.org</a><br>
<a href="http://oss.clusterlabs.org/mailman/listinfo/pacemaker" target="_blank">http://oss.clusterlabs.org/mailman/listinfo/pacemaker</a><br>
<br>
Project Home: <a href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a><br>
Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>
Bugs: <a href="http://bugs.clusterlabs.org" target="_blank">http://bugs.clusterlabs.org</a><br>
</blockquote></div><br><br clear="all"><br>-- <br>esta es mi vida e me la vivo hasta que dios quiera
</div>