[ClusterLabs] pacemaker pingd with ms drbd = double masters short time when disconnected networks.
Прокопов Павел
prokopov at experium.ru
Wed Dec 20 02:39:44 EST 2017
Stonith - the kill other node. But I need another.
So that a node can not become a master if it is not connected to network.
I solved my question by increasing the parameter token in corosync.conf
default token value: 3000. New value: 30000.
And then the rule (location location_master_ms_drbd1 ms_drbd1 rule
$role=Master -inf: not_defined pingd or pingd lte 0) has time to work
correctly.
Thanks to Adrián Gómez <a.gomez at medlabmg.com> for his question, which
turned out to be a response to my.
On 19.12.2017 13:00, emmanuel segura wrote:
> You need to configure the stonith and drbd stonith handler
>
> 2017-12-19 8:19 GMT+01:00 Прокопов Павел <prokopov at experium.ru
> <mailto:prokopov at experium.ru>>:
>
> Hello!
>
> pacemaker pingd with ms drbd = double masters short time when
> disconnected networks.
>
> My crm config:
>
> node 168885811 <tel:168885811>: pp-pacemaker1.heliosoft.ru
> <http://pp-pacemaker1.heliosoft.ru>
> node 168885812 <tel:168885812>: pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru>
> primitive drbd1 ocf:linbit:drbd \
> params drbd_resource=drbd1 \
> op monitor interval=60s \
> op start interval=15 timeout=240s \
> op stop interval=15 timeout=240s \
> op monitor role=Master interval=30s \
> op monitor role=Slave interval=60s
> primitive fs_drbd1 Filesystem \
> params device="/dev/drbd1" directory="/mnt/drbd1" fstype=ext4
> options=noatime
> primitive pinger ocf:pacemaker:ping \
> params host_list=10.16.4.1 multiplier=100 \
> op monitor interval=15s \
> op start interval=0 timeout=5s \
> op stop interval=0
> primitive vip IPaddr2 \
> params ip=10.16.5.227 nic=eth0 \
> op monitor interval=10s
> primitive vip2 IPaddr2 \
> params ip=10.16.254.50 nic=eth1 \
> op monitor interval=10s
> group group_master fs_drbd1 vip vip2
> ms ms_drbd1 drbd1 \
> meta master-max=1 master-node-max=1 clone-max=2
> clone-node-max=1 notify=true
> clone pingerclone pinger \
> meta globally-unique=false
> colocation colocation_master inf: ms_drbd1:Master group_master
> location location_master_ms_drbd1 ms_drbd1 \
> rule $role=Master -inf: not_defined pingd or pingd lte 0
> order main_order Mandatory: pingerclone:start ms_drbd1:promote
> group_master:start
> property cib-bootstrap-options: \
> stonith-enabled=false \
> no-quorum-policy=ignore \
> default-resource-stickiness=500 \
> cluster-name=pp1
>
> root at pp-pacemaker2:~# crm_mon -1
> Stack: corosync
> Current DC: pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru> (version 1.1.16-94ff4df) -
> partition with quorum
> Last updated: Fri Dec 15 13:48:10 2017
> Last change: Fri Dec 15 13:46:38 2017 by root via cibadmin on
> pp-pacemaker1.heliosoft.ru <http://pp-pacemaker1.heliosoft.ru>
>
> 2 nodes configured
> 7 resources configured
>
> Online: [ pp-pacemaker1.heliosoft.ru
> <http://pp-pacemaker1.heliosoft.ru> pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru> ]
>
> Active resources:
>
> Resource Group: group_master
> fs_drbd1 (ocf::heartbeat:Filesystem): Started
> pp-pacemaker1.heliosoft.ru <http://pp-pacemaker1.heliosoft.ru>
> vip (ocf::heartbeat:IPaddr2): Started
> pp-pacemaker1.heliosoft.ru <http://pp-pacemaker1.heliosoft.ru>
> vip2 (ocf::heartbeat:IPaddr2): Started
> pp-pacemaker1.heliosoft.ru <http://pp-pacemaker1.heliosoft.ru>
> Master/Slave Set: ms_drbd1 [drbd1]
> Masters: [ pp-pacemaker1.heliosoft.ru
> <http://pp-pacemaker1.heliosoft.ru> ]
> Slaves: [ pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru> ]
> Clone Set: pingerclone [pinger]
> Started: [ pp-pacemaker1.heliosoft.ru
> <http://pp-pacemaker1.heliosoft.ru> pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru> ]
> #end crm_mon
>
> When I disconnect pp-pacemaker2 from all networks, I have:
> root at pp-pacemaker2:~# crm_mon -1
> Stack: corosync
> Current DC: pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru> (version 1.1.16-94ff4df) -
> partition with quorum
> Last updated: Fri Dec 15 13:53:15 2017
> Last change: Fri Dec 15 13:53:00 2017 by root via cibadmin on
> pp-pacemaker2.heliosoft.ru <http://pp-pacemaker2.heliosoft.ru>
>
> 2 nodes configured
> 7 resources configured
>
> Online: [ pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru>]
> OFFLINE: [pp-pacemaker1.heliosoft.ru
> <http://pp-pacemaker1.heliosoft.ru> ]
>
> Active resources:
>
> Resource Group: group_master
> fs_drbd1 (ocf::heartbeat:Filesystem): Started
> pp-pacemaker2.heliosoft.ru <http://pp-pacemaker2.heliosoft.ru>
> vip (ocf::heartbeat:IPaddr2): Started
> pp-pacemaker2.heliosoft.ru <http://pp-pacemaker2.heliosoft.ru>
> vip2 (ocf::heartbeat:IPaddr2): Started
> pp-pacemaker2.heliosoft.ru <http://pp-pacemaker2.heliosoft.ru>
> Master/Slave Set: ms_drbd1 [drbd1]
> Masters: [ pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru> ]
> Clone Set: pingerclone [pinger]
> Started: [ pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru> ]
> #end crm_mon
>
> Wait 5 seconds.
>
> root at pp-pacemaker2:~# crm_mon -1
> Stack: corosync
> Current DC: pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru> (version 1.1.16-94ff4df) -
> partition with quorum
> Last updated: Fri Dec 15 13:48:10 2017
> Last change: Fri Dec 15 13:46:38 2017 by root via cibadmin on
> pp-pacemaker1.heliosoft.ru <http://pp-pacemaker1.heliosoft.ru>
>
> 2 nodes configured
> 7 resources configured
>
> Online: [ pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru>
> OFFLINE: [pp-pacemaker1.heliosoft.ru
> <http://pp-pacemaker1.heliosoft.ru> ]
>
> Active resources:
>
> Master/Slave Set: ms_drbd1 [drbd1]
> Slaves: [ pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru> ]
> Clone Set: pingerclone [pinger]
> Started: [ pp-pacemaker2.heliosoft.ru
> <http://pp-pacemaker2.heliosoft.ru> ]
> #end crm_mon
>
> Why pp-pacemaker2 first become a master? It breaks drdb.
>
>
>
>
> _______________________________________________
> Users mailing list: Users at clusterlabs.org
> <mailto:Users at clusterlabs.org>
> http://lists.clusterlabs.org/mailman/listinfo/users
> <http://lists.clusterlabs.org/mailman/listinfo/users>
>
> Project Home: http://www.clusterlabs.org
> Getting started:
> http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> <http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf>
> Bugs: http://bugs.clusterlabs.org
>
>
>
>
> --
> .~.
> /V\
> // \\
> /( )\
> ^`~'^
>
>
> _______________________________________________
> Users mailing list: Users at clusterlabs.org
> http://lists.clusterlabs.org/mailman/listinfo/users
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
--
Павел Прокопов
Старший системный администратор
Отдел системного администрирования
Адрес: 109004, Москва,
Большой факельный переулок, д. 3, стр. 2,
3 этаж
Тел.: +7 (495) 783-66-03, доб. 1143
Факс: +7 (495) 789-89-58
Почта: prokopov at experium.ru <mailto:prokopov at experium.ru>
Сайт: www.experium.ru <http://www.experium.ru>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20171220/bef15659/attachment-0003.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: jighifkkeghckhnk.gif
Type: image/gif
Size: 1745 bytes
Desc: not available
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20171220/bef15659/attachment-0003.gif>
More information about the Users
mailing list