<html>

  <head>


    <meta http-equiv="content-type" content="text/html; charset=UTF-8">

  </head>

  <body>

    <p>Hallo,</p>

    <p>I have strange problem: when I reset the node on which my

      resources are running, they are correctly migrated to the other

      node. But when I turn the failed node back, then as soon as it is

      up all resources are returned back to it. I have set

      resource-stickiness default value to 100. When this did not help I

      have set up resource-stickiness meta attr also to 100 for all my

      resources. Still when the failed node recovers the resources are

      migrated back to it! Where should I look to try to understand this

      situation?</p>

    <p>Here's the configuration of my cluster:<br>

    </p>

    root@node1# pcs status<br>

    Cluster name: gcluster<br>

    Cluster Summary:<br>

      * Stack: corosync<br>

      * Current DC: node1 (version 2.0.4-2deceaa3ae) - partition with

    quorum<br>

      * Last updated: Sat Sep 26 11:12:34 2020<br>

      * Last change:  Sat Sep 26 10:39:16 2020 by root via cibadmin on

    node1<br>

      * 2 nodes configured<br>

      * 14 resource instances configured (1 DISABLED)<br>

    <br>

    Node List:<br>

      * Online: [ node1 node2 ]<br>

    <br>

    Full List of Resources:<br>

      * ilo5_node1    (stonith:fence_ilo5_ssh):     Started node2<br>

      * ilo5_node2    (stonith:fence_ilo5_ssh):     Started node1<br>

      * Resource Group: VirtIP:<br>

        * PrimaryIP    (ocf::heartbeat:IPaddr2):     Started node2<br>

        * PrimaryIP6    (ocf::heartbeat:IPv6addr):     Started node2<br>

        * AliasIP    (ocf::heartbeat:IPaddr2):     Started node2<br>

      * BackupFS    (ocf::redhat:netfs.sh):     Started node2<br>

      * Clone Set: MailVolume-clone [MailVolume] (promotable):<br>

        * Masters: [ node2 ]<br>

        * Slaves: [ node1 ]<br>

      * MailFS    (ocf::heartbeat:Filesystem):     Started node2<br>

      * apache    (ocf::heartbeat:apache):     Started node2<br>

      * postfix    (ocf::heartbeat:postfix):     Started node2<br>

      * amavis    (service:amavis):     Started node2<br>

      * dovecot    (service:dovecot):     Started node2<br>

      * openvpn    (service:openvpn):     Stopped (disabled)<br>

    <br>

    And resources:<br>

    <br>

    root@node1# pcs resource config<br>

     Group: VirtIP<br>

      Meta Attrs: resource-stickiness=100<br>

      Resource: PrimaryIP (class=ocf provider=heartbeat type=IPaddr2)<br>

       Attributes: cidr_netmask=16 ip=xx.xx.xx.20 nic=br0<br>

       Meta Attrs: resource-stickiness=100<br>

       Operations: monitor interval=30s (PrimaryIP-monitor-interval-30s)<br>

                   start interval=0s timeout=20s

    (PrimaryIP-start-interval-0s)<br>

                   stop interval=0s timeout=20s

    (PrimaryIP-stop-interval-0s)<br>

      Resource: PrimaryIP6 (class=ocf provider=heartbeat type=IPv6addr)<br>

       Attributes: cidr_netmask=64 ipv6addr=xxxx:xxxx:xxxx:xxxx:0:0:0:20

    nic=br0<br>

       Meta Attrs: resource-stickiness=100<br>

       Operations: monitor interval=30s

    (PrimaryIP6-monitor-interval-30s)<br>

                   start interval=0s timeout=15s

    (PrimaryIP6-start-interval-0s)<br>

                   stop interval=0s timeout=15s

    (PrimaryIP6-stop-interval-0s)<br>

      Resource: AliasIP (class=ocf provider=heartbeat type=IPaddr2)<br>

       Attributes: cidr_netmask=16 ip=xx.xx.yy.20 nic=br0<br>

       Meta Attrs: resource-stickiness=100<br>

       Operations: monitor interval=30s (AliasIP-monitor-interval-30s)<br>

                   start interval=0s timeout=20s

    (AliasIP-start-interval-0s)<br>

                   stop interval=0s timeout=20s

    (AliasIP-stop-interval-0s)<br>

     Resource: BackupFS (class=ocf provider=redhat type=netfs.sh)<br>

      Attributes: export=/Backup/Gateway fstype=nfs host=atlas

    mountpoint=/Backup options=noatime,async<br>

      Meta Attrs: resource-stickiness=100<br>

      Operations: monitor interval=1m timeout=10

    (BackupFS-monitor-interval-1m)<br>

                  monitor interval=5m timeout=30 OCF_CHECK_LEVEL=10

    (BackupFS-monitor-interval-5m)<br>

                  monitor interval=10m timeout=30 OCF_CHECK_LEVEL=20

    (BackupFS-monitor-interval-10m)<br>

                  start interval=0s timeout=900

    (BackupFS-start-interval-0s)<br>

                  stop interval=0s timeout=30

    (BackupFS-stop-interval-0s)<br>

     Clone: MailVolume-clone<br>

      Meta Attrs: clone-max=2 clone-node-max=1 notify=true

    promotable=true promoted-max=1 promoted-node-max=1

    resource-stickiness=100<br>

      Resource: MailVolume (class=ocf provider=linbit type=drbd)<br>

       Attributes: drbd_resource=mail<br>

       Meta Attrs: resource-stickiness=100<br>

       Operations: demote interval=0s timeout=90

    (MailVolume-demote-interval-0s)<br>

                   monitor interval=60s

    (MailVolume-monitor-interval-60s)<br>

                   notify interval=0s timeout=90

    (MailVolume-notify-interval-0s)<br>

                   promote interval=0s timeout=90

    (MailVolume-promote-interval-0s)<br>

                   reload interval=0s timeout=30

    (MailVolume-reload-interval-0s)<br>

                   start interval=0s timeout=240

    (MailVolume-start-interval-0s)<br>

                   stop interval=0s timeout=100

    (MailVolume-stop-interval-0s)<br>

     Resource: MailFS (class=ocf provider=heartbeat type=Filesystem)<br>

      Attributes: device=/dev/drbd0 directory=/var/mail fstype=btrfs<br>

      Meta Attrs: resource-stickiness=100<br>

      Operations: monitor interval=20s timeout=40s

    (MailFS-monitor-interval-20s)<br>

                  start interval=0s timeout=60s

    (MailFS-start-interval-0s)<br>

                  stop interval=0s timeout=60s (MailFS-stop-interval-0s)<br>

     Resource: apache (class=ocf provider=heartbeat type=apache)<br>

      Attributes: client=wget statusurl=<a class="moz-txt-link-freetext" href="https://localhost/server-status">https://localhost/server-status</a><br>

      Meta Attrs: resource-stickiness=100<br>

      Operations: monitor interval=1min (apache-monitor-interval-1min)<br>

                  start interval=0s timeout=40s

    (apache-start-interval-0s)<br>

                  stop interval=0s timeout=60s (apache-stop-interval-0s)<br>

     Resource: postfix (class=ocf provider=heartbeat type=postfix)<br>

      Meta Attrs: resource-stickiness=100<br>

      Operations: monitor interval=60s timeout=20s

    (postfix-monitor-interval-60s)<br>

                  reload interval=0s timeout=20s

    (postfix-reload-interval-0s)<br>

                  start interval=0s timeout=20s

    (postfix-start-interval-0s)<br>

                  stop interval=0s timeout=20s

    (postfix-stop-interval-0s)<br>

     Resource: amavis (class=service type=amavis)<br>

      Meta Attrs: resource-stickiness=100<br>

      Operations: force-reload interval=0s timeout=15

    (amavis-force-reload-interval-0s)<br>

                  monitor interval=15 timeout=15

    (amavis-monitor-interval-15)<br>

                  restart interval=0s timeout=15

    (amavis-restart-interval-0s)<br>

                  start interval=0s timeout=15

    (amavis-start-interval-0s)<br>

                  stop interval=0s timeout=15 (amavis-stop-interval-0s)<br>

     Resource: dovecot (class=service type=dovecot)<br>

      Meta Attrs: resource-stickiness=100<br>

      Operations: force-reload interval=0s timeout=15

    (dovecot-force-reload-interval-0s)<br>

                  monitor interval=15 timeout=15

    (dovecot-monitor-interval-15)<br>

                  restart interval=0s timeout=15

    (dovecot-restart-interval-0s)<br>

                  start interval=0s timeout=15

    (dovecot-start-interval-0s)<br>

                  stop interval=0s timeout=15 (dovecot-stop-interval-0s)<br>

     Resource: openvpn (class=service type=openvpn)<br>

      Meta Attrs: resource-stickiness=100 target-role=Stopped<br>

      Operations: force-reload interval=0s timeout=15

    (openvpn-force-reload-interval-0s)<br>

                  monitor interval=15 timeout=15

    (openvpn-monitor-interval-15)<br>

                  restart interval=0s timeout=15

    (openvpn-restart-interval-0s)<br>

                  start interval=0s timeout=15

    (openvpn-start-interval-0s)<br>

                  stop interval=0s timeout=15 (openvpn-stop-interval-0s)<br>

    <p>drbd resource is configured as follows:</p>

    <p>root@node1# cat /etc/drbd.d/mail.res <br>

      resource mail {<br>

        protocol  B;<br>

        device    /dev/drbd0;<br>

        disk      /dev/sys/mail;<br>

        meta-disk internal;<br>

      <br>

        net {<br>

          csums-alg sha1;<br>

          after-sb-0pri discard-zero-changes;<br>

          after-sb-1pri discard-secondary;<br>

          after-sb-2pri disconnect;<br>

          rr-conflict disconnect;<br>

        }<br>

      <br>

        handlers {<br>

          fence-peer            "/usr/lib/drbd/crm-fence-peer.sh";<br>

          after-resync-target   "/usr/lib/drbd/crm-unfence-peer.sh";<br>

          split-brain           "/usr/lib/drbd/notify-split-brain.sh

      <a class="moz-txt-link-abbreviated" href="mailto:admin@logit-ag.de">admin@logit-ag.de</a>";<br>

        }<br>

      <br>

        on node1 {<br>

          address 192.168.0.102:7789;<br>

        }<br>

        on node2 {<br>

          address 192.168.0.103:7789;<br>

        }<br>

      }<br>

    </p>

    <p>Best regards,<br>

    </p>

    <pre class="moz-signature" cols="128">-- 

 \   / |                                   |

 (OvO) |  Mikhail Iwanow                   |

 (^^^) |                                   |

  \^/  |      E-mail:  <a class="moz-txt-link-abbreviated" href="mailto:ivans@logit-ag.de">ivans@logit-ag.de</a>   |

  ^ ^  |                                   |

</pre>

  </body>

</html>