[Pacemaker] weird drbd/cluster behaviour

Саша Александров shurrman at gmail.com
Tue Jun 25 09:25:47 EDT 2013


Hi all!

I am setting up a new cluster on OracleLinux 6.4 (well, it is CentOS 6.4).
I went through http://clusterlabs.org/quickstart-redhat.html
Then I installed DRBD 8.4.2 from elrepo.
This setup is unusable :-( with DRBD 8.4.2.
I created three DRBD resources:

cat /proc/drbd
version: 8.4.2 (api:1/proto:86-101)
GIT-hash: 7ad5f850d711223713d6dcadc3dd48860321070c build by root at flashfon1,
2013-06-24 22:08:41
 0: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
    ns:97659171 nr:0 dw:36 dr:97660193 al:1 bm:5961 lo:0 pe:0 ua:0 ap:0
ep:1 wo:f oos:0
 1: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
    ns:292421653 nr:16 dw:16 dr:292422318 al:0 bm:17848 lo:0 pe:0 ua:0 ap:0
ep:1 wo:f oos:0
 2: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
    ns:292421600 nr:8 dw:8 dr:292422265 al:0 bm:17848 lo:0 pe:0 ua:0 ap:0
ep:1 wo:f oos:0

It appeared that drbd resource-agent script did not work. Debugging showed
that check_crm_feature_set() function always returned zeroes. Ok, just
added 'exit' as its first line for now.

Next, I created three drbd resources in pacemaker, three master-slave sets,
three filesystem resources (and ip resources, but they are no problem):

 pcs status
Last updated: Tue Jun 25 21:20:17 2013
Last change: Tue Jun 25 02:46:25 2013 via crm_resource on flashfon1
Stack: cman
Current DC: flashfon1 - partition with quorum
Version: 1.1.8-7.el6-394e906
2 Nodes configured, unknown expected votes
11 Resources configured.


Online: [ flashfon1 flashfon2 ]

Full list of resources:

 Master/Slave Set: ms_wsoft [drbd_wsoft]
     Masters: [ flashfon1 ]
     Slaves: [ flashfon2 ]
 Master/Slave Set: ms_oradata [drbd_oradata]
     Slaves: [ flashfon1 flashfon2 ]
 Master/Slave Set: ms_flash [drbd_flash]
     Slaves: [ flashfon1 flashfon2 ]
 Resource Group: WcsGroup
     wcs_vip_local      (ocf::heartbeat:IPaddr2):       Started flashfon1
     wcs_fs     (ocf::heartbeat:Filesystem):    Started flashfon1
 Resource Group: OraGroup
     ora_vip_local      (ocf::heartbeat:IPaddr2):       Started flashfon1
     oradata_fs (ocf::heartbeat:Filesystem):    Stopped
     oraflash_fs        (ocf::heartbeat:Filesystem):    Stopped

See, only one master-slave set is recognizing DRBD state!

Resources are configured identically in CIB (except for drbd resource name
parameter):

      <master id="ms_wsoft">
        <primitive class="ocf" id="drbd_wsoft" provider="linbit"
type="drbd">
          <instance_attributes id="drbd_wsoft-instance_attributes">
            <nvpair id="drbd_wsoft-instance_attributes-drbd_resource"
name="drbd_resource" value="wsoft"/>
          </instance_attributes>
          <operations>
            <op id="drbd_wsoft-interval-60s" interval="60s" name="monitor"/>
          </operations>
        </primitive>
        <meta_attributes id="ms_wsoft-meta_attributes">
          <nvpair id="ms_wsoft-meta_attributes-master-max"
name="master-max" value="1"/>
          <nvpair id="ms_wsoft-meta_attributes-master-node-max"
name="master-node-max" value="1"/>
          <nvpair id="ms_wsoft-meta_attributes-clone-max" name="clone-max"
value="2"/>
          <nvpair id="ms_wsoft-meta_attributes-clone-node-max"
name="clone-node-max" value="1"/>
          <nvpair id="ms_wsoft-meta_attributes-notify" name="notify"
value="true"/>
        </meta_attributes>
      </master>
      <master id="ms_oradata">
        <primitive class="ocf" id="drbd_oradata" provider="linbit"
type="drbd">
          <instance_attributes id="drbd_oradata-instance_attributes">
            <nvpair id="drbd_oradata-instance_attributes-drbd_resource"
name="drbd_resource" value="oradata"/>
          </instance_attributes>
          <operations>
            <op id="drbd_oradata-interval-60s" interval="60s"
name="monitor"/>
          </operations>
        </primitive>
        <meta_attributes id="ms_oradata-meta_attributes">
          <nvpair id="ms_oradata-meta_attributes-master-max"
name="master-max" value="1"/>
          <nvpair id="ms_oradata-meta_attributes-master-node-max"
name="master-node-max" value="1"/>
          <nvpair id="ms_oradata-meta_attributes-clone-max"
name="clone-max" value="2"/>
          <nvpair id="ms_oradata-meta_attributes-clone-node-max"
name="clone-node-max" value="1"/>
          <nvpair id="ms_oradata-meta_attributes-notify" name="notify"
value="true"/>
        </meta_attributes>
      </master>

I am stuck. :-((((

Best regards,
Alexandr A. Alexandrov
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.clusterlabs.org/pipermail/pacemaker/attachments/20130625/c061c2e2/attachment-0002.html>


More information about the Pacemaker mailing list