[ClusterLabs] MS Promotion Not Working

Andrei Borzenkov arvidjaar at gmail.com
Sun Jun 14 11:26:52 UTC 2015


В Sun, 14 Jun 2015 13:16:18 +0200
"Brian D. Lees" <brian at fides.me.uk> пишет:

> Andrei,
> 
> Thanks for the suggestion;  however it sadly has the same outcome!  I think the key to this is understanding how the promotion scores are calculated as that will point us towards the items which are making the scores both -1.  Do you have any idea how this works?
> 

As far as I know there is no default master score. Resource agent is
responsible for deciding which instance should (can) be promoted and
setting scores accordingly. 

> node 1084751972: ACL001 \
>         attributes standby=off
> node 1084752072: ACL002 \
>         attributes standby=off
> primitive res_drbd_1 ocf:linbit:drbd \
>         params drbd_resource=acl_shared \
>         operations $id=res_drbd_1-operations \
>         op start interval=0 timeout=240 \
>         op promote interval=0 timeout=90 \
>         op demote interval=0 timeout=90 \
>         op stop interval=0 timeout=100 \
>         op monitor interval=10 timeout=20 role=Master start-delay=0 \
>         op monitor interval=11 timeout=20 role=Slave start-delay=0 \
>         op notify interval=0 timeout=90 \
>         meta
> ms ms_drbd_1 res_drbd_1 \
>         meta master-max=1 master-node-max=1 clone-max=2 clone-node-max=1 notify=true target-role=master
> property cib-bootstrap-options: \
>         symmetric-cluster=true \
>         stonith-enabled=false \
>         no-quorum-policy=ignore \
>         dc-version=1.1.12-1.1.12.git20140904.266d5c2 \
>         cluster-infrastructure=corosync \
>         cluster-name=aclcluster
> rsc_defaults rsc-options: \
>         resource-stickiness=0
> 
> Current cluster status:
> Online: [ ACL001 ACL002 ]
> 
>  Master/Slave Set: ms_drbd_1 [res_drbd_1]
>      Slaves: [ ACL001 ACL002 ]
> 
> Allocation scores:
> clone_color: ms_drbd_1 allocation score on ACL001: 0
> clone_color: ms_drbd_1 allocation score on ACL002: 0
> clone_color: res_drbd_1:0 allocation score on ACL001: 0
> clone_color: res_drbd_1:0 allocation score on ACL002: 0
> clone_color: res_drbd_1:1 allocation score on ACL001: 0
> clone_color: res_drbd_1:1 allocation score on ACL002: 0
> native_color: res_drbd_1:0 allocation score on ACL001: 0
> native_color: res_drbd_1:0 allocation score on ACL002: 0
> native_color: res_drbd_1:1 allocation score on ACL001: -INFINITY
> native_color: res_drbd_1:1 allocation score on ACL002: 0
> res_drbd_1:0 promotion score on ACL001: -1
> res_drbd_1:1 promotion score on ACL002: -1
> 
> Transition Summary:
> 
> 
>    debug: qb_rb_open_2:         shm size:524301; real_size:528384; rb->word_size:132096
>    debug: qb_rb_open_2:         shm size:524301; real_size:528384; rb->word_size:132096
>    debug: qb_rb_open_2:         shm size:524301; real_size:528384; rb->word_size:132096
>    debug: cib_native_signon_raw:        Connection to CIB successful
>    debug: cib_native_signoff:   Signing out of the CIB Service
>    debug: qb_ipcc_disconnect:   qb_ipcc_disconnect()
>    debug: qb_rb_close:  Closing ringbuffer: /dev/shm/qb-cib_rw-request-1929-14545-13-header
>    debug: qb_rb_close:  Closing ringbuffer: /dev/shm/qb-cib_rw-response-1929-14545-13-header
>    debug: qb_rb_close:  Closing ringbuffer: /dev/shm/qb-cib_rw-event-1929-14545-13-header
>     info: validate_with_relaxng:        Creating RNG parser context
>    debug: cib_file_signon:      crm_simulate: Opened connection to local file '/var/lib/pacemaker/cib/shadow.14545'
>     info: cib_file_perform_op_delegate:         cib_query on (null)
>    debug: cib_acl_enabled:      CIB ACL is disabled
>    debug: unpack_config:        STONITH timeout: 60000
>    debug: unpack_config:        STONITH of failed nodes is disabled
>    debug: unpack_config:        Stop all active resources: false
>    debug: unpack_config:        Cluster is symmetric - resources can run anywhere by default
>    debug: unpack_config:        Default stickiness: 0
>   notice: unpack_config:        On loss of CCM Quorum: Ignore
>    debug: unpack_config:        Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
>     info: determine_online_status:      Node ACL001 is online
>     info: determine_online_status:      Node ACL002 is online
>    debug: find_anonymous_clone:         Internally renamed res_drbd_1 on ACL001 to res_drbd_1:0
>    debug: find_anonymous_clone:         Internally renamed res_drbd_1 on ACL002 to res_drbd_1:1
> 
> Current cluster status:
> Online: [ ACL001 ACL002 ]
> 
>  Master/Slave Set: ms_drbd_1 [res_drbd_1]
>    debug: native_active:        Resource res_drbd_1:0 active on ACL001
>    debug: native_active:        Resource res_drbd_1:0 active on ACL001
>    debug: native_active:        Resource res_drbd_1:1 active on ACL002
>    debug: native_active:        Resource res_drbd_1:1 active on ACL002
>      Slaves: [ ACL001 ACL002 ]
> 
>     info: clone_print:   Master/Slave Set: ms_drbd_1 [res_drbd_1]
>    debug: native_active:        Resource res_drbd_1:0 active on ACL001
>    debug: native_active:        Resource res_drbd_1:0 active on ACL001
>    debug: native_active:        Resource res_drbd_1:1 active on ACL002
>    debug: native_active:        Resource res_drbd_1:1 active on ACL002
>     info: short_print:       Slaves: [ ACL001 ACL002 ]
>    debug: native_assign_node:   Assigning ACL001 to res_drbd_1:0
>    debug: native_assign_node:   Assigning ACL002 to res_drbd_1:1
>    debug: clone_color:  Allocated 2 ms_drbd_1 instances of a possible 2
>    debug: master_color:         res_drbd_1:0 master score: -1
>    debug: master_color:         res_drbd_1:1 master score: -1
>     info: master_color:         ms_drbd_1: Promoted 0 instances of a possible 1 to master
>    debug: master_create_actions:        Creating actions for ms_drbd_1
>     info: LogActions:   Leave   res_drbd_1:0    (Slave ACL001)
>     info: LogActions:   Leave   res_drbd_1:1    (Slave ACL002)
> Transition Summary:
>     info: LogActions:   Leave   res_drbd_1:0    (Slave ACL001)
>     info: LogActions:   Leave   res_drbd_1:1    (Slave ACL002)
>    debug: cib_file_signoff:     Signing out of the CIB Service
>     info: cib_file_signoff:     Wrote CIB to /var/lib/pacemaker/cib/shadow.14545
>     info: crm_xml_cleanup:      Cleaning up memory from libxml2
> 
> -----Original Message-----
> From: Andrei Borzenkov [mailto:arvidjaar at gmail.com] 
> Sent: 14 June 2015 06:34
> To: Brian D. Lees
> Cc: 'Takehiro Matsushima'; 'Cluster Labs - All topics related to open-source clustering welcomed'
> Subject: Re: [ClusterLabs] MS Promotion Not Working
> 
> В Sat, 13 Jun 2015 13:43:46 +0200
> "Brian D. Lees" <brian at fides.me.uk> пишет:
> 
> > 
> > primitive res_drbd_1 ocf:linbit:drbd \
> > 
> >         params drbd_resource=acl_shared \
> > 
> >         operations $id=res_drbd_1-operations \
> > 
> >         op start interval=0 timeout=240 \
> > 
> >         op promote interval=0 timeout=90 \
> > 
> >         op demote interval=0 timeout=90 \
> > 
> >         op stop interval=0 timeout=100 \
> > 
> >         op monitor interval=10 timeout=20 role=Master \
> > 
> >         op monitor interval=11 timeout=20 role=Slave \
> > 
> >         op notify interval=0 timeout=90
> > 
> > ms ms_drbd_1 res_drbd_1 \
> > 
> >         meta clone-max=2 notify=true interleave=true target-role=Started
> 
> According to pacemaker documentation
> 
> Started - Allow the resource to be started (In the case of multi-state resources, they will not promoted to master) 
> 
> You probably want to have Master here.
> 





More information about the Users mailing list