Hi,<br><br>I'm making some headway finally with my pacemaker install but now that crm_mon doesn't return errors any more and crm_verify is clear, I'm having a problem where my master won't get promoted.  Not sure what to do with this one, any suggestions?   Here's the log snippet and config files:<br>
<br>Sep 26 04:06:12 staging1 crmd: [1686]: info: crm_timer_popped: PEngine Recheck Timer (I_PE_CALC) just popped!<br>Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: State transition S_IDLE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_TIMER_POPPED origin=crm_timer_popped ]<br>
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: Progressed to state S_POLICY_ENGINE after C_TIMER_POPPED<br>Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: All 2 cluster nodes are eligible to run resources.<br>
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_pe_invoke: Query 106: Requesting the current CIB: S_POLICY_ENGINE<br>Sep 26 04:06:12 staging1 crmd: [1686]: info: do_pe_invoke_callback: Invoking the PE: query=106, ref=pe_calc-dc-1317020772-95, seq=2564, quorate=1<br>
Sep 26 04:06:12 staging1 pengine: [1685]: info: unpack_config: Startup probes: enabled<br>Sep 26 04:06:12 staging1 pengine: [1685]: notice: unpack_config: On loss of CCM Quorum: Ignore<br>Sep 26 04:06:12 staging1 pengine: [1685]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0<br>
Sep 26 04:06:12 staging1 pengine: [1685]: info: unpack_domains: Unpacking domains<br>Sep 26 04:06:12 staging1 pengine: [1685]: info: determine_online_status: Node <a href="http://staging1.dev.applepeak.com">staging1.dev.applepeak.com</a> is online<br>
Sep 26 04:06:12 staging1 pengine: [1685]: info: determine_online_status: Node <a href="http://staging2.dev.applepeak.com">staging2.dev.applepeak.com</a> is online<br>Sep 26 04:06:12 staging1 pengine: [1685]: notice: group_print:  Resource Group: mysql<br>
Sep 26 04:06:12 staging1 pengine: [1685]: notice: native_print:      fs_mysql#011(ocf::heartbeat:Filesystem):#011Stopped<br>Sep 26 04:06:12 staging1 pengine: [1685]: notice: native_print:      ip_mysql#011(ocf::heartbeat:IPaddr2):#011Stopped<br>
Sep 26 04:06:12 staging1 pengine: [1685]: notice: native_print:      mysqld#011(lsb:mysqld):#011Stopped<br>Sep 26 04:06:12 staging1 pengine: [1685]: notice: clone_print:  Master/Slave Set: ms_drbd_mysql<br>Sep 26 04:06:12 staging1 pengine: [1685]: notice: short_print:      Stopped: [ drbd_mysql:0 drbd_mysql:1 ]<br>
Sep 26 04:06:12 staging1 pengine: [1685]: info: master_color: ms_drbd_mysql: Promoted 0 instances of a possible 1 to master<br>Sep 26 04:06:12 staging1 pengine: [1685]: info: native_merge_weights: fs_mysql: Rolling back scores from ip_mysql<br>
Sep 26 04:06:12 staging1 pengine: [1685]: info: native_merge_weights: ip_mysql: Rolling back scores from mysqld<br>Sep 26 04:06:12 staging1 pengine: [1685]: info: master_color: ms_drbd_mysql: Promoted 0 instances of a possible 1 to master<br>
Sep 26 04:06:12 staging1 pengine: [1685]: notice: LogActions: Leave resource fs_mysql#011(Stopped)<br>Sep 26 04:06:12 staging1 pengine: [1685]: notice: LogActions: Leave resource ip_mysql#011(Stopped)<br>Sep 26 04:06:12 staging1 pengine: [1685]: notice: LogActions: Leave resource mysqld#011(Stopped)<br>
Sep 26 04:06:12 staging1 pengine: [1685]: notice: LogActions: Leave resource drbd_mysql:0#011(Stopped)<br>Sep 26 04:06:12 staging1 pengine: [1685]: notice: LogActions: Leave resource drbd_mysql:1#011(Stopped)<br>Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]<br>
Sep 26 04:06:12 staging1 crmd: [1686]: info: unpack_graph: Unpacked transition 72: 0 actions in 0 synapses<br>Sep 26 04:06:12 staging1 crmd: [1686]: info: do_te_invoke: Processing graph 72 (ref=pe_calc-dc-1317020772-95) derived from /var/lib/pengine/pe-input-3802.bz2<br>
Sep 26 04:06:12 staging1 crmd: [1686]: info: run_graph: ====================================================<br>Sep 26 04:06:12 staging1 crmd: [1686]: notice: run_graph: Transition 72 (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pengine/pe-input-3802.bz2): Complete<br>
Sep 26 04:06:12 staging1 crmd: [1686]: info: te_graph_trigger: Transition 72 is now complete<br>Sep 26 04:06:12 staging1 crmd: [1686]: info: notify_crmd: Transition 72 status: done - <null><br>Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS cause=C_FSA_INTERNAL origin=notify_crmd ]<br>
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: Starting PEngine Recheck Timer<br>Sep 26 04:06:12 staging1 pengine: [1685]: info: process_pe_message: Transition 72: PEngine Input stored in: /var/lib/pengine/pe-input-3802.bz2<br>
Sep 26 04:15:09 staging1 cib: [1682]: info: cib_stats: Processed 1 operations (0.00us average, 0% utilization) in the last 10min<br><br>My drbd config file:<br><br>resource mysqld {<br><br>protocol C;<br><br>startup { wfc-timeout 0; degr-wfc-timeout 120; }<br>
<br>disk { on-io-error detach; }<br><br><br>on staging1 {<br><br>device /dev/drbd0;<br><br>disk /dev/vg_staging1/lv_data;<br><br>meta-disk internal;<br><br>address <a href="http://10.10.20.1:7788">10.10.20.1:7788</a>;<br>
<br>}<br><br>on staging2 {<br><br>device /dev/drbd0;<br><br>disk /dev/vg_staging2/lv_data;<br><br>meta-disk internal;<br><br>address <a href="http://10.10.20.2:7788">10.10.20.2:7788</a>;<br><br>}<br><br>}<br><br>corosync.conf:<br>
<br>compatibility: whitetank<br><br>aisexec {<br>  user: root<br>  group: root<br>}<br><br>totem {<br>        version: 2<br>        secauth: off<br>        threads: 0<br>        interface {<br>                ringnumber: 0<br>
                bindnetaddr: 10.10.10.0<br>                mcastaddr: 226.94.1.1<br>                mcastport: 5405<br>        }<br>}<br><br>logging {<br>        fileline: off<br>        to_stderr: no<br>        to_logfile: no<br>
        to_syslog: yes<br>        logfile: /var/log/cluster/corosync.log<br>        debug: off<br>        timestamp: on<br>        logger_subsys {<br>                subsys: AMF<br>                debug: off<br>        }<br>
}<br><br>amf {<br>        mode: disabled<br>}<br><br>service {<br>#Load Pacemaker<br>name: pacemaker<br>ver: 0<br>use_mgmtd: yes<br>}<br><br>And my crm config:<br><br>node <a href="http://staging1.dev.applepeak.com">staging1.dev.applepeak.com</a><br>
node <a href="http://staging2.dev.applepeak.com">staging2.dev.applepeak.com</a><br>primitive drbd_mysql ocf:linbit:drbd \<br>        params drbd_resource="mysqld" \<br>        op monitor interval="15s" \<br>
        op start interval="0" timeout="240s" \<br>        op stop interval="0" timeout="100s"<br>primitive fs_mysql ocf:heartbeat:Filesystem \<br>        params device="/dev/drbd0" directory="/opt/data/mysql/data/mysql" fstype="ext4" \<br>
        op start interval="0" timeout="60s" \<br>        op stop interval="0" timeout="60s"<br>primitive ip_mysql ocf:heartbeat:IPaddr2 \<br>        params ip="10.10.10.31" nic="eth0"<br>
primitive mysqld lsb:mysqld<br>group mysql fs_mysql ip_mysql mysqld<br>ms ms_drbd_mysql drbd_mysql \<br>        meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"<br>
colocation mysql_on_drbd inf: mysql ms_drbd_mysql:Master<br>order mysql_after_drbd inf: ms_drbd_mysql:promote mysql:start<br>property $id="cib-bootstrap-options" \<br>        dc-version="1.1.2-f059ec7ced7a86f18e5490b67ebf4a0b963bccfe" \<br>
        cluster-infrastructure="openais" \<br>        expected-quorum-votes="2" \<br>        stonith-enabled="false" \<br>        last-lrm-refresh="1316961847" \<br>        stop-all-resources="true" \<br>
        no-quorum-policy="ignore"<br>rsc_defaults $id="rsc-options" \<br>        resource-stickiness="100"<br><br>Thanks,<br>Charles<br>