[Pacemaker] Dual-primary DRBD problem: Promoted 0 instances of a possible 2 to master

Matt Anderson tuxfan at hotmail.com
Tue Aug 2 05:08:30 EDT 2011


Hi!

Sorry for the repost, but the links in my previous message expired.
Now these new ones shouldn't do that. I also added the DC's log at the end
of this message.

I've been trying to make a simple HA cluster with 3 servers (the 3rd server
is there only to maintain quorum if one node fails). The idea is to run two
virtual domains over dedicated DRBD devices in dual-primary mode (so that
live migration would be possible).

Things worked well for a while, but somewhere during my tests something
went wrong and now the DRBD devices don't get promoted to primary mode by
pacemaker. Pacemaker just keeps starting and stopping the devices in a loop.
If I start DRBD from the init script, both devices are started and
automaticly synced. At first I had this problem only with one device, but
now it's the same with both devices under pacemaker.

Pacemaker and DRBD write a lot of logs [1] [2] [3] (these are made when I
try to start ms_drbd_www2, but I don't see a reason why pacemaker doesn't
promote any masters.

My guess is that this has something to do with my fencing rules in DRBD [4]
or then just in my pacemaker config [5]. I used to have STONITH enabled, but
since my STONITH devices share the power supply with the server, I've then
removed those settings from my pacemaker config.

I'm running Debian squeeze on amd64 with pacemaker (1.0.11-1~bpo60+1) and
corosync (1.3.0-3~bpo60+1) from backports.

Any ideas what's wrong and how to fix it?


[1] http://paste.debian.net/124836/ (DRBD log from on node)

[2] http://paste.debian.net/124838/ (pacemaker log from the same node as above)

[3] http://paste.debian.net/124839/ (pacemaker log from DC) 

[4] http://paste.debian.net/124845/ (DRBD common config)

[5] http://paste.debian.net/124846/ (pacemaker config)

Pacemaker log from DC [3]:

Jul 28 22:28:01 s3-1 cibadmin: [10292]: info: Invoked: cibadmin -Ql -o resources 
Jul 28 22:28:01 s3-1 cibadmin: [10295]: info: Invoked: cibadmin -p -R -o resources 
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: - <cib admin_epoch="0" epoch="439" num_updates="10" >
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: -   <configuration >
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: -     <resources >
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: -       <master id="ms_drbd_www2" >
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: -         <meta_attributes id="ms_drbd_www2-meta_attributes" >
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: -           <nvpair value="Stopped" id="ms_drbd_www2-meta_attributes-target-role" />
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: -         </meta_attributes>
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: -       </master>
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: -     </resources>
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: -   </configuration>
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: - </cib>
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: + <cib admin_epoch="0" epoch="440" num_updates="1" >
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: +   <configuration >
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: +     <resources >
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: +       <master id="ms_drbd_www2" >
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: +         <meta_attributes id="ms_drbd_www2-meta_attributes" >
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: +           <nvpair value="Started" id="ms_drbd_www2-meta_attributes-target-role" />
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: +         </meta_attributes>
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: +       </master>
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: +     </resources>
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: +   </configuration>
Jul 28 22:28:01 s3-1 cib: [21918]: info: log_data_element: cib:diff: + </cib>
Jul 28 22:28:01 s3-1 cib: [21918]: info: cib_process_request: Operation complete: op cib_replace for section resources (origin=local/cibadmin/2, version=0.440.1): ok (rc=0)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: abort_transition_graph: need_abort:59 - Triggered transition abort (complete=1) : Non-status change
Jul 28 22:28:01 s3-1 crmd: [21922]: info: need_abort: Aborting on change to admin_epoch
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_state_transition: State transition S_IDLE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_FSA_INTERNAL origin=abort_transition_graph ]
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_state_transition: All 3 cluster nodes are eligible to run resources.
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_pe_invoke: Query 1845: Requesting the current CIB: S_POLICY_ENGINE
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_pe_invoke_callback: Invoking the PE: query=1845, ref=pe_calc-dc-1311881281-3699, seq=190040, quorate=1
Jul 28 22:28:01 s3-1 pengine: [21921]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jul 28 22:28:01 s3-1 pengine: [21921]: info: determine_online_status: Node s3 is online
Jul 28 22:28:01 s3-1 pengine: [21921]: info: determine_online_status: Node s1 is online
Jul 28 22:28:01 s3-1 pengine: [21921]: info: determine_online_status: Node s2 is online
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: clone_print:  Master/Slave Set: ms_drbd_www
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: short_print:      Stopped: [ drbd_www:0 drbd_www:1 ]
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: clone_print:  Master/Slave Set: ms_drbd_www2
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: short_print:      Stopped: [ drbd_www2:0 drbd_www2:1 ]
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: native_print: www-server#011(ocf::heartbeat:VirtualDomain):#011Stopped 
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: native_print: www2-server#011(ocf::heartbeat:VirtualDomain):#011Stopped 
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: native_print: www2-mailto#011(ocf::heartbeat:MailTo):#011Stopped 
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: native_print: www-mailto#011(ocf::heartbeat:MailTo):#011Stopped 
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource drbd_www:0 cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource drbd_www:1 cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: info: master_color: ms_drbd_www: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:01 s3-1 pengine: [21921]: info: master_color: ms_drbd_www2: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:01 s3-1 pengine: [21921]: info: master_color: ms_drbd_www: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:01 s3-1 pengine: [21921]: info: rsc_merge_weights: www-server: Rolling back scores from www-mailto
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource www-server cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: info: master_color: ms_drbd_www2: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:01 s3-1 pengine: [21921]: info: rsc_merge_weights: www2-server: Rolling back scores from www2-mailto
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource www2-server cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource www2-mailto cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource www-mailto cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:0 on s2
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:1 on s1
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:0 on s2
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:1 on s1
Jul 28 22:28:01 s3-1 pengine: [21921]: ERROR: clone_rsc_order_rh_non_clone: Unknown action: www-server_demote_0
Jul 28 22:28:01 s3-1 pengine: [21921]: ERROR: clone_rsc_order_rh_non_clone: Unknown action: www2-server_demote_0
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource drbd_www:0#011(Stopped)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource drbd_www:1#011(Stopped)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Start   drbd_www2:0#011(s2)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Start   drbd_www2:1#011(s1)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www-server#011(Stopped)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www2-server#011(Stopped)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www2-mailto#011(Stopped)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www-mailto#011(Stopped)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Jul 28 22:28:01 s3-1 crmd: [21922]: info: unpack_graph: Unpacked transition 1548: 12 actions in 12 synapses
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_te_invoke: Processing graph 1548 (ref=pe_calc-dc-1311881281-3699) derived from /var/lib/pengine/pe-input-9218.bz2
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 36 fired and confirmed
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 37 fired and confirmed
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 34 fired and confirmed
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 30: start drbd_www2:0_start_0 on s2
Jul 28 22:28:01 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:0_start_0 (30) confirmed on s2 (rc=0)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 32: start drbd_www2:1_start_0 on s1
Jul 28 22:28:01 s3-1 crmd: [21922]: info: abort_transition_graph: te_update_diff:150 - Triggered transition abort (complete=0, tag=nvpair, id=status-s1-master-drbd_www2:1, magic=NA, cib=0.440.3) : Transient attribute: update
Jul 28 22:28:01 s3-1 crmd: [21922]: info: update_abort_priority: Abort priority upgraded from 0 to 1000000
Jul 28 22:28:01 s3-1 crmd: [21922]: info: update_abort_priority: Abort action done superceeded by restart
Jul 28 22:28:01 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:1_start_0 (32) confirmed on s1 (rc=0)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 35 fired and confirmed
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 38 fired and confirmed
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 80: notify drbd_www2:0_post_notify_start_0 on s2
Jul 28 22:28:01 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:0_post_notify_start_0 (80) confirmed on s2 (rc=0)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 81: notify drbd_www2:1_post_notify_start_0 on s1
Jul 28 22:28:01 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:1_post_notify_start_0 (81) confirmed on s1 (rc=0)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 39 fired and confirmed
Jul 28 22:28:01 s3-1 crmd: [21922]: info: run_graph: ====================================================
Jul 28 22:28:01 s3-1 crmd: [21922]: notice: run_graph: Transition 1548 (Complete=10, Pending=0, Fired=0, Skipped=2, Incomplete=0, Source=/var/lib/pengine/pe-input-9218.bz2): Stopped
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_graph_trigger: Transition 1548 is now complete
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_FSA_INTERNAL origin=notify_crmd ]
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_state_transition: All 3 cluster nodes are eligible to run resources.
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_pe_invoke: Query 1846: Requesting the current CIB: S_POLICY_ENGINE
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_pe_invoke_callback: Invoking the PE: query=1846, ref=pe_calc-dc-1311881281-3704, seq=190040, quorate=1
Jul 28 22:28:01 s3-1 cib: [10296]: info: write_cib_contents: Archived previous version as /var/lib/heartbeat/crm/cib-77.raw
Jul 28 22:28:01 s3-1 pengine: [21921]: info: process_pe_message: Transition 1548: PEngine Input stored in: /var/lib/pengine/pe-input-9218.bz2
Jul 28 22:28:01 s3-1 pengine: [21921]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jul 28 22:28:01 s3-1 pengine: [21921]: info: determine_online_status: Node s3 is online
Jul 28 22:28:01 s3-1 pengine: [21921]: info: determine_online_status: Node s1 is online
Jul 28 22:28:01 s3-1 pengine: [21921]: info: determine_online_status: Node s2 is online
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: clone_print:  Master/Slave Set: ms_drbd_www
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: short_print:      Stopped: [ drbd_www:0 drbd_www:1 ]
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: clone_print:  Master/Slave Set: ms_drbd_www2
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: short_print:      Slaves: [ s2 s1 ]
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: native_print: www-server#011(ocf::heartbeat:VirtualDomain):#011Stopped 
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: native_print: www2-server#011(ocf::heartbeat:VirtualDomain):#011Stopped 
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: native_print: www2-mailto#011(ocf::heartbeat:MailTo):#011Stopped 
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: native_print: www-mailto#011(ocf::heartbeat:MailTo):#011Stopped 
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource drbd_www:0 cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource drbd_www:1 cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: info: master_color: ms_drbd_www: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:01 s3-1 pengine: [21921]: info: master_color: ms_drbd_www2: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:01 s3-1 pengine: [21921]: info: master_color: ms_drbd_www: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:01 s3-1 pengine: [21921]: info: rsc_merge_weights: www-server: Rolling back scores from www-mailto
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource www-server cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: info: master_color: ms_drbd_www2: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:01 s3-1 pengine: [21921]: info: rsc_merge_weights: www2-server: Rolling back scores from www2-mailto
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource www2-server cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource www2-mailto cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: info: native_color: Resource www-mailto cannot run anywhere
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:0 on s1
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:1 on s2
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:0 on s1
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:1 on s2
Jul 28 22:28:01 s3-1 pengine: [21921]: ERROR: clone_rsc_order_rh_non_clone: Unknown action: www-server_demote_0
Jul 28 22:28:01 s3-1 pengine: [21921]: ERROR: clone_rsc_order_rh_non_clone: Unknown action: www2-server_demote_0
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource drbd_www:0#011(Stopped)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource drbd_www:1#011(Stopped)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Move    resource drbd_www2:0#011(Slave s2 -> s1)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Move    resource drbd_www2:1#011(Slave s1 -> s2)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www-server#011(Stopped)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www2-server#011(Stopped)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www2-mailto#011(Stopped)
Jul 28 22:28:01 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www-mailto#011(Stopped)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Jul 28 22:28:01 s3-1 crmd: [21922]: info: unpack_graph: Unpacked transition 1549: 23 actions in 23 synapses
Jul 28 22:28:01 s3-1 crmd: [21922]: info: do_te_invoke: Processing graph 1549 (ref=pe_calc-dc-1311881281-3704) derived from /var/lib/pengine/pe-input-9219.bz2
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 46 fired and confirmed
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 82: notify drbd_www2:0_pre_notify_stop_0 on s2
Jul 28 22:28:01 s3-1 pengine: [21921]: info: process_pe_message: Transition 1549: PEngine Input stored in: /var/lib/pengine/pe-input-9219.bz2
Jul 28 22:28:01 s3-1 cib: [10296]: info: write_cib_contents: Wrote version 0.440.0 of the CIB to disk (digest: 3fa86d20299acf9247c14b5760f9b9c3)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:0_pre_notify_stop_0 (82) confirmed on s2 (rc=0)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 83: notify drbd_www2:1_pre_notify_stop_0 on s1
Jul 28 22:28:01 s3-1 cib: [10296]: info: retrieveCib: Reading cluster configuration from: /var/lib/heartbeat/crm/cib.sLA4uT (digest: /var/lib/heartbeat/crm/cib.tXdeLK)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:1_pre_notify_stop_0 (83) confirmed on s1 (rc=0)
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 47 fired and confirmed
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 44 fired and confirmed
Jul 28 22:28:01 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 31: stop drbd_www2:0_stop_0 on s2
Jul 28 22:28:02 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:0_stop_0 (31) confirmed on s2 (rc=0)
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 35: stop drbd_www2:1_stop_0 on s1
Jul 28 22:28:02 s3-1 crmd: [21922]: info: abort_transition_graph: te_update_diff:164 - Triggered transition abort (complete=0, tag=transient_attributes, id=s1, magic=NA, cib=0.440.10) : Transient attribute: removal
Jul 28 22:28:02 s3-1 crmd: [21922]: info: update_abort_priority: Abort priority upgraded from 0 to 1000000
Jul 28 22:28:02 s3-1 crmd: [21922]: info: update_abort_priority: Abort action done superceeded by restart
Jul 28 22:28:02 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:1_stop_0 (35) confirmed on s1 (rc=0)
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 45 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 48 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 49 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: run_graph: ====================================================
Jul 28 22:28:02 s3-1 crmd: [21922]: notice: run_graph: Transition 1549 (Complete=10, Pending=0, Fired=0, Skipped=8, Incomplete=5, Source=/var/lib/pengine/pe-input-9219.bz2): Stopped
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_graph_trigger: Transition 1549 is now complete
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_FSA_INTERNAL origin=notify_crmd ]
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_state_transition: All 3 cluster nodes are eligible to run resources.
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_pe_invoke: Query 1847: Requesting the current CIB: S_POLICY_ENGINE
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_pe_invoke_callback: Invoking the PE: query=1847, ref=pe_calc-dc-1311881282-3709, seq=190040, quorate=1
Jul 28 22:28:02 s3-1 pengine: [21921]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jul 28 22:28:02 s3-1 pengine: [21921]: info: determine_online_status: Node s3 is online
Jul 28 22:28:02 s3-1 pengine: [21921]: info: determine_online_status: Node s1 is online
Jul 28 22:28:02 s3-1 pengine: [21921]: info: determine_online_status: Node s2 is online
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: clone_print:  Master/Slave Set: ms_drbd_www
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: short_print:      Stopped: [ drbd_www:0 drbd_www:1 ]
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: clone_print:  Master/Slave Set: ms_drbd_www2
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: short_print:      Stopped: [ drbd_www2:0 drbd_www2:1 ]
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: native_print: www-server#011(ocf::heartbeat:VirtualDomain):#011Stopped 
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: native_print: www2-server#011(ocf::heartbeat:VirtualDomain):#011Stopped 
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: native_print: www2-mailto#011(ocf::heartbeat:MailTo):#011Stopped 
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: native_print: www-mailto#011(ocf::heartbeat:MailTo):#011Stopped 
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource drbd_www:0 cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource drbd_www:1 cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: info: master_color: ms_drbd_www: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:02 s3-1 pengine: [21921]: info: master_color: ms_drbd_www2: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:02 s3-1 pengine: [21921]: info: master_color: ms_drbd_www: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:02 s3-1 pengine: [21921]: info: rsc_merge_weights: www-server: Rolling back scores from www-mailto
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource www-server cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: info: master_color: ms_drbd_www2: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:02 s3-1 pengine: [21921]: info: rsc_merge_weights: www2-server: Rolling back scores from www2-mailto
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource www2-server cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource www2-mailto cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource www-mailto cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:0 on s2
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:1 on s1
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:0 on s2
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:1 on s1
Jul 28 22:28:02 s3-1 pengine: [21921]: ERROR: clone_rsc_order_rh_non_clone: Unknown action: www-server_demote_0
Jul 28 22:28:02 s3-1 pengine: [21921]: ERROR: clone_rsc_order_rh_non_clone: Unknown action: www2-server_demote_0
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource drbd_www:0#011(Stopped)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource drbd_www:1#011(Stopped)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Start   drbd_www2:0#011(s2)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Start   drbd_www2:1#011(s1)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www-server#011(Stopped)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www2-server#011(Stopped)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www2-mailto#011(Stopped)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www-mailto#011(Stopped)
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Jul 28 22:28:02 s3-1 crmd: [21922]: info: unpack_graph: Unpacked transition 1550: 12 actions in 12 synapses
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_te_invoke: Processing graph 1550 (ref=pe_calc-dc-1311881282-3709) derived from /var/lib/pengine/pe-input-9220.bz2
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 36 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 37 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 34 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 30: start drbd_www2:0_start_0 on s2
Jul 28 22:28:02 s3-1 pengine: [21921]: info: process_pe_message: Transition 1550: PEngine Input stored in: /var/lib/pengine/pe-input-9220.bz2
Jul 28 22:28:02 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:0_start_0 (30) confirmed on s2 (rc=0)
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 32: start drbd_www2:1_start_0 on s1
Jul 28 22:28:02 s3-1 crmd: [21922]: info: abort_transition_graph: te_update_diff:150 - Triggered transition abort (complete=0, tag=nvpair, id=status-s1-master-drbd_www2:1, magic=NA, cib=0.440.13) : Transient attribute: update
Jul 28 22:28:02 s3-1 crmd: [21922]: info: update_abort_priority: Abort priority upgraded from 0 to 1000000
Jul 28 22:28:02 s3-1 crmd: [21922]: info: update_abort_priority: Abort action done superceeded by restart
Jul 28 22:28:02 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:1_start_0 (32) confirmed on s1 (rc=0)
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 35 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 38 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 80: notify drbd_www2:0_post_notify_start_0 on s2
Jul 28 22:28:02 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:0_post_notify_start_0 (80) confirmed on s2 (rc=0)
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 81: notify drbd_www2:1_post_notify_start_0 on s1
Jul 28 22:28:02 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:1_post_notify_start_0 (81) confirmed on s1 (rc=0)
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 39 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: run_graph: ====================================================
Jul 28 22:28:02 s3-1 crmd: [21922]: notice: run_graph: Transition 1550 (Complete=10, Pending=0, Fired=0, Skipped=2, Incomplete=0, Source=/var/lib/pengine/pe-input-9220.bz2): Stopped
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_graph_trigger: Transition 1550 is now complete
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_FSA_INTERNAL origin=notify_crmd ]
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_state_transition: All 3 cluster nodes are eligible to run resources.
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_pe_invoke: Query 1848: Requesting the current CIB: S_POLICY_ENGINE
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_pe_invoke_callback: Invoking the PE: query=1848, ref=pe_calc-dc-1311881282-3714, seq=190040, quorate=1
Jul 28 22:28:02 s3-1 pengine: [21921]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jul 28 22:28:02 s3-1 pengine: [21921]: info: determine_online_status: Node s3 is online
Jul 28 22:28:02 s3-1 pengine: [21921]: info: determine_online_status: Node s1 is online
Jul 28 22:28:02 s3-1 pengine: [21921]: info: determine_online_status: Node s2 is online
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: clone_print:  Master/Slave Set: ms_drbd_www
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: short_print:      Stopped: [ drbd_www:0 drbd_www:1 ]
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: clone_print:  Master/Slave Set: ms_drbd_www2
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: short_print:      Slaves: [ s2 s1 ]
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: native_print: www-server#011(ocf::heartbeat:VirtualDomain):#011Stopped 
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: native_print: www2-server#011(ocf::heartbeat:VirtualDomain):#011Stopped 
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: native_print: www2-mailto#011(ocf::heartbeat:MailTo):#011Stopped 
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: native_print: www-mailto#011(ocf::heartbeat:MailTo):#011Stopped 
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource drbd_www:0 cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource drbd_www:1 cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: info: master_color: ms_drbd_www: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:02 s3-1 pengine: [21921]: info: master_color: ms_drbd_www2: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:02 s3-1 pengine: [21921]: info: master_color: ms_drbd_www: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:02 s3-1 pengine: [21921]: info: rsc_merge_weights: www-server: Rolling back scores from www-mailto
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource www-server cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: info: master_color: ms_drbd_www2: Promoted 0 instances of a possible 2 to master
Jul 28 22:28:02 s3-1 pengine: [21921]: info: rsc_merge_weights: www2-server: Rolling back scores from www2-mailto
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource www2-server cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource www2-mailto cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: info: native_color: Resource www-mailto cannot run anywhere
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:0 on s1
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:1 on s2
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:0 on s1
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: RecurringOp:  Start recurring monitor (15s) for drbd_www2:1 on s2
Jul 28 22:28:02 s3-1 pengine: [21921]: ERROR: clone_rsc_order_rh_non_clone: Unknown action: www-server_demote_0
Jul 28 22:28:02 s3-1 pengine: [21921]: ERROR: clone_rsc_order_rh_non_clone: Unknown action: www2-server_demote_0
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource drbd_www:0#011(Stopped)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource drbd_www:1#011(Stopped)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Move    resource drbd_www2:0#011(Slave s2 -> s1)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Move    resource drbd_www2:1#011(Slave s1 -> s2)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www-server#011(Stopped)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www2-server#011(Stopped)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www2-mailto#011(Stopped)
Jul 28 22:28:02 s3-1 pengine: [21921]: notice: LogActions: Leave   resource www-mailto#011(Stopped)
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Jul 28 22:28:02 s3-1 crmd: [21922]: info: unpack_graph: Unpacked transition 1551: 23 actions in 23 synapses
Jul 28 22:28:02 s3-1 crmd: [21922]: info: do_te_invoke: Processing graph 1551 (ref=pe_calc-dc-1311881282-3714) derived from /var/lib/pengine/pe-input-9221.bz2
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 46 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 82: notify drbd_www2:0_pre_notify_stop_0 on s2
Jul 28 22:28:02 s3-1 pengine: [21921]: info: process_pe_message: Transition 1551: PEngine Input stored in: /var/lib/pengine/pe-input-9221.bz2
Jul 28 22:28:02 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:0_pre_notify_stop_0 (82) confirmed on s2 (rc=0)
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 83: notify drbd_www2:1_pre_notify_stop_0 on s1
Jul 28 22:28:02 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:1_pre_notify_stop_0 (83) confirmed on s1 (rc=0)
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 47 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 44 fired and confirmed
Jul 28 22:28:02 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 31: stop drbd_www2:0_stop_0 on s2
Jul 28 22:28:03 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:0_stop_0 (31) confirmed on s2 (rc=0)
Jul 28 22:28:03 s3-1 crmd: [21922]: info: te_rsc_command: Initiating action 35: stop drbd_www2:1_stop_0 on s1
Jul 28 22:28:03 s3-1 crmd: [21922]: info: abort_transition_graph: te_update_diff:164 - Triggered transition abort (complete=0, tag=transient_attributes, id=s1, magic=NA, cib=0.440.20) : Transient attribute: removal
Jul 28 22:28:03 s3-1 crmd: [21922]: info: update_abort_priority: Abort priority upgraded from 0 to 1000000
Jul 28 22:28:03 s3-1 crmd: [21922]: info: update_abort_priority: Abort action done superceeded by restart
Jul 28 22:28:03 s3-1 crmd: [21922]: info: match_graph_event: Action drbd_www2:1_stop_0 (35) confirmed on s1 (rc=0)
Jul 28 22:28:03 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 45 fired and confirmed
Jul 28 22:28:03 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 48 fired and confirmed
Jul 28 22:28:03 s3-1 crmd: [21922]: info: te_pseudo_action: Pseudo action 49 fired and confirmed
Jul 28 22:28:03 s3-1 crmd: [21922]: info: run_graph: ====================================================

 		 	   		  



More information about the Pacemaker mailing list