<div dir="ltr">Guys, can anyone help please?</div><div class="gmail_extra"><br><br><div class="gmail_quote">2014-05-30 11:32 GMT+03:00 Виталий Туровец <span dir="ltr"><<a href="mailto:corebug@corebug.net" target="_blank">corebug@corebug.net</a>></span>:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">Hello there, people!<div>I am new to this list, so please excuse me if i'm posting to the wrong place.</div>
<div><br></div><div>I've got a pacemaker cluster with such a configuration: <a href="http://pastebin.com/1SbWWh4n" target="_blank">http://pastebin.com/1SbWWh4n</a>.</div>
<div><br></div><div>Output of "crm status":</div><div><div>============</div><div>Last updated: Fri May 30 11:22:59 2014</div><div>Last change: Thu May 29 03:22:38 2014 via crmd on wb-db2</div><div>Stack: openais</div>
<div>Current DC: wb-db2 - partition with quorum</div><div>Version: 1.1.7-6.el6-148fccfd5985c5590cc601123c6c16e966b85d14</div><div>2 Nodes configured, 2 expected votes</div><div>7 Resources configured.</div><div>============</div>
<div><br></div><div>Online: [ wb-db2 wb-db1 ]</div><div><br></div><div> ClusterIP (ocf::heartbeat:IPaddr2): Started wb-db2</div><div> MySQL_Reader_VIP (ocf::heartbeat:IPaddr2): Started wb-db2</div>
<div>
resMON (ocf::pacemaker:ClusterMon): Started wb-db2</div><div> Master/Slave Set: MySQL_MasterSlave [MySQL]</div><div> Masters: [ wb-db2 ]</div><div> Stopped: [ MySQL:1 ]</div><div> Clone Set: pingclone [ping-gateway]</div>
<div> Started: [ wb-db1 wb-db2 ]</div><div><br></div><div>There was an unclean shutdown of a cluster and after that i've faced a problem that a slave of MySQL_MasterSlave resource does not come up.</div><div>When i try to do a "cleanup MySQL_MasterSlave" i see such thing in logs:</div>
<div><br></div><div><div>May 29 03:22:22 [4423] wb-db1 crmd: warning: decode_transition_key: Bad UUID (crm-resource-4819) in sscanf result (3) for 0:0:crm-resource-4819 </div><div>May 29 03:22:22 [4423] wb-db1 crmd: warning: decode_transition_key: Bad UUID (crm-resource-4819) in sscanf result (3) for 0:0:crm-resource-4819 </div>
<div>May 29 03:22:22 [4423] wb-db1 crmd: info: ais_dispatch_message: Membership 408: quorum retained </div><div>May 29 03:22:22 [4418] wb-db1 cib: info: set_crm_log_level: New log level: 3 0 </div>
<div>May 29 03:22:38 [4421] wb-db1 attrd: notice: attrd_ais_dispatch: Update relayed from wb-db2 </div><div>May 29 03:22:38 [4421] wb-db1 attrd: notice: attrd_ais_dispatch: Update relayed from wb-db2 </div>
<div>May 29 03:22:38 [4418] wb-db1 cib: info: apply_xml_diff: Digest mis-match: expected 2f5bc3d7f673df3cf37f774211976d69, calculated b8a7adf0e34966242551556aab605286 </div><div>May 29 03:22:38 [4418] wb-db1 cib: notice: cib_process_diff: Diff 0.243.4 -> 0.243.5 not applied to 0.243.4: Failed application of an update diff </div>
<div>May 29 03:22:38 [4418] wb-db1 cib: info: cib_server_process_diff: Requesting re-sync from peer </div><div>May 29 03:22:38 [4418] wb-db1 cib: notice: cib_server_process_diff: Not applying diff 0.243.4 -> 0.243.5 (sync in progress) </div>
<div>May 29 03:22:38 [4418] wb-db1 cib: info: cib_replace_notify: Replaced: -1.-1.-1 -> 0.243.5 from wb-db2 </div><div>May 29 03:22:38 [4421] wb-db1 attrd: notice: attrd_trigger_update: Sending flush op to all hosts for: pingd (100) </div>
<div>May 29 03:22:38 [4421] wb-db1 attrd: notice: attrd_trigger_update: Sending flush op to all hosts for: probe_complete (true) </div><div>May 29 03:22:38 [4418] wb-db1 cib: info: set_crm_log_level: New log level: 3 0 </div>
<div>May 29 03:22:38 [4418] wb-db1 cib: info: apply_xml_diff: Digest mis-match: expected 754ed3b1d999e34d93e0835b310fd98a, calculated c322686deb255936ab54e064c696b6b8 </div><div>May 29 03:22:38 [4418] wb-db1 cib: notice: cib_process_diff: Diff 0.244.5 -> 0.244.6 not applied to 0.244.5: Failed application of an update diff </div>
<div>May 29 03:22:38 [4418] wb-db1 cib: info: cib_server_process_diff: Requesting re-sync from peer </div><div>May 29 03:22:38 [4423] wb-db1 crmd: info: delete_resource: Removing resource MySQL:0 for 4996_crm_resource (internal) on wb-db2 </div>
<div>May 29 03:22:38 [4423] wb-db1 crmd: info: notify_deleted: Notifying 4996_crm_resource on wb-db2 that MySQL:0 was deleted </div><div>May 29 03:22:38 [4418] wb-db1 cib: notice: cib_server_process_diff: Not applying diff 0.244.5 -> 0.244.6 (sync in progress) </div>
<div>May 29 03:22:38 [4423] wb-db1 crmd: warning: decode_transition_key: Bad UUID (crm-resource-4996) in sscanf result (3) for 0:0:crm-resource-4996 </div><div>May 29 03:22:38 [4418] wb-db1 cib: notice: cib_server_process_diff: Not applying diff 0.244.6 -> 0.244.7 (sync in progress) </div>
<div>May 29 03:22:38 [4418] wb-db1 cib: notice: cib_server_process_diff: Not applying diff 0.244.7 -> 0.244.8 (sync in progress) </div><div>May 29 03:22:38 [4418] wb-db1 cib: info: cib_replace_notify: Replaced: -1.-1.-1 -> 0.244.8 from wb-db2 </div>
<div>May 29 03:22:38 [4421] wb-db1 attrd: notice: attrd_trigger_update: Sending flush op to all hosts for: pingd (100) </div><div>May 29 03:22:38 [4421] wb-db1 attrd: notice: attrd_trigger_update: Sending flush op to all hosts for: probe_complete (true) </div>
<div>May 29 03:22:38 [4423] wb-db1 crmd: notice: do_lrm_invoke: Not creating resource for a delete event: (null) </div><div>May 29 03:22:38 [4423] wb-db1 crmd: info: notify_deleted: Notifying 4996_crm_resource on wb-db2 that MySQL:1 was deleted </div>
<div>May 29 03:22:38 [4423] wb-db1 crmd: warning: decode_transition_key: Bad UUID (crm-resource-4996) in sscanf result (3) for 0:0:crm-resource-4996 </div><div>May 29 03:22:38 [4423] wb-db1 crmd: warning: decode_transition_key: Bad UUID (crm-resource-4996) in sscanf result (3) for 0:0:crm-resource-4996 </div>
<div>May 29 03:22:38 [4418] wb-db1 cib: info: set_crm_log_level: New log level: 3 0 </div><div>May 29 03:22:38 [4423] wb-db1 crmd: info: ais_dispatch_message: Membership 408: quorum retained </div>
</div><div><br></div><div>Here's the cibadmin -Q output from node that is alive: <a href="http://pastebin.com/aeqfTaCe" target="_blank">http://pastebin.com/aeqfTaCe</a></div><div>And here's the one from failed node: <a href="http://pastebin.com/ME2U5vjK" target="_blank">http://pastebin.com/ME2U5vjK</a></div>
<div>The question is: how do i somehow cleanup the things for master/slave resource MySQL_MasterSlave to start working properly?</div><div><br></div><div>Thank you!</div><span class="HOEnZb"><font color="#888888"><div><br>
</div>-- <br><div dir="ltr"><br><br><br><br>
~~~<br>WBR,<br>Vitaliy Turovets<br>
Lead Operations Engineer<div>Global Message Services Ukraine<br><a href="tel:%2B38%28093%29265-70-55" value="+380932657055" target="_blank">+38(093)265-70-55</a><br>VITU-RIPE<br><br></div></div>
</font></span></div></div>
</blockquote></div><br><br clear="all"><div><br></div>-- <br><div dir="ltr"><br><br><br><br>~~~<br>WBR,<br>Vitaliy Turovets<br>Lead Operations Engineer<div>Global Message Services Ukraine<br>+38(093)265-70-55<br>VITU-RIPE<br>
<br></div></div>
</div>