Hi! Thanks for your reply!<br><br>I make some mistakes in configuration, and now i have againt segfault on lastest sources(pacemaker-1-0_b0266dd5ffa9) :<br><br><div style="margin-left: 40px;">Dec 9 16:51:29 storage0 kernel: [ 407.923417] pengine[891]: segfault at 8 ip b77289b8 sp bfe38120 error 4 in libpengine.so.3.0.0[b771d000+33000]<br>
Dec 9 16:52:32 storage0 kernel: [ 470.943739] pengine[958]: segfault at 8 ip b78479b8 sp bff05cd0 error 4 in libpengine.so.3.0.0[b783c000+33000]<br>Dec 9 16:53:35 storage0 kernel: [ 534.044403] pengine[962]: segfault at 8 ip b77fc9b8 sp bfd1fd50 error 4 in libpengine.so.3.0.0[b77f1000+33000]<br>
</div><br>i do foollow:<br> order o1 inf: ms_drbd_web iscsi<br><br>After that, pacemaker go to the segfault on one node(in my case storage1). As i understated pacemaker try to commit bad changes and fault, how can i discard this changes?<br>
<br><br><br><div class="gmail_quote">2010/12/9 Andrew Beekhof <span dir="ltr"><<a href="mailto:andrew@beekhof.net">andrew@beekhof.net</a>></span><br><blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<div><div></div><div class="h5">On Wed, Dec 8, 2010 at 12:26 PM, ruslan usifov <<a href="mailto:ruslan.usifov@gmail.com">ruslan.usifov@gmail.com</a>> wrote:<br>
> hello<br>
><br>
> I have 2 node cluster with follow conf:<br>
> node storage0<br>
> node storage1<br>
> primitive drbd_web ocf:linbit:drbd \<br>
> params drbd_resource="web" \<br>
> op monitor interval="30s" timeout="60s"<br>
> primitive iscsi_ip ocf:heartbeat:IPaddr2 \<br>
> params ip="192.168.17.19" nic="eth1:1" cidr_netmask="24" \<br>
> op monitor interval="10s" \<br>
> meta target-role="Started"<br>
> primitive iscsi_web_target ocf:heartbeat:iSCSITarget \<br>
> params iqn="iqn.2010-06.playrix.local:san.web" implementation="iet"<br>
> \<br>
> op monitor interval="10s" timeout="30s" depth="0" \<br>
> meta target-role="Started"<br>
> primitive iscsi_web_target_lun1 ocf:heartbeat:iSCSILogicalUnit \<br>
> params lun="1" path="/dev/drbd1"<br>
> target_iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" \<br>
> op monitor interval="10s" timeout="30s"<br>
> group iscsi iscsi_ip iscsi_web_target iscsi_web_target_lun1<br>
> ms ms_drbd_web drbd_web \<br>
> meta master-max="1" master-node-max="1" clone-max="2"<br>
> clone-node-max="1" notify="true" target-role="Started"<br>
> colocation iscsi_on_drbd inf: ms_drbd_web:Master iscsi<br>
> order iscsi_target_after_drbd inf: ms_drbd_web:promote iscsi_web_target<br>
> order iscsi_target_lun_after_iscsi_target inf: iscsi_web_target<br>
> iscsi_web_target_lun1<br>
> property $id="cib-bootstrap-options" \<br>
> dc-version="1.0.10-b0266dd5ffa9c51377c68b1f29d6bc84367f51dd" \<br>
> cluster-infrastructure="openais" \<br>
> expected-quorum-votes="2" \<br>
> stonith-enabled="false" \<br>
> no-quorum-policy="ignore"<br>
> rsc_defaults $id="rsc-options" \<br>
> resource-stickiness="100"<br>
><br>
><br>
> after some throbles with pacemaker(segfault in older version in ubuntu) I<br>
> can not get to work ms_drbd_web. It always show only slaves status:<br>
<br>
</div></div>This says only promote drbd where iscsi group is running:<br>
<div class="im"> colocation iscsi_on_drbd inf: ms_drbd_web:Master iscsi<br>
<br>
</div>And since its only partially active, drbd wont be made a master.<br>
Perhaps you want:<br>
colocation iscsi_on_drbd inf: iscsi ms_drbd_web:Master<br>
<div><div></div><div class="h5"><br>
> ============<br>
> Last updated: Tue Dec 7 14:00:13 2010<br>
> Stack: openais<br>
> Current DC: storage0 - partition with quorum<br>
> Version: 1.0.10-b0266dd5ffa9c51377c68b1f29d6bc84367f51dd<br>
> 2 Nodes configured, 2 expected votes<br>
> 2 Resources configured.<br>
> ============<br>
><br>
> Online: [ storage1 storage0 ]<br>
><br>
> Master/Slave Set: ms_drbd_web<br>
> Slaves: [ storage0 storage1 ]<br>
> Resource Group: iscsi<br>
> iscsi_ip (ocf::heartbeat:IPaddr2): Started storage1<br>
> iscsi_web_target (ocf::heartbeat:iSCSITarget): Started storage1<br>
> iscsi_web_target_lun1 (ocf::heartbeat:iSCSILogicalUnit):<br>
> Stopped<br>
><br>
> Failed actions:<br>
> iscsi_web_target_lun1_monitor_0 (node=storage0, call=5, rc=5,<br>
> status=complete): not installed<br>
> iscsi_web_target_monitor_0 (node=storage0, call=4, rc=5,<br>
> status=complete): not installed<br>
> iscsi_web_target_lun1_start_0 (node=storage1, call=13, rc=1,<br>
> status=complete): unknown error<br>
><br>
> but no split brain situation(nothing about in logs)<br>
> If i do master selection myself,<br>
><br>
> drbdadm -- --overwrite-data-of-peer primary all<br>
><br>
> pacemaker still switch move to [Slave, Slave]<br>
><br>
> I found follow errors in my corosync.log:<br>
><br>
> ERROR: clone_rsc_order_rh_non_clone: Unknown action:<br>
> iscsi_web_target_demote_0<br>
><br>
><br>
> What i do wrong, and how can i restore drdb to work?<br>
><br>
</div></div>> _______________________________________________<br>
> Pacemaker mailing list: <a href="mailto:Pacemaker@oss.clusterlabs.org">Pacemaker@oss.clusterlabs.org</a><br>
> <a href="http://oss.clusterlabs.org/mailman/listinfo/pacemaker" target="_blank">http://oss.clusterlabs.org/mailman/listinfo/pacemaker</a><br>
><br>
> Project Home: <a href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a><br>
> Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>
> Bugs:<br>
> <a href="http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker" target="_blank">http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker</a><br>
><br>
><br>
<br>
_______________________________________________<br>
Pacemaker mailing list: <a href="mailto:Pacemaker@oss.clusterlabs.org">Pacemaker@oss.clusterlabs.org</a><br>
<a href="http://oss.clusterlabs.org/mailman/listinfo/pacemaker" target="_blank">http://oss.clusterlabs.org/mailman/listinfo/pacemaker</a><br>
<br>
Project Home: <a href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a><br>
Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>
Bugs: <a href="http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker" target="_blank">http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker</a><br>
</blockquote></div><br>