hello<br><br>I run pacemaker on ubuntu (Ubuntu 10.04.1 LTS) with corosync, i installed it from apt, and my pacemaker version is:<br><br><div style="margin-left: 40px;">root@storage0:/var/log# dpkg -l | grep 'pacemaker'<br>
ii pacemaker 1.0.8+hg15494-2ubuntu2 HA cluster resource manager<br></div><br><br>and have follow problem with pacemaker, with follow configration:<br><div style="margin-left: 40px;">root@storage0:/var/log# crm configure show<br>
node storage0<br>node storage1<br>primitive drbd_web ocf:linbit:drbd \<br> params drbd_resource="web" \<br> op monitor interval="10s" timeout="60s"<br>primitive iscsi_ip ocf:heartbeat:IPaddr2 \<br>
params ip="192.168.17.19" nic="eth1:1" cidr_netmask="24" \<br> op monitor interval="10s" \<br> meta target-role="Started"<br>primitive iscsi_web_target ocf:heartbeat:iSCSITarget \<br>
params iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" \<br> op monitor interval="10s" timeout="30s" depth="0" \<br> meta target-role="Started"<br>
primitive iscsi_web_target_lun1 ocf:heartbeat:iSCSILogicalUnit \<br> params lun="1" path="/dev/drbd1" target_iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" \<br>
op monitor interval="10s" timeout="30s"<br>group iscsi iscsi_ip iscsi_web_target iscsi_web_target_lun1<br>ms ms_drbd_web drbd_web \<br> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"<br>
colocation iscsi_on_drbd inf: ms_drbd_web:Master iscsi<br>order iscsi_target_after_drbd inf: ms_drbd_web:promote iscsi_web_target<br>order iscsi_target_lun_after_iscsi_target inf: iscsi_web_target iscsi_web_target_lun1<br>
property $id="cib-bootstrap-options" \<br> dc-version="1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd" \<br> cluster-infrastructure="openais" \<br> expected-quorum-votes="2" \<br>
stonith-enabled="false" \<br> no-quorum-policy="ignore"<br>rsc_defaults $id="rsc-options" \<br> resource-stickiness="100"<br></div><br><br>When i shutdown node storage1, node storage0 doesn't accept Master drbd role, so output from crm_mon -1 lokks like this:<br>
<div style="margin-left: 40px;">============<br>Last updated: Mon Dec 6 15:04:18 2010<br>Stack: openais<br>Current DC: storage0 - partition WITHOUT quorum<br>Version: 1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd<br>2 Nodes configured, 2 expected votes<br>
2 Resources configured.<br>============<br><br>Online: [ storage0 ]<br>OFFLINE: [ storage1 ]<br><br> Master/Slave Set: ms_drbd_web<br> Slaves: [ storage0 ]<br> Stopped: [ drbd_web:1 ]<br> Resource Group: iscsi<br>
iscsi_ip (ocf::heartbeat:IPaddr2): Started storage0<br> iscsi_web_target (ocf::heartbeat:iSCSITarget): Started storage0<br> iscsi_web_target_lun1 (ocf::heartbeat:iSCSILogicalUnit): Started storage0 FAILED<br>
<br>Failed actions:<br> iscsi_web_target_lun1_start_0 (node=storage0, call=91, rc=1, status=complete): unknown error<br><br></div><br>and when try to promote node got folow error:<br><div style="margin-left: 40px;">crm(live)resource# promote ms_drbd_web<br>
Error performing operation: Remote node did not respond<br><br></div><br>and periodicaly in /var/log/messages, i see folow error:<br><div style="margin-left: 40px;">Dec 6 14:49:35 storage0 kernel: [ 5048.618562] pengine[8584]: segfault at 8 ip b76ad094 sp bf8261d0 error 4 in libpengine.so.3.0.0[b76a2000+32000]<br>
Dec 6 14:50:37 storage0 kernel: [ 5111.505491] pengine[8681]: segfault at 0 ip b7831ef3 sp bfd28b30 error 4 in libpengine.so.3.0.0[b7821000+32000]<br>Dec 6 14:51:41 storage0 kernel: [ 5174.746349] pengine[8770]: segfault at 8 ip b7751094 sp bfe1ccb0 error 4 in libpengine.so.3.0.0[b7746000+32000]<br>
</div><br><br><br>Why pacemacker doesn't switch role of live node to master? And why segfault happens? <br>Please help<br>