<div dir="ltr"><div><div>Hi,<br><br>I hope I've got the right list, I'm still a little confused about where CMAN ends and Pacemaker begins! We're using Pacemaker and CMAN on Oracle 6.3 to try and create an active/standby failover pair, but seem to have some annoying conditions that are making this tricky.<br>
<br></div><div>We hae 2 nodes, vfontopensips1 and vfontopensips2 that we want a VIP to float between based on the availability of a single daemon we have called OSP<br></div><div><br>We have a daemon, called OSP that we want running at all times on both nodes, we dont want Pacemaker to stop/start this so I believe the correct thing to do is configure it as unmanaged?<br>
<br></div>We have one virtual IP that we want on one of the nodes running the OSP daemon.<br><br></div><div>The ideal condition is vfontopensips1 has an instance of OSP and the VIP on it and vfontopensips2 has a running instance of OSP on it. If OSP dies or fails on vfontopensips1 we want the VIP to move to vfontopensips2 immediately, we wont want Pacemaker/CMAN to try and restart it.<br>
<br>If OSP is then restarted/fixed manually on vfontopensips1 we'd like Pacemaker/CMAN to detect that monitor events are now working and mark the node as available but to not move the VIP back to it unless there's a a failure on vfontopensips2.<br>
<br></div><div>Here's the output of crm configure show<br><br>node vfontopensips1<br>node vfontopensips2<br>primitive ClusterIPPres ocf:heartbeat:IPaddr2 \<br> params ip="10.30.0.176" cidr_netmask="32" \<br>
op monitor interval="5s"<br>primitive osp ocf:netdev:osp \<br> params interval="1s" \<br> op monitor interval="5s" \<br> meta is-managed="false" migration-threshold="1" on-fail="standby"<br>
colocation osp-with-ip 200: osp ClusterIPPres<br>property $id="cib-bootstrap-options" \<br> dc-version="1.1.8-7.el6-394e906" \<br> cluster-infrastructure="cman" \<br> stonith-enabled="false" \<br>
no-quorum-policy="ignore" \<br> last-lrm-refresh="1365509847"<br>rsc_defaults $id="rsc-options" \<br> resource-stickiness="100" \<br> migration-threshold="1" \<br>
allow-migrate="true" \<br> failure-timeout="5s"<br><br></div><div>I've attached the OSP OCF file to this email.<br><br></div><div>This is the output of crm_mon when both OSP instances are up and vfontopensips1 has the VIP<br>
<br>*********************<br><br>Last updated: Wed Apr 10 10:21:11 2013<br>Last change: Tue Apr 9 16:39:00 2013 via cibadmin on vfontopensips1<br>Stack: cman<br>Current DC: vfontopensips1 - partition with quorum<br>Version: 1.1.8-7.el6-394e906<br>
2 Nodes configured, unknown expected votes<br>2 Resources configured.<br><br><br>Online: [ vfontopensips1 vfontopensips2 ]<br><br>ClusterIPPres (ocf::heartbeat:IPaddr2): Started vfontopensips1<br>osp (ocf::netdev:osp): Started (unmanaged) [ vfontopensips1 vfontopensips2 ]<br>
<br>*********************<br><br></div><div>If OSP fails vfontopensips1 loses the VIP but the VIP doesnt move to vfontopensips2 .. crm_mon outputs<br><br>*********************<br><br>Last updated: Wed Apr 10 10:22:20 2013<br>
Last change: Tue Apr 9 16:39:00 2013 via cibadmin on vfontopensips1<br>Stack: cman<br>Current DC: vfontopensips1 - partition with quorum<br>Version: 1.1.8-7.el6-394e906<br>2 Nodes configured, unknown expected votes<br>2 Resources configured.<br>
<br><br>Online: [ vfontopensips1 vfontopensips2 ]<br><br>osp (ocf::netdev:osp): Started (unmanaged) FAILED [ vfontopensips1 vfontopensips2 ]<br><br>Failed actions:<br> osp_monitor_5000 (node=vfontopensips1, call=96, rc=7, status=complete): not running<br>
ClusterIPPres_migrate_to_0 (node=vfontopensips1, call=161, rc=3, status=complete): unimplemented feature<br> ClusterIPPres_migrate_from_0 (node=vfontopensips2, call=138, rc=3, status=complete): unimplemented feature<br>
<br>*********************<br><br></div><div>If we fix OSP the VIP then comes back up on vfontopensips1<br><br>*********************<br><br>Last updated: Wed Apr 10 10:23:26 2013<br>Last change: Tue Apr 9 16:39:00 2013 via cibadmin on vfontopensips1<br>
Stack: cman<br>Current DC: vfontopensips1 - partition with quorum<br>Version: 1.1.8-7.el6-394e906<br>2 Nodes configured, unknown expected votes<br>2 Resources configured.<br><br><br>Online: [ vfontopensips1 vfontopensips2 ]<br>
<br>ClusterIPPres (ocf::heartbeat:IPaddr2): Started vfontopensips1<br>osp (ocf::netdev:osp): Started (unmanaged) [ vfontopensips1 vfontopensips2 ]<br><br></div><div>*********************</div><div><br></div><div>
Obviously this isn't the behaviour I'm after as OSP is up and available on vfontopensips2 so we'd like the VIP to move there..<br><br></div><div>Any hints would be great please as this has been confusing me for a few days now!<br>
<br></div><div>The versions we are using are:<br><br>cman-3.0.12.1-49.el6.x86_64<br>pacemaker-libs-1.1.8-7.el6.x86_64<br>pacemaker-1.1.8-7.el6.x86_64<br>pacemaker-cluster-libs-1.1.8-7.el6.x86_64<br>pacemaker-cli-1.1.8-7.el6.x86_64<br>
<br>Linux vfontopensips1 2.6.32-279.el6.x86_64 #1 SMP Thu Jun 21 15:00:18 EDT 2012 x86_64 x86_64 x86_64 GNU/Linux<br><br></div><div><br></div><div>Cheers,<br><br>Rus<br></div></div>