[Pacemaker] stonith resource issue

Shravan Mishra shravan.mishra at gmail.com
Wed Oct 6 11:04:34 EDT 2010


Hi guys,

I'm having a weird problem with my stonith resources.They are
constantly starting and stopping.

I'm using:

pacemaker=1.1.3
corosync=1.2.8
glue=glue_1.0-10

2.6.29.6-0.6.smp.gcc4.1.x86_64

My configuration looks like this:

=======================
node ha1.itactics.com
node ha2.itactics.com
primitive ha1.itactics.com-stonith stonith:external/safe/ipmi \
	op monitor interval="20" timeout="180" \
	params target_role="started" hostname="ha1.itactics.com" ipaddr="192.168.2.3"
primitive ha2.itactics.com-stonith stonith:external/safe/ipmi \
	op monitor interval="20" timeout="180" \
	params target_role="started" hostname="ha2.itactics.com" ipaddr="192.168.2.7"
location ha1.itactics.com-stonith-placement ha1.itactics.com-stonith \
	rule $id="ri-ha1.itactics.com-stonith-placement-1" -inf: #uname eq
ha1.itactics.com
location ha2.itactics.com-stonith-placement ha2.itactics.com-stonith \
	rule $id="ri-ha2.itactics.com-stonith-placement-1" -inf: #uname eq
ha2.itactics.com
property $id="cib-bootstrap-options" \
	dc-version="1.1.2-e0d731c2b1be446b27a73327a53067bf6230fb6a" \
	cluster-infrastructure="openais" \
	expected-quorum-votes="2" \
	stonith-enabled="true"
=========================

An excerpt from /var/log/messages
==========================
Oct  6 11:00:02 ha1 lrmd: [5994]: info:
rsc:ha2.itactics.com-stonith:1150: monitor
Oct  6 11:00:03 ha1 lrmd: [5994]: info: cancel_op: operation
monitor[1150] on stonith::external/safe/ipmi::ha2.itactics.com-stonith
for client 5997, its parameters: CRM_meta_interval=[20000]
target_role=[started] ipaddr=[192.168.2.7] CRM_meta_timeout=[180000]
crm_feature_set=[3.0.2] CRM_meta_name=[monitor]
hostname=[ha2.itactics.com]  cancelled
Oct  6 11:00:03 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1151: stop
Oct  6 11:00:03 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1152: start
Oct  6 11:00:03 ha1 lrmd: [5994]: info:
rsc:ha2.itactics.com-stonith:1153: monitor
Oct  6 11:00:04 ha1 lrmd: [5994]: info: cancel_op: operation
monitor[1153] on stonith::external/safe/ipmi::ha2.itactics.com-stonith
for client 5997, its parameters: CRM_meta_interval=[20000]
target_role=[started] ipaddr=[192.168.2.7] CRM_meta_timeout=[180000]
crm_feature_set=[3.0.2] CRM_meta_name=[monitor]
hostname=[ha2.itactics.com]  cancelled
Oct  6 11:00:04 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1154: stop
Oct  6 11:00:04 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1155: start
Oct  6 11:00:04 ha1 lrmd: [5994]: info:
rsc:ha2.itactics.com-stonith:1156: monitor
Oct  6 11:00:06 ha1 lrmd: [5994]: info: cancel_op: operation
monitor[1156] on stonith::external/safe/ipmi::ha2.itactics.com-stonith
for client 5997, its parameters: CRM_meta_interval=[20000]
target_role=[started] ipaddr=[192.168.2.7] CRM_meta_timeout=[180000]
crm_feature_set=[3.0.2] CRM_meta_name=[monitor]
hostname=[ha2.itactics.com]  cancelled
Oct  6 11:00:06 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1157: stop
Oct  6 11:00:06 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1158: start
Oct  6 11:00:06 ha1 lrmd: [5994]: info:
rsc:ha2.itactics.com-stonith:1159: monitor
Oct  6 11:00:07 ha1 lrmd: [5994]: info: cancel_op: operation
monitor[1159] on stonith::external/safe/ipmi::ha2.itactics.com-stonith
for client 5997, its parameters: CRM_meta_interval=[20000]
target_role=[started] ipaddr=[192.168.2.7] CRM_meta_timeout=[180000]
crm_feature_set=[3.0.2] CRM_meta_name=[monitor]
hostname=[ha2.itactics.com]  cancelled
Oct  6 11:00:07 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1160: stop
Oct  6 11:00:08 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1161: start
Oct  6 11:00:08 ha1 lrmd: [5994]: info:
rsc:ha2.itactics.com-stonith:1162: monitor
Oct  6 11:00:09 ha1 lrmd: [5994]: info: cancel_op: operation
monitor[1162] on stonith::external/safe/ipmi::ha2.itactics.com-stonith
for client 5997, its parameters: CRM_meta_interval=[20000]
target_role=[started] ipaddr=[192.168.2.7] CRM_meta_timeout=[180000]
crm_feature_set=[3.0.2] CRM_meta_name=[monitor]
hostname=[ha2.itactics.com]  cancelled
Oct  6 11:00:09 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1163: stop
Oct  6 11:00:09 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1164: start
Oct  6 11:00:12 ha1 lrmd: [5994]: info: stonithRA plugin: got
metadata: <?xml version="1.0"?> <!DOCTYPE resource-agent SYSTEM
"ra-api-1.dtd"> <resource-agent name="external/safe/ipmi">
<version>1.0</version>   <longdesc lang="en"> <!-- no value -->
 </longdesc>   <shortdesc lang="en"><!-- no value
--></shortdesc> <!-- no value -->   <actions>     <action
name="start"   timeout="15" />     <action name="stop"    timeout="15"
/>     <action name="status"  timeout="15" />     <action
name="monitor" timeout="15" interval="15" start-delay="15" />
<action name="meta-data"  timeout="15" />   </actions>   <special
tag="heartbeat">     <version>2.0</version>   </special>
</resource-agent>
Oct  6 11:00:12 ha1 lrmd: [5994]: info: G_SIG_dispatch: started at
1726679412 should have started at 1726679110
Oct  6 11:00:12 ha1 lrmd: [5994]: info:
rsc:ha2.itactics.com-stonith:1165: monitor
Oct  6 11:00:13 ha1 lrmd: [5994]: info: cancel_op: operation
monitor[1165] on stonith::external/safe/ipmi::ha2.itactics.com-stonith
for client 5997, its parameters: CRM_meta_interval=[20000]
target_role=[started] ipaddr=[192.168.2.7] CRM_meta_timeout=[180000]
crm_feature_set=[3.0.2] CRM_meta_name=[monitor]
hostname=[ha2.itactics.com]  cancelled
Oct  6 11:00:13 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1166: stop
Oct  6 11:00:13 ha1 lrmd: [5994]: info: rsc:ha2.itactics.com-stonith:1167: start
Oct  6 11:00:13 ha1 lrmd: [5994]: info:
rsc:ha2.itactics.com-stonith:1168: monitor

===========================


I'm on a critical path for this release.
I would really appreciate a quick  help on this.



Thanks a lot.


Shravan




More information about the Pacemaker mailing list