[Pacemaker] Bug? failed to stonith with fence_ipmilan on CentOS6.2

Xiaomin Zhang zhangxiaomin at gmail.com
Tue Oct 8 17:09:56 EDT 2013


Hi:
I configure stonith on CentOS 6.2 with fence-ipmilan agent:
primitive node2-stonith stonith:fence_ipmilan \
        params pcmk_host_list="node2" pcmk_host_check="static-list"
ipaddr="192.168.170.1" login="root" passwd="123" lanplus="false"
power_wait="1"

The IPaddr for IPMI and credentials are verified to be correct with raw
ipmitool command.

While I test the stonith, I just found that the node1-stonith seem not
working at all, and I also found some strange log on another node which is
expected to kill node1:

Oct  9 04:39:05 node1 stonith-ng: [3705]: info: stonith_fence: Exec
<stonith_command t="stonith-ng"
st_async_id="4ca92d0e-9a2a-4fdd-8968-c91eb89e8cbe" st_op="st_fence"
st_callid="0" st_callopt="0"
st_remote_op="4ca92d0e-9a2a-4fdd-8968-c91eb89e8cbe" st_target="node2"
st_device_action="reboot" st_timeout="54000" src="node3" seq="12" />
Oct  9 04:39:05 node1 stonith-ng: [3705]: info: can_fence_host_with_device:
node2-stonith can fence node2: static-list
Oct  9 04:39:05 node1 stonith-ng: [3705]: info: stonith_fence: Found 1
matching devices for 'node2'
Oct  9 04:39:05 node1 stonith-ng: [3705]: info: stonith_command: Processed
st_fence from node3: rc=-1
Oct  9 04:39:05 node1 stonith-ng: [3705]: info: make_args: reboot-ing node
'node2' as 'port=node2'
Oct  9 04:39:05 node1 crmd: [3710]: info: send_direct_ack: ACK'ing resource
op drbd_hadoop:1_notify_0 from 77:4:0:ee8de687-92c9-4123-8efb-befd45814a3b:
lrm_invoke-lrmd-1381264745-30
Oct  9 04:39:05 node1 crmd: [3710]: info: process_lrm_event: LRM operation
drbd_hadoop:1_notify_0 (call=20, rc=0, cib-update=0, confirmed=true) ok
Oct  9 04:39:05 node1 stonith-ng: [3705]: ERROR: log_operation: Operation
'reboot' [22346] (call 0 from (null)) for host 'node2' with device
'node2-stonith' returned: -2
Oct  9 04:39:05 node1 stonith-ng: [3705]: ERROR: log_operation:
node2-stonith: Rebooting machine @ IPMI:192.168.170.1...Failed

The log shows that stonith failed with return value (-2). However, what
does this mean? Is there any configure issue?
Thanks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.clusterlabs.org/pipermail/pacemaker/attachments/20131009/87744d27/attachment-0002.html>


More information about the Pacemaker mailing list