[ClusterLabs] fence_scsi won't start
Duncan Mortimer
duncan.mortimer at ndcn.ox.ac.uk
Mon Apr 27 10:18:08 UTC 2015
Hi,
I’m trying to setup a simple two host Apache cluster on Centos 7 using a shared iSCSI backing store (hosted on a third Centos 7 machine).
Following several how tos I have created two iSCSI devices, one for the shared filesystem and one for fencing. I have created /var/target/pr (for persistent reservation support) and confirmed that both cluster hosts can see both iSCSI targets (both read and write).
I then built the cluster with:
$ pcs cluster auth wwwa wwwb
Username: …….
Password: ……..
$ pcs cluster setup --start --name web-ha wwwa wwwb
Shutting down pacemaker/corosync services...
Redirecting to /bin/systemctl stop pacemaker.service
Redirecting to /bin/systemctl stop corosync.service
Killing any remaining services...
Removing all cluster configuration files...
wwwa: Succeeded
wwwb: Succeeded
Starting cluster on nodes: wwwa, wwwb...
wwwb: Starting Cluster...
wwwa: Starting Cluster...
$ pcs cluster enable --all
wwwa: Cluster Enabled
wwwb: Cluster Enabled
and add the stonith device…
$ pcs stonith create iscsi-stonith-device fence_scsi devices=/dev/disk/by-id/wwn-0x6001405d8aef6fc57c4428b8dbf646b2 meta provides=unfencing
Cluster name: web-ha
Last updated: Mon Apr 27 10:27:13 2015
Last change: Mon Apr 27 10:27:05 2015
Stack: corosync
Current DC: wwwa (1) - partition with quorum
Version: 1.1.12-a14efad
2 Nodes configured
1 Resources configured
Online: [ wwwa wwwb ]
Full list of resources:
iscsi-stonith-device (stonith:fence_scsi): Stopped
PCSD Status:
wwwa: Online
wwwb: Online
Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: active/enabled
The STONITH device never starts.
$ pcs stonith show iscsi-stonith-device
Resource: iscsi-stonith-device (class=stonith type=fence_scsi)
Attributes: devices=/dev/disk/by-id/wwn-0x6001405d8aef6fc57c4428b8dbf646b2
Meta Attrs: provides=unfencing
Operations: monitor interval=60s (iscsi-stonith-device-monitor-interval-60s)
Wait for 10-15 minutes…
$ pcs status
Cluster name: web-ha
Last updated: Mon Apr 27 11:12:36 2015
Last change: Mon Apr 27 10:54:06 2015
Stack: corosync
Current DC: wwwb (2) - partition with quorum
Version: 1.1.12-a14efad
2 Nodes configured
1 Resources configured
Online: [ wwwa wwwb ]
Full list of resources:
iscsi-stonith-device (stonith:fence_scsi): Stopped
Failed actions:
iscsi-stonith-device_start_0 on wwwb 'unknown error' (1): call=20, status=Error, exit-reason='none', last-rc-change='Mon Apr 27 11:09:12 2015', queued=0ms, exec=1081ms
iscsi-stonith-device_start_0 on wwwa 'unknown error' (1): call=6, status=Error, exit-reason='none', last-rc-change='Mon Apr 27 11:09:09 2015', queued=0ms, exec=1171ms
PCSD Status:
wwwa: Online
wwwb: Online
Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: active/enabled
I’ve attached /var/log/pacemaker.log for the node that the pcs commands were run on.
Any assistance with getting this to work gratefully received.
Regards,
Duncan
--
Duncan Mortimer
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pacemaker.log
Type: application/octet-stream
Size: 61269 bytes
Desc: pacemaker.log
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20150427/f6fe1f43/attachment-0003.obj>
More information about the Users
mailing list