[ClusterLabs] fence_scsi won't start

Duncan Mortimer duncan.mortimer at ndcn.ox.ac.uk
Mon Apr 27 06:18:08 EDT 2015


Hi,

I’m trying to setup a simple two host Apache cluster on Centos 7 using a shared iSCSI backing store (hosted on a third Centos 7 machine).

Following several how tos I have created two iSCSI devices, one for the shared filesystem and one for fencing. I have created /var/target/pr (for persistent reservation support) and confirmed that both cluster hosts can see both iSCSI targets (both read and write).

I then built the cluster with:

$ pcs cluster auth wwwa wwwb
Username: …….
Password: ……..
$ pcs cluster setup --start --name web-ha wwwa wwwb
Shutting down pacemaker/corosync services...
Redirecting to /bin/systemctl stop  pacemaker.service
Redirecting to /bin/systemctl stop  corosync.service
Killing any remaining services...
Removing all cluster configuration files...
wwwa: Succeeded
wwwb: Succeeded
Starting cluster on nodes: wwwa, wwwb...
wwwb: Starting Cluster...
wwwa: Starting Cluster...
$ pcs cluster enable --all
wwwa: Cluster Enabled
wwwb: Cluster Enabled

and add the stonith device…

$ pcs stonith create iscsi-stonith-device fence_scsi devices=/dev/disk/by-id/wwn-0x6001405d8aef6fc57c4428b8dbf646b2 meta provides=unfencing

Cluster name: web-ha
Last updated: Mon Apr 27 10:27:13 2015
Last change: Mon Apr 27 10:27:05 2015
Stack: corosync
Current DC: wwwa (1) - partition with quorum
Version: 1.1.12-a14efad
2 Nodes configured
1 Resources configured


Online: [ wwwa wwwb ]

Full list of resources:

 iscsi-stonith-device	(stonith:fence_scsi):	Stopped 

PCSD Status:
  wwwa: Online
  wwwb: Online

Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/enabled

The STONITH device never starts.

$ pcs stonith show iscsi-stonith-device
 Resource: iscsi-stonith-device (class=stonith type=fence_scsi)
  Attributes: devices=/dev/disk/by-id/wwn-0x6001405d8aef6fc57c4428b8dbf646b2 
  Meta Attrs: provides=unfencing 
  Operations: monitor interval=60s (iscsi-stonith-device-monitor-interval-60s)

Wait for 10-15 minutes…

$ pcs status
Cluster name: web-ha
Last updated: Mon Apr 27 11:12:36 2015
Last change: Mon Apr 27 10:54:06 2015
Stack: corosync
Current DC: wwwb (2) - partition with quorum
Version: 1.1.12-a14efad
2 Nodes configured
1 Resources configured


Online: [ wwwa wwwb ]

Full list of resources:

 iscsi-stonith-device	(stonith:fence_scsi):	Stopped 

Failed actions:
    iscsi-stonith-device_start_0 on wwwb 'unknown error' (1): call=20, status=Error, exit-reason='none', last-rc-change='Mon Apr 27 11:09:12 2015', queued=0ms, exec=1081ms
    iscsi-stonith-device_start_0 on wwwa 'unknown error' (1): call=6, status=Error, exit-reason='none', last-rc-change='Mon Apr 27 11:09:09 2015', queued=0ms, exec=1171ms


PCSD Status:
  wwwa: Online
  wwwb: Online

Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/enabled


I’ve attached /var/log/pacemaker.log for the node that the pcs commands were run on.

Any assistance with getting this to work gratefully received.

Regards,

Duncan


-- 
Duncan Mortimer
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pacemaker.log
Type: application/octet-stream
Size: 61269 bytes
Desc: pacemaker.log
URL: <http://lists.clusterlabs.org/pipermail/users/attachments/20150427/f6fe1f43/attachment-0002.obj>


More information about the Users mailing list