[ClusterLabs] replacing SBD devices on SLES12

Schaefer, Diane E diane.schaefer at unisys.com
Mon Aug 3 11:18:40 EDT 2015


We are having trouble replacing SBD devices with a live cluster running on SLES 12 with

pacemaker 1.1.12-7.1
sbd 1.2.1-8.7
corosync 2.3.3-7.12

We are sometimes seeing these errors.  We are not sure which is "device 4" and why it things the header is bad.  This sometimes works OK and seems transient.

Aug 03 11:04:38 usrv-tsegp1 sbd[2987]: [2987]: info: Watchdog enabled.
Aug 03 11:04:38 usrv-tsegp1 sbd[2987]: [2987]: ERROR: Header magic does not match.
Aug 03 11:04:38 usrv-tsegp1 sbd[2987]: [2987]: ERROR: header on device 4 is not valid.

We also see dependency errors in pacemaker where it says corosync fails to start.
A reboot seems to fix it.

Our SBD config is

SBD_DEVICE="/dev/disk/by-id/scsi-14945540000000000633a8f8ff9e6d4a36268317b4175f84e;/dev/disk/by-id/scsi-1494554000000000065418950b11404a6b7e8a6ba6a82a05c;/dev/disk/by-id/scsi-149455400000000009ad2f567db1a0c3280b1efc6d7feb0dd"
SBD_WATCHDOG="yes"
A dump of the devices is:
+ sbd -d /dev/disk/by-id/scsi-14945540000000000633a8f8ff9e6d4a36268317b4175f84e dump
==Dumping header on disk /dev/disk/by-id/scsi-14945540000000000633a8f8ff9e6d4a36268317b4175f84e
Header version     : 2.1
UUID               : f74bf73f-a6b6-4541-b192-e36dacbae3d6
Number of slots    : 255
Sector size        : 512
Timeout (watchdog) : 5
Timeout (allocate) : 2
Timeout (loop)     : 1
Timeout (msgwait)  : 10
==Header on disk /dev/disk/by-id/scsi-14945540000000000633a8f8ff9e6d4a36268317b4175f84e is dumped
+ sbd -d /dev/disk/by-id/scsi-1494554000000000065418950b11404a6b7e8a6ba6a82a05c dump
==Dumping header on disk /dev/disk/by-id/scsi-1494554000000000065418950b11404a6b7e8a6ba6a82a05c
Header version     : 2.1
UUID               : 52b1539f-f18e-4266-8065-26ab49c472c0
Number of slots    : 255
Sector size        : 512
Timeout (watchdog) : 5
Timeout (allocate) : 2
Timeout (loop)     : 1
Timeout (msgwait)  : 10
==Header on disk /dev/disk/by-id/scsi-1494554000000000065418950b11404a6b7e8a6ba6a82a05c is dumped
+ sbd -d /dev/disk/by-id/scsi-149455400000000009ad2f567db1a0c3280b1efc6d7feb0dd dump
==Dumping header on disk /dev/disk/by-id/scsi-149455400000000009ad2f567db1a0c3280b1efc6d7feb0dd
Header version     : 2.1
UUID               : ad21ba2f-f09e-49c1-a48b-119a032dc9b2
Number of slots    : 255
Sector size        : 512
Timeout (watchdog) : 5
Timeout (allocate) : 2
Timeout (loop)     : 1
Timeout (msgwait)  : 10
==Header on disk /dev/disk/by-id/scsi-149455400000000009ad2f567db1a0c3280b1efc6d7feb0dd is dumped

We are sharing some of our SBD devices with both SLES11 and SLES12 clusters.  Is this allowed?

Thanks for any help that can be shared,
Diane Schaefer
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.clusterlabs.org/pipermail/users/attachments/20150803/6a58cb02/attachment-0002.html>


More information about the Users mailing list