[Pacemaker] 2 sbd devices and stonith-ng is showing (1 active devices)

Lars Marowsky-Bree lmb at suse.com
Thu Mar 15 17:08:48 EDT 2012


On 2012-03-15T15:39:23, "Janec, Jozef" <jozef.janec at hp.com> wrote:

> We have configured pacemaker on HAE  from novell:
> 
> cat /etc/sysconfig/sbd
> SBD_DEVICE="/dev/mapper/SHARED1_part1;/dev/mapper/SHARED2_part1"
> SBD_OPTS="-W"
> 
> I'm running  2 instances of watcher
> 
> root      9157     1  0 11:00 pts/0    00:00:00 sbd: inquisitor
> root      9158  9157  0 11:00 pts/0    00:00:00 sbd: watcher: /dev/mapper/SHARED1_part1 - slot: 0
> root      9159  9157  0 11:00 pts/0    00:00:00 sbd: watcher: /dev/mapper/SHARED2_part1 - slot: 1

That looks fine, but did you read
http://www.linux-ha.org/wiki/SBD_Fencing about the limitations of using
2 devices?

> I have running one resource per node
> 
> Online: [ b300ple0 b400ple0 ]
> 
> sbd_fense_b400  (stonith:external/sbd): Started b400ple0
> sbd_fense_b300  (stonith:external/sbd): Started b300ple0

Why that? Did you read http://www.linux-ha.org/wiki/SBD_Fencing?

You only have to have one external/sbd per cluster. A single primitive
is sufficient. No need to run several, nor to clone them.

> Mar 17 11:03:51 b400ple0 stonith-ng: [9467]: info: stonith_device_register: Added 'sbd_fense_b400' to the device list (1 active devices)

Yes, because from the point of view of the stonith-ng, there is only one
"sbd" device, though that internally uses two storage devices - which
stonith-ng doesn't know about.

> I got:
> 
> 15 11:15:33 b300ple0 stonith-ng: [8546]: debug: exec_child_done: Got 60 more bytes: Performing: stonith -t external/sbd -S  failed:  0.05859375
> Mar 15 11:15:33 b300ple0 stonith-ng: [8546]: notice: log_operation: Operation 'monitor' [15803] for device 'sbd_fense_b300' returned: 1
> Mar 15 11:15:33 b300ple0 stonith-ng: [8546]: debug: log_operation: sbd_fense_b300 output: Performing: stonith -t external/sbd -S
> Mar 15 11:15:33 b300ple0 stonith-ng: [8546]: debug: log_operation: sbd_fense_b300 output: failed:  0.05859375
> Mar 15 11:15:33 b300ple0 lrm-stonith: [15802]: debug: execra: sbd_fense_b300_monitor returned 1
> Mar 15 11:15:33 b300ple0 stonith-ng: [8546]: debug: log_operation: sbd_fense_b300 output:  (total 60 bytes)

The agent itself should also have logged something.

Have you, by chance, configured one external/sbd instance per device?
That would be wrong; you need to run one external/sbd instance per
cluster for all devices.

Did you read http://www.linux-ha.org/wiki/SBD_Fencing?

Regards,
    Lars

-- 
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde





More information about the Pacemaker mailing list