[ClusterLabs] Pacemaker / ubuntu doesn't see my sbd device: what am I missing?
Tavanyar, Simon
Simon.Tavanyar at stratus.com
Wed Apr 6 15:34:18 EDT 2022
This is my first time using Pacemaker, and I wanted to try watchdog-only fencing with SBD.
I’m running on Ubuntu 21.10 and Pacemaker v2.0.5
My cluster is up just fine with Dummy services on two nodes.
Systemd says my sbd device is active and running.
But the ‘stonith’ command that Pacemaker uses won’t find it, so the resource fails to start in the cluster.
Help much appreciated!
Thanks
Simon
$ sudo stonith -t external/sbd -E -S
external/sbd[361914]: ERROR: No sbd device(s) found in the configuration.
WARN: external_status: 'sbd status' failed with rc 1
ERROR: external/sbd device not accessible.
$ systemctl status sbd
● sbd.service - Shared-storage based fencing daemon
Loaded: loaded (/lib/systemd/system/sbd.service; enabled; vendor preset: enabled)
Active: active (running) since Fri 2022-04-01 15:18:04 EDT; 4 days ago
Docs: man:sbd(8)
Process: 2474278 ExecStart=/usr/sbin/sbd $SBD_OPTS -p /var/run/sbd.pid watch (code=exited, status=0/SUCCESS)
Main PID: 2474279 (sbd)
Tasks: 3 (limit: 38258)
Memory: 11.2M
CPU: 4min 7.329s
CGroup: /system.slice/sbd.service
├─2474279 sbd: inquisitor
├─2474280 sbd: watcher: Pacemaker
└─2474281 sbd: watcher: Cluster
$ sudo pcs status
Cluster name: Axx
Cluster Summary:
* Stack: corosync
* Current DC: node0 (version 2.0.5-ba59be7122) - partition with quorum
* Last updated: Wed Apr 6 14:38:44 2022
* Last change: Wed Apr 6 14:38:35 2022 by root via cibadmin on node0
* 2 nodes configured
* 6 resource instances configured
Node List:
* Online: [ node0 node1 ]
Full List of Resources:
* Resource Group: AxxDummy:
* p_Dummy_1 (ocf::heartbeat:Dummy): Started node0
* p_Dummy_2 (ocf::heartbeat:Dummy): Started node0
* p_Dummy_3 (ocf::heartbeat:Dummy): Started node0
* ClusterIP (ocf::heartbeat:IPaddr2): Started node0
* p_Dummy_4 (ocf::heartbeat:Dummy): Started node0
* fence-sbd (stonith:external/sbd): Stopped
Failed Resource Actions:
* fence-sbd_start_0 on node0 'error' (1): call=51, status='complete', exitreason='', last-rc-change='2022-04-06 14:38:13 -04:00', queued=0ms, exec=3102ms
* fence-sbd_start_0 on node1 'error' (1): call=41, status='complete', exitreason='', last-rc-change='2022-04-06 14:38:09 -04:00', queued=0ms, exec=3094ms
Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: active/enabled
sbd: active/enabled
This is from /var/log/syslog
Apr 6 14:40:43 ubuntuserver pacemaker-controld[349716]: notice: Requesting local execution of start operation for fence-sbd on node0
Apr 6 14:40:43 ubuntuserver external/sbd[349924]: [349930]: ERROR: No sbd device(s) found in the configuration.
Apr 6 14:40:46 ubuntuserver pacemaker-fenced[349712]: notice: Operation 'monitor' [349931] for device 'fence-sbd' returned: -61 (No data available)
Apr 6 14:40:46 ubuntuserver pacemaker-fenced[349712]: warning: fence-sbd:349931 [ Performing: stonith -t external/sbd -E -S ]
Apr 6 14:40:46 ubuntuserver pacemaker-fenced[349712]: warning: fence-sbd:349931 [ failed: 1 ]
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20220406/882df516/attachment-0001.htm>
More information about the Users
mailing list