arvidjaar at gmail.com
Sat May 8 02:12:23 EDT 2021
On 07.05.2021 13:36, Kyle O'Donnell wrote:
> Hi Everyone.
> We've setup fencing with our ilo/idrac interfaces and things generally work well but during some of our failover scenario testing we ran into issues when we "failed' the switches in which those ilo/idrac interfaces were connected. The issue was that resources were migrated away from any node with an offline fencing device. I can see how that is desirable, but in our case this is essentially a single point of failure. How are others managing this?
I am not sure I understand the issue. So node did not fail and remained
online but pacemaker migrated resources off this node? And what exactly
"offline fencing device" means?
Sounds you have some constraints that do it. You need to post logs at
least from DC from the point stonith resource failed as well as your
actual configuration with all constraints.
> In one of our sites we have "smart" APC power strips so we can setup multiple fencing devices, but in another site we do not. I tried increasing the timeout= value on the fencing devices but that did not seem to work.
> Manage your subscription:
> ClusterLabs home: https://www.clusterlabs.org/
More information about the Users