[ClusterLabs] Q: fence_kdump and fence_kdump_send

Ulrich Windl Ulrich.Windl at rz.uni-regensburg.de
Thu Feb 24 07:21:58 EST 2022


Hi!

After reading about fence_kdump and fence_kdump_send I wonder:
Does anybody use that in production?
Having the networking and bonding in initrd does not sound like a good idea to me.
Wouldn't it be easier to integrate that functionality into sbd?
I mean: Let sbd wait for a "kdump-ed" message that initrd could send when kdump is complete.
Basically that would be the same mechanism, but using storage instead of networking.

If I get it right, the original fence_kdump would also introduce an extra fencing delay, and I wonder what happens with a hardware watchdog while a kdump is in progress...

The background of all this is that our nodes kernel-panic, and support says the kdumps are all incomplete.
The events are most likely:
node1: panics (kdump)
other_node: seens node1 had failed and fences it (via sbd).

However sbd fencing wont work while kdump is executing (IMHO)

So what happens most likely is that the watchdog terminates the kdump.
In that case all the mess with fence_kdump won't help, right?

Regards,
Ulrich






More information about the Users mailing list