[ClusterLabs] [EXT] Rebuild of failed node

Windl, Ulrich u.windl at ukr.de
Mon May 12 06:41:26 UTC 2025


Maybe explain what “failed node” and “rebuild” actually means:
It was fenced, or was it reinstalled, or did you have a fatal disk failure?
Usually a backup is your best friend.

Kind regards,
Ulrich Windl

From: Users <users-bounces at clusterlabs.org> On Behalf Of Fabrizio Ermini
Sent: Friday, May 9, 2025 4:26 PM
To: users at clusterlabs.org
Subject: [EXT] [ClusterLabs] Rebuild of failed node

Hi all! Freshmen here, just joined.

I'm currently in the need to rebuild a failed node on a pacemaker2.1/corosync3.1 2-node cluster with drbd storage.
I've searched in Pacemaker docs and in the list archives, but I haven't found a clear guide on how to proceed in this task. So far, I've reinstalled a new server, configured the same IP and hostname of the failed one, and installed all the software. I've also fixed DRBD layer and started the resync of the volumes. But it's not clear to me how to proceed - I've found some hints online pointing to the need of manually copying corosync config, but they were quite old and probably obsolete. I'm using pcs as a shell and I haven't found a command designed to replace a node, only to add or remove them.
It seems really strange to me that there isn't a guide, since this should be a very basic operation and it's quite important to know how to do it - HW breaks, as a matter of fact :D
So I'll be very grateful if anyone can point me in the right direction.
Thanks in advance, and best regards

Fabrizio

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20250512/5b368e73/attachment.htm>


More information about the Users mailing list