[ClusterLabs] Two-Node OCFS2 cluster keep rebooting each other

Jonathan Vargas jonathan.vargas at alkaid.cr
Wed Jun 10 05:50:32 UTC 2015


2015-06-09 23:26 GMT-06:00 Digimer <lists at alteeve.ca>:

> On 10/06/15 01:19 AM, Jonathan Vargas wrote:
> > Thanks Andrei, Digimer.
> >
> > I see. Since I need to address this discussion to a definitive solution,
> > I am sharing you a diagram of how we are designing this HA architecture,
> > to clarify the problem we are trying to solve:
> >
> > http://i.imgur.com/BFPcZSx.png
>
> Last block is DRBD. If DRBD will be managed by the cluster, it must have
> fencing.
>
> This is your definitive answer.
>
> Without it, you *will* get a split-brain. That leads to, at best, data
> divergence or data loss.
>
> > The first layer, Load Balancer; and the third later, Database, are both
> > already setup. The Load Balancer cluster uses only an VIP resource,
> > while Database cluster uses DRBD+VIP resources. They are on production
> > and work fine, test passed :-)
> >
> > Now we are handling the Web Server layer, which I am discussing with
> > experts like you. These servers require to be all active and see the
> > same data for read & write, as quickly as possible, mainly reads.
> >
> > *So, If we stay with OCFS2: *Since we need to protect the service
> > availability and keep most of nodes up, what choices do I have to avoid
> > reboots on both Web nodes caused by a split-brain situation when one of
> > them is disconnected from network?
>
> None of this matters relative to the importance of working, tested
> fencing for replicated storage.
>
> In any HA setup, the reboot of a node should matter not. If you are
> afraid of rebooting a node, you need to reconsider your design.
>
>

Well, the problem is caused by a pretty common scenario: A simple network
disconnection on node 1 causes both nodes to reboot, even when the node 1
is still offline, it will keep rebooting the active node 2. There were no
disk issues, but the service availability was lost. *That's the main
complain now :-/*



> > Correct me if I'm wrong:
> >
> > *1. Redundant Channel:* This is pretty difficult, since we would have to
> > add two new physical netword cards to the virtual machine hosts, and
> > that changes network configuration a lot in the virtualization platform.
>
> High Availability must put priorities like hassle and cost second to
> what makes a system more resilient. If you choose not to spend the extra
> money or time, then you must accept the risks.


> > *2. Three Node Cluster:* This is possible, but it will consume more
> > resources. We can have it only for cluster communication though, not for
> > web processing, that will decrease load.
>
> Quorum is NOT a substitution for fencing. They solve different problems.
>
> Quorum is a tool for when all nodes are behaving properly. Fencing is a
> tool for when a node is not behaving properly.
>


Yes, but by adding a 3rd node, it will help to determine which node could
be failing and which are not, to fence the proper one. Right?



>
> > *3. Disable Fencing:* You said this should not happen at all if we use a
> > shared disk like OCFS. So I am discarding it.
>
> Correct.
>
> > *4. Use NFS: *Yes, this will cause a SPoF, and to solve it we would have
> > to setup another cluster with DRBD as described here
> > <
> https://www.suse.com/documentation/sle_ha/singlehtml/book_sleha_techguides/book_sleha_techguides.html
> >,
> > and add more infrastructure resources, or do we can setup NFS over OCFS2?
>
> ... Which would require fencing anyway, so you gain nothing but another
> layer of things to break. First rule of HA; Keep it simple.
>
> Complexity is the enemy of availability.
>


Sure, fencing must be added to if this would be the case.



>
> > Thanks in advance.
> >
> > *Jonathan Vargas Rodríguez*
> > Founder and Solution Engineer
> > Alkaid <https://alkaid.cr/> | Open Source Software
> >
> > * mail **  *jonathan.vargas at alkaid.cr <mailto:jonathan.vargas at alkaid.cr>
> >  telf   +506 4001 6259 Ext. 01
> >  mobi   +506 4001 6259 Ext. 51
> >
> > <http://linkedin.com/in/jonathanvargas/>
> >   <https://plus.google.com/+JonathanVargas/>
> >   <https://www.facebook.com/alkaid.cr>      <
> https://twitter.com/alkaidcr>
> >
> >
> > 2015-06-09 22:03 GMT-06:00 Andrei Borzenkov <arvidjaar at gmail.com
> > <mailto:arvidjaar at gmail.com>>:
> >
> >     В Tue, 9 Jun 2015 21:53:41 -0600
> >     Jonathan Vargas <jonathan.vargas at alkaid.cr
> >     <mailto:jonathan.vargas at alkaid.cr>> пишет:
> >
> >     > Thanks,
> >     >
> >     > Those nodes do not need coordination between them. They have been
> working
> >     > so far until now without HA and OCFS2. A load balancer distributes
> the
> >     > requests between both nodes, they do not know about the existence
> of each
> >     > other.
> >     >
> >     > However, they do require shared storage to work with the same
> data. Before
> >     > setting up the OCFS2 cluster, we have been syncing disks using
> rsync, but
> >     > it syncs each minute, not real time.
> >     >
> >     > So, our requirement would depend on OCFS2, and it works, but not
> of an HA
> >     > and stonith setup I think. I see no way how it could add value to
> the
> >     > required solution. Or it does?
> >     >
> >
> >     You need coordination between nodes on write and even if you mount
> your
> >     system read-only you still have at least boot time journal replay. So
> >     no, your nodes cannot free run.
> >
> >     You probably want to use NFS for this.
> >
> >
> >
> >
> > _______________________________________________
> > Users mailing list: Users at clusterlabs.org
> > http://clusterlabs.org/mailman/listinfo/users
> >
> > Project Home: http://www.clusterlabs.org
> > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> > Bugs: http://bugs.clusterlabs.org
> >
>
>
> --
> Digimer
> Papers and Projects: https://alteeve.ca/w/
> What if the cure for cancer is trapped in the mind of a person without
> access to education?
>
> _______________________________________________
> Users mailing list: Users at clusterlabs.org
> http://clusterlabs.org/mailman/listinfo/users
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20150609/314ee014/attachment.htm>


More information about the Users mailing list