[Pacemaker] stonith q

Andrei Borzenkov arvidjaar at gmail.com
Mon Nov 3 00:41:42 EST 2014


В Sun, 2 Nov 2014 20:47:22 +0000
Alex Samad - Yieldbroker <Alex.Samad at yieldbroker.com> пишет:

> 
> 
> > -----Original Message-----
> > From: Digimer [mailto:lists at alteeve.ca]
> > Sent: Monday, 3 November 2014 3:26 AM
> > To: The Pacemaker cluster resource manager; Alex Samad - Yieldbroker
> > Subject: Re: [Pacemaker] stonith q
> > 
> > On 02/11/14 06:45 AM, Andrei Borzenkov wrote:
> > > В Sun, 2 Nov 2014 10:01:59 +0000
> > > Alex Samad - Yieldbroker <Alex.Samad at yieldbroker.com> пишет:
> > >
> > >>
> > >>
> > >>> -----Original Message-----
> > >>> From: Digimer [mailto:lists at alteeve.ca]
> > >>> Sent: Sunday, 2 November 2014 9:49 AM
> > >>> To: The Pacemaker cluster resource manager
> > >>> Subject: Re: [Pacemaker] stonith q
> > >>>
> > >>> On 01/11/14 06:27 PM, Alex Samad - Yieldbroker wrote:
> > >>>> Hi
> {snip}
> > >
> > > That hardly makes sense except in pure test environment. Stonith is
> > > needed when you do not know state of partner node, in which case you
> > > cannot be sure your reboot/shutdown command will be executed, nor that
> > > you can reach your partner at all.
> > >
> > > If you are running under Vmware, use stonith/vmware or stonith/vcenter.
> > 
> > Andrei is correct. A stonith method must be external to the node and work
> > regardless of the state of a node. Try this; 'echo c > /proc/sysrq-trigger' will
> > crash the node. Any stonith method that requires the OS to respond will fail
> > and your cluster will hang.
> 
> Yes but vmware will restart the node in that circumstance.
> I have had issues with my 2 node cluster where 1 node will remove itself because of lack of communication. Very hard to track down when it happens every now and then and only at 1am, I believe because of backup traffic or because its starved of cpu cycles.
> 
> What I would like to see happen in that situation if for a reboot to be issued, I know that the node would respond and I know that it would reconnect.
> 
> I read that there was a suicide option module that did what I wanted but its not available.
> 
> I don't want to setup useid for vmware for each node and configure that. I just want fenced to do a reboot via the os of the node instead of just killing cman 
> 
> What I am hearing is that its not available. Is it possible to hook to a custom script on that event, I can write my own restart
>  

Sure you can write your own external stonith script.




More information about the Pacemaker mailing list