[Pacemaker] Question on resource groups

Ken Dechick kend at medent.com
Tue Jan 19 11:38:06 EST 2010


Hello all,

Quick question here today. Please forgive me if this has been answered, I have searched for a couple days and not been able to come up with the answer. I am working on a standard 2 node cluster using DRBD and I have my resources in a group. All in working well, but my question has to do with what happens when there is a problem with an individual service. Consider the following example using heartbeat (3.0.1-1) drbd (8.3.6) and pacemaker (1.0.6):

Cluster with one reosurce group which contains these resources in this order: 
   
   -drbd master/slave
   -virtual file system
   -openvpn
   -samba
   -apache webserver
   -cupsd

Problem I am running into is if there is a problem with openvpn in this example (VPN goes down and keys are missing so it CANNOT restart without intervention), watching the cluster with crm_mon, I see that all the services under openvpn in order (samba,apache, cupsd) will all starta "rolling restart". In other words, I see openvpn fail, then samba goes down, then apache goes down, then cups goes down. Next cups comes up, apache comes up, samba comes up, then openvpn tries to start but fails so the progress starts over - smba, apache and cups stop then start again. What I end up with is a system where those last 3 services which runs fine alone keep coming up then going down again, over and over. Only way I can change this is to fix the openvpn issue, then things restart and stay restarted.

My question is: is this normal (expected) behavior? If so how do I change this? I have tried several on-fail options in the monitors for those services (tried: stop, restart, and block) but this doesn't change the behavior. I would like to just have the one service stop without affecting the others. Do I need to re-think using a resource group?? Any assistance would be greatly appreciated. The pacemaker site has a lot of documentation but it's not the clearest explainations at times.

-Thanks

Kenneth M DeChick
Linux Systems Administrator
Community Computer Service, Inc.
(315)-255-1751 ext154
http://www.medent.com
kend at medent.com
Registered Linux User #497318
-- -- -- -- -- -- -- -- -- -- --
"You canna change the laws of physics, Captain; I've got to have thirtyminutes! "

.
 
This message has been scanned for viruses and dangerous content by MailScanner, SpamAssassin  & ClamAV. <BR>
 <BR>
This message and any attachments may contain information that is protected by law as privileged and confidential, and is transmitted for the sole use <BR>
of the intended recipient(s). If you are not the intended recipient, you are hereby notified that any use, dissemination, copying or retention of this e-mail <BR>
or the information contained herein is strictly prohibited. If you received this e-mail in error, please immediately notify the sender by e-mail, and permanently <BR>
delete this e-mail. <BR>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.clusterlabs.org/pipermail/pacemaker/attachments/20100119/d2d766dc/attachment-0001.html>


More information about the Pacemaker mailing list