[Pacemaker] Strange DRBD error in cluster operation

Lars Ellenberg lars.ellenberg at linbit.com
Mon Sep 5 12:49:55 EDT 2011


On Mon, Sep 05, 2011 at 04:40:31PM +0200, Michael Schwartzkopff wrote:
> > On Thu, Sep 01, 2011 at 02:59:56PM +0200, Michael Schwartzkopff wrote:
> > > Hi,
> > > 
> > > from time to time we see the DRBD M/S resource failing on one of our
> > > clusters.
> > > 
> > > From the logs we see that the monitoring fail with rc=5 (not_installed)
> > > and the log entry:
> > > 
> > > lrmd: [2454]: info: RA output: (resDRBD:1:monitor:stderr)
> > > /etc/drbd.conf:3: Failed to open include file
> > > 'drbd.d/global_common.conf'.
> > > 
> > > This happens about once per week and causes constant trouble.
> > > 
> > > Any ideas what might be the reason for this behavior?
> > 
> > You periodically re-create that file from some "recipe",
> > and it so happens that at the time of the monitor,
> > it is not there?
> 
> Of course, this also was my first thought.
> 
> The file is managed by cfengine, but the guys in charge for cfengine swear that 
> it does not interfere with the monitoring.
> 
> So I wanted to ask if there is another known reason for this behavior, besides 
> the obvious.

monitor the inode, mtime and ctime of that file.

chattr +i, and see if someone notices :)


-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com

DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.




More information about the Pacemaker mailing list