[ClusterLabs] Resource Parameter Change Not Honoring Constraints

Marc Smith msmith626 at gmail.com
Mon Mar 16 13:26:30 EDT 2020


On Thu, Mar 12, 2020 at 10:51 AM Ken Gaillot <kgaillot at redhat.com> wrote:
>
> On Wed, 2020-03-11 at 17:24 -0400, Marc Smith wrote:
> > Hi,
> >
> > I'm using Pacemaker 1.1.20 (yes, I know, a bit dated now). I noticed
>
> I'd still consider that recent :)
>
> > when I modify a resource parameter (eg, update the value), this
> > causes
> > the resource itself to restart. And that's fine, but when this
> > resource is restarted, it doesn't appear to honor the full set of
> > constraints for that resource.
> >
> > I see the output like this (right after the resource parameter
> > change):
> > ...
> > Mar 11 20:43:25 localhost crmd[1943]:   notice: State transition
> > S_IDLE -> S_POL
> > ICY_ENGINE
> > Mar 11 20:43:25 localhost crmd[1943]:   notice: Current ping state:
> > S_POLICY_ENG
> > INE
> > Mar 11 20:43:25 localhost pengine[1942]:   notice: Clearing failure
> > of
> > p_bmd_140c58-1 on 140c58-1 because resource parameters have changed
> > Mar 11 20:43:25 localhost pengine[1942]:   notice:  * Restart
> > p_bmd_140c58-1             (                   140c58-1 )   due to
> > resource definition change
> > Mar 11 20:43:25 localhost pengine[1942]:   notice:  * Restart
> > p_dummy_g_lvm_140c58-1     (                   140c58-1 )   due to
> > required g_md_140c58-1 running
> > Mar 11 20:43:25 localhost pengine[1942]:   notice:  * Restart
> > p_lvm_140c58_vg_01         (                   140c58-1 )   due to
> > required p_dummy_g_lvm_140c58-1 start
> > Mar 11 20:43:25 localhost pengine[1942]:   notice: Calculated
> > transition 41, saving inputs in
> > /var/lib/pacemaker/pengine/pe-input-173.bz2
> > Mar 11 20:43:25 localhost crmd[1943]:   notice: Initiating stop
> > operation p_lvm_140c58_vg_01_stop_0 on 140c58-1
> > Mar 11 20:43:25 localhost crmd[1943]:   notice: Transition aborted by
> > deletion of lrm_rsc_op[@id='p_bmd_140c58-1_last_failure_0']: Resource
> > operation removal
> > Mar 11 20:43:25 localhost crmd[1943]:   notice: Current ping state:
> > S_TRANSITION_ENGINE
> > ...
> >
> > The stop on 'p_lvm_140c58_vg_01' then times out, because the other
> > constraint (to stop the service above LVM) is never executed. I can
> > see from the messages it never even tries to demote the resource
> > above
> > that.
> >
> > Yet, if I use crmsh at the shell, and do a restart on that same
> > resource, it works correctly, and all constraints are honored: crm
> > resource restart p_bmd_140c58-1
> >
> > I can certainly provide my full cluster config if needed, but hoping
> > to keep this email concise for clarity. =)
> >
> > I guess my questions are: 1) Is the difference in restart behavior
> > expected, and not all constraints are followed when resource
> > parameters change (or some other restart event that originated
> > internally like this)? 2) Or perhaps this is known bug that was
> > already resolved in newer versions of Pacemaker?
>
> No to both. Can you attach that pe-input-173.bz2 file (with any
> sensitive info removed)?

Thanks; that system got wiped, so I reproduced it on another system
and I am attaching that pe-input file. Log snippet is below for
completeness:

Mar 16 17:16:50 localhost crmd[1340]:   notice: State transition S_IDLE -> S_POL
ICY_ENGINE
Mar 16 17:16:50 localhost pengine[1339]:   notice:  * Restart
p_bmd_126c4f-1             (                   126c4f-1 )   due to
resource definition change
Mar 16 17:16:50 localhost pengine[1339]:   notice:  * Restart
p_dummy_g_lvm_126c4f-1     (                   126c4f-1 )   due to
required g_md_126c4f-1 running
Mar 16 17:16:50 localhost pengine[1339]:   notice:  * Restart
p_lvm_126c4f_vg_01         (                   126c4f-1 )   due to
required p_dummy_g_lvm_126c4f-1 start
Mar 16 17:16:50 localhost pengine[1339]:   notice: Calculated
transition 149, saving inputs in
/var/lib/pacemaker/pengine/pe-input-46.bz2


--Marc


> >
> > I searched a bit for #2 but I didn't get many (well any) hits on
> > other
> > users experiencing this behavior.
> >
> > Many thanks in advance.
> >
> > --Marc
> --
> Ken Gaillot <kgaillot at redhat.com>
>
> _______________________________________________
> Manage your subscription:
> https://lists.clusterlabs.org/mailman/listinfo/users
>
> ClusterLabs home: https://www.clusterlabs.org/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pe-input-46.bz2
Type: application/x-bzip
Size: 4294 bytes
Desc: not available
URL: <http://lists.clusterlabs.org/pipermail/users/attachments/20200316/d5090ab6/attachment.bin>


More information about the Users mailing list