[Pacemaker] crm resource restart fails to restart the service

Dejan Muhamedagic dejanmm at fastmail.fm
Wed Nov 17 13:03:47 EST 2010


On Wed, Nov 17, 2010 at 09:56:25AM -0500, Vadym Chepkov wrote:
> 
> On Nov 17, 2010, at 9:46 AM, Dejan Muhamedagic wrote:
> 
> > On Wed, Nov 17, 2010 at 08:30:36AM -0500, Vadym Chepkov wrote:
> >> On Wed, Nov 17, 2010 at 8:01 AM, Dejan Muhamedagic <dejanmm at fastmail.fm> wrote:
> >> 
> >>>> RA doesn't support restart action? Most luckily you get OCF_ERR_UNIMPLEMENTED in the log
> >>> 
> >>> It's actually a resource stop followed by start. It says so in
> >>> the help too. Perhaps the start precludes the stop action. The
> >>> logs should give a hint. We need a sleep in between.
> >>> 
> >> 
> >> In this case this command is not working at all, because I tried in
> >> the past for many resources and it never worked, so I just assumed it
> >> has to be implemented by RA.
> > 
> > Funny, it worked here for me every time for apache, Dummy,
> > Delay, stonith resources. With both pacemaker 1.0 and 1.1.
> > 
> >> To test it right now I issued a command
> >> # crm resource restart xen_vbuild
> > 
> > Can you try to insert a sleep and see if that helps. It's in
> > /usr/lib64/python2.6/site-packages/crm/ui.py:
> > 
> > 802     def restart(self,cmd,rsc):
> > 803         "usage: restart <rsc>"
> > 804         if not is_name_sane(rsc):
> > 805             return False
> > 806         if not self.stop("stop",rsc):
> > 807             return False
> > 808         time.sleep(1)
> > 809         return self.start("start",rsc)
> > 
> > Thanks,
> > 
> > Dejan
> 
> 
> Yep, that did the trick

OK. These nodes are faster than what I have (or the other way
around), i.e. this seems to be timing issue.

Thanks,

Dejan

> Now I see this:
> 
> Nov 17 14:52:39 xen-11 Xen[14444]: INFO: Xen domain vbuild will be stopped (timeout: 220s)
> Nov 17 14:52:40 xen-11 Xen[14444]: DEBUG: vbuild still not stopped. Waiting...
> Nov 17 14:52:44 xen-11 Xen[14444]: DEBUG: vbuild still not stopped. Waiting…
> Nov 17 14:52:45 xen-11 Xen[14444]: DEBUG: vbuild still not stopped. Waiting...
> Nov 17 14:52:47 xen-11 Xen[14444]: DEBUG: vbuild still not stopped. Waiting...
> Nov 17 14:52:48 xen-11 Xen[14444]: DEBUG: vbuild still not stopped. Waiting...
> Nov 17 14:52:50 xen-11 Xen[14444]: DEBUG: vbuild still not stopped. Waiting...
> Nov 17 14:52:54 xen-11 Xen[14444]: DEBUG: vbuild still not stopped. Waiting...
> Nov 17 14:52:55 xen-11 Xen[14444]: DEBUG: vbuild still not stopped. Waiting...
> Nov 17 14:53:00 xen-11 Xen[14444]: INFO: Xen domain vbuild stopped.
> 
> [root at xen-11 ~]# xm list|grep build
> vbuild                                    18      511     2 -b----     12.0
> 
> 
> 
> > 
> >> where xen_vbuild is a Xen VM and no results whatsoever.
> >> 
> >> Here is output of the log
> >> 
> >> Nov 17 13:04:13 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:05:14 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:06:14 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:07:15 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:07:44 xen-11 cib: [4294]: info: log_data_element: cib:diff:
> >> +       <primitive id="xen_vbuild" >
> >> Nov 17 13:07:44 xen-11 cib: [4294]: info: log_data_element: cib:diff:
> >> +         <meta_attributes id="xen_vbuild-meta_attributes"
> >> __crm_diff_marker__="added:top" >
> >> Nov 17 13:07:44 xen-11 cib: [4294]: info: log_data_element: cib:diff:
> >> +           <nvpair id="xen_vbuild-meta_attributes-target-role"
> >> name="target-role" value="Stopped" />
> >> Nov 17 13:07:44 xen-11 cib: [4294]: info: log_data_element: cib:diff:
> >> -       <primitive id="xen_vbuild" >
> >> Nov 17 13:07:44 xen-11 cib: [4294]: info: log_data_element: cib:diff:
> >> -         <meta_attributes id="xen_vbuild-meta_attributes" >
> >> Nov 17 13:07:44 xen-11 cib: [4294]: info: log_data_element: cib:diff:
> >> -           <nvpair value="Stopped"
> >> id="xen_vbuild-meta_attributes-target-role" />
> >> Nov 17 13:07:44 xen-11 pengine: [22958]: debug: unpack_lrm_rsc_state:
> >> xen_vbuild: Overwriting calculated next role Unknown with requested
> >> next role Stopped
> >> Nov 17 13:07:44 xen-11 cib: [4294]: info: log_data_element: cib:diff:
> >> +       <primitive id="xen_vbuild" >
> >> Nov 17 13:07:44 xen-11 cib: [4294]: info: log_data_element: cib:diff:
> >> +         <meta_attributes id="xen_vbuild-meta_attributes" >
> >> Nov 17 13:07:44 xen-11 cib: [4294]: info: log_data_element: cib:diff:
> >> +           <nvpair value="Started"
> >> id="xen_vbuild-meta_attributes-target-role" />
> >> Nov 17 13:07:45 xen-11 pengine: [22958]: debug: unpack_lrm_rsc_state:
> >> xen_vbuild: Overwriting calculated next role Unknown with requested
> >> next role Stopped
> >> Nov 17 13:07:45 xen-11 pengine: [22958]: notice: native_print:
> >> xen_vbuild	(ocf::heartbeat:Xen):	Started xen-11
> >> Nov 17 13:07:45 xen-11 pengine: [22958]: debug: native_assign_node:
> >> All nodes for resource xen_vbuild are unavailable, unclean or shutting
> >> down (xen-11: 1, -1000000)
> >> Nov 17 13:07:45 xen-11 pengine: [22958]: debug: native_assign_node:
> >> Could not allocate a node for xen_vbuild
> >> Nov 17 13:07:45 xen-11 pengine: [22958]: info: native_color: Resource
> >> xen_vbuild cannot run anywhere
> >> Nov 17 13:07:45 xen-11 pengine: [22958]: notice: LogActions: Stop
> >> resource xen_vbuild	(xen-11)
> >> Nov 17 13:07:46 xen-11 pengine: [22958]: notice: native_print:
> >> xen_vbuild	(ocf::heartbeat:Xen):	Started xen-11
> >> Nov 17 13:07:46 xen-11 pengine: [22958]: debug: native_assign_node:
> >> Assigning xen-11 to xen_vbuild
> >> Nov 17 13:07:46 xen-11 pengine: [22958]: notice: LogActions: Leave
> >> resource xen_vbuild	(Started xen-11)
> >> Nov 17 13:08:15 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:09:15 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:10:16 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:11:16 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:12:17 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:13:17 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:14:17 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:15:18 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:16:18 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:17:19 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:17:47 xen-11 pengine: [22958]: notice: native_print:
> >> xen_vbuild	(ocf::heartbeat:Xen):	Started xen-11
> >> Nov 17 13:17:47 xen-11 pengine: [22958]: debug: native_assign_node:
> >> Assigning xen-11 to xen_vbuild
> >> Nov 17 13:17:47 xen-11 pengine: [22958]: notice: LogActions: Leave
> >> resource xen_vbuild	(Started xen-11)
> >> Nov 17 13:18:19 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:19:19 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> Nov 17 13:20:20 xen-11 lrmd: [4295]: debug: rsc:xen_vbuild:101: monitor
> >> 
> >> but VM never stopped:
> >> 
> >> 
> >> [root at xen-11 ~]# xm list|grep vbuild
> >> vbuild                                     3      511     2 -b----    352.4
> >> 
> >> 
> >> still ID 3 as it was before
> >> 
> >> Vadym
> >> 
> >> _______________________________________________
> >> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> >> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> >> 
> >> Project Home: http://www.clusterlabs.org
> >> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> >> Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
> > 
> > _______________________________________________
> > Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> > 
> > Project Home: http://www.clusterlabs.org
> > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> > Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
> 
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker




More information about the Pacemaker mailing list