[ClusterLabs] op stop timeout update causes monitor op to fail?

Dennis Jacobfeuerborn dennisml at conversis.de
Tue Sep 10 03:54:00 EDT 2019


Hi,
I just updated the timeout for the stop operation on an nfs cluster and
while the timeout was update the status suddenly showed this:

Failed Actions:
* nfsserver_monitor_10000 on nfs1aqs1 'unknown error' (1): call=41,
status=Timed Out, exitreason='none',
    last-rc-change='Tue Aug 13 14:14:28 2019', queued=0ms, exec=0ms

The command used:
pcs resource update nfsserver op stop timeout=30s

I can't imagine that this is expected to happen. Is there another way to
update the timeout that doesn't cause this?

I attached the log of the transition.

Regards,
  Dennis
-------------- next part --------------
Sep 10 09:39:29 [2378] nfs1a-qs1        cib:     info: cib_process_request:	Forwarding cib_replace operation for section configuration to all (origin=local/cibadmin/2)
Sep 10 09:39:29 [2378] nfs1a-qs1        cib:     info: cib_perform_op:	Diff: --- 0.76.14 2
Sep 10 09:39:29 [2378] nfs1a-qs1        cib:     info: cib_perform_op:	Diff: +++ 0.77.0 8b73092b4ee9744fc4eaff60f8ba8388
Sep 10 09:39:29 [2378] nfs1a-qs1        cib:     info: cib_perform_op:	+  /cib:  @epoch=77, @num_updates=0
Sep 10 09:39:29 [2378] nfs1a-qs1        cib:     info: cib_perform_op:	+  /cib/configuration/resources/primitive[@id='nfsserver']/operations/op[@id='nfsserver-stop-interval-0s']:  @timeout=30s
Sep 10 09:39:29 [2378] nfs1a-qs1        cib:     info: cib_perform_op:	++ /cib/configuration/resources/primitive[@id='nfsserver']:  <meta_attributes id="nfsserver-meta_attributes"/>
Sep 10 09:39:29 [2378] nfs1a-qs1        cib:     info: cib_process_request:	Completed cib_replace operation for section configuration: OK (rc=0, origin=nfs1aqs1/cibadmin/2, version=0.77.0)
Sep 10 09:39:29 [2383] nfs1a-qs1       crmd:     info: abort_transition_graph:	Transition aborted by op.nfsserver-stop-interval-0s 'modify': Configuration change | cib=0.77.0 source=te_update_diff:456 path=/cib/configuration/resources/primitive[@id='nfsserver']/operations/op[@id='nfsserver-stop-interval-0s'] complete=true
Sep 10 09:39:29 [2383] nfs1a-qs1       crmd:   notice: do_state_transition:	State transition S_IDLE -> S_POLICY_ENGINE | input=I_PE_CALC cause=C_FSA_INTERNAL origin=abort_transition_graph
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:   notice: unpack_config:	On loss of CCM Quorum: Ignore
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: determine_online_status:	Node nfs1bqs1 is online
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: determine_online_status:	Node nfs1aqs1 is online
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:  warning: unpack_rsc_op_failure:	Processing failed op monitor for nfsserver on nfs1aqs1: unknown error (1)
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: unpack_node_loop:	Node 2 is already processed
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: unpack_node_loop:	Node 1 is already processed
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: unpack_node_loop:	Node 2 is already processed
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: unpack_node_loop:	Node 1 is already processed
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: clone_print:	 Master/Slave Set: drbd-clone [drbd]
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: short_print:	     Masters: [ nfs1aqs1 ]
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: short_print:	     Slaves: [ nfs1bqs1 ]
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: common_print:	metadata-fs	(ocf::heartbeat:Filesystem):	Started nfs1aqs1
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: common_print:	medias-fs	(ocf::heartbeat:Filesystem):	Started nfs1aqs1
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: common_print:	nfsserver	(ocf::heartbeat:nfsserver):	Started nfs1aqs1
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: common_print:	vip	(ocf::heartbeat:IPaddr2):	Started nfs1aqs1
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: get_failcount_full:	nfsserver has failed 1 times on nfs1aqs1
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: check_migration_threshold:	nfsserver can fail 999999 more times on nfs1aqs1 before being forced off
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: master_color:	Promoting drbd:1 (Master nfs1aqs1)
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: master_color:	drbd-clone: Promoted 1 instances of a possible 1 to master
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: LogActions:	Leave   drbd:0	(Slave nfs1bqs1)
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: LogActions:	Leave   drbd:1	(Master nfs1aqs1)
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: LogActions:	Leave   metadata-fs	(Started nfs1aqs1)
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: LogActions:	Leave   medias-fs	(Started nfs1aqs1)
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: LogActions:	Leave   nfsserver	(Started nfs1aqs1)
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:     info: LogActions:	Leave   vip	(Started nfs1aqs1)
Sep 10 09:39:29 [2382] nfs1a-qs1    pengine:   notice: process_pe_message:	Calculated transition 52373, saving inputs in /var/lib/pacemaker/pengine/pe-input-121.bz2
Sep 10 09:39:29 [2383] nfs1a-qs1       crmd:     info: do_state_transition:	State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE | input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response
Sep 10 09:39:29 [2383] nfs1a-qs1       crmd:     info: do_te_invoke:	Processing graph 52373 (ref=pe_calc-dc-1568101169-53552) derived from /var/lib/pacemaker/pengine/pe-input-121.bz2
Sep 10 09:39:29 [2383] nfs1a-qs1       crmd:   notice: run_graph:	Transition 52373 (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-121.bz2): Complete
Sep 10 09:39:29 [2383] nfs1a-qs1       crmd:     info: do_log:	Input I_TE_SUCCESS received in state S_TRANSITION_ENGINE from notify_crmd
Sep 10 09:39:29 [2383] nfs1a-qs1       crmd:   notice: do_state_transition:	State transition S_TRANSITION_ENGINE -> S_IDLE | input=I_TE_SUCCESS cause=C_FSA_INTERNAL origin=notify_crmd
Sep 10 09:39:29 [2378] nfs1a-qs1        cib:     info: cib_file_backup:	Archived previous version as /var/lib/pacemaker/cib/cib-68.raw
Sep 10 09:39:29 [2378] nfs1a-qs1        cib:     info: cib_file_write_with_digest:	Wrote version 0.77.0 of the CIB to disk (digest: ed0d56723649ac978fa191c204e70c55)
Sep 10 09:39:29 [2378] nfs1a-qs1        cib:     info: cib_file_write_with_digest:	Reading cluster configuration file /var/lib/pacemaker/cib/cib.1Gv7Xi (digest: /var/lib/pacemaker/cib/cib.wbZZfK)
Sep 10 09:39:34 [2378] nfs1a-qs1        cib:     info: cib_process_ping:	Reporting our current digest to nfs1aqs1: 8b73092b4ee9744fc4eaff60f8ba8388 for 0.77.0 (0x5601ffa0e990 0)



More information about the Users mailing list