[ClusterLabs] Constant stop/start of resource in spite of interval=0
Kadlecsik József
kadlecsik.jozsef at wigner.mta.hu
Sat May 18 11:34:18 EDT 2019
Hello,
We have a resource agent which creates IP tunnels. In spite of the
configuration setting
primitive tunnel-eduroam ocf:local:tunnel \
params ....
op start timeout=120s interval=0 \
op stop timeout=300s interval=0 \
op monitor timeout=30s interval=30s depth=0 \
meta target-role=Started
order bifur-eduroam-ipv4-before-tunnel-eduroam \
Mandatory: bifur-eduroam-ipv4 tunnel-eduroam
colocation tunnel-eduroam-on-bifur-eduroam-ipv4 inf: tunnel-eduroam \
bifur-eduroam-ipv4:Started
the resource is restarted again and again. According to the debug logs:
May 16 14:20:35 [3052] bifur1 lrmd: debug: recurring_action_timer:
Scheduling another invocation of tunnel-eduroam_monitor_30000
May 16 14:20:35 [3052] bifur1 lrmd: debug: operation_finished:
tunnel-eduroam_monitor_30000:62066 - exited with rc=0
May 16 14:20:35 [3052] bifur1 lrmd: debug: operation_finished:
tunnel-eduroam_monitor_30000:62066:stderr [ -- empty -- ]
May 16 14:20:35 [3052] bifur1 lrmd: debug: operation_finished:
tunnel-eduroam_monitor_30000:62066:stdout [ -- empty -- ]
May 16 14:20:35 [3052] bifur1 lrmd: debug: log_finished:
finished - rsc:tunnel-eduroam action:monitor call_id:1045 pid:62066
exit-code:0 exec-time:0ms queue-time:0ms
May 16 14:21:04 [3054] bifur1 pengine: info: native_print:
tunnel-eduroam (ocf::local:tunnel): Started bifur1
May 16 14:21:04 [3054] bifur1 pengine: info:
check_action_definition:
Parameters to tunnel-eduroam_start_0 on bifur1 changed: was
94afff0ff7cfc62f7cb1d5bf5b4d83aa vs. now f2317cad3d54cec5d7d7aa7d0bf35cf8
(restart:3.0.11) 0:0;48:3:0:73562fd6-1fe2-4930-8c6e-5953b82ebb32
May 16 14:21:04 [3054] bifur1 pengine: debug: native_assign_node:
Assigning bifur1 to tunnel-eduroam
May 16 14:21:04 [3054] bifur1 pengine: info: RecurringOp:
Start recurring monitor (30s) for tunnel-eduroam on bifur1
May 16 14:21:04 [3054] bifur1 pengine: notice: LogActions: Restart
tunnel-eduroam (Started bifur1)
May 16 14:21:04 [3055] bifur1 crmd: notice: te_rsc_command:
Initiating stop operation tunnel-eduroam_stop_0 locally on bifur1 | action
50
May 16 14:21:04 [3055] bifur1 crmd: debug:
stop_recurring_action_by_rsc: Cancelling op 1045 for tunnel-eduroam
(tunnel-eduroam:1045)
May 16 14:21:04 [3055] bifur1 crmd: debug: cancel_op: Cancelling
op 1045 for tunnel-eduroam (tunnel-eduroam:1045)
May 16 14:21:04 [3052] bifur1 lrmd: info:
cancel_recurring_action: Cancelling ocf operation
tunnel-eduroam_monitor_30000
May 16 14:21:04 [3052] bifur1 lrmd: debug: log_finished:
finished - rsc:tunnel-eduroam action:monitor call_id:1045 exit-code:0
exec-time:0ms queue-time:0ms
May 16 14:21:04 [3055] bifur1 crmd: debug: cancel_op: Op 1045
for tunnel-eduroam (tunnel-eduroam:1045): cancelled
May 16 14:21:04 [3055] bifur1 crmd: info: do_lrm_rsc_op:
Performing key=50:4:0:73562fd6-1fe2-4930-8c6e-5953b82ebb32
op=tunnel-eduroam_stop_0
May 16 14:21:04 [3052] bifur1 lrmd: info: log_execute:
executing - rsc:tunnel-eduroam action:stop call_id:1047
May 16 14:21:04 [3055] bifur1 crmd: info: process_lrm_event:
Result of monitor operation for tunnel-eduroam on bifur1: Cancelled |
call=1045 key=tunnel-eduroam_monitor_30000 confirmed=true
...
>From where does the restart operation come? Why does it happen? The IP
address is at the same node where the tunnel resource is already running.
Best regards,
Jozsef
--
E-mail : kadlecsik.jozsef at wigner.mta.hu
PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address: Wigner Research Centre for Physics, Hungarian Academy of Sciences
H-1525 Budapest 114, POB. 49, Hungary
More information about the Users
mailing list