[ClusterLabs] Antw: [EXT] Re: maximum token value (knet)
Jan Friesse
jfriesse at redhat.com
Fri Mar 12 03:31:16 EST 2021
Ulrich,
>>>> Jan Friesse <jfriesse at redhat.com> schrieb am 11.03.2021 um 18:12 in Nachricht
> <e6bf446a-186e-053c-8bdc-921c6f7e9feb at redhat.com>:
>> Strahil,
>>> Hello all,
>>> I'm building a test cluster on RHEL8.2 and I have noticed that the cluster
>> fails to assemble ( nodes stay inquorate as if the network is not working) if
>> I set the token at 30000 or more (30s+).
>
>
> Hi!
>
> I know you will be bored when I say this, but anyway:
> In old HP Service Guard the node connectivity was checked with ping/pong too, and you could specify the interval and the number lost responses that declare a node unreachable. The good thing (as
Yes, and this is exactly what
knet_ping_interval/knet_ping_timeout/knet_pong_count is doing :) What I
was describing are default timeouts.
opposed to the TOTEM protocol I know) was that single missed responses
were logged, so you did not
Yup, missed responses are logged.
just have an OK/BAD status, but also an indicator how far you are away
from BAD status.
>
> So you have a token timeout of 30s, and we had 3 lost responses at an interval of 7 seconds (at that time the 100Mb NIC needed about 5 seconds to renegotiate after a link failure (like unplug/replug). Recent hard- and software is somewhat faster AFAIK.
Indeed. But sadly recent hard- and software are quite usually running in
VM on super overprovisioned bare-metal so bad things are happening,
that's why token timeouts are (by default) set to quite big numbers.
Regards,
Honza
>
> Regards,
> Ulrich
>
>>
>> Knet waits for enough pong replies for other nodes before it marks them
>> as alive and starts sending/receiving packets from them. By default it
>> needs to receive 2 pongs and ping is sent 4 times in token timeout so it
>> means 15 sec until node is considered up for 30 sec token timeout.
>>
>>> What is the maximum token value with knet ?On SLES12 (I think it was
>> corosync 1) , I used to set the token/consensus with far greater values on
>> some of our clusters.
>>
>> I'm really not aware about any arbitrary limits.
>>
>>> Best Regards,Strahil Nikolov
>>>
>>
>> Regards,
>> Honza
>>
>>>
>>>
>>> _______________________________________________
>>> Manage your subscription:
>>> https://lists.clusterlabs.org/mailman/listinfo/users
>>>
>>> ClusterLabs home: https://www.clusterlabs.org/
>>>
>>
>> _______________________________________________
>> Manage your subscription:
>> https://lists.clusterlabs.org/mailman/listinfo/users
>>
>> ClusterLabs home: https://www.clusterlabs.org/
>
>
>
>
> _______________________________________________
> Manage your subscription:
> https://lists.clusterlabs.org/mailman/listinfo/users
>
> ClusterLabs home: https://www.clusterlabs.org/
>
More information about the Users
mailing list