[ClusterLabs] Cannot ping a secondary address apart from the server which it is assigned to (on Azure)
Andrei Borzenkov
arvidjaar at gmail.com
Mon Nov 1 01:36:11 EDT 2021
On 01.11.2021 01:56, Paul Warwicker wrote:
> On 28/10/2021 14:30, Andrei Borzenkov wrote:
>> For virtual IP you can (should?) use Azure
>> load balancers - basically, you create a pool of one address, Azure
>> probes each node and detects which node has IP active.
>>
>> See as example this RH documentation:
>>
>> https://access.redhat.com/documentation/en-us/red_hat_enterprise_linux/8/html/deploying_red_hat_enterprise_linux_8_on_public_cloud_platforms/configuring-rhel-high-availability-on-azure_cloud-content#azure-create-internal-load-balancer-in-azure-ha_configuring-rhel-high-availability-on-azure
>>
>>
> I have configured a load balancer as suggested but it is still not
> exposing the floating IP address.
>
> Status looks okay:
>
> [root at haswmfs-vm-lin-000 ~]# pcs status
> Cluster name: haswmfs
> Cluster Summary:
> * Stack: corosync
> * Current DC: haswmfs-vm-lin-001 (version 2.1.0-8.el8-7c3f660707) -
> partition with quorum
> * Last updated: Sun Oct 31 22:52:00 2021
> * Last change: Sun Oct 31 20:38:57 2021 by root via cibadmin on
> haswmfs-vm-lin-000
> * 2 nodes configured
> * 5 resource instances configured
>
> Node List:
> * Online: [ haswmfs-vm-lin-000 haswmfs-vm-lin-001 ]
>
> Full List of Resources:
> * Resource Group: haswmfs-service:
> * haswmfs-ip(ocf::heartbeat:IPaddr2): Started haswmfs-vm-lin-000
> * haswmfs-daemon(lsb:smallworld_GIS): Started haswmfs-vm-lin-000
> * haswmfs-fs(ocf::heartbeat:Filesystem): Started haswmfs-vm-lin-000
> * haswmfs-lb(ocf::heartbeat:azure-lb): Started haswmfs-vm-lin-000
> * haswmfs-fence(stonith:fence_azure_arm): Started haswmfs-vm-lin-001
>
> Daemon Status:
> corosync: active/enabled
> pacemaker: active/enabled
> pcsd: active/enabled
> [root at haswmfs-vm-lin-000 ~]#
>
> Resources created using:
>
> pcs resource create haswmfs-ip ocf:heartbeat:IPaddr2 ip=172.16.31.5
> cidr_netmask=24 nic=eth0 iflabel=haswmfs op monitor interval=30s
> pcs resource create haswmfs-lb ocf:heartbeat:azure-lb port=61000
>
> The IP address 172.16.31.5 is the frontend IP address which is
> dynamically assigned to the load balancer
>
> The load balancer is as follows:
>
> Internal (aka private)
> Basic SKU
> Dynamic IP address assignment and floating IP enable (using the
> frontend IP address)
> Backend pool to which all the nodes in the cluster are allocated
> Health probe added for port 61000
>
> I have also enable full debug logging in the pacemaker log
>
> Oct 31 20:38:54 IPaddr2(haswmfs-ip)[6291]: INFO: Adding inet address
> 172.16.31.5/24 with broadcast address 172.16.31.255 to device eth0 (with
> label eth0:haswmfs)
> Oct 31 20:38:54 IPaddr2(haswmfs-ip)[6291]: INFO: Bringing device
> eth0 up
> Oct 31 20:38:54 IPaddr2(haswmfs-ip)[6291]: INFO:
> /usr/libexec/heartbeat/send_arp -i 200 -r 5 -p
> /run/resource-agents/send_arp-172.16.31.5 eth0 172.16.31.5 auto not_used
> not_used
> ...
> Oct 31 20:38:58 IPaddr2(haswmfs-ip)[6291]: INFO: ARPING 172.16.31.5
> from 172.16.31.5 eth0
>
> Also multiple repetitions of this:
>
> Oct 31 20:39:08 haswmfs-vm-lin-000 pacemaker-execd [5974]
> (recurring_action_timer) debug: Scheduling another invocation of
> haswmfs-lb_monitor_10000
> Oct 31 20:39:08 haswmfs-vm-lin-000 pacemaker-execd [5974]
> (operation_finished) debug: haswmfs-lb_monitor_10000[6878] exited with
> status 0
> Oct 31 20:39:08 haswmfs-vm-lin-000 pacemaker-execd [5974]
> (log_finished) debug: haswmfs-lb monitor (call 24, PID 6878) exited with
> status 0 (execution time 0ms, queue time 0ms)
> Oct 31 20:39:18 haswmfs-vm-lin-000 pacemaker-execd [5974]
> (recurring_action_timer) debug: Scheduling another invocation of
> haswmfs-lb_monitor_10000
> Oct 31 20:39:18 haswmfs-vm-lin-000 pacemaker-execd [5974]
> (operation_finished) debug: haswmfs-lb_monitor_10000[7002] exited with
> status 0
> Oct 31 20:39:18 haswmfs-vm-lin-000 pacemaker-execd [5974]
> (log_finished) debug: haswmfs-lb monitor (call 24, PID 7002) exited with
> status 0 (execution time 0ms, queue time 0ms)
>
> Any further advice?
>
Do you see any traffic to this configured address (tcpdump, wireshark)?
Is firewall active?
More information about the Users
mailing list