[ClusterLabs] node name issues (Could not obtain a node name for corosync nodeid 739512332)

Ulrich Windl Ulrich.Windl at rz.uni-regensburg.de
Thu Aug 22 03:07:03 EDT 2019


Hi!

When starting pacemaker (1.1.19+20181105.ccd6b5b10-3.10.1) on a node that had been down for a while, I noticed some unexpected messages about the node name:

pacemakerd:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512332
pacemakerd:     info: crm_get_peer:    Created entry a21bf687-045b-4fd7-9340-0562ef595883/0x18752f0 for node (null)/739512332 (1 total)
pacemakerd:     info: crm_get_peer:    Node 739512332 has uuid 739512332

Seems UUID and node ID is mixed up in the message at least...

pacemakerd:     info: crm_update_peer_proc: cluster_connect_cpg: Node (null)[739512332] - corosync-cpg is now online
pacemakerd:   notice: cluster_connect_quorum: Quorum acquired
pacemakerd:     info: corosync_node_name: Unable to get node name for nodeid 739512332
pacemakerd:   notice: get_node_name:   Defaulting to uname -n for the local corosync node name
pacemakerd:     info: crm_get_peer:    Node 739512332 is now known as h12
...
pacemakerd:     info: main:    Starting mainloop
pacemakerd:     info: pcmk_quorum_notification:        Quorum retained | membership=172 members=2
pacemakerd:     info: corosync_node_name:      Unable to get node name for nodeid 739512331
pacemakerd:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512331
pacemakerd:     info: crm_get_peer:    Created entry f4ef35e4-1b49-4e48-916b-bb0fab7c52c9/0x1876820 for node (null)/739512331 (2 total)
pacemakerd:     info: crm_get_peer:    Node 739512331 has uuid 739512331
...
pacemakerd:     info: corosync_node_name:      Unable to get node name for nodeid 739512331
...
pacemakerd:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512331
pacemakerd:   notice: crm_update_peer_state_iter:      Node (null) state is now member | nodeid=739512331 previous=unknown source=pcmk_quorum_notification
pacemakerd:   notice: crm_update_peer_state_iter:      Node 12 state is now member | nodeid=739512332 previous=unknown source=pcmk_quorum_notification
pacemakerd:     info: pcmk_cpg_membership:     Node 739512332 joined group pacemakerd (counter=0.0, pid=32766, unchecked for rivals)
stonith-ng:     info: corosync_node_name:      Unable to get node name for nodeid 739512332
stonith-ng:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512332

What's that? The ID had been resolved before!

stonith-ng:     info: crm_get_peer:    Created entry 155a30a0-ddd3-4b31-9f76-46313ffa9824/0x1bff130 for node (null)/739512332 (1 total)
stonith-ng:     info: crm_get_peer:    Node 739512332 has uuid 739512332
...
stonith-ng:   notice: crm_update_peer_state_iter:      Node (null) state is now member | nodeid=739512332 previous=unknown source=crm_update_peer_proc
...
attrd:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512332
attrd:     info: crm_get_peer:    Created entry 961e718f-ad71-479a-ae04-c2ec5ba29858/0x256ca40 for node (null)/739512332 (1 total)
attrd:     info: crm_get_peer:    Node 739512332 has uuid 739512332
attrd:     info: crm_update_peer_proc:    cluster_connect_cpg: Node (null)[739512332] - corosync-cpg is now online
attrd:   notice: crm_update_peer_state_iter:      Node (null) state is now member | nodeid=739512332 previous=unknown source=crm_update_peer_proc
...
pacemakerd:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512331
pacemakerd:     info: pcmk_cpg_membership:     Node 739512331 still member of group pacemakerd (peer=(null):7275, counter=0.0, at least once)
stonith-ng:   notice: get_node_name:   Defaulting to uname -n for the local corosync node name
...
pacemakerd:     info: crm_get_peer:    Node 739512331 is now known as h11
...
attrd:     info: corosync_node_name:      Unable to get node name for nodeid 739512332
attrd:   notice: get_node_name:   Defaulting to uname -n for the local corosync node name
attrd:     info: crm_get_peer:    Node 739512332 is now known as h12
stonith-ng:     info: corosync_node_name:      Unable to get node name for nodeid 739512332
stonith-ng:   notice: get_node_name:   Defaulting to uname -n for the local corosync node name
stonith-ng:     info: crm_get_peer:    Node 739512332 is now known as h12
cib:     info: corosync_node_name:      Unable to get node name for nodeid 739512332
cib:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512332
cib:     info: crm_get_peer:    Created entry 287bf9d9-b9f7-44d5-997f-89fd3ee038de/0x24d2740 for node (null)/739512332 (1 total)
cib:     info: crm_get_peer:    Node 739512332 has uuid 739512332
cib:     info: crm_update_peer_proc:    cluster_connect_cpg: Node (null)[739512332] - corosync-cpg is now online
cib:   notice: crm_update_peer_state_iter:      Node (null) state is now member | nodeid=739512332 previous=unknown source=crm_update_peer_proc
...

This doesn't look right in my eyes.

cib:     info: cib_init:        Starting cib mainloop
cib:     info: pcmk_cpg_membership:     Node 739512332 joined group cib (counter=0.0, pid=0, unchecked for rivals)
cib:     info: corosync_node_name:      Unable to get node name for nodeid 739512331
cib:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512331
cib:     info: crm_get_peer:    Created entry a3a97ea4-27b0-474b-9052-37892bbb3eb2/0x24d3250 for node (null)/739512331 (2 total)
cib:     info: crm_get_peer:    Node 739512331 has uuid 739512331
cib:     info: pcmk_cpg_membership:     Node 739512331 still member of group cib (peer=(null):7276, counter=0.0, at least once)
cib:     info: crm_update_peer_proc:    pcmk_cpg_membership: Node (null)[739512331] - corosync-cpg is now online
cib:   notice: crm_update_peer_state_iter:      Node (null) state is now member | nodeid=739512331 previous=unknown source=crm_update_peer_proc
cib:     info: pcmk_cpg_membership:     Node 739512332 still member of group cib (peer=h12:40550, counter=0.1, at least once)
cib:     info: cib_file_backup: Archived previous version as /var/lib/pacemaker/cib/cib-39.raw
cib:     info: cib_file_write_with_digest:      Wrote version 0.212.0 of the CIB to disk (digest: 8ca1ed7121bc34a2f81c25eb952b843a)
...
crmd:     info: corosync_node_name:      Unable to get node name for nodeid 739512332
crmd:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512332
crmd:     info: crm_get_peer:    Created entry 14984fcd-a050-4e09-890e-6eee7be7d459/0x1d3a010 for node (null)/739512332 (1 total)
crmd:     info: crm_get_peer:    Node 739512332 has uuid 739512332
crmd:     info: crm_update_peer_proc:    cluster_connect_cpg: Node (null)[739512332] - corosync-cpg is now online
crmd:     info: init_cs_connection_once: Connection to 'corosync': established
crmd:     info: corosync_node_name:      Unable to get node name for nodeid 739512332
crmd:   notice: get_node_name:   Defaulting to uname -n for the local corosync node name
crmd:     info: crm_get_peer:    Node 739512332 is now known as h12
crmd:     info: peer_update_callback:    Cluster node h12 is now in unknown state
cib:     info: corosync_node_name:      Unable to get node name for nodeid 739512332
cib:   notice: get_node_name:   Defaulting to uname -n for the local corosync node name
cib:     info: crm_get_peer:    Node 739512331 is now known as h11
...
crmd:   notice: cluster_connect_quorum:  Quorum acquired
crmd:     info: do_ha_control:   Connected to the cluster
...
crmd:     info: corosync_node_name:      Unable to get node name for nodeid 739512331
crmd:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512331
crmd:     info: crm_get_peer:    Created entry 0a6fdb02-7a25-4c0d-b496-60bb7287168e/0x1e7e500 for node (null)/739512331 (2 total)
crmd:     info: crm_get_peer:    Node 739512331 has uuid 739512331
crmd:     info: corosync_node_name:      Unable to get node name for nodeid 739512331
crmd:     info: pcmk_quorum_notification:        Obtaining name for new node 739512331
crmd:     info: corosync_node_name:      Unable to get node name for nodeid 739512331
crmd:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512331
crmd:   notice: crm_update_peer_state_iter:      Node (null) state is now member | nodeid=739512331 previous=unknown source=pcmk_quorum_notification
crmd:   notice: crm_update_peer_state_iter:      Node h12 state is now member | nodeid=739512332 previous=unknown source=pcmk_quorum_notification
crmd:     info: peer_update_callback:    Cluster node h12 is now member (was in unknown state)
crmd:     info: corosync_node_name:      Unable to get node name for nodeid 739512332
crmd:   notice: get_node_name:   Defaulting to uname -n for the local corosync node name
...

???

attrd:     info: corosync_node_name:      Unable to get node name for nodeid 739512332
attrd:   notice: get_node_name:   Defaulting to uname -n for the local corosync node name
attrd:     info: main:    CIB connection active
...
stonith-ng:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512331
stonith-ng:     info: crm_get_peer:    Created entry 956e8bf0-5634-4535-aa72-cdd6cf319d5b/0x1d04440 for node (null)/739512331 (2 total)
stonith-ng:     info: crm_get_peer:    Node 739512331 has uuid 739512331
stonith-ng:     info: pcmk_cpg_membership:     Node 739512331 still member of group stonith-ng (peer=(null):7277, counter=0.0, at least once)
stonith-ng:     info: crm_update_peer_proc:    pcmk_cpg_membership: Node (null)[739512331] - corosync-cpg is now online
stonith-ng:   notice: crm_update_peer_state_iter:      Node (null) state is now member | nodeid=739512331 previous=unknown source=crm_update_peer_proc
...
attrd:     info: corosync_node_name:      Unable to get node name for nodeid 739512331
attrd:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512331
attrd:     info: crm_get_peer:    Created entry 40380a43-c1e2-498a-bc9e-d68968acf4d6/0x2572850 for node (null)/739512331 (2 total)
attrd:     info: crm_get_peer:    Node 739512331 has uuid 739512331
attrd:     info: pcmk_cpg_membership:     Node 739512331 still member of group attrd (peer=(null):7279, counter=0.0, at least once)
attrd:     info: crm_update_peer_proc:    pcmk_cpg_membership: Node (null)[739512331] - corosync-cpg is now online
attrd:   notice: crm_update_peer_state_iter:      Node (null) state is now member | nodeid=739512331 previous=unknown source=crm_update_peer_proc
attrd:     info: pcmk_cpg_membership:     Node 739512332 still member of group attrd (peer=h12:40553, counter=0.1, at least once)
attrd:     info: crm_get_peer:    Node 739512331 is now known as h11
attrd:   notice: attrd_check_for_new_writer:      Recorded new attribute writer: h11 (was unset)
...
crmd:     info: pcmk_cpg_membership:     Node 739512332 joined group crmd (counter=0.0, pid=0, unchecked for rivals)
crmd:     info: corosync_node_name:      Unable to get node name for nodeid 739512331
crmd:   notice: get_node_name:   Could not obtain a node name for corosync nodeid 739512331
crmd:     info: pcmk_cpg_membership:     Node 739512331 still member of group crmd (peer=(null):7281, counter=0.0, at least once)
crmd:     info: crm_update_peer_proc:    pcmk_cpg_membership: Node (null)[739512331] - corosync-cpg is now online

???

crmd:     info: pcmk_cpg_membership:     Node 739512332 still member of group crmd (peer=h12:40555, counter=0.1, at least once)
crmd:     info: crm_get_peer:    Node 739512331 is now known as h11
crmd:     info: peer_update_callback:    Cluster node h11 is now member
crmd:     info: update_dc:       Set DC to h11 (3.0.14)
crmd:     info: crm_update_peer_expected:        update_dc: Node h11[739512331] - expected state is now member (was (null))
...

I feel this mess with determining the node name is overly complicated...

Regards,
Ulrich





More information about the Users mailing list