[ClusterLabs] Pacemaker fatal shutdown
Priyanka Balotra
priyanka.14balotra at gmail.com
Fri Jul 21 00:36:40 EDT 2023
Hi All,
Any updates on this issue?
Regards
Priyanka
On Thu, 20 Jul 2023 at 12:43 PM, Priyanka Balotra <
priyanka.14balotra at gmail.com> wrote:
> What I mainly want to understand is that:
> - why "fatal failure" is coming
> - why does pacemaker not start on the node after a node boots followed by
> "pacemaker fatal failure" .
> - How can this be handled?
>
> Thanks
> Priyanka
>
> On Thu, Jul 20, 2023 at 12:41 PM Priyanka Balotra <
> priyanka.14balotra at gmail.com> wrote:
>
>> Hi,
>>
>> Here are FILE-6 logs:
>>
>> 65710:Jul 17 14:16:51.517 FILE-6 pacemaker-controld [19415]
>> (throttle_mode) debug: Current load is 0.760000 across 10 core(s)
>> 65711:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (throttle_update) debug: Node FILE-2 has negligible load and supports at
>> most 20 jobs; new job limit 20
>> 65712:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (handle_request) debug: The throttle changed. Trigger a graph.
>> 65713:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x00020000 (new_actions)
>> for controller set by s_crmd_fsa:198
>> 65714:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_JOIN_REQUEST: [ state=S_INTEGRATION
>> cause=C_HA_MESSAGE origin=route_message ]
>> 65715:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00020000 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65716:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (do_dc_join_filter_offer) debug: Accepting join-1 request from FILE-2 |
>> ref=join_request-crmd-1689603392-8
>> 65717:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__update_peer_expected) info: do_dc_join_filter_offer: Node
>> FILE-2[2] - expected state is now member (was (null))
>> 65718:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (do_dc_join_filter_offer) debug: 2 nodes currently integrated in join-1
>> 65719:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (check_join_state) debug: join-1: Integration of 2 peers complete |
>> state=S_INTEGRATION for=do_dc_join_filter_offer
>> 65720:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x00040000 (new_actions)
>> for controller set by s_crmd_fsa:198
>> 65721:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_INTEGRATED: [ state=S_INTEGRATION
>> cause=C_FSA_INTERNAL origin=check_join_state ]
>> 65722:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (do_state_transition) info: State transition S_INTEGRATION ->
>> S_FINALIZE_JOIN | input=I_INTEGRATED cause=C_FSA_INTERNAL
>> origin=check_join_state
>> 65723:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x00000020
>> (A_INTEGRATE_TIMER_STOP) for controller set by do_state_transition:559
>> 65724:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x00000040
>> (A_FINALIZE_TIMER_START) for controller set by do_state_transition:563
>> 65725:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x00000200
>> (A_DC_TIMER_STOP) for controller set by do_state_transition:569
>> 65726:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (do_state_transition) debug: All cluster nodes (2) responded to join
>> offer
>> 65727:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000200 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65728:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000020 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65729:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000040 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65730:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (controld_start_timer) debug: Started Finalization Timer (inject
>> I_ELECTION if pops after 1800000ms, source=119)
>> 65731:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00040000 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65732:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (do_dc_join_finalize) debug: Finalizing join-1 for 2 nodes (sync'ing
>> from local CIB)
>> 65733:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (do_dc_join_finalize) debug: Requested CIB version <generation_tuple
>> crm_feature_set="3.11.0" validate-with="pacemaker-3.7" epoch="24"
>> num_updates="72" admin_epoch="0" cib-last-written="Thu Jul 13 13:11:46
>> 2023" update-origin="FILE-1" update-client="cibadmin" update-user="root"
>> have-quorum="1" dc-uuid="6"/>
>> 65734:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-6=integrated
>> 65735:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-2=integrated
>> 65736:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-3=confirmed
>> 65737:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-1=none
>> 65738:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-5=confirmed
>> 65739:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-4=confirmed
>> 65740:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000
>> (fsa_data->actions) for controller set by s_crmd_fsa:193
>> 65741:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000000
>> (new_actions) for controller set by s_crmd_fsa:198
>> 65742:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
>> cause=C_HA_MESSAGE origin=do_te_invoke ]
>> 65743:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65744:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415] (do_log)
>> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
>> do_te_invoke
>> 65745:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415] (do_log)
>> debug: do_log <create_request_adv origin="do_cl_join_query" t="crmd"
>> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
>> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
>> 65746:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65747:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (abort_transition_graph) info: Transition 0 aborted: Peer Halt |
>> source=do_te_invoke:135 complete=false
>> 65748:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (register_fsa_input_adv) debug: Stalling the FSA pending further input:
>> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c6194ed4c0 queue=0
>> 65749:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Exiting the FSA: queue=1, fsa_actions=0x0, stalled=true
>> 65750:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (fsa_dump_queue) debug: queue[0.72]: input I_WAIT_FOR_EVENT raised by
>> do_te_invoke(0x55c619869580.1) (cause=C_HA_MESSAGE)
>> 65751:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000
>> (fsa_data->actions) for controller set by s_crmd_fsa:193
>> 65752:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000000
>> (new_actions) for controller set by s_crmd_fsa:198
>> 65753:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
>> cause=C_HA_MESSAGE origin=do_te_invoke ]
>> 65754:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65755:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415] (do_log)
>> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
>> do_te_invoke
>> 65756:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415] (do_log)
>> debug: do_log <create_request_adv origin="do_cl_join_query" t="crmd"
>> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
>> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
>> 65757:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65758:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (abort_transition_graph) info: Transition 0 aborted: Peer Halt |
>> source=do_te_invoke:135 complete=false
>> 65759:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (register_fsa_input_adv) debug: Stalling the FSA pending further input:
>> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c619869580 queue=0
>> 65760:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Exiting the FSA: queue=1, fsa_actions=0x0, stalled=true
>> 65761:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (fsa_dump_queue) debug: queue[0.73]: input I_WAIT_FOR_EVENT raised by
>> do_te_invoke(0x55c6194ed4c0.1) (cause=C_HA_MESSAGE)
>> 65762:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__execute_graph) debug: Transition 0 (Complete=33, Pending=2,
>> Fired=0, Skipped=0, Incomplete=24,
>> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
>> 65764:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (check_join_state) debug: join-1: Still waiting on 2 integrated
>> nodes | state=S_FINALIZE_JOIN for=finalize_sync_callback
>> 65765:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-6=integrated
>> 65766:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-2=integrated
>> 65767:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-3=confirmed
>> 65768:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-1=none
>> 65769:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-5=confirmed
>> 65770:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-4=confirmed
>> 65771:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (finalize_sync_callback) debug: Notifying 2 nodes of join-1 results
>> 65772:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (finalize_join_for) debug: Acknowledging join-1 request from FILE-6
>> 65773:Jul 17 14:16:55.085 FILE-6 pacemaker-controld [19415]
>> (finalize_join_for) debug: Acknowledging join-1 request from FILE-2
>> 65776:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415]
>> (handle_request) debug: Raising I_JOIN_RESULT: join-1
>> 65777:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000
>> (fsa_data->actions) for controller set by s_crmd_fsa:193
>> 65778:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000000
>> (new_actions) for controller set by s_crmd_fsa:198
>> 65779:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
>> cause=C_HA_MESSAGE origin=do_te_invoke ]
>> 65780:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65781:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415] (do_log)
>> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
>> do_te_invoke
>> 65782:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415] (do_log)
>> debug: do_log <create_request_adv origin="do_cl_join_query" t="crmd"
>> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
>> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
>> 65783:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65784:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415]
>> (abort_transition_graph) info: Transition 0 aborted: Peer Halt |
>> source=do_te_invoke:135 complete=false
>> 65785:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415]
>> (register_fsa_input_adv) debug: Stalling the FSA pending further input:
>> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c6194ed4c0 queue=1
>> 65786:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Exiting the FSA: queue=2, fsa_actions=0x0, stalled=true
>> 65787:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415]
>> (fsa_dump_queue) debug: queue[0.74]: input I_JOIN_RESULT raised by
>> route_message(0x55c619861a90.1) (cause=C_HA_MESSAGE)
>> 65788:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415]
>> (fsa_dump_queue) debug: queue[1.75]: input I_WAIT_FOR_EVENT raised by
>> do_te_invoke(0x55c61986ed80.1) (cause=C_HA_MESSAGE)
>> 65789:Jul 17 14:16:55.093 FILE-6 pacemaker-controld [19415]
>> (pcmk__execute_graph) debug: Transition 0 (Complete=33, Pending=2,
>> Fired=0, Skipped=0, Incomplete=24,
>> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
>> 65792:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x00880000 (new_actions)
>> for controller set by s_crmd_fsa:198
>> 65793:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_JOIN_RESULT: [ state=S_FINALIZE_JOIN
>> cause=C_HA_MESSAGE origin=route_message ]
>> 65794:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00800000 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65795:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource stonith-sbd after monitor op complete (interval=0)
>> 65796:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource FILE_Filesystem after monitor op complete (interval=0)
>> 65797:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource Service_pfile after monitor op complete (interval=0)
>> 65798:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource Service_Postgresql after monitor op complete (interval=0)
>> 65799:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource Service_esm_primary after monitor op complete (interval=0)
>> 65800:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource Service_Postgrest after monitor op complete (interval=0)
>> 65801:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource IP_Floating after monitor op complete (interval=0)
>> 65802:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource Shared_Cluster_Backup after monitor op complete (interval=0)
>> 65803:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (do_cl_join_finalize_respond) debug: Confirming join-1: sending local
>> operation history to FILE-6
>> 65804:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00080000 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65805:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (do_dc_join_ack) debug: Ignoring 'join_ack_nack' message from FILE-6
>> while waiting for 'join_confirm'
>> 65806:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000
>> (fsa_data->actions) for controller set by s_crmd_fsa:193
>> 65807:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000000
>> (new_actions) for controller set by s_crmd_fsa:198
>> 65808:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
>> cause=C_HA_MESSAGE origin=do_te_invoke ]
>> 65809:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65810:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415] (do_log)
>> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
>> do_te_invoke
>> 65811:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415] (do_log)
>> debug: do_log <create_request_adv origin="do_cl_join_query" t="crmd"
>> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
>> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
>> 65812:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65813:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (abort_transition_graph) info: Transition 0 aborted: Peer Halt |
>> source=do_te_invoke:135 complete=false
>> 65814:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (register_fsa_input_adv) debug: Stalling the FSA pending further input:
>> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c61986ed80 queue=1
>> 65815:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Exiting the FSA: queue=2, fsa_actions=0x0, stalled=true
>> 65816:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (fsa_dump_queue) debug: queue[0.76]: input I_JOIN_RESULT raised by
>> route_message(0x55c619871630.1) (cause=C_HA_MESSAGE)
>> 65817:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (fsa_dump_queue) debug: queue[1.77]: input I_WAIT_FOR_EVENT raised by
>> do_te_invoke(0x55c619861a90.1) (cause=C_HA_MESSAGE)
>> 65818:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__execute_graph) debug: Transition 0 (Complete=33, Pending=2,
>> Fired=0, Skipped=0, Incomplete=24,
>> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
>> 65821:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x00880000 (new_actions)
>> for controller set by s_crmd_fsa:198
>> 65822:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_JOIN_RESULT: [ state=S_FINALIZE_JOIN
>> cause=C_HA_MESSAGE origin=route_message ]
>> 65823:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00800000 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65824:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00080000 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65825:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (controld_delete_node_state) info: Deleting resource history for node
>> FILE-2 (via CIB call 71) | xpath=//node_state[@uname='FILE-2']/lrm
>> 65826:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (do_dc_join_ack) debug: Updating node history for FILE-2 from join-1
>> confirmation (via CIB call 72)
>> 65827:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000
>> (fsa_data->actions) for controller set by s_crmd_fsa:193
>> 65828:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000000
>> (new_actions) for controller set by s_crmd_fsa:198
>> 65829:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
>> cause=C_HA_MESSAGE origin=do_te_invoke ]
>> 65830:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65831:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415] (do_log)
>> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
>> do_te_invoke
>> 65832:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415] (do_log)
>> debug: do_log <create_request_adv origin="do_cl_join_query" t="crmd"
>> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
>> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
>> 65833:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65834:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (abort_transition_graph) info: Transition 0 aborted: Peer Halt |
>> source=do_te_invoke:135 complete=false
>> 65835:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (register_fsa_input_adv) debug: Stalling the FSA pending further input:
>> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c619861a90 queue=1
>> 65836:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Exiting the FSA: queue=2, fsa_actions=0x0, stalled=true
>> 65837:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (fsa_dump_queue) debug: queue[0.78]: input I_JOIN_RESULT raised by
>> route_message(0x55c6198798d0.1) (cause=C_HA_MESSAGE)
>> 65838:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (fsa_dump_queue) debug: queue[1.79]: input I_WAIT_FOR_EVENT raised by
>> do_te_invoke(0x55c619871630.1) (cause=C_HA_MESSAGE)
>> 65839:Jul 17 14:16:55.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__execute_graph) debug: Transition 0 (Complete=33, Pending=2,
>> Fired=0, Skipped=0, Incomplete=24,
>> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
>> 65851:Jul 17 14:16:55.109 FILE-6 pacemaker-controld [19415]
>> (cib_delete_callback) debug: Deletion of resource history for node
>> FILE-2 (via CIB call 71) succeeded
>> 65861:Jul 17 14:16:55.109 FILE-6 pacemaker-controld [19415]
>> (te_update_diff) debug: Processing (cib_modify) diff: 0.24.72 -> 0.24.73
>> (S_FINALIZE_JOIN)
>> 65862:Jul 17 14:16:55.109 FILE-6 pacemaker-controld [19415]
>> (join_update_complete_callback) debug: join-1 node history update (via
>> CIB call 72) complete
>> 65863:Jul 17 14:16:55.109 FILE-6 pacemaker-controld [19415]
>> (check_join_state) debug: join-1: Still waiting on 1 finalized node
>> | state=S_FINALIZE_JOIN for=join_update_complete_callback
>> 65864:Jul 17 14:16:55.109 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-6=finalized
>> 65865:Jul 17 14:16:55.109 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-2=confirmed
>> 65866:Jul 17 14:16:55.109 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-3=confirmed
>> 65867:Jul 17 14:16:55.109 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-1=none
>> 65868:Jul 17 14:16:55.109 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-5=confirmed
>> 65869:Jul 17 14:16:55.109 FILE-6 pacemaker-controld [19415]
>> (crmd_join_phase_log) debug: join-1: FILE-4=confirmed
>> 65876:Jul 17 14:17:21.517 FILE-6 pacemaker-controld [19415]
>> (throttle_cib_load) debug: cib load: 0.001000 (3 ticks in 30s)
>> 65877:Jul 17 14:17:21.517 FILE-6 pacemaker-controld [19415]
>> (throttle_mode) debug: Current load is 0.960000 across 10 core(s)
>> 65878:Jul 17 14:17:51.517 FILE-6 pacemaker-controld [19415]
>> (throttle_cib_load) debug: cib load: 0.000333 (1 ticks in 30s)
>> 65879:Jul 17 14:17:51.517 FILE-6 pacemaker-controld [19415]
>> (throttle_mode) debug: Current load is 0.580000 across 10 core(s)
>> 65883:Jul 17 14:18:20.085 FILE-6 pacemaker-fenced [19411]
>> (process_remote_stonith_exec) debug: Finalizing action 'reboot'
>> targeting FILE-2 on behalf of pacemaker-controld.19415 at FILE-6: OK | rc=0
>> id=4e523b34
>> 65884:Jul 17 14:18:20.085 FILE-6 pacemaker-fenced [19411]
>> (remote_op_done) notice: Operation 'reboot' targeting FILE-2 by FILE-4
>> for pacemaker-controld.19415 at FILE-6: OK | id=4e523b34
>> 65886:Jul 17 14:18:20.085 FILE-6 pacemaker-controld [19415]
>> (tengine_stonith_callback) notice: Stonith operation
>> 3/63:0:0:232e6505-2e98-4a79-b6ce-5f26d9cba645: OK (0)
>> 65887:Jul 17 14:18:20.085 FILE-6 pacemaker-controld [19415]
>> (tengine_stonith_callback) info: Stonith operation 3 for FILE-2
>> passed
>> 65888:Jul 17 14:18:20.085 FILE-6 pacemaker-controld [19415]
>> (pcmk__update_peer_expected) info: crmd_peer_down: Node FILE-2[2] -
>> expected state is now down (was member)
>> 65889:Jul 17 14:18:20.085 FILE-6 pacemaker-controld [19415]
>> (send_stonith_update) debug: Sending fencing update 73 for FILE-2
>> 65890:Jul 17 14:18:20.085 FILE-6 pacemaker-controld [19415]
>> (controld_delete_node_state) info: Deleting all state for node FILE-2
>> (via CIB call 74) | xpath=//node_state[@uname='FILE-2']/*
>> 65892:Jul 17 14:18:20.089 FILE-6 pacemaker-controld [19415]
>> (exec_alert_list) info: Sending fencing alert via pf-ha-alert to (null)
>> 65896:Jul 17 14:18:20.089 FILE-6 pacemaker-controld [19415]
>> (tengine_stonith_notify) notice: Peer FILE-2 was terminated (reboot) by
>> FILE-4 on behalf of pacemaker-controld.19415: OK | initiator=FILE-6
>> ref=4e523b34-dcb1-40bc-a296-5e984b4e6b00
>> 65897:Jul 17 14:18:20.089 FILE-6 pacemaker-controld [19415]
>> (send_stonith_update) debug: Sending fencing update 75 for FILE-2
>> 65898:Jul 17 14:18:20.089 FILE-6 pacemaker-controld [19415]
>> (controld_delete_node_state) info: Deleting all state for node FILE-2
>> (via CIB call 76) | xpath=//node_state[@uname='FILE-2']/*
>> 65899:Jul 17 14:18:20.089 FILE-6 pacemaker-controld [19415]
>> (pcmk__execute_graph) debug: Transition 0 (Complete=34, Pending=1,
>> Fired=0, Skipped=0, Incomplete=24,
>> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
>> 65907:Jul 17 14:18:20.089 FILE-6 pacemaker-controld [19415]
>> (te_update_diff) debug: Processing (cib_modify) diff: 0.24.73 -> 0.24.74
>> (S_FINALIZE_JOIN)
>> 65908:Jul 17 14:18:20.089 FILE-6 pacemaker-controld [19415]
>> (cib_fencing_updated) info: Fencing update 73 for FILE-2: complete
>> 65916:Jul 17 14:18:20.093 FILE-6 pacemaker-controld [19415]
>> (te_update_diff) debug: Processing (cib_delete) diff: 0.24.74 -> 0.24.75
>> (S_FINALIZE_JOIN)
>> 65919:Jul 17 14:18:20.093 FILE-6 pacemaker-controld [19415]
>> (match_down_event) debug: Shutdown action 63
>> (stonith-FILE-2-reboot) found for node 2
>> 65920:Jul 17 14:18:20.093 FILE-6 pacemaker-controld [19415]
>> (cib_delete_callback) debug: Deletion of all state for node FILE-2
>> (via CIB call 74) succeeded
>> 65921:Jul 17 14:18:20.093 FILE-6 pacemaker-controld [19415]
>> (cib_fencing_updated) info: Fencing update 75 for FILE-2: complete
>> 65924:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (cib_delete_callback) debug: Deletion of all state for node FILE-2
>> (via CIB call 76) succeeded
>> 65927:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415] (node_left)
>> info: Group crmd event 5: FILE-2 (node 2 pid 15962) left for unknown
>> reason
>> 65928:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (crm_update_peer_proc) info: node_left: Node FILE-2[2] - corosync-cpg
>> is now offline
>> 65929:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (peer_update_callback) info: Node FILE-2 is no longer a peer | DC=true
>> old=0x4000000 new=0x0000000
>> 65930:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (controld_delete_node_state) info: Deleting transient attributes for
>> node FILE-2 (via CIB call 77) |
>> xpath=//node_state[@uname='FILE-2']/transient_attributes
>> 65932:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (match_down_event) debug: Shutdown action 63
>> (stonith-FILE-2-reboot) found for node 2
>> 65933:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk_cpg_membership) info: Group crmd event 5: FILE-3 (node 3 pid
>> 19250) is member
>> 65934:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk_cpg_membership) info: Group crmd event 5: FILE-4 (node 4 pid
>> 19122) is member
>> 65935:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk_cpg_membership) info: Group crmd event 5: FILE-5 (node 5 pid
>> 19273) is member
>> 65936:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk_cpg_membership) info: Group crmd event 5: FILE-6 (node 6 pid
>> 19415) is member
>> 65938:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x00880000 (new_actions)
>> for controller set by s_crmd_fsa:198
>> 65939:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_JOIN_RESULT: [ state=S_FINALIZE_JOIN
>> cause=C_HA_MESSAGE origin=route_message ]
>> 65940:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00800000 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65941:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x00080000 (an_action)
>> for controller cleared by do_fsa_action:108
>> 65942:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (controld_delete_node_state) info: Deleting resource history for node
>> FILE-6 (via CIB call 79) | xpath=//node_state[@uname='FILE-6']/lrm
>> 65943:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource stonith-sbd after monitor op complete (interval=0)
>> 65945:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource FILE_Filesystem after monitor op complete (interval=0)
>> 65946:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource Service_pfile after monitor op complete (interval=0)
>> 65947:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource Service_Postgresql after monitor op complete (interval=0)
>> 65948:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource Service_esm_primary after monitor op complete (interval=0)
>> 65949:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource Service_Postgrest after monitor op complete (interval=0)
>> 65950:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource IP_Floating after monitor op complete (interval=0)
>> 65951:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__create_history_xml) debug: build_active_RAs: Updating
>> resource Shared_Cluster_Backup after monitor op complete (interval=0)
>> 65952:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (do_dc_join_ack) debug: Updating local node history for join-1 from query
>> result (via CIB call 80)
>> 65954:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000
>> (fsa_data->actions) for controller set by s_crmd_fsa:193
>> 65955:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000000
>> (new_actions) for controller set by s_crmd_fsa:198
>> 65956:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
>> cause=C_HA_MESSAGE origin=do_te_invoke ]
>> 65957:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65958:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415] (do_log)
>> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
>> do_te_invoke
>> 65959:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415] (do_log)
>> debug: do_log <create_request_adv origin="do_cl_join_query" t="crmd"
>> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
>> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
>> 65960:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65961:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (abort_transition_graph) info: Transition 0 aborted: Peer Halt |
>> source=do_te_invoke:135 complete=false
>> 65962:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (register_fsa_input_adv) debug: Stalling the FSA pending further input:
>> source=do_te_invoke cause=C_HA_MESSAGE data=0x55c619871630 queue=0
>> 65963:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Exiting the FSA: queue=1, fsa_actions=0x0, stalled=true
>> 65964:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (fsa_dump_queue) debug: queue[0.80]: input I_WAIT_FOR_EVENT raised by
>> do_te_invoke(0x55c6198798d0.1) (cause=C_HA_MESSAGE)
>> 65966:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__execute_graph) debug: Transition 0 (Complete=34, Pending=1,
>> Fired=0, Skipped=0, Incomplete=24,
>> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): In progress
>> 65967:Jul 17 14:18:20.097 FILE-6 pacemaker-fenced [19411]
>> (process_remote_stonith_exec) debug: Finalizing action 'reboot'
>> targeting FILE-1 on behalf of pacemaker-controld.19415 at FILE-6: OK | rc=0
>> id=446afc42
>> 65968:Jul 17 14:18:20.097 FILE-6 pacemaker-fenced [19411]
>> (remote_op_done) notice: Operation 'reboot' targeting FILE-1 by FILE-5
>> for pacemaker-controld.19415 at FILE-6: OK | id=446afc42
>> 65970:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (tengine_stonith_callback) notice: Stonith operation
>> 4/62:0:0:232e6505-2e98-4a79-b6ce-5f26d9cba645: OK (0)
>> 65971:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (tengine_stonith_callback) info: Stonith operation 4 for FILE-1
>> passed
>> 65972:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (pcmk__update_peer_expected) info: crmd_peer_down: Node FILE-1[1] -
>> expected state is now down (was pending)
>> 65973:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (send_stonith_update) debug: Sending fencing update 81 for FILE-1
>> 65974:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (controld_delete_node_state) info: Deleting all state for node FILE-1
>> (via CIB call 82) | xpath=//node_state[@uname='FILE-1']/*
>> 65975:Jul 17 14:18:20.097 FILE-6 pacemaker-controld [19415]
>> (exec_alert_list) info: Sending fencing alert via pf-ha-alert to (null)
>> 65979:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (tengine_stonith_notify) notice: Peer FILE-1 was terminated (reboot) by
>> FILE-5 on behalf of pacemaker-controld.19415: OK | initiator=FILE-6
>> ref=446afc42-b46e-47af-9fac-0fa87c1c5e57
>> 65980:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (send_stonith_update) debug: Sending fencing update 83 for FILE-1
>> 65982:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (controld_delete_node_state) info: Deleting all state for node FILE-1
>> (via CIB call 84) | xpath=//node_state[@uname='FILE-1']/*
>> 65983:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (cib_delete_callback) debug: Deletion of transient attributes for node
>> FILE-2 (via CIB call 77) succeeded
>> 65984:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (pcmk__execute_graph) notice: Transition 0 (Complete=35, Pending=0,
>> Fired=0, Skipped=3, Incomplete=24,
>> Source=/var/lib/pacemaker/pengine/pe-warn-0.bz2): Stopped
>> 65985:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (te_graph_trigger) debug: Transition 0 is now complete
>> 65986:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (notify_crmd) debug: Processing transition completion in state
>> S_FINALIZE_JOIN
>> 65987:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (notify_crmd) debug: Transition 0 status: restart - Node join
>> 65988:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000
>> (fsa_data->actions) for controller set by s_crmd_fsa:193
>> 65989:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000000
>> (new_actions) for controller set by s_crmd_fsa:198
>> 65990:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_WAIT_FOR_EVENT: [ state=S_FINALIZE_JOIN
>> cause=C_HA_MESSAGE origin=do_te_invoke ]
>> 65991:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65992:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415] (do_log)
>> info: Input I_WAIT_FOR_EVENT received in state S_FINALIZE_JOIN from
>> do_te_invoke
>> 65993:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415] (do_log)
>> debug: do_log <create_request_adv origin="do_cl_join_query" t="crmd"
>> version="3.11.0" subt="request" reference="join_announce-crmd-1689603376-2"
>> crm_task="join_announce" crm_sys_to="dc" crm_sys_from="crmd" src="FILE-1"/>
>> 65994:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000
>> (an_action) for controller cleared by do_fsa_action:108
>> 65995:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (abort_transition_graph) info: Transition 0 aborted: Peer Halt |
>> source=do_te_invoke:135 complete=true
>> 65996:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415] (s_crmd_fsa)
>> debug: Processing I_PE_CALC: [ state=S_FINALIZE_JOIN
>> cause=C_FSA_INTERNAL origin=abort_transition_graph ]
>> 66024:Jul 17 14:18:20.101 FILE-6 pacemaker-controld [19415]
>> (cib_delete_callback) debug: Deletion of resource history for node
>> FILE-6 (via CIB call 79) succeeded
>> 66063:Jul 17 14:18:20.105 FILE-6 pacemaker-controld [19415]
>> (join_update_complete_callback) debug: join-1 node history update (via
>> CIB call 80) complete
>> 66064:Jul 17 14:18:20.105 FILE-6 pacemaker-controld [19415]
>> (check_join_state) debug: join-1: Complete | state=S_FINALIZE_JOIN
>> for=join_update_complete_callback
>> 66068:Jul 17 14:18:20.105 FILE-6 pacemaker-controld [19415]
>> (pcmk__set_flags_as) debug: FSA action flags 0x800400000000
>> (new_actions) for controller set by s_crmd_fsa:198
>>
>> Thanks
>> Priyanka
>>
>> On Thu, Jul 20, 2023 at 11:53 AM Reid Wahl <nwahl at redhat.com> wrote:
>>
>>> On Wed, Jul 19, 2023 at 8:33 PM Priyanka Balotra
>>> <priyanka.14balotra at gmail.com> wrote:
>>> >
>>> > Sure,
>>> > Here are the logs:
>>> >
>>> >
>>> > 63138:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (post_cache_update) debug: Updated cache after membership event 44.
>>> > 63139:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x200000000
>>> (A_ELECTION_CHECK) for controller set by post_cache_update:81
>>> > 63140:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000002 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63141:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (do_started) info: Delaying start, Config not read (0000000000000040)
>>> > 63142:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (register_fsa_input_adv) debug: Stalling the FSA pending further input:
>>> source=do_started cause=C_FSA_INTERNAL data=(nil) queue=0
>>> > 63143:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x00000002
>>> (with_actions) for controller set by register_fsa_input_adv:88
>>> > 63144:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (s_crmd_fsa) debug: Exiting the FSA: queue=0,
>>> fsa_actions=0x200000002, stalled=true
>>> > 63145:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (config_query_callback) debug: Call 3 : Parsing CIB options
>>> > 63146:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (config_query_callback) debug: Shutdown escalation occurs if DC has not
>>> responded to request in 1200000ms
>>> > 63147:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (config_query_callback) debug: Re-run scheduler after 900000ms of
>>> inactivity
>>> > 63148:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pe_unpack_alerts) debug: Alert pf-ha-alert:
>>> path=/usr/lib/ocf/resource.d/pacemaker/pf_ha_alert.sh timeout=30000ms
>>> tstamp-format='%H:%M:%S.%06N' 0 vars
>>> > 63149:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000002 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63150:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (do_started) debug: Init server comms
>>> > 63151:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcs_us_publish) info: server name: crmd
>>> > 63152:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (do_started) notice: Pacemaker controller successfully started and
>>> accepting connections
>>> > 63153:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x200000000 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63154:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (do_election_check) debug: Ignoring election check because we are
>>> not in an election
>>> > 63155:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000100100
>>> (new_actions) for controller set by s_crmd_fsa:198
>>> > 63156:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (s_crmd_fsa) debug: Processing I_PENDING: [ state=S_STARTING
>>> cause=C_FSA_INTERNAL origin=do_started ]
>>> > 63157:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000000
>>> (an_action) for controller cleared by do_fsa_action:108
>>> > 63158:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962] (do_log)
>>> info: Input I_PENDING received in state S_STARTING from do_started
>>> > 63159:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (do_state_transition) notice: State transition S_STARTING -> S_PENDING
>>> | input=I_PENDING cause=C_FSA_INTERNAL origin=do_started
>>> > 63160:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x00000020
>>> (A_INTEGRATE_TIMER_STOP) for controller set by do_state_transition:559
>>> > 63161:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x00000080
>>> (A_FINALIZE_TIMER_STOP) for controller set by do_state_transition:565
>>> > 63162:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000020 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63163:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000080 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63164:Jul 17 14:16:25.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00100000 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63165:Jul 17 14:16:26.132 FILE-2 pacemaker-controld [15962]
>>> (do_cl_join_query) debug: Querying for a DC
>>> > 63166:Jul 17 14:16:26.132 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000100 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63167:Jul 17 14:16:26.132 FILE-2 pacemaker-controld [15962]
>>> (controld_start_timer) debug: Started Election Trigger (inject
>>> I_DC_TIMEOUT if pops after 20000ms, source=18)
>>> > 63168:Jul 17 14:16:26.132 FILE-2 pacemaker-controld [15962]
>>> (stonith_api_signon) debug: Attempting fencer connection by
>>> pacemaker-controld with mainloop
>>> > 63175:Jul 17 14:16:26.132 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:131085; real_size:135168;
>>> rb->word_size:33792
>>> > 63176:Jul 17 14:16:26.132 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:131085; real_size:135168;
>>> rb->word_size:33792
>>> > 63177:Jul 17 14:16:26.132 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:131085; real_size:135168;
>>> rb->word_size:33792
>>> > 63178:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (stonith_command) debug: Processing register 8 from client
>>> pacemaker-controld.15962 with call options 0x00000000
>>> > 63179:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (stonith_command) debug: Processed register from client
>>> pacemaker-controld.15962: OK (rc=0)
>>> > 63180:Jul 17 14:16:26.132 FILE-2 pacemaker-controld [15962]
>>> (stonith_api_signon) debug: Connection to fencer by
>>> pacemaker-controld succeeded (registration token:
>>> 5552b1b4-f725-46ac-b239-e404cadd8d94)
>>> > 63181:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (stonith_command) debug: Processing st_notify 9 from client
>>> pacemaker-controld.15962 with call options 0x00000000
>>> > 63182:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (handle_request) debug: Enabling st_notify_disconnect callbacks for
>>> client pacemaker-controld.15962
>>> > 63183:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (stonith_command) debug: Processed st_notify from client
>>> pacemaker-controld.15962: OK (rc=0)
>>> > 63184:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (stonith_command) debug: Processing st_notify 10 from client
>>> pacemaker-controld.15962 with call options 0x00000000
>>> > 63185:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (handle_request) debug: Enabling st_notify_fence callbacks for client
>>> pacemaker-controld.15962
>>> > 63186:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (stonith_command) debug: Processed st_notify from client
>>> pacemaker-controld.15962: OK (rc=0)
>>> > 63187:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (stonith_command) debug: Processing st_notify 11 from client
>>> pacemaker-controld.15962 with call options 0x00000000
>>> > 63188:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (handle_request) debug: Enabling st_notify_history_synced callbacks for
>>> client pacemaker-controld.15962
>>> > 63189:Jul 17 14:16:26.132 FILE-2 pacemaker-fenced [15958]
>>> (stonith_command) debug: Processed st_notify from client
>>> pacemaker-controld.15962: OK (rc=0)
>>> > 63190:Jul 17 14:16:26.132 FILE-2 pacemaker-controld [15962]
>>> (te_trigger_stonith_history_sync) info: Fence history will be synchronized
>>> cluster-wide within 30 seconds
>>> > 63191:Jul 17 14:16:26.132 FILE-2 pacemaker-controld [15962]
>>> (te_connect_stonith) notice: Fencer successfully connected
>>> > 63192:Jul 17 14:16:32.664 FILE-2 pacemaker-controld [15962]
>>> (quorum_notification_cb) info: Quorum retained | membership=48 members=5
>>> > 63193:Jul 17 14:16:32.664 FILE-2 pacemaker-controld [15962]
>>> (quorum_notification_cb) debug: Member[0] 2
>>> > 63194:Jul 17 14:16:32.664 FILE-2 pacemaker-controld [15962]
>>> (quorum_notification_cb) debug: Member[1] 4
>>> > 63195:Jul 17 14:16:32.668 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63196:Jul 17 14:16:32.668 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63197:Jul 17 14:16:32.668 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63198:Jul 17 14:16:32.668 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63199:Jul 17 14:16:32.668 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-e4qK7U/qb-request-cmap-header
>>> > 63200:Jul 17 14:16:32.668 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-e4qK7U/qb-response-cmap-header
>>> > 63201:Jul 17 14:16:32.668 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-e4qK7U/qb-event-cmap-header
>>> > 63202:Jul 17 14:16:32.668 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 4
>>> > 63203:Jul 17 14:16:32.668 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 4
>>> > 63204:Jul 17 14:16:32.672 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63205:Jul 17 14:16:32.672 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63206:Jul 17 14:16:32.672 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63209:Jul 17 14:16:32.676 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63210:Jul 17 14:16:32.676 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-YYxILU/qb-request-cmap-header
>>> > 63211:Jul 17 14:16:32.676 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-YYxILU/qb-response-cmap-header
>>> > 63212:Jul 17 14:16:32.676 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-YYxILU/qb-event-cmap-header
>>> > 63213:Jul 17 14:16:32.676 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 4
>>> > 63214:Jul 17 14:16:32.676 FILE-2 pacemaker-controld [15962]
>>> (quorum_notification_cb) info: Obtaining name for new node 4
>>> > 63218:Jul 17 14:16:32.684 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63222:Jul 17 14:16:32.684 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63225:Jul 17 14:16:32.684 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63240:Jul 17 14:16:32.688 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63241:Jul 17 14:16:32.688 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-Cy8QVV/qb-request-cmap-header
>>> > 63242:Jul 17 14:16:32.688 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-Cy8QVV/qb-response-cmap-header
>>> > 63243:Jul 17 14:16:32.688 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-Cy8QVV/qb-event-cmap-header
>>> > 63244:Jul 17 14:16:32.688 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 4
>>> > 63245:Jul 17 14:16:32.688 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 4
>>> > 63246:Jul 17 14:16:32.688 FILE-2 pacemaker-controld [15962]
>>> (quorum_notification_cb) debug: Member[2] 3
>>> > 63259:Jul 17 14:16:32.700 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63265:Jul 17 14:16:32.700 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63267:Jul 17 14:16:32.700 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63298:Jul 17 14:16:32.712 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63299:Jul 17 14:16:32.712 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-34-0DHKhX/qb-request-cmap-header
>>> > 63300:Jul 17 14:16:32.712 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-34-0DHKhX/qb-response-cmap-header
>>> > 63301:Jul 17 14:16:32.712 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-34-0DHKhX/qb-event-cmap-header
>>> > 63302:Jul 17 14:16:32.712 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 3
>>> > 63303:Jul 17 14:16:32.712 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 3
>>> > 63307:Jul 17 14:16:32.720 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63313:Jul 17 14:16:32.720 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63320:Jul 17 14:16:32.720 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63351:Jul 17 14:16:32.728 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63352:Jul 17 14:16:32.728 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-34-V0bQlV/qb-request-cmap-header
>>> > 63353:Jul 17 14:16:32.728 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-34-V0bQlV/qb-response-cmap-header
>>> > 63355:Jul 17 14:16:32.728 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-34-V0bQlV/qb-event-cmap-header
>>> > 63356:Jul 17 14:16:32.728 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 3
>>> > 63357:Jul 17 14:16:32.728 FILE-2 pacemaker-controld [15962]
>>> (quorum_notification_cb) info: Obtaining name for new node 3
>>> > 63365:Jul 17 14:16:32.736 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63372:Jul 17 14:16:32.736 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63374:Jul 17 14:16:32.736 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63415:Jul 17 14:16:32.748 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63416:Jul 17 14:16:32.748 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-34-EAFzTX/qb-request-cmap-header
>>> > 63417:Jul 17 14:16:32.748 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-34-EAFzTX/qb-response-cmap-header
>>> > 63418:Jul 17 14:16:32.748 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-34-EAFzTX/qb-event-cmap-header
>>> > 63419:Jul 17 14:16:32.748 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 3
>>> > 63420:Jul 17 14:16:32.748 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 3
>>> > 63421:Jul 17 14:16:32.748 FILE-2 pacemaker-controld [15962]
>>> (quorum_notification_cb) debug: Member[3] 6
>>> > 63425:Jul 17 14:16:32.752 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63426:Jul 17 14:16:32.752 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63427:Jul 17 14:16:32.752 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63479:Jul 17 14:16:32.756 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63480:Jul 17 14:16:32.756 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-33-q3mFYU/qb-request-cmap-header
>>> > 63481:Jul 17 14:16:32.756 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-33-q3mFYU/qb-response-cmap-header
>>> > 63482:Jul 17 14:16:32.756 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-33-q3mFYU/qb-event-cmap-header
>>> > 63483:Jul 17 14:16:32.756 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 6
>>> > 63484:Jul 17 14:16:32.756 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 6
>>> > 63485:Jul 17 14:16:32.760 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63486:Jul 17 14:16:32.760 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63487:Jul 17 14:16:32.760 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63490:Jul 17 14:16:32.760 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63491:Jul 17 14:16:32.760 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-EcEbfV/qb-request-cmap-header
>>> > 63492:Jul 17 14:16:32.760 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-EcEbfV/qb-response-cmap-header
>>> > 63493:Jul 17 14:16:32.760 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-EcEbfV/qb-event-cmap-header
>>> > 63494:Jul 17 14:16:32.760 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 6
>>> > 63495:Jul 17 14:16:32.760 FILE-2 pacemaker-controld [15962]
>>> (quorum_notification_cb) info: Obtaining name for new node 6
>>> > 63499:Jul 17 14:16:32.764 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63502:Jul 17 14:16:32.764 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63505:Jul 17 14:16:32.764 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63508:Jul 17 14:16:32.764 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63509:Jul 17 14:16:32.764 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-fLk4xW/qb-request-cmap-header
>>> > 63510:Jul 17 14:16:32.764 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-fLk4xW/qb-response-cmap-header
>>> > 63511:Jul 17 14:16:32.764 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-fLk4xW/qb-event-cmap-header
>>> > 63512:Jul 17 14:16:32.764 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 6
>>> > 63513:Jul 17 14:16:32.764 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 6
>>> > 63514:Jul 17 14:16:32.764 FILE-2 pacemaker-controld [15962]
>>> (quorum_notification_cb) debug: Member[4] 5
>>> > 63517:Jul 17 14:16:32.768 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63518:Jul 17 14:16:32.768 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63521:Jul 17 14:16:32.768 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63528:Jul 17 14:16:32.768 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63529:Jul 17 14:16:32.768 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-ushXmW/qb-request-cmap-header
>>> > 63530:Jul 17 14:16:32.768 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-ushXmW/qb-response-cmap-header
>>> > 63531:Jul 17 14:16:32.768 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-ushXmW/qb-event-cmap-header
>>> > 63532:Jul 17 14:16:32.768 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 5
>>> > 63533:Jul 17 14:16:32.768 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 5
>>> > 63534:Jul 17 14:16:32.772 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63535:Jul 17 14:16:32.772 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63536:Jul 17 14:16:32.772 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63537:Jul 17 14:16:32.772 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63538:Jul 17 14:16:32.772 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-x3qVkW/qb-request-cmap-header
>>> > 63539:Jul 17 14:16:32.772 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-x3qVkW/qb-response-cmap-header
>>> > 63540:Jul 17 14:16:32.772 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-x3qVkW/qb-event-cmap-header
>>> > 63541:Jul 17 14:16:32.772 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 5
>>> > 63542:Jul 17 14:16:32.772 FILE-2 pacemaker-controld [15962]
>>> (quorum_notification_cb) info: Obtaining name for new node 5
>>> > 63543:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63544:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63545:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63546:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63547:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-gUNSFU/qb-request-cmap-header
>>> > 63548:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-gUNSFU/qb-response-cmap-header
>>> > 63549:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-gUNSFU/qb-event-cmap-header
>>> > 63550:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 5
>>> > 63551:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 5
>>> > 63552:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (update_peer_state_iter) notice: Node (null) state is now lost | nodeid=1
>>> previous=member source=pcmk__reap_unseen_nodes
>>> > 63553:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (post_cache_update) debug: Updated cache after membership event 48.
>>> > 63554:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x200000000
>>> (A_ELECTION_CHECK) for controller set by post_cache_update:81
>>> > 63555:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x200000000 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63556:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (do_election_check) debug: Ignoring election check because we are
>>> not in an election
>>> > 63557:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (pcmk_cpg_membership) info: Group crmd event 0: node 2 pid 15962
>>> joined via cpg_join
>>> > 63558:Jul 17 14:16:32.776 FILE-2 pacemaker-controld [15962]
>>> (pcmk_cpg_membership) info: Group crmd event 0: FILE-2 (node 2 pid
>>> 15962) is member
>>> > 63559:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63560:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63561:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63564:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63565:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-5PH1gV/qb-request-cmap-header
>>> > 63566:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-5PH1gV/qb-response-cmap-header
>>> > 63567:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-5PH1gV/qb-event-cmap-header
>>> > 63568:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 3
>>> > 63569:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 3
>>> > 63570:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (pcmk_cpg_membership) info: Group crmd event 0: peer node (node 3 pid
>>> 19250) is member
>>> > 63571:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (crm_update_peer_proc) info: pcmk_cpg_membership: Node (null)[3] -
>>> corosync-cpg is now online
>>> > 63572:Jul 17 14:16:32.780 FILE-2 pacemaker-controld [15962]
>>> (peer_update_callback) debug: Sending hello to node 3 so that it learns
>>> our node name
>>> > 63573:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63574:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63575:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63576:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63577:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-QATDEV/qb-request-cmap-header
>>> > 63578:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-QATDEV/qb-response-cmap-header
>>> > 63579:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-QATDEV/qb-event-cmap-header
>>> > 63580:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 4
>>> > 63581:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 4
>>> > 63582:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (pcmk_cpg_membership) info: Group crmd event 0: peer node (node 4 pid
>>> 19122) is member
>>> > 63583:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (crm_update_peer_proc) info: pcmk_cpg_membership: Node (null)[4] -
>>> corosync-cpg is now online
>>> > 63584:Jul 17 14:16:32.784 FILE-2 pacemaker-controld [15962]
>>> (peer_update_callback) debug: Sending hello to node 4 so that it learns
>>> our node name
>>> > 63585:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63586:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63587:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63588:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63589:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-TVzR1T/qb-request-cmap-header
>>> > 63590:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-TVzR1T/qb-response-cmap-header
>>> > 63591:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-TVzR1T/qb-event-cmap-header
>>> > 63592:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 5
>>> > 63593:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 5
>>> > 63594:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (pcmk_cpg_membership) info: Group crmd event 0: peer node (node 5 pid
>>> 19273) is member
>>> > 63595:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (crm_update_peer_proc) info: pcmk_cpg_membership: Node (null)[5] -
>>> corosync-cpg is now online
>>> > 63596:Jul 17 14:16:32.788 FILE-2 pacemaker-controld [15962]
>>> (peer_update_callback) debug: Sending hello to node 5 so that it learns
>>> our node name
>>> > 63597:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63598:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63599:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_open_2) debug: shm size:1048589; real_size:1052672;
>>> rb->word_size:263168
>>> > 63600:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (qb_ipcc_disconnect) debug: qb_ipcc_disconnect()
>>> > 63601:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-8LRaoV/qb-request-cmap-header
>>> > 63602:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-8LRaoV/qb-response-cmap-header
>>> > 63603:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (qb_rb_close_helper) debug: Closing ringbuffer:
>>> /dev/shm/qb-13142-15962-31-8LRaoV/qb-event-cmap-header
>>> > 63604:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (pcmk__corosync_name) info: Unable to get node name for nodeid 6
>>> > 63605:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (get_node_name) notice: Could not obtain a node name for corosync node
>>> with id 6
>>> > 63606:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (pcmk_cpg_membership) info: Group crmd event 0: peer node (node 6 pid
>>> 19415) is member
>>> > 63607:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (crm_update_peer_proc) info: pcmk_cpg_membership: Node (null)[6] -
>>> corosync-cpg is now online
>>> > 63608:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (peer_update_callback) debug: Sending hello to node 6 so that it learns
>>> our node name
>>> > 63609:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (get_xpath_object) debug: No match for //st_notify_history_synced
>>> in /notify
>>> > 63610:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (stonith_api_del_notification) debug: Removing callback for
>>> st_notify_history_synced events
>>> > 63611:Jul 17 14:16:32.792 FILE-2 pacemaker-fenced [15958]
>>> (stonith_command) debug: Processing st_notify 12 from client
>>> pacemaker-controld.15962 with call options 0x00000000
>>> > 63612:Jul 17 14:16:32.792 FILE-2 pacemaker-fenced [15958]
>>> (handle_request) debug: Disabling st_notify_history_synced callbacks for
>>> client pacemaker-controld.15962
>>> > 63613:Jul 17 14:16:32.792 FILE-2 pacemaker-fenced [15958]
>>> (stonith_command) debug: Processed st_notify from client
>>> pacemaker-controld.15962: OK (rc=0)
>>> > 63614:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (tengine_stonith_history_synced) debug: Fence-history synced - cancel all
>>> timers
>>> > 63615:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (crm_get_peer) info: Node 4 is now known as FILE-4
>>> > 63616:Jul 17 14:16:32.792 FILE-2 pacemaker-controld [15962]
>>> (update_peer_uname) warning: Node names with capitals are
>>> discouraged, consider changing 'FILE-4'
>>> > 63617:Jul 17 14:16:32.796 FILE-2 pacemaker-controld [15962]
>>> (peer_update_callback) info: Cluster node FILE-4 is now member
>>> > 63618:Jul 17 14:16:32.796 FILE-2 pacemaker-controld [15962]
>>> (crm_get_peer) info: Node 3 is now known as FILE-3
>>> > 63619:Jul 17 14:16:32.796 FILE-2 pacemaker-controld [15962]
>>> (update_peer_uname) warning: Node names with capitals are
>>> discouraged, consider changing 'FILE-3'
>>> > 63620:Jul 17 14:16:32.796 FILE-2 pacemaker-controld [15962]
>>> (peer_update_callback) info: Cluster node FILE-3 is now member
>>> > 63621:Jul 17 14:16:32.796 FILE-2 pacemaker-controld [15962]
>>> (crm_get_peer) info: Node 5 is now known as FILE-5
>>> > 63622:Jul 17 14:16:32.796 FILE-2 pacemaker-controld [15962]
>>> (update_peer_uname) warning: Node names with capitals are
>>> discouraged, consider changing 'FILE-5'
>>> > 63623:Jul 17 14:16:32.796 FILE-2 pacemaker-controld [15962]
>>> (peer_update_callback) info: Cluster node FILE-5 is now member
>>> > 63640:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (crm_get_peer) info: Node 6 is now known as FILE-6
>>> > 63641:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (update_peer_uname) warning: Node names with capitals are
>>> discouraged, consider changing 'FILE-6'
>>> > 63642:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (peer_update_callback) info: Cluster node FILE-6 is now member
>>> > 63643:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (handle_request) debug: Raising I_JOIN_OFFER: join-1
>>> > 63644:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x00400200 (new_actions)
>>> for controller set by s_crmd_fsa:198
>>> > 63645:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (s_crmd_fsa) debug: Processing I_JOIN_OFFER: [ state=S_PENDING
>>> cause=C_HA_MESSAGE origin=route_message ]
>>> > 63646:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000200 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63647:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00400000 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63648:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (update_dc) info: Set DC to FILE-6 (3.11.0)
>>> > 63649:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (pcmk__update_peer_expected) info: update_dc: Node FILE-6[6] -
>>> expected state is now member (was (null))
>>> > 63650:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x00000200
>>> (A_DC_TIMER_STOP) for controller set by do_cl_join_offer_respond:147
>>> > 63651:Jul 17 14:16:32.880 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000200 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63788:Jul 17 14:16:32.884 FILE-2 pacemaker-controld [15962]
>>> (do_cib_replaced) debug: Updating the CIB after a replace: DC=false
>>> > 63811:Jul 17 14:16:32.892 FILE-2 pacemaker-controld [15962]
>>> (join_query_callback) debug: Respond to join offer join-1 from FILE-6
>>> > 63819:Jul 17 14:16:55.080 FILE-2 pacemaker-controld [15962]
>>> (pcmk__procfs_pid_of) info: Found pacemaker-based active as process
>>> 15957
>>> > 63820:Jul 17 14:16:55.080 FILE-2 pacemaker-controld [15962]
>>> (throttle_cib_load) debug: Init 6 + 2 ticks at 1689603415 (100 tps)
>>> > 63821:Jul 17 14:16:55.080 FILE-2 pacemaker-controld [15962]
>>> (throttle_mode) debug: Current load is 0.980000 across 10 core(s)
>>> > 63822:Jul 17 14:16:55.080 FILE-2 pacemaker-controld [15962]
>>> (throttle_send_command) info: New throttle mode: negligible load (was
>>> undetermined)
>>> > 63823:Jul 17 14:16:55.080 FILE-2 pacemaker-controld [15962]
>>> (throttle_update) debug: Node FILE-2 has negligible load and supports at
>>> most 20 jobs; new job limit 20
>>> > 63824:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (handle_request) debug: Raising I_JOIN_RESULT: join-1
>>> > 63825:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x00800000 (new_actions)
>>> for controller set by s_crmd_fsa:198
>>> > 63826:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (s_crmd_fsa) debug: Processing I_JOIN_RESULT: [ state=S_PENDING
>>> cause=C_HA_MESSAGE origin=route_message ]
>>> > 63827:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00800000 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63828:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (do_cl_join_finalize_respond) debug: Confirming join-1: sending local
>>> operation history to FILE-6
>>> > 63829:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x1000000000000200
>>> (new_actions) for controller set by s_crmd_fsa:198
>>> > 63830:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (s_crmd_fsa) debug: Processing I_NOT_DC: [ state=S_PENDING
>>> cause=C_HA_MESSAGE origin=do_cl_join_finalize_respond ]
>>> > 63831:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x1000000000000000
>>> (an_action) for controller cleared by do_fsa_action:108
>>> > 63832:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962] (do_log)
>>> info: Input I_NOT_DC received in state S_PENDING from
>>> do_cl_join_finalize_respond
>>> > 63833:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (do_state_transition) notice: State transition S_PENDING -> S_NOT_DC |
>>> input=I_NOT_DC cause=C_HA_MESSAGE origin=do_cl_join_finalize_respond
>>> > 63834:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x00000020
>>> (A_INTEGRATE_TIMER_STOP) for controller set by do_state_transition:559
>>> > 63835:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (pcmk__set_flags_as) debug: FSA action flags 0x00000080
>>> (A_FINALIZE_TIMER_STOP) for controller set by do_state_transition:565
>>> > 63836:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000200 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63837:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000020 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63838:Jul 17 14:16:55.092 FILE-2 pacemaker-controld [15962]
>>> (pcmk__clear_flags_as) debug: FSA action flags 0x00000080 (an_action)
>>> for controller cleared by do_fsa_action:108
>>> > 63863:Jul 17 14:17:25.073 FILE-2 pacemaker-controld [15962]
>>> (throttle_cib_load) debug: cib load: 0.000667 (2 ticks in 30s)
>>> > 63864:Jul 17 14:17:25.073 FILE-2 pacemaker-controld [15962]
>>> (throttle_mode) debug: Current load is 0.650000 across 10 core(s)
>>> > 63865:Jul 17 14:17:55.073 FILE-2 pacemaker-controld [15962]
>>> (throttle_cib_load) debug: cib load: 0.000333 (1 ticks in 30s)
>>> > 63866:Jul 17 14:17:55.073 FILE-2 pacemaker-controld [15962]
>>> (throttle_mode) debug: Current load is 0.850000 across 10 core(s)
>>> > 63868:Jul 17 14:18:20.085 FILE-2 pacemaker-fenced [15958]
>>> (process_remote_stonith_exec) debug: Finalizing action 'reboot'
>>> targeting FILE-2 on behalf of pacemaker-controld.19415 at FILE-6: OK |
>>> rc=0 id=4e523b34
>>> > 63869:Jul 17 14:18:20.085 FILE-2 pacemaker-fenced [15958]
>>> (remote_op_done) notice: Operation 'reboot' targeting FILE-2 by FILE-4
>>> for pacemaker-controld.19415 at FILE-6: OK | id=4e523b34
>>> > 63872:Jul 17 14:18:20.089 FILE-2 pacemaker-controld [15962]
>>> (exec_alert_list) info: Sending fencing alert via pf-ha-alert to (null)
>>> > 63875:Jul 17 14:18:20.089 FILE-2 pacemaker-controld [15962]
>>> (tengine_stonith_notify) crit: We were allegedly just fenced by FILE-4
>>> for FILE-6!
>>> > 63876:Jul 17 14:18:20.089 FILE-2 pacemaker-controld [15962]
>>> (crm_xml_cleanup) info: Cleaning up memory from libxml2
>>> > 63877:Jul 17 14:18:20.089 FILE-2 pacemaker-controld [15962]
>>> (crm_exit) info: Exiting pacemaker-controld | with status 100
>>> > 63900:Jul 17 14:18:20.093 FILE-2 pacemakerd [15956]
>>> (pcmk_child_exit) warning: Shutting cluster down because
>>> pacemaker-controld[15962] had fatal failure
>>> > 63902:Jul 17 14:18:20.093 FILE-2 pacemakerd [15956]
>>> (pcmk_shutdown_worker) debug: pacemaker-controld confirmed stopped
>>> > 63956:Jul 17 14:18:20.101 FILE-2 pacemaker-fenced [15958]
>>> (process_remote_stonith_exec) debug: Finalizing action 'reboot'
>>> targeting FILE-1 on behalf of pacemaker-controld.19415 at FILE-6: OK |
>>> rc=0 id=446afc42
>>> > 63957:Jul 17 14:18:20.101 FILE-2 pacemaker-fenced [15958]
>>> (remote_op_done) notice: Operation 'reboot' targeting FILE-1 by FILE-5
>>> for pacemaker-controld.19415 at FILE-6: OK | id=446afc42>
>>> > Thanks
>>> > Priyanka
>>>
>>> Hi, node FILE-6 requested that node FILE-2 be fenced by node FILE-4.
>>> FILE-2's controller daemon received notification that it was being
>>> fenced, and it shut down. You'd want to check the logs on FILE-6 to
>>> determine why FILE-2 was fenced.
>>>
>>> >
>>> > On Thu, Jul 20, 2023 at 12:07 AM Ken Gaillot <kgaillot at redhat.com>
>>> wrote:
>>> >>
>>> >> On Wed, 2023-07-19 at 23:49 +0530, Priyanka Balotra wrote:
>>> >> > Hi All,
>>> >> > I am using SLES 15 SP4. One of the nodes of the cluster is brought
>>> >> > down and boot up after sometime. Pacemaker service came up first but
>>> >> > later it faced a fatal shutdown. Due to that crm service is down.
>>> >> >
>>> >> > The logs from /var/log/pacemaker.pacemaker.log are as follows:
>>> >> >
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd [15956]
>>> >> > (pcmk_child_exit) warning: Shutting cluster down because
>>> >> > pacemaker-controld[15962] had fatal failure
>>> >>
>>> >> The interesting messages will be before this. The ones with
>>> "pacemaker-
>>> >> controld" will be the most relevant, at least initially.
>>> >>
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd [15956]
>>> >> > (pcmk_shutdown_worker) notice: Shutting down Pacemaker
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd [15956]
>>> >> > (pcmk_shutdown_worker) debug: pacemaker-controld confirmed stopped
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd [15956] (stop_child)
>>> >> > notice: Stopping pacemaker-schedulerd | sent signal 15 to process
>>> >> > 15961
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-schedulerd[15961]
>>> >> > (crm_signal_dispatch) notice: Caught 'Terminated' signal | 15
>>> >> > (invoking handler)
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-schedulerd[15961]
>>> >> > (qb_ipcs_us_withdraw) info: withdrawing server sockets
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-schedulerd[15961]
>>> >> > (qb_ipcs_unref) debug: qb_ipcs_unref() - destroying
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-schedulerd[15961]
>>> >> > (crm_xml_cleanup) info: Cleaning up memory from libxml2
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-schedulerd[15961] (crm_exit)
>>> >> > info: Exiting pacemaker-schedulerd | with status 0
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-based [15957]
>>> >> > (qb_ipcs_event_sendv) debug: new_event_notification (/dev/shm/qb-
>>> >> > 15957-15962-12-RDPw6O/qb): Broken pipe (32)
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-based [15957]
>>> >> > (cib_notify_send_one) warning: Could not notify client crmd:
>>> >> > Broken pipe | id=e29d175e-7e91-4b6a-bffb-fabfdd7a33bf
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-based [15957]
>>> >> > (cib_process_request) info: Completed cib_delete operation for
>>> >> > section //node_state[@uname='FILE-2']/*: OK (rc=0, origin=FILE-
>>> >> > 6/crmd/74, version=0.24.75)
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-fenced [15958]
>>> >> > (xml_patch_version_check) debug: Can apply patch 0.24.75 to
>>> >> > 0.24.74
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd [15956]
>>> >> > (pcmk_child_exit) info: pacemaker-schedulerd[15961] exited
>>> >> > with status 0 (OK)
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-based [15957]
>>> >> > (cib_process_request) info: Completed cib_modify operation for
>>> >> > section status: OK (rc=0, origin=FILE-6/crmd/75, version=0.24.75)
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd [15956]
>>> >> > (pcmk_shutdown_worker) debug: pacemaker-schedulerd confirmed
>>> >> > stopped
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemakerd [15956] (stop_child)
>>> >> > notice: Stopping pacemaker-attrd | sent signal 15 to process 15960
>>> >> > Jul 17 14:18:20.093 FILE-2 pacemaker-attrd [15960]
>>> >> > (crm_signal_dispatch) notice: Caught 'Terminated' signal | 15
>>> >> > (invoking handler)
>>> >> >
>>> >> > Could you please help me understand the issue here.
>>> >> >
>>> >> > Regards
>>> >> > Priyanka
>>> >> > _______________________________________________
>>> >> > Manage your subscription:
>>> >> > https://lists.clusterlabs.org/mailman/listinfo/users
>>> >> >
>>> >> > ClusterLabs home: https://www.clusterlabs.org/
>>> >> --
>>> >> Ken Gaillot <kgaillot at redhat.com>
>>> >>
>>> >> _______________________________________________
>>> >> Manage your subscription:
>>> >> https://lists.clusterlabs.org/mailman/listinfo/users
>>> >>
>>> >> ClusterLabs home: https://www.clusterlabs.org/
>>> >
>>> > _______________________________________________
>>> > Manage your subscription:
>>> > https://lists.clusterlabs.org/mailman/listinfo/users
>>> >
>>> > ClusterLabs home: https://www.clusterlabs.org/
>>>
>>>
>>>
>>> --
>>> Regards,
>>>
>>> Reid Wahl (He/Him)
>>> Senior Software Engineer, Red Hat
>>> RHEL High Availability - Pacemaker
>>>
>>> _______________________________________________
>>> Manage your subscription:
>>> https://lists.clusterlabs.org/mailman/listinfo/users
>>>
>>> ClusterLabs home: https://www.clusterlabs.org/
>>>
>>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20230721/56c9e47f/attachment-0001.htm>
More information about the Users
mailing list