[Pacemaker] OpenAIS + mgmtd [Not Working]

Andrew Beekhof beekhof at gmail.com
Wed Nov 19 15:15:40 EST 2008


I just installed the new openais packages (0.80.3-12.1 or higher is
required) and it seems to be working.

c001n02:~ # grep mgmtd /var/log/messages
Nov 19 21:09:44 c001n02 openais[16495]: [MAIN ] info: get_config_opt:
Found 'yes' for option: use_mgmtd
Nov 19 21:09:44 c001n02 openais[16495]: [MAIN ] info: spawn_child:
Forked child 16506 for process mgmtd
Nov 19 21:09:44 c001n02 mgmtd: [16506]: info:
G_main_add_SignalHandler: Added signal handler for signal 15
Nov 19 21:09:44 c001n02 mgmtd: [16506]: debug: Enabling coredumps
Nov 19 21:09:44 c001n02 mgmtd: [16506]: info:
G_main_add_SignalHandler: Added signal handler for signal 10
Nov 19 21:09:44 c001n02 mgmtd: [16506]: info:
G_main_add_SignalHandler: Added signal handler for signal 12
Nov 19 21:09:44 c001n02 lrmd: [16502]: debug: on_msg_register:client
mgmtd [16506] registered
Nov 19 21:09:44 c001n02 mgmtd: [16506]: info: init_crm
Nov 19 21:09:44 c001n02 mgmtd: [16506]: info: login to cib: 0, ret:-10
Nov 19 21:09:46 c001n02 mgmtd: [16506]: debug: main: run the loop...
Nov 19 21:09:46 c001n02 mgmtd: [16506]: info: Started.
Nov 19 21:10:01 c001n02 mgmtd: [16506]: debug: update cib finished
Nov 19 21:10:01 c001n02 mgmtd: [16506]: debug: update cib finished

Here is the directive in openais.conf

service {
        # Load the Pacemaker Cluster Resource Manager
        name: pacemaker
        ver:  0
        use_mgmtd: yes
}

On Wed, Nov 19, 2008 at 13:51, Bret Palsson <bretep at gmail.com> wrote:
> I know there is some testing going on right now. Here is what I am getting.
>
> With the latest unstable I'm getting the following:
>
> # /etc/init.d/openais start
> # cat /var/log/messages
>
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] AIS Executive Service RELEASE
> 'subrev 1152 version 0.80'
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] Copyright (C) 2002-2006
> MontaVista Software, Inc and contributors.
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] Copyright (C) 2006 Red Hat,
> Inc.
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] AIS Executive Service: started
> and ready to provide service.
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] Token Timeout (10000 ms)
> retransmit timeout (495 ms)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] token hold (386 ms)
> retransmits before loss (20 retrans)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] join (60 ms) send_join (0 ms)
> consensus (4800 ms) merge (200 ms)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] downcheck (1000 ms) fail to
> recv const (50 msgs)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] seqno unchanged const (30
> rotations) Maximum network MTU 1500
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] window size per rotation (50
> messages) maximum messages per rotation (20 messages)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] send threads (0 threads)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] RRP token expired timeout (495
> ms)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] RRP token problem counter
> (2000 ms)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] RRP threshold (10 problem
> count)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] RRP mode set to none.
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] heartbeat_failures_allowed (0)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] max_network_delay (50 ms)
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] HeartBeat is Disabled. To
> enable set heartbeat_failures_allowed > 0
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] Receive multicast socket recv
> buffer size (262142 bytes).
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] Transmit multicast socket send
> buffer size (262142 bytes).
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] The network interface
> [10.128.6.3] is now up.
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] Created or loaded sequence id
> 0.10.128.6.3 for this ring.
> Nov 19 05:42:45 m-nfs2 openais[5725]: [TOTEM] entering GATHER state from 15.
> Nov 19 05:42:45 m-nfs2 openais[5725]: [SERV ] Service initialized 'openais
> extended virtual synchrony service'
> Nov 19 05:42:45 m-nfs2 openais[5725]: [SERV ] Service initialized 'openais
> cluster membership service B.01.01'
> Nov 19 05:42:45 m-nfs2 openais[5725]: [SERV ] Service initialized 'openais
> availability management framework B.01.01'
> Nov 19 05:42:45 m-nfs2 openais[5725]: [SERV ] Service initialized 'openais
> checkpoint service B.01.01'
> Nov 19 05:42:45 m-nfs2 openais[5725]: [SERV ] Service initialized 'openais
> event service B.01.01'
> Nov 19 05:42:45 m-nfs2 openais[5725]: [SERV ] Service initialized 'openais
> distributed locking service B.01.01'
> Nov 19 05:42:45 m-nfs2 openais[5725]: [SERV ] Service initialized 'openais
> message service B.01.01'
> Nov 19 05:42:45 m-nfs2 openais[5725]: [SERV ] Service initialized 'openais
> configuration service'
> Nov 19 05:42:45 m-nfs2 openais[5725]: [SERV ] Service initialized 'openais
> cluster closed process group service v1.01'
> Nov 19 05:42:45 m-nfs2 openais[5725]: [crm  ] info: process_ais_conf:
> Reading configure
> Nov 19 05:42:45 m-nfs2 openais[5725]: [crm  ] info: get_config_section:
> Processing additional logging options...
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] info: get_config_opt: Found
> 'off' for option: debug
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] info: get_config_opt: Found
> 'yes' for option: to_syslog
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] info: get_config_opt: Found
> 'daemon' for option: syslog_facility
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] info: get_config_opt:
> Defaulting to 'off' for option: to_file
> Nov 19 05:42:45 m-nfs2 openais[5725]: [crm  ] info: get_config_section: No
> additional configuration supplied for: pacemaker
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] info: get_config_opt:
> Defaulting to '2' for option: expected_nodes
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] info: get_config_opt:
> Defaulting to '2' for option: expected_votes
> Nov 19 05:42:45 m-nfs2 openais[5725]: [MAIN ] info: get_config_opt:
> Defaulting to '1' for option: quorum_votes
>
>
> # HA_cluster_type="openais" /usr/lib64/heartbeat/mgmtd
> # cat /var/log/messages
>
> Nov 19 05:44:59 m-nfs2 mgmtd: [5740]: info: G_main_add_SignalHandler: Added
> signal handler for signal 15
> Nov 19 05:44:59 m-nfs2 mgmtd: [5740]: info: G_main_add_SignalHandler: Added
> signal handler for signal 10
> Nov 19 05:44:59 m-nfs2 mgmtd: [5740]: info: G_main_add_SignalHandler: Added
> signal handler for signal 12
> Nov 19 05:44:59 m-nfs2 mgmtd: [5740]: WARN: lrm_signon: can not initiate
> connection
> Nov 19 05:44:59 m-nfs2 mgmtd: [5740]: info: login to lrm: 0, ret:0
> Nov 19 05:45:00 m-nfs2 mgmtd: [5740]: WARN: lrm_signon: can not initiate
> connection
> Nov 19 05:45:00 m-nfs2 mgmtd: [5740]: info: login to lrm: 1, ret:0
> Nov 19 05:45:01 m-nfs2 mgmtd: [5740]: WARN: lrm_signon: can not initiate
> connection
> Nov 19 05:45:01 m-nfs2 mgmtd: [5740]: info: login to lrm: 2, ret:0
> Nov 19 05:45:02 m-nfs2 mgmtd: [5740]: WARN: lrm_signon: can not initiate
> connection
> Nov 19 05:45:02 m-nfs2 mgmtd: [5740]: info: login to lrm: 3, ret:0
> Nov 19 05:45:03 m-nfs2 mgmtd: [5740]: WARN: lrm_signon: can not initiate
> connection
> Nov 19 05:45:03 m-nfs2 mgmtd: [5740]: info: login to lrm: 4, ret:0
> Nov 19 05:45:04 m-nfs2 mgmtd: [5740]: info: login to lrm failed
> Nov 19 05:45:04 m-nfs2 mgmtd: [5740]: ERROR: Can't initialize management
> library.Shutting down.(-1)
>
>
>
> On Nov 19, 2008, at 4:14 AM, Andrew Beekhof wrote:
>
>> On Wed, Nov 19, 2008 at 07:22, Steven Dake <sdake at redhat.com> wrote:
>>>
>>> is there some problem with whitetank branch that you are having?
>>
>> My fault (and not in the upstream branch).
>> I had one of those "this can't possibly break anything" moments when I
>> modifed the start order (so that custom services were loaded after
>> default ones)
>>
>> Should be fixed now - packages are rebuilding
>>
>>>
>>> Works for me from tip + cman.  I'm on vacation haven't tried other
>>> plugins.
>>>
>>> Regards
>>> -steve
>>>
>>> On Tue, 2008-11-18 at 16:09 +0100, Andrew Beekhof wrote:
>>>>
>>>> On Tue, Nov 18, 2008 at 16:04, Yan Gao <ygao at novell.com> wrote:
>>>>>
>>>>> On Tue, 2008-11-18 at 11:00 +0100, Andrew Beekhof wrote:
>>>>>>
>>>>>> On Tue, Nov 18, 2008 at 10:16, Yan Gao <ygao at novell.com> wrote:
>>>>>>>
>>>>>>> On Tue, 2008-11-18 at 08:38 +0100, Andrew Beekhof wrote:
>>>>>>>>>>
>>>>>>>>>> Does anyone know what might be wrong here? I shouldn't have to run
>>>>>>>>>> the
>>>>>>>>>> heartbeat stack when I am running the OpenAIS stack.
>>>>>>>>>
>>>>>>>>> You shouldn't.
>>>>>>>>> It seems that something is wrong with openais. And several daemons
>>>>>>>>> (at
>>>>>>>>> least lrmd) could not be started.
>>>>>>>>>
>>>>>>>>> I've just tested the latest build (on Nov 14th) of openais and met
>>>>>>>>> the
>>>>>>>>> similar issue.
>>>>>>>>
>>>>>>>> Any reason you didn't tell anyone?
>>>>>>>
>>>>>>> I haven't got a clue either.
>>>>>>
>>>>>> Ok, well next time you have a problem that results in the machine
>>>>>> locking up - please report it so someone can fix it.
>>>>>
>>>>> OK, I see. I hadn't tried the lastest build until this afternoon I saw
>>>>> the mail from Bret.
>>>>>
>>>>>> What rpm versions (and from what location) are you running.
>>>>>
>>>>> Packages are from:
>>>>>
>>>>> http://download.opensuse.org/repositories/server:/ha-clustering:/UNSTABLE/openSUSE_11.0/i586/
>>>>
>>>> They've been changing a lot recently - which exact version did you
>>>> install?
>>>> _______________________________________________
>>>> Pacemaker mailing list
>>>> Pacemaker at clusterlabs.org
>>>> http://list.clusterlabs.org/mailman/listinfo/pacemaker
>>>
>>>
>>> _______________________________________________
>>> Pacemaker mailing list
>>> Pacemaker at clusterlabs.org
>>> http://list.clusterlabs.org/mailman/listinfo/pacemaker
>>>
>> _______________________________________________
>> Pacemaker mailing list
>> Pacemaker at clusterlabs.org
>> http://list.clusterlabs.org/mailman/listinfo/pacemaker
>
>
> _______________________________________________
> Pacemaker mailing list
> Pacemaker at clusterlabs.org
> http://list.clusterlabs.org/mailman/listinfo/pacemaker
>




More information about the Pacemaker mailing list