[Pacemaker] Can not use multicast, Any Ideas?

Angie T. Muhammad angie.tawfik at gmail.com
Tue Jan 5 18:54:39 EST 2010


Everything is working fine now.. Thanks for every one who helped :)

On Wed, Jan 6, 2010 at 12:53 AM, Angie T. Muhammad
<angie.tawfik at gmail.com>wrote:

> well, I am not sure if what I did is right or not, but:
>
>
> # vim /etc/ha.d/ha.cf
> crm on
>
> // now crm_mon displays things as usual !!
> # crm_mon -i5
>
> ============
> Last updated: Wed Jan  6 00:49:04 2010
> Stack: Heartbeat
> Current DC: node2.mydomain.com (8e8ca99f-ff34-45c7-814b-d73d69889441) -
> partition with quorum
> Version: 1.0.6-f709c638237cdff7556cb6ab615f32826c0f8c06
> 2 Nodes configured, unknown expected votes
> 0 Resources configured.
> ============
>
> Online: [ node1.mydomain.com node2.mydomain.com ]
>
>
> Now, I 'll configure my resources under pacemaker as I always did and let
> you know of any progress / problems.
> Thank you Dejan for keeping up with me on this issue :)
>
> =====================================================================================
>
>
>
>
>
> On Wed, Jan 6, 2010 at 12:08 AM, Angie T. Muhammad <angie.tawfik at gmail.com
> > wrote:
>
>> Hello,
>> Thank you for the prompt reply.
>>
>> All permissions are correct, and here is the output of ulimit:
>> # cd /var/lib/heartbeat/cores/
>> # ulimit -a
>> core file size          (blocks, -c) 0
>> data seg size           (kbytes, -d) unlimited
>> scheduling priority             (-e) 0
>> file size               (blocks, -f) unlimited
>> pending signals                 (-i) 73728
>> max locked memory       (kbytes, -l) 32
>> max memory size         (kbytes, -m) unlimited
>> open files                      (-n) 1024
>> pipe size            (512 bytes, -p) 8
>> POSIX message queues     (bytes, -q) 819200
>> real-time priority              (-r) 0
>> stack size              (kbytes, -s) 10240
>> cpu time               (seconds, -t) unlimited
>> max user processes              (-u) 73728
>> virtual memory          (kbytes, -v) unlimited
>> file locks                      (-x) unlimited
>>
>> *
>> what should I do in this respect?*
>>
>>
>> On Tue, Jan 5, 2010 at 10:37 PM, Dejan Muhamedagic <dejanmm at fastmail.fm>wrote:
>>
>>> Hi,
>>>
>>> On Tue, Jan 05, 2010 at 09:47:46PM +0200, Angie T. Muhammad wrote:
>>> > mmm, I truncated the logs to re-genrate the error and send you the
>>> file, but
>>> > the error no longer appears at /var/log/messages now. There were the
>>> words
>>> > "kernel" and "segfault" on the last line !!!
>>>
>>> Did you enabled coredumps (ulimit -c)? Please check
>>> /var/lib/heartbeat/cores/*.
>>>
>>> > Any way, I'll try to regenerate the error at /var/log/messages and send
>>> it.
>>> > Till then, would you please let me know which files exactly you mean
>>> have
>>> > wrong permissions?
>>>
>>> d /var/lib/heartbeat 0755 root root
>>> d /var/lib/pengine 0750 hacluster haclient
>>> d /var/lib/heartbeat/crm 0750 hacluster haclient
>>> d /var/run/crm 0750 hacluster haclient
>>>
>>> Thanks,
>>>
>>> Dejan
>>>
>>> > Thank you
>>> >
>>> >
>>> >
>>> > On Tue, Jan 5, 2010 at 9:29 PM, Dejan Muhamedagic <dejanmm at fastmail.fm
>>> >wrote:
>>> >
>>> > > Hi,
>>> > >
>>> > > On Tue, Jan 05, 2010 at 09:19:16PM +0200, Angie T. Muhammad wrote:
>>> > > > Hello all
>>> > > >
>>> > > > Thank you Dejan and Dr. Schwartzkopff
>>> > > > But please bear with me because I'm still suffering a problem. Here
>>> is
>>> > > what
>>> > > > I did:
>>> > > >
>>> > > > #  wget -O /etc/yum.repos.d/clusterlabs.repo
>>> > > > http://clusterlabs.org/rpm/epel-5/clusterlabs.repo
>>> > > > # yum install pacemaker pacemaker-libs cluster-glue
>>> cluster-glue-libs
>>> > > > resource-agents heartbeat
>>> > > >
>>> > >
>>> =============================================================================================================================================================
>>> > > >  Package                                    Arch
>>> > > > Version                               Repository
>>> > > > Size
>>> > > >
>>> > >
>>> =============================================================================================================================================================
>>> > > > Installing:
>>> > > >  cluster-glue                               x86_64
>>> > > > 1.0.1-1.el5                           clusterlabs
>>> > > > 262 k
>>> > > >  cluster-glue-libs                          x86_64
>>> > > > 1.0.1-1.el5                           clusterlabs
>>> > > > 130 k
>>> > > >  heartbeat                                  x86_64
>>> > > > 3.0.1-1.el5                           clusterlabs
>>> > > > 193 k
>>> > > >  pacemaker                                  x86_64
>>> > > > 1.0.6-1.el5                           clusterlabs
>>> > > > 689 k
>>> > > >  pacemaker-libs                             x86_64
>>> > > > 1.0.6-1.el5                           clusterlabs
>>> > > > 310 k
>>> > > >  resource-agents                            x86_64
>>> > > > 1.0.1-1.el5                           clusterlabs
>>> > > > 179 k
>>> > > > Installing for dependencies:
>>> > > >  corosync                                   x86_64
>>> > > > 1.1.2-1.el5                           clusterlabs
>>> > > > 163 k
>>> > > >  corosynclib                                x86_64
>>> > > > 1.1.2-1.el5                           clusterlabs
>>> > > > 163 k
>>> > > >  heartbeat-libs                             x86_64
>>> > > > 3.0.1-1.el5                           clusterlabs
>>> > > > 292 k
>>> > > >  libesmtp                                   x86_64
>>> > > > 1.0.4-5.el5                           epel
>>> > > > 60 k
>>> > > >  libibverbs                                 x86_64
>>> > > > 1.1.2-4.el5                           base
>>> > > > 44 k
>>> > > >  librdmacm                                  x86_64
>>> > > > 1.0.8-5.el5                           base
>>> > > > 22 k
>>> > > >  openhpi-libs                               x86_64
>>> > > > 2.14.0-5.el5                          base
>>> > > > 168 k
>>> > > >  openib                                     noarch
>>> > > > 1.4.1-3.el5                           base
>>> > > > 20 k
>>> > > >
>>> > > > Transaction Summary
>>> > > >
>>> > >
>>> =============================================================================================================================================================
>>> > > > Install     14 Package(s)
>>> > > > Update       0 Package(s)
>>> > > > Remove       0 Package(s)
>>> > > >
>>> > > > Total download size: 2.6 M
>>> > > >
>>> > > > # vim /etc/ha.d/ha.cf
>>> > > > keepalive       2
>>> > > > deadtime        30
>>> > > > warntime        10
>>> > > > initdead        120
>>> > > > udpport         694
>>> > > > ucast eth1      10.0.0.101
>>> > > > auto_failback   on
>>> > > > node            node1.mydomain.com
>>> > > > node            node2.mydomain.com
>>> > > > use_logd        yes
>>> > > >
>>> > > > // and I changed the ucast directive properly for each node
>>> > > >
>>> > > > # vim /etc/ha.d/authkeys
>>> > > > # chmod 600 /etc/ha.d/authkeys
>>> > > > # /etc/init.d/heartbeat start
>>> > > > Starting High-Availability services:                       [  OK  ]
>>> > > > // started properly on both nodes
>>> > > >
>>> > > > # crm_mon -i5
>>> > > > Attempting connection to the cluster....
>>> > > >
>>> > > > # strace -o hb-again crm_mon -i5
>>> > > > // the file is attached
>>> > > >
>>> > > > // I didn't find perl on the system , so I installed it
>>> > > > # yum install perl
>>> > > >
>>> > > > // indeed, i believe the error is at around 92% of the strace
>>> output file
>>> > > > when it attempts to:
>>> > > >
>>> > > > connect(3, {sa_family=AF_FILE, path="/var/run/crm/cib_ro"...}, 110)
>>> = -1
>>> > > > ENOENT (No such file or directory)
>>> > > > close(3)                                = 0
>>> > > > socket(PF_FILE, SOCK_STREAM, 0)         = 3
>>> > > > fcntl(3, F_GETFL)                       = 0x2 (flags O_RDWR)
>>> > > > fcntl(3, F_SETFL, O_RDWR|O_NONBLOCK)    = 0
>>> > > > connect(3, {sa_family=AF_FILE,
>>> path="/var/run/crm/cib_callback"...}, 110)
>>> > > =
>>> > > > -1 ENOENT (No such file or directory)
>>> > >
>>> > > Looks like cib didn't start. The logs should say why. Perhaps
>>> > > there are permission problems?
>>> > >
>>> > > Thanks,
>>> > >
>>> > > Dejan
>>> > >
>>> > > > I can't understand why it can not run :( ..
>>> > > > Version 1.0.5 of pace maker and openais 0.80.5 worked like a charm
>>> on the
>>> > > > same nodes.
>>> > > > Now I have to shift to heartbeat because of unicast directive.
>>> Please
>>> > > help!
>>> > > >
>>> > > > Thank you in advance
>>> > > >
>>> > > >
>>> > > > On Tue, Jan 5, 2010 at 2:17 PM, Michael Schwartzkopff <
>>> misch at multinet.de
>>> > > >wrote:
>>> > > >
>>> > > > > Am Dienstag, 5. Januar 2010 13:00:44 schrieb Dejan Muhamedagic:
>>> > > > > > Hi,
>>> > > > > >
>>> > > > > > On Tue, Jan 05, 2010 at 01:51:38PM +0200, Angie T. Muhammad
>>> wrote:
>>> > > > > > > Hello all,
>>> > > > > > > Hope you spent good time on holidays!
>>> > > > > > >
>>> > > > > > > Our data center does not support multicast and I have been
>>> googling
>>> > > > > > > "unicast site:openais.org" but now results.
>>> > > > > > > And changing our data center is not an option at the moment.
>>> > > > > > >
>>> > > > > > > I wonder does any beta version of openais support unicast?
>>> > > > > >
>>> > > > > > I think that the latest corosync (1.2.0) supports broadcast.
>>> > > > > >
>>> > > > > > > If not, do you have any link to pacemaker installation with
>>> > > heartbeat
>>> > > > > > > stack?
>>> > > > > >
>>> > > > > > clusterlabs.org has some installation docs and there are also
>>> > > > > > brand new docs at http://linux-ha.org/wiki/Documentation
>>> > > > > >
>>> > > > > > Thanks,
>>> > > > > >
>>> > > > > > Dejan
>>> > > > > >
>>> > > > > > > Indeed, I would be very grateful if you could suggest me any
>>> other
>>> > > > > > > solution?
>>> > > > >
>>> > > > >
>>> > > > > Perhaps you could use a tunnel (gre, ...) to route the multicast.
>>> > > > >
>>> > > > > --
>>> > > > > Dr. Michael Schwartzkopff
>>> > > > > MultiNET Services GmbH
>>> > > > > Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
>>> > > > > Tel: +49 - 89 - 45 69 11 0
>>> > > > > Fax: +49 - 89 - 45 69 11 21
>>> > > > > mob: +49 - 174 - 343 28 75
>>> > > > >
>>> > > > > mail: misch at multinet.de
>>> > > > > web: www.multinet.de
>>> > > > >
>>> > > > > Sitz der Gesellschaft: 85630 Grasbrunn
>>> > > > > Registergericht: Amtsgericht München HRB 114375
>>> > > > > Geschäftsführer: Günter Jurgeneit, Hubert Martens
>>> > > > >
>>> > > > > ---
>>> > > > >
>>> > > > > PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8
>>> 979B
>>> > > > > Skype: misch42
>>> > > > >
>>> > > > > _______________________________________________
>>> > > > > Pacemaker mailing list
>>> > > > > Pacemaker at oss.clusterlabs.org
>>> > > > > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>>> > > > >
>>> > > >
>>> > > >
>>> > > >
>>> > > > --
>>> > > > All the best,
>>> > > > Angie
>>> > >
>>> > >
>>> > > > _______________________________________________
>>> > > > Pacemaker mailing list
>>> > > > Pacemaker at oss.clusterlabs.org
>>> > > > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>>> > >
>>> > >
>>> > > _______________________________________________
>>> > > Pacemaker mailing list
>>> > > Pacemaker at oss.clusterlabs.org
>>> > > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>>> > >
>>> >
>>> >
>>> >
>>> > --
>>> > All the best,
>>> > Angie
>>>
>>> > _______________________________________________
>>> > Pacemaker mailing list
>>> > Pacemaker at oss.clusterlabs.org
>>> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>>>
>>>
>>> _______________________________________________
>>> Pacemaker mailing list
>>> Pacemaker at oss.clusterlabs.org
>>> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>>>
>>
>>
>>
>> --
>> All the best,
>> Angie
>>
>
>
>
> --
> All the best,
> Angie
>



-- 
All the best,
Angie
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20100106/f75900f5/attachment-0001.html>


More information about the Pacemaker mailing list