[Pacemaker] Master Slave resource

ruslan usifov ruslan.usifov at gmail.com
Sat Dec 11 06:58:25 EST 2010


2010/12/11 Andrew Beekhof <andrew at beekhof.net>

> On Fri, Dec 10, 2010 at 4:59 PM, ruslan usifov <ruslan.usifov at gmail.com>
> wrote:
> > and to me what to do?
>
> Nothing yet, there looks to be some memory corruption going on.
> With that file I've been able to reproduce locally.  I'll let you know
> when there is a fix (hopefully very soon).
>
> >
> > 2010/12/10 Andrew Beekhof <andrew at beekhof.net>
> >>
> >> On Fri, Dec 10, 2010 at 11:16 AM, ruslan usifov <
> ruslan.usifov at gmail.com>
> >> wrote:
> >> > you mean some think like this:
> >> >
> >> > Dec 07 15:14:05 storage1 crmd: [16003]: notice: save_cib_contents:
> Saved
> >> > CIB
> >> > contents after PE crash to /var/lib/pengine/pe-core
> >> > -121f59f1-ca5c-4ce4-996c-53f4aa617ac3.bz2
> >>
> >> perfect
> >>
> >> >
> >> >
> >> > ??
> >> > If so, i attached it to this email
> >> >
> >> >
> >> > 2010/12/10 Andrew Beekhof <andrew at beekhof.net>
> >> >>
> >> >> On Fri, Dec 10, 2010 at 10:18 AM, ruslan usifov
> >> >> <ruslan.usifov at gmail.com>
> >> >> wrote:
> >> >> > I don't know how to see version of pacemaker, crm doesn't provide
> -v
> >> >> > (or
> >> >> > -V
> >> >> > or --version) option, but I got source from here
> >> >> > http://hg.clusterlabs.org/pacemaker/stable-1.0/archive/tip.tar.bz2
> ,
> >> >> > as
> >> >> > result I download Pacemaker-1-0-b0266dd5ffa9.tar.bz2
> >> >>
> >> >> Ah, b0266dd5ffa9 tells me exactly what you're running.
> >> >>
> >> >> I'm very concerned that can_run_resources() is crashing, looking at
> >> >> the code it should not be possible.
> >> >> Do you see any logs containing the text "save_cib_contents"?  If so,
> >> >> it should mention a filename, could you attach it?
> >> >>
> >> >> >
> >> >> >
> >> >> > and here is my backtrace:
> >> >> >
> >> >> > gdb /usr/lib/heartbeat/pengine /tmp/pengine.940.core
> >> >> > (gdb) bt
> >> >> > #0  0xb76879b8 in can_run_resources () from
> /usr/lib/libpengine.so.3
> >> >> > #1  0xb768dcd7 in node_list_attr_score () from
> >> >> > /usr/lib/libpengine.so.3
> >> >> > #2  0xb768de91 in ?? () from /usr/lib/libpengine.so.3
> >> >> > #3  0xb768e193 in rsc_merge_weights () from
> /usr/lib/libpengine.so.3
> >> >> > #4  0xb7681676 in native_merge_weights () from
> >> >> > /usr/lib/libpengine.so.3
> >> >> > #5  0xb7698654 in group_merge_weights () from
> >> >> > /usr/lib/libpengine.so.3
> >> >> > #6  0xb76a063a in ?? () from /usr/lib/libpengine.so.3
> >> >> > #7  0xb76a17ad in master_color () from /usr/lib/libpengine.so.3
> >> >> > #8  0xb7683f3e in stage5 () from /usr/lib/libpengine.so.3
> >> >> > #9  0xb76813aa in do_calculations () from /usr/lib/libpengine.so.3
> >> >> > #10 0xb7680991 in process_pe_message () from
> /usr/lib/libpengine.so.3
> >> >> > #11 0x08048ed5 in ?? ()
> >> >> > #12 0xb7611f4e in G_CH_dispatch_int () from /usr/lib/libplumb.so.2
> >> >> > #13 0xb727f5e5 in g_main_context_dispatch () from
> >> >> > /lib/libglib-2.0.so.0
> >> >> > #14 0xb72832d8 in ?? () from /lib/libglib-2.0.so.0
> >> >> > #15 0xb7283817 in g_main_loop_run () from /lib/libglib-2.0.so.0
> >> >> > #16 0x080494f8 in ?? ()
> >> >> > #17 0xb7337bd6 in __libc_start_main () from
> >> >> > /lib/tls/i686/cmov/libc.so.6
> >> >> > #18 0x08048dc1 in ?? ()
> >> >> >
> >> >> > _______________________________________________
> >> >> > Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> >> >> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> >> >> >
> >> >> > Project Home: http://www.clusterlabs.org
> >> >> > Getting started:
> >> >> > http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> >> >> > Bugs:
> >> >> >
> >> >> >
> >> >> >
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
> >> >> >
> >> >> >
> >> >>
> >> >> _______________________________________________
> >> >> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> >> >> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> >> >>
> >> >> Project Home: http://www.clusterlabs.org
> >> >> Getting started:
> >> >> http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> >> >> Bugs:
> >> >>
> >> >>
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
> >> >
> >> >
> >> > _______________________________________________
> >> > Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> >> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> >> >
> >> > Project Home: http://www.clusterlabs.org
> >> > Getting started:
> http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> >> > Bugs:
> >> >
> >> >
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
> >> >
> >> >
> >>
> >> _______________________________________________
> >> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> >> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> >>
> >> Project Home: http://www.clusterlabs.org
> >> Getting started:
> http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> >> Bugs:
> >>
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
> >
> >
> > _______________________________________________
> > Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> >
> > Project Home: http://www.clusterlabs.org
> > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> > Bugs:
> >
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
> >
> >
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs:
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
>


Thanks, but i have a question

I change my configuration in test inviroment (VMware) to   follow:

root at ubuntu:/home/ruslan# crm configure show
node ubuntu
node ubuntu1
primitive drbd_web ocf:linbit:drbd \
        params drbd_resource="web" \
        op monitor interval="15s" \
        op stop interval="0" timeout="300s" \
        op start interval="0" timeout="300s"
primitive iscsi_ip ocf:heartbeat:IPaddr2 \
        params ip="192.168.83.5" nic="eth1:1" cidr_netmask="24" \
        op monitor interval="10s" \
        meta target-role="Started"
primitive iscsi_lun1_web_target ocf:heartbeat:iSCSILogicalUnit \
        params lun="1" path="/dev/drbd0"
target_iqn="iqn.2010-06.playrix.local:san.web" implementation="iet"
primitive iscsi_target lsb:iscsitarget
primitive iscsi_web_target ocf:heartbeat:iSCSITarget \
        params iqn="iqn.2010-06.playrix.local:san.web" implementation="iet"
\
        op monitor interval="10s" timeout="30s" depth="0" \
        meta target-role="Started"
group iscsi iscsi_ip iscsi_target iscsi_web_target iscsi_lun1_web_target \
        meta target-role="Started"
ms ms_drbd_web drbd_web \
        meta master-max="1" master-node-max="1" clone-max="2"
clone-node-max="1" notify="true" target-role="Started" is-managed="true"
colocation ms_drbd_with_iscsi inf: iscsi ms_drbd_web:Master
order o1 0: ms_drbd_web:promote iscsi_ip
order o2 inf: iscsi_ip iscsi_target:start
order o3 inf: iscsi_target iscsi_web_target
order o4 inf: iscsi_web_target iscsi_lun1_web_target
property $id="cib-bootstrap-options" \
        dc-version="1.0.10-b0266dd5ffa9c51377c68b1f29d6bc84367f51dd" \
        cluster-infrastructure="openais" \
        expected-quorum-votes="2" \
        stonith-enabled="false" \
        no-quorum-policy="ignore" \
        last-lrm-refresh="1292028155"
rsc_defaults $id="rsc-options" \
        resource-stickness="100"


and when start one node (without second) i got follow output:

root at ubuntu:/home/ruslan# crm_mon -1
Stack: openais
Current DC: ubuntu - partition WITHOUT quorum
Version: 1.0.10-b0266dd5ffa9c51377c68b1f29d6bc84367f51dd
2 Nodes configured, 2 expected votes
2 Resources configured.
============

Online: [ ubuntu ]
OFFLINE: [ ubuntu1 ]

 Master/Slave Set: ms_drbd_web
     Slaves: [ ubuntu ]
     Stopped: [ drbd_web:1 ]

Failed actions:
    iscsi_target_monitor_0 (node=ubuntu, call=4, rc=1, status=complete):
unknown error
    iscsi_web_target_monitor_0 (node=ubuntu, call=5, rc=5, status=complete):
not installed
    iscsi_lun1_web_target_monitor_0 (node=ubuntu, call=6, rc=5,
status=complete): not installed


Hm it very strange. As i expected this rules:

          order o1 inf: ms_drbd_web:promote iscsi_ip
order o2 inf: iscsi_ip iscsi_target:start
order o3 inf: iscsi_target iscsi_web_target
order o4 inf: iscsi_web_target iscsi_lun1_web_target

prevent launch and monitor of all resource that depends on drbd, but with
Failed actions I confused, why they monitored?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20101211/8f21fd9f/attachment-0001.html>


More information about the Pacemaker mailing list