<div dir="ltr">Thanks for clarifying.<div><br></div><div>Regards,</div><div>Sriram.</div></div><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Aug 14, 2017 at 7:34 PM, Klaus Wenninger <span dir="ltr"><<a href="mailto:kwenning@redhat.com" target="_blank">kwenning@redhat.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
  
    
  
  <div text="#000000" bgcolor="#FFFFFF"><span class="">
    <div class="m_-5828583773566347351moz-cite-prefix">On 08/14/2017 03:19 PM, Sriram wrote:<br>
    </div>
    <blockquote type="cite">
      <div dir="ltr">Yes, I had precreated the script file with the
        required permission. 
        <div>
          <div><br>
          </div>
          <div>[root@<b>node1</b> alerts]# ls -l
            /usr/share/pacemaker/alert_<wbr>file.sh</div>
          <div>-rwxr-xr-x. 1 root root 4140 Aug 14 01:51
            /usr/share/pacemaker/alert_<wbr>file.sh</div>
        </div>
        <div>
          <div>
            <div> [root@<b>node2</b> alerts]# ls -l
              /usr/share/pacemaker/alert_<wbr>file.sh</div>
            <div>-rwxr-xr-x. 1 root root 4139 Aug 14 01:51
              /usr/share/pacemaker/alert_<wbr>file.sh</div>
          </div>
          <div>[root@<b>node3</b> alerts]# ls -l
            /usr/share/pacemaker/alert_<wbr>file.sh</div>
          <div>-rwxr-xr-x. 1 root root 4139 Aug 14 01:51
            /usr/share/pacemaker/alert_<wbr>file.sh</div>
        </div>
        <div><br>
        </div>
        <div>Later I observed that user "hacluster" is not able to
          create the log file under <span style="font-size:12.8px">/usr/share/pacemaker/</span><span style="font-size:12.8px">ale<wbr>rt_file.log.</span></div>
        <div>I am sorry, I should have observed this in the log before
          posting the query. Then I gave the path as
          /tmp/alert_file.log, it is able to create now.</div>
        <div>
          <div>Thanks for pointing it out.</div>
        </div>
        <div><br>
        </div>
        <div>I have one more clarification,</div>
        <div><br>
        </div>
        <div>if the resource is running in node2,</div>
        <div>[root@node2 tmp]# pcs resource<br>
        </div>
        <div>
          <div> TRR    (ocf::heartbeat:<wbr>TimingRedundancyRA):    Started
            node2</div>
        </div>
        <div><br>
        </div>
        <div>And I executed the below command to make it standby.</div>
        <div>[root@node2 tmp] # pcs node standby node2</div>
        <div><br>
        </div>
        <div>Resource shifted to node3, because of higher location
          constraint.</div>
        <div>
          <div>
            <div>[root@node2 tmp]# pcs resource</div>
            <div> TRR    (ocf::heartbeat:<wbr>TimingRedundancyRA):    Started
              node3.</div>
          </div>
        </div>
        <div><br>
        </div>
        <div><br>
        </div>
        <div>I got the log file created under node2(resource stopped)
          and node3(resource started). </div>
        <div><br>
        </div>
        <div>Node1 was not notified about the resource shift, I mean no
          log file was created there.</div>
        <div>Its because alerts are designed to notify the external
          agents about the cluster events. Its not for internal
          notifications.</div>
        <div><br>
        </div>
        <div>Is my understanding correct ?</div>
      </div>
    </blockquote>
    <br></span>
    Quite simple: crmd of node1 just didn't have anything to do with
    shifting the resource<br>
    from node2 -> node3. There is no additional information passed
    between the nodes<br>
    just to create a full set of notifications on every node. If you
    want to have a full log<br>
    (or whatever you altert-agent is doing) in one place this would be
    up to your alert-agent.<div><div class="h5"><br>
    <br>
    Regards,<br>
    Klaus<br>
    <br>
    <blockquote type="cite">
      <div dir="ltr">
        <div> </div>
        <div>Regards,<br>
        </div>
        <div>Sriram.</div>
        <div><br>
        </div>
        <div><br>
        </div>
      </div>
      <div class="gmail_extra"><br>
        <div class="gmail_quote">On Mon, Aug 14, 2017 at 5:42 PM, Klaus
          Wenninger <span dir="ltr"><<a href="mailto:kwenning@redhat.com" target="_blank">kwenning@redhat.com</a>></span>
          wrote:<br>
          <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
            <div text="#000000" bgcolor="#FFFFFF">
              <div>
                <div class="m_-5828583773566347351h5">
                  <div class="m_-5828583773566347351m_-4802990479422366187moz-cite-prefix">On
                    08/14/2017 12:32 PM, Sriram wrote:<br>
                  </div>
                  <blockquote type="cite">
                    <div dir="ltr">Hi Ken,
                      <div><br>
                      </div>
                      <div>I used the alerts as well, seems to be not
                        working.</div>
                      <div><br>
                      </div>
                      <div>Please check the below configuration</div>
                      <div>
                        <div>[root@node1 alerts]# pcs config show</div>
                        <div>Cluster Name:</div>
                        <div>Corosync Nodes:</div>
                        <div>Pacemaker Nodes:</div>
                        <div> node1 node2 node3</div>
                        <div><br>
                        </div>
                        <div>Resources:</div>
                        <div> Resource: TRR (class=ocf
                          provider=heartbeat type=TimingRedundancyRA)</div>
                        <div>  Operations: start interval=0s timeout=60s
                          (TRR-start-interval-0s)</div>
                        <div>              stop interval=0s timeout=20s
                          (TRR-stop-interval-0s)</div>
                        <div>              monitor interval=10
                          timeout=20 (TRR-monitor-interval-10)</div>
                        <div><br>
                        </div>
                        <div>Stonith Devices:</div>
                        <div>Fencing Levels:</div>
                        <div><br>
                        </div>
                        <div>Location Constraints:</div>
                        <div>  Resource: TRR</div>
                        <div>    Enabled on: node1 (score:100)
                          (id:location-TRR-node1-100)</div>
                        <div>    Enabled on: node2 (score:200)
                          (id:location-TRR-node2-200)</div>
                        <div>    Enabled on: node3 (score:300)
                          (id:location-TRR-node3-300)</div>
                        <div>Ordering Constraints:</div>
                        <div>Colocation Constraints:</div>
                        <div>Ticket Constraints:</div>
                        <div><br>
                        </div>
                        <div>Alerts:</div>
                        <div> Alert: alert_file
                          (path=/usr/share/pacemaker/ale<wbr>rt_file.sh)</div>
                        <div>  Options: debug_exec_order=false</div>
                        <div>  Meta options: timeout=15s</div>
                        <div>  Recipients:</div>
                        <div>   Recipient: recipient_alert_file_id
                          (value=/usr/share/pacemaker/al<wbr>ert_file.log)</div>
                      </div>
                    </div>
                  </blockquote>
                  <br>
                </div>
              </div>
              Did you pre-create the file with proper rights? Be aware
              that the alert-agent<br>
              is called as user hacluster.<span><br>
                <br>
                <blockquote type="cite">
                  <div dir="ltr">
                    <div>
                      <div><br>
                      </div>
                      <div>Resources Defaults:</div>
                      <div> resource-stickiness: INFINITY</div>
                      <div>Operations Defaults:</div>
                      <div> No defaults set</div>
                      <div><br>
                      </div>
                      <div>Cluster Properties:</div>
                      <div> cluster-infrastructure: corosync</div>
                      <div> dc-version: 1.1.15-11.el7_3.4-e174ec8</div>
                      <div> default-action-timeout: 240</div>
                      <div> have-watchdog: false</div>
                      <div> no-quorum-policy: ignore</div>
                      <div> placement-strategy: balanced</div>
                      <div> stonith-enabled: false</div>
                      <div> symmetric-cluster: false</div>
                      <div><br>
                      </div>
                      <div>Quorum:</div>
                      <div>  Options:</div>
                    </div>
                    <div><br>
                    </div>
                    <div><br>
                    </div>
                    <div>/usr/share/pacemaker/alert_fil<wbr>e.sh does
                      not get called whenever I trigger a scenario for
                      failover.<br>
                    </div>
                    <div>Please let me know if I m missing anything. <br>
                    </div>
                  </div>
                </blockquote>
                <br>
              </span> Do you get any logs - like for startup of
              resources - or nothing at all?<br>
              <br>
              Regards,<br>
              Klaus
              <div>
                <div class="m_-5828583773566347351h5"><br>
                  <br>
                  <blockquote type="cite">
                    <div dir="ltr">
                      <div class="gmail_extra"><br>
                      </div>
                      <div class="gmail_extra"><br>
                      </div>
                      <div class="gmail_extra">Regards,</div>
                      <div class="gmail_extra">Sriram.</div>
                      <div class="gmail_extra"><br>
                        <div class="gmail_quote">On Tue, Aug 8, 2017 at
                          8:29 PM, Ken Gaillot <span dir="ltr"><<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a>></span>
                          wrote:<br>
                          <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
                            <div class="m_-5828583773566347351m_-4802990479422366187gmail-HOEnZb">
                              <div class="m_-5828583773566347351m_-4802990479422366187gmail-h5">On
                                Tue, 2017-08-08 at 17:40 +0530, Sriram
                                wrote:<br>
                                > Hi Ulrich,<br>
                                ><br>
                                ><br>
                                > Please see inline.<br>
                                ><br>
                                > On Tue, Aug 8, 2017 at 2:01 PM,
                                Ulrich Windl<br>
                                > <<a href="mailto:Ulrich.Windl@rz.uni-regensburg.de" target="_blank">Ulrich.Windl@rz.uni-regensbur<wbr>g.de</a>>
                                wrote:<br>
                                >         >>> Sriram <<a href="mailto:sriram.ec@gmail.com" target="_blank">sriram.ec@gmail.com</a>>
                                schrieb am <a href="tel:08.08.2017" value="+498082017" target="_blank">08.08.2017</a>
                                um<br>
                                >         09:30 in Nachricht<br>
                                >       
                                 <CAMvdjurcQc6t=ZfGr=cRL25Xq0J<wbr>e9h9F_TvZXyxVAn3n<br>
                                >         +<a href="mailto:Dvcgw@mail.gmail.com" target="_blank">Dvcgw@mail.gmail.com</a>>:<br>
                                >         > Hi Ken & Jan,<br>
                                >         ><br>
                                >         > In the cluster we
                                have, there is only one resource
                                running.<br>
                                >         Its a OPT-IN<br>
                                >         > cluster with
                                resource-stickiness set to INFINITY.<br>
                                >         ><br>
                                >         > Just to clarify my
                                question, lets take a scenario where<br>
                                >         there are four<br>
                                >         > nodes N1, N2, N3, N4<br>
                                >         > a. N1 comes up first,
                                starts the cluster.<br>
                                ><br>
                                >         The cluster will start once
                                it has a quorum.<br>
                                ><br>
                                >         > b. N1 Checks that
                                there is no resource running, so it will<br>
                                >         add the<br>
                                >         > resource(R) with the
                                some location constraint(lets say score<br>
                                >         100)<br>
                                >         > c. So Resource(R) runs
                                in N1 now.<br>
                                >         > d. N2 comes up next,
                                checks that resource(R) is already<br>
                                >         running in N1, so<br>
                                >         > it will update the
                                location constraint(lets say score 200)<br>
                                >         > e. N3 comes up next,
                                checks that resource(R) is already<br>
                                >         running in N1, so<br>
                                >         > it will update the
                                location constraint(lets say score 300)<br>
                                ><br>
                                >         See my remark on quorum
                                above.<br>
                                ><br>
                                > Yes you are right, I forgot to
                                mention it.<br>
                                ><br>
                                ><br>
                                >         > f.  N4 comes up next,
                                checks that resource(R) is already<br>
                                >         running in N1, so<br>
                                >         > it will update the
                                location constraint(lets say score 400)<br>
                                >         > g. For the some
                                reason, if N1 goes down, resource(R)
                                shifts<br>
                                >         to N4(as its<br>
                                >         > score is higher than
                                anyone).<br>
                                >         ><br>
                                >         > In this case is it
                                possible to notify the nodes N2, N3 that<br>
                                >         newly elected<br>
                                >         > active node is N4 ?<br>
                                ><br>
                                >         What type of notification,
                                and what would the node do with it?<br>
                                >         Any node in the cluster
                                always has up to date configuration<br>
                                >         information. So it knows
                                the status of the other nodes also.<br>
                                ><br>
                                ><br>
                                > I agree that the node always has
                                upto date configuration information,<br>
                                > but an application or a thread
                                needs to poll for that information. Is<br>
                                > there any way, where the
                                notifications are received through some<br>
                                > action function in RA. ?<br>
                                <br>
                              </div>
                            </div>
                            Ah, I misunderstood your situation, I
                            thought you had a cloned resource.<br>
                            <br>
                            For that, the alerts feature (available in
                            Pacemaker 1.1.15 and later)<br>
                            might be useful:<br>
                            <br>
                            <a href="http://clusterlabs.org/doc/en-US/Pacemaker/1.1-pcs/html-single/Pacemaker_Explained/index.html#idm139900098676896" rel="noreferrer" target="_blank">http://clusterlabs.org/doc/en-<wbr>US/Pacemaker/1.1-pcs/html-sing<wbr>le/Pacemaker_Explained/index.h<wbr>tml#idm139900098676896</a><br>
                            <div class="m_-5828583773566347351m_-4802990479422366187gmail-HOEnZb">
                              <div class="m_-5828583773566347351m_-4802990479422366187gmail-h5"><br>
                                <br>
                                ><br>
                                ><br>
                                > Regards,<br>
                                > Sriram.<br>
                                ><br>
                                >         ><br>
                                >         > I went through clone
                                notifications and master-slave, Iooks<br>
                                >         like it either<br>
                                >         > requires identical
                                resources(Anonymous) or Unique or<br>
                                >         Stateful resources to<br>
                                >         > be running<br>
                                >         > in all the nodes of
                                the cluster, where as in our case there<br>
                                >         is only<br>
                                >         > resource running in
                                the whole cluster.<br>
                                ><br>
                                >         Maybe the main reason for
                                not having notifications is that if<br>
                                >         a node fails hard, it won't
                                be able to send out much status<br>
                                >         information to the other
                                nodes.<br>
                                ><br>
                                >         Regards,<br>
                                >         Ulrich<br>
                                ><br>
                                >         ><br>
                                >         > Regards,<br>
                                >         > Sriram.<br>
                                >         ><br>
                                >         ><br>
                                >         ><br>
                                >         ><br>
                                >         > On Mon, Aug 7, 2017 at
                                11:28 AM, Sriram<br>
                                >         <<a href="mailto:sriram.ec@gmail.com" target="_blank">sriram.ec@gmail.com</a>>
                                wrote:<br>
                                >         ><br>
                                >         >><br>
                                >         >> Thanks Ken, Jan.
                                Will look into the clone notifications.<br>
                                >         >><br>
                                >         >> Regards,<br>
                                >         >> Sriram.<br>
                                >         >><br>
                                >         >> On Sat, Aug 5,
                                2017 at 1:25 AM, Ken Gaillot<br>
                                >         <<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a>>
                                wrote:<br>
                                >         >><br>
                                >         >>> On Thu,
                                2017-08-03 at 12:31 +0530, Sriram wrote:<br>
                                >         >>> ><br>
                                >         >>> > Hi Team,<br>
                                >         >>> ><br>
                                >         >>> ><br>
                                >         >>> > We have a
                                four node cluster (1 active : 3 standby)
                                in<br>
                                >         our lab for a<br>
                                >         >>> >
                                particular service. If the active node
                                goes down, one of<br>
                                >         the three<br>
                                >         >>> > standby
                                node  becomes active. Now there will be
                                (1<br>
                                >         active :  2<br>
                                >         >>> > standby :
                                1 offline).<br>
                                >         >>> ><br>
                                >         >>> ><br>
                                >         >>> > Is there
                                any way where this newly elected node
                                sends<br>
                                >         notification to<br>
                                >         >>> > the
                                remaining 2 standby nodes about its new
                                status ?<br>
                                >         >>><br>
                                >         >>> Hi Sriram,<br>
                                >         >>><br>
                                >         >>> This depends
                                on how your service is configured in the<br>
                                >         cluster.<br>
                                >         >>><br>
                                >         >>> If you have a
                                clone or master/slave resource, then
                                clone<br>
                                >         notifications<br>
                                >         >>> is probably
                                what you want (not alerts, which is the
                                path<br>
                                >         you were going<br>
                                >         >>> down -- alerts
                                are designed to e.g. email a system<br>
                                >         administrator after<br>
                                >         >>> an important
                                event).<br>
                                >         >>><br>
                                >         >>> For details
                                about clone notifications, see:<br>
                                >         >>><br>
                                >         >>><br>
                                >         <a href="http://clusterlabs.org/doc/en-US/Pacemaker/1.1-pcs/html-sing" rel="noreferrer" target="_blank">http://clusterlabs.org/doc/en<wbr>-US/Pacemaker/1.1-pcs/html-sin<wbr>g</a><br>
                                >         >>><br>
                                >       
                                 le/Pacemaker_Explained/index.<wbr>html#_clone_resource_agent_req<wbr>uirements<br>
                                >         >>><br>
                                >         >>> The RA must
                                support the "notify" action, which will
                                be<br>
                                >         called when a<br>
                                >         >>> clone instance
                                is started or stopped. See the similar<br>
                                >         section later for<br>
                                >         >>> master/slave
                                resources for additional information.
                                See the<br>
                                >         mysql or<br>
                                >         >>> pgsql resource
                                agents for examples of notify<br>
                                >         implementations.<br>
                                >         >>><br>
                                >         >>> > I was
                                exploring "notification agent" and
                                "notification<br>
                                >         recipient"<br>
                                >         >>> > features,
                                but that doesn't seem to<br>
                                >         work.
                                /etc/sysconfig/notify.sh<br>
                                >         >>> > doesn't
                                get invoked even in the newly elected
                                active<br>
                                >         node.<br>
                                >         >>><br>
                                >         >>> Yep, that's
                                something different altogether -- it's
                                only<br>
                                >         enabled on RHEL<br>
                                >         >>> systems, and
                                solely for backward compatibility with
                                an<br>
                                >         early<br>
                                >         >>> implementation
                                of the alerts interface. The new alerts<br>
                                >         interface is more<br>
                                >         >>> flexible, but
                                it's not designed to send information<br>
                                >         between cluster<br>
                                >         >>> nodes -- it's
                                designed to send information to
                                something<br>
                                >         external to the<br>
                                >         >>> cluster, such
                                as a human, or an SNMP server, or a<br>
                                >         monitoring system.<br>
                                >         >>><br>
                                >         >>><br>
                                >         >>> > Cluster
                                Properties:<br>
                                >         >>> > 
                                cluster-infrastructure: corosync<br>
                                >         >>> > 
                                dc-version: 1.1.17-e2e6cdce80<br>
                                >         >>> > 
                                default-action-timeout: 240<br>
                                >         >>> > 
                                have-watchdog: false<br>
                                >         >>> > 
                                no-quorum-policy: ignore<br>
                                >         >>> > 
                                notification-agent:
                                /etc/sysconfig/notify.sh<br>
                                >         >>> > 
                                notification-recipient:
                                /var/log/notify.log<br>
                                >         >>> > 
                                placement-strategy: balanced<br>
                                >         >>> > 
                                stonith-enabled: false<br>
                                >         >>> > 
                                symmetric-cluster: false<br>
                                >         >>> ><br>
                                >         >>> ><br>
                                >         >>> ><br>
                                >         >>> ><br>
                                >         >>> > I m using
                                the following versions of pacemaker and<br>
                                >         corosync.<br>
                                >         >>> ><br>
                                >         >>> ><br>
                                >         >>> > /usr/sbin
                                # ./pacemakerd --version<br>
                                >         >>> > Pacemaker
                                1.1.17<br>
                                >         >>> > Written
                                by Andrew Beekhof<br>
                                >         >>> > /usr/sbin
                                # ./corosync -v<br>
                                >         >>> > Corosync
                                Cluster Engine, version '2.3.5'<br>
                                >         >>> > Copyright
                                (c) 2006-2009 Red Hat, Inc.<br>
                                >         >>> ><br>
                                >         >>> ><br>
                                >         >>> > Can you
                                please suggest if I m doing anything
                                wrong or if<br>
                                >         there any<br>
                                >         >>> > other
                                mechanisms to achieve this ?<br>
                                >         >>> ><br>
                                >         >>> ><br>
                                >         >>> > Regards,<br>
                                >         >>> > Sriram.<br>
                                <br>
                              </div>
                            </div>
                            <div class="m_-5828583773566347351m_-4802990479422366187gmail-HOEnZb">
                              <div class="m_-5828583773566347351m_-4802990479422366187gmail-h5">--<br>
                                Ken Gaillot <<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a>><br>
                                <br>
                                <br>
                                <br>
                                <br>
                                <br>
                                ______________________________<wbr>_________________<br>
                                Users mailing list: <a href="mailto:Users@clusterlabs.org" target="_blank">Users@clusterlabs.org</a><br>
                                <a href="http://lists.clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.clusterlabs.org/m<wbr>ailman/listinfo/users</a><br>
                                <br>
                                Project Home: <a href="http://www.clusterlabs.org" rel="noreferrer" target="_blank">http://www.clusterlabs.org</a><br>
                                Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" rel="noreferrer" target="_blank">http://www.clusterlabs.org/doc<wbr>/Cluster_from_Scratch.pdf</a><br>
                                Bugs: <a href="http://bugs.clusterlabs.org" rel="noreferrer" target="_blank">http://bugs.clusterlabs.org</a><br>
                              </div>
                            </div>
                          </blockquote>
                        </div>
                        <br>
                      </div>
                    </div>
                    <br>
                    <fieldset class="m_-5828583773566347351m_-4802990479422366187mimeAttachmentHeader"></fieldset>
                    <br>
                    <pre>______________________________<wbr>_________________
Users mailing list: <a class="m_-5828583773566347351m_-4802990479422366187moz-txt-link-abbreviated" href="mailto:Users@clusterlabs.org" target="_blank">Users@clusterlabs.org</a>
<a class="m_-5828583773566347351m_-4802990479422366187moz-txt-link-freetext" href="http://lists.clusterlabs.org/mailman/listinfo/users" target="_blank">http://lists.clusterlabs.org/m<wbr>ailman/listinfo/users</a>

Project Home: <a class="m_-5828583773566347351m_-4802990479422366187moz-txt-link-freetext" href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a>
Getting started: <a class="m_-5828583773566347351m_-4802990479422366187moz-txt-link-freetext" href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc<wbr>/Cluster_from_Scratch.pdf</a>
Bugs: <a class="m_-5828583773566347351m_-4802990479422366187moz-txt-link-freetext" href="http://bugs.clusterlabs.org" target="_blank">http://bugs.clusterlabs.org</a>
</pre>
                  </blockquote>
                  <br>
                </div>
              </div>
            </div>
          </blockquote>
        </div>
        <br>
      </div>
    </blockquote>
    <br>
  </div></div></div>

</blockquote></div><br></div>