<div dir="ltr">Thanks for clarifying.<div><br></div><div>Regards,</div><div>Sriram.</div></div><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Aug 14, 2017 at 7:34 PM, Klaus Wenninger <span dir="ltr"><<a href="mailto:kwenning@redhat.com" target="_blank">kwenning@redhat.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div text="#000000" bgcolor="#FFFFFF"><span class="">
<div class="m_-5828583773566347351moz-cite-prefix">On 08/14/2017 03:19 PM, Sriram wrote:<br>
</div>
<blockquote type="cite">
<div dir="ltr">Yes, I had precreated the script file with the
required permission.
<div>
<div><br>
</div>
<div>[root@<b>node1</b> alerts]# ls -l
/usr/share/pacemaker/alert_<wbr>file.sh</div>
<div>-rwxr-xr-x. 1 root root 4140 Aug 14 01:51
/usr/share/pacemaker/alert_<wbr>file.sh</div>
</div>
<div>
<div>
<div> [root@<b>node2</b> alerts]# ls -l
/usr/share/pacemaker/alert_<wbr>file.sh</div>
<div>-rwxr-xr-x. 1 root root 4139 Aug 14 01:51
/usr/share/pacemaker/alert_<wbr>file.sh</div>
</div>
<div>[root@<b>node3</b> alerts]# ls -l
/usr/share/pacemaker/alert_<wbr>file.sh</div>
<div>-rwxr-xr-x. 1 root root 4139 Aug 14 01:51
/usr/share/pacemaker/alert_<wbr>file.sh</div>
</div>
<div><br>
</div>
<div>Later I observed that user "hacluster" is not able to
create the log file under <span style="font-size:12.8px">/usr/share/pacemaker/</span><span style="font-size:12.8px">ale<wbr>rt_file.log.</span></div>
<div>I am sorry, I should have observed this in the log before
posting the query. Then I gave the path as
/tmp/alert_file.log, it is able to create now.</div>
<div>
<div>Thanks for pointing it out.</div>
</div>
<div><br>
</div>
<div>I have one more clarification,</div>
<div><br>
</div>
<div>if the resource is running in node2,</div>
<div>[root@node2 tmp]# pcs resource<br>
</div>
<div>
<div> TRR (ocf::heartbeat:<wbr>TimingRedundancyRA): Started
node2</div>
</div>
<div><br>
</div>
<div>And I executed the below command to make it standby.</div>
<div>[root@node2 tmp] # pcs node standby node2</div>
<div><br>
</div>
<div>Resource shifted to node3, because of higher location
constraint.</div>
<div>
<div>
<div>[root@node2 tmp]# pcs resource</div>
<div> TRR (ocf::heartbeat:<wbr>TimingRedundancyRA): Started
node3.</div>
</div>
</div>
<div><br>
</div>
<div><br>
</div>
<div>I got the log file created under node2(resource stopped)
and node3(resource started). </div>
<div><br>
</div>
<div>Node1 was not notified about the resource shift, I mean no
log file was created there.</div>
<div>Its because alerts are designed to notify the external
agents about the cluster events. Its not for internal
notifications.</div>
<div><br>
</div>
<div>Is my understanding correct ?</div>
</div>
</blockquote>
<br></span>
Quite simple: crmd of node1 just didn't have anything to do with
shifting the resource<br>
from node2 -> node3. There is no additional information passed
between the nodes<br>
just to create a full set of notifications on every node. If you
want to have a full log<br>
(or whatever you altert-agent is doing) in one place this would be
up to your alert-agent.<div><div class="h5"><br>
<br>
Regards,<br>
Klaus<br>
<br>
<blockquote type="cite">
<div dir="ltr">
<div> </div>
<div>Regards,<br>
</div>
<div>Sriram.</div>
<div><br>
</div>
<div><br>
</div>
</div>
<div class="gmail_extra"><br>
<div class="gmail_quote">On Mon, Aug 14, 2017 at 5:42 PM, Klaus
Wenninger <span dir="ltr"><<a href="mailto:kwenning@redhat.com" target="_blank">kwenning@redhat.com</a>></span>
wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div text="#000000" bgcolor="#FFFFFF">
<div>
<div class="m_-5828583773566347351h5">
<div class="m_-5828583773566347351m_-4802990479422366187moz-cite-prefix">On
08/14/2017 12:32 PM, Sriram wrote:<br>
</div>
<blockquote type="cite">
<div dir="ltr">Hi Ken,
<div><br>
</div>
<div>I used the alerts as well, seems to be not
working.</div>
<div><br>
</div>
<div>Please check the below configuration</div>
<div>
<div>[root@node1 alerts]# pcs config show</div>
<div>Cluster Name:</div>
<div>Corosync Nodes:</div>
<div>Pacemaker Nodes:</div>
<div> node1 node2 node3</div>
<div><br>
</div>
<div>Resources:</div>
<div> Resource: TRR (class=ocf
provider=heartbeat type=TimingRedundancyRA)</div>
<div> Operations: start interval=0s timeout=60s
(TRR-start-interval-0s)</div>
<div> stop interval=0s timeout=20s
(TRR-stop-interval-0s)</div>
<div> monitor interval=10
timeout=20 (TRR-monitor-interval-10)</div>
<div><br>
</div>
<div>Stonith Devices:</div>
<div>Fencing Levels:</div>
<div><br>
</div>
<div>Location Constraints:</div>
<div> Resource: TRR</div>
<div> Enabled on: node1 (score:100)
(id:location-TRR-node1-100)</div>
<div> Enabled on: node2 (score:200)
(id:location-TRR-node2-200)</div>
<div> Enabled on: node3 (score:300)
(id:location-TRR-node3-300)</div>
<div>Ordering Constraints:</div>
<div>Colocation Constraints:</div>
<div>Ticket Constraints:</div>
<div><br>
</div>
<div>Alerts:</div>
<div> Alert: alert_file
(path=/usr/share/pacemaker/ale<wbr>rt_file.sh)</div>
<div> Options: debug_exec_order=false</div>
<div> Meta options: timeout=15s</div>
<div> Recipients:</div>
<div> Recipient: recipient_alert_file_id
(value=/usr/share/pacemaker/al<wbr>ert_file.log)</div>
</div>
</div>
</blockquote>
<br>
</div>
</div>
Did you pre-create the file with proper rights? Be aware
that the alert-agent<br>
is called as user hacluster.<span><br>
<br>
<blockquote type="cite">
<div dir="ltr">
<div>
<div><br>
</div>
<div>Resources Defaults:</div>
<div> resource-stickiness: INFINITY</div>
<div>Operations Defaults:</div>
<div> No defaults set</div>
<div><br>
</div>
<div>Cluster Properties:</div>
<div> cluster-infrastructure: corosync</div>
<div> dc-version: 1.1.15-11.el7_3.4-e174ec8</div>
<div> default-action-timeout: 240</div>
<div> have-watchdog: false</div>
<div> no-quorum-policy: ignore</div>
<div> placement-strategy: balanced</div>
<div> stonith-enabled: false</div>
<div> symmetric-cluster: false</div>
<div><br>
</div>
<div>Quorum:</div>
<div> Options:</div>
</div>
<div><br>
</div>
<div><br>
</div>
<div>/usr/share/pacemaker/alert_fil<wbr>e.sh does
not get called whenever I trigger a scenario for
failover.<br>
</div>
<div>Please let me know if I m missing anything. <br>
</div>
</div>
</blockquote>
<br>
</span> Do you get any logs - like for startup of
resources - or nothing at all?<br>
<br>
Regards,<br>
Klaus
<div>
<div class="m_-5828583773566347351h5"><br>
<br>
<blockquote type="cite">
<div dir="ltr">
<div class="gmail_extra"><br>
</div>
<div class="gmail_extra"><br>
</div>
<div class="gmail_extra">Regards,</div>
<div class="gmail_extra">Sriram.</div>
<div class="gmail_extra"><br>
<div class="gmail_quote">On Tue, Aug 8, 2017 at
8:29 PM, Ken Gaillot <span dir="ltr"><<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a>></span>
wrote:<br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div class="m_-5828583773566347351m_-4802990479422366187gmail-HOEnZb">
<div class="m_-5828583773566347351m_-4802990479422366187gmail-h5">On
Tue, 2017-08-08 at 17:40 +0530, Sriram
wrote:<br>
> Hi Ulrich,<br>
><br>
><br>
> Please see inline.<br>
><br>
> On Tue, Aug 8, 2017 at 2:01 PM,
Ulrich Windl<br>
> <<a href="mailto:Ulrich.Windl@rz.uni-regensburg.de" target="_blank">Ulrich.Windl@rz.uni-regensbur<wbr>g.de</a>>
wrote:<br>
> >>> Sriram <<a href="mailto:sriram.ec@gmail.com" target="_blank">sriram.ec@gmail.com</a>>
schrieb am <a href="tel:08.08.2017" value="+498082017" target="_blank">08.08.2017</a>
um<br>
> 09:30 in Nachricht<br>
>
<CAMvdjurcQc6t=ZfGr=cRL25Xq0J<wbr>e9h9F_TvZXyxVAn3n<br>
> +<a href="mailto:Dvcgw@mail.gmail.com" target="_blank">Dvcgw@mail.gmail.com</a>>:<br>
> > Hi Ken & Jan,<br>
> ><br>
> > In the cluster we
have, there is only one resource
running.<br>
> Its a OPT-IN<br>
> > cluster with
resource-stickiness set to INFINITY.<br>
> ><br>
> > Just to clarify my
question, lets take a scenario where<br>
> there are four<br>
> > nodes N1, N2, N3, N4<br>
> > a. N1 comes up first,
starts the cluster.<br>
><br>
> The cluster will start once
it has a quorum.<br>
><br>
> > b. N1 Checks that
there is no resource running, so it will<br>
> add the<br>
> > resource(R) with the
some location constraint(lets say score<br>
> 100)<br>
> > c. So Resource(R) runs
in N1 now.<br>
> > d. N2 comes up next,
checks that resource(R) is already<br>
> running in N1, so<br>
> > it will update the
location constraint(lets say score 200)<br>
> > e. N3 comes up next,
checks that resource(R) is already<br>
> running in N1, so<br>
> > it will update the
location constraint(lets say score 300)<br>
><br>
> See my remark on quorum
above.<br>
><br>
> Yes you are right, I forgot to
mention it.<br>
><br>
><br>
> > f. N4 comes up next,
checks that resource(R) is already<br>
> running in N1, so<br>
> > it will update the
location constraint(lets say score 400)<br>
> > g. For the some
reason, if N1 goes down, resource(R)
shifts<br>
> to N4(as its<br>
> > score is higher than
anyone).<br>
> ><br>
> > In this case is it
possible to notify the nodes N2, N3 that<br>
> newly elected<br>
> > active node is N4 ?<br>
><br>
> What type of notification,
and what would the node do with it?<br>
> Any node in the cluster
always has up to date configuration<br>
> information. So it knows
the status of the other nodes also.<br>
><br>
><br>
> I agree that the node always has
upto date configuration information,<br>
> but an application or a thread
needs to poll for that information. Is<br>
> there any way, where the
notifications are received through some<br>
> action function in RA. ?<br>
<br>
</div>
</div>
Ah, I misunderstood your situation, I
thought you had a cloned resource.<br>
<br>
For that, the alerts feature (available in
Pacemaker 1.1.15 and later)<br>
might be useful:<br>
<br>
<a href="http://clusterlabs.org/doc/en-US/Pacemaker/1.1-pcs/html-single/Pacemaker_Explained/index.html#idm139900098676896" rel="noreferrer" target="_blank">http://clusterlabs.org/doc/en-<wbr>US/Pacemaker/1.1-pcs/html-sing<wbr>le/Pacemaker_Explained/index.h<wbr>tml#idm139900098676896</a><br>
<div class="m_-5828583773566347351m_-4802990479422366187gmail-HOEnZb">
<div class="m_-5828583773566347351m_-4802990479422366187gmail-h5"><br>
<br>
><br>
><br>
> Regards,<br>
> Sriram.<br>
><br>
> ><br>
> > I went through clone
notifications and master-slave, Iooks<br>
> like it either<br>
> > requires identical
resources(Anonymous) or Unique or<br>
> Stateful resources to<br>
> > be running<br>
> > in all the nodes of
the cluster, where as in our case there<br>
> is only<br>
> > resource running in
the whole cluster.<br>
><br>
> Maybe the main reason for
not having notifications is that if<br>
> a node fails hard, it won't
be able to send out much status<br>
> information to the other
nodes.<br>
><br>
> Regards,<br>
> Ulrich<br>
><br>
> ><br>
> > Regards,<br>
> > Sriram.<br>
> ><br>
> ><br>
> ><br>
> ><br>
> > On Mon, Aug 7, 2017 at
11:28 AM, Sriram<br>
> <<a href="mailto:sriram.ec@gmail.com" target="_blank">sriram.ec@gmail.com</a>>
wrote:<br>
> ><br>
> >><br>
> >> Thanks Ken, Jan.
Will look into the clone notifications.<br>
> >><br>
> >> Regards,<br>
> >> Sriram.<br>
> >><br>
> >> On Sat, Aug 5,
2017 at 1:25 AM, Ken Gaillot<br>
> <<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a>>
wrote:<br>
> >><br>
> >>> On Thu,
2017-08-03 at 12:31 +0530, Sriram wrote:<br>
> >>> ><br>
> >>> > Hi Team,<br>
> >>> ><br>
> >>> ><br>
> >>> > We have a
four node cluster (1 active : 3 standby)
in<br>
> our lab for a<br>
> >>> >
particular service. If the active node
goes down, one of<br>
> the three<br>
> >>> > standby
node becomes active. Now there will be
(1<br>
> active : 2<br>
> >>> > standby :
1 offline).<br>
> >>> ><br>
> >>> ><br>
> >>> > Is there
any way where this newly elected node
sends<br>
> notification to<br>
> >>> > the
remaining 2 standby nodes about its new
status ?<br>
> >>><br>
> >>> Hi Sriram,<br>
> >>><br>
> >>> This depends
on how your service is configured in the<br>
> cluster.<br>
> >>><br>
> >>> If you have a
clone or master/slave resource, then
clone<br>
> notifications<br>
> >>> is probably
what you want (not alerts, which is the
path<br>
> you were going<br>
> >>> down -- alerts
are designed to e.g. email a system<br>
> administrator after<br>
> >>> an important
event).<br>
> >>><br>
> >>> For details
about clone notifications, see:<br>
> >>><br>
> >>><br>
> <a href="http://clusterlabs.org/doc/en-US/Pacemaker/1.1-pcs/html-sing" rel="noreferrer" target="_blank">http://clusterlabs.org/doc/en<wbr>-US/Pacemaker/1.1-pcs/html-sin<wbr>g</a><br>
> >>><br>
>
le/Pacemaker_Explained/index.<wbr>html#_clone_resource_agent_req<wbr>uirements<br>
> >>><br>
> >>> The RA must
support the "notify" action, which will
be<br>
> called when a<br>
> >>> clone instance
is started or stopped. See the similar<br>
> section later for<br>
> >>> master/slave
resources for additional information.
See the<br>
> mysql or<br>
> >>> pgsql resource
agents for examples of notify<br>
> implementations.<br>
> >>><br>
> >>> > I was
exploring "notification agent" and
"notification<br>
> recipient"<br>
> >>> > features,
but that doesn't seem to<br>
> work.
/etc/sysconfig/notify.sh<br>
> >>> > doesn't
get invoked even in the newly elected
active<br>
> node.<br>
> >>><br>
> >>> Yep, that's
something different altogether -- it's
only<br>
> enabled on RHEL<br>
> >>> systems, and
solely for backward compatibility with
an<br>
> early<br>
> >>> implementation
of the alerts interface. The new alerts<br>
> interface is more<br>
> >>> flexible, but
it's not designed to send information<br>
> between cluster<br>
> >>> nodes -- it's
designed to send information to
something<br>
> external to the<br>
> >>> cluster, such
as a human, or an SNMP server, or a<br>
> monitoring system.<br>
> >>><br>
> >>><br>
> >>> > Cluster
Properties:<br>
> >>> >
cluster-infrastructure: corosync<br>
> >>> >
dc-version: 1.1.17-e2e6cdce80<br>
> >>> >
default-action-timeout: 240<br>
> >>> >
have-watchdog: false<br>
> >>> >
no-quorum-policy: ignore<br>
> >>> >
notification-agent:
/etc/sysconfig/notify.sh<br>
> >>> >
notification-recipient:
/var/log/notify.log<br>
> >>> >
placement-strategy: balanced<br>
> >>> >
stonith-enabled: false<br>
> >>> >
symmetric-cluster: false<br>
> >>> ><br>
> >>> ><br>
> >>> ><br>
> >>> ><br>
> >>> > I m using
the following versions of pacemaker and<br>
> corosync.<br>
> >>> ><br>
> >>> ><br>
> >>> > /usr/sbin
# ./pacemakerd --version<br>
> >>> > Pacemaker
1.1.17<br>
> >>> > Written
by Andrew Beekhof<br>
> >>> > /usr/sbin
# ./corosync -v<br>
> >>> > Corosync
Cluster Engine, version '2.3.5'<br>
> >>> > Copyright
(c) 2006-2009 Red Hat, Inc.<br>
> >>> ><br>
> >>> ><br>
> >>> > Can you
please suggest if I m doing anything
wrong or if<br>
> there any<br>
> >>> > other
mechanisms to achieve this ?<br>
> >>> ><br>
> >>> ><br>
> >>> > Regards,<br>
> >>> > Sriram.<br>
<br>
</div>
</div>
<div class="m_-5828583773566347351m_-4802990479422366187gmail-HOEnZb">
<div class="m_-5828583773566347351m_-4802990479422366187gmail-h5">--<br>
Ken Gaillot <<a href="mailto:kgaillot@redhat.com" target="_blank">kgaillot@redhat.com</a>><br>
<br>
<br>
<br>
<br>
<br>
______________________________<wbr>_________________<br>
Users mailing list: <a href="mailto:Users@clusterlabs.org" target="_blank">Users@clusterlabs.org</a><br>
<a href="http://lists.clusterlabs.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.clusterlabs.org/m<wbr>ailman/listinfo/users</a><br>
<br>
Project Home: <a href="http://www.clusterlabs.org" rel="noreferrer" target="_blank">http://www.clusterlabs.org</a><br>
Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" rel="noreferrer" target="_blank">http://www.clusterlabs.org/doc<wbr>/Cluster_from_Scratch.pdf</a><br>
Bugs: <a href="http://bugs.clusterlabs.org" rel="noreferrer" target="_blank">http://bugs.clusterlabs.org</a><br>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</div>
<br>
<fieldset class="m_-5828583773566347351m_-4802990479422366187mimeAttachmentHeader"></fieldset>
<br>
<pre>______________________________<wbr>_________________
Users mailing list: <a class="m_-5828583773566347351m_-4802990479422366187moz-txt-link-abbreviated" href="mailto:Users@clusterlabs.org" target="_blank">Users@clusterlabs.org</a>
<a class="m_-5828583773566347351m_-4802990479422366187moz-txt-link-freetext" href="http://lists.clusterlabs.org/mailman/listinfo/users" target="_blank">http://lists.clusterlabs.org/m<wbr>ailman/listinfo/users</a>
Project Home: <a class="m_-5828583773566347351m_-4802990479422366187moz-txt-link-freetext" href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a>
Getting started: <a class="m_-5828583773566347351m_-4802990479422366187moz-txt-link-freetext" href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc<wbr>/Cluster_from_Scratch.pdf</a>
Bugs: <a class="m_-5828583773566347351m_-4802990479422366187moz-txt-link-freetext" href="http://bugs.clusterlabs.org" target="_blank">http://bugs.clusterlabs.org</a>
</pre>
</blockquote>
<br>
</div>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</blockquote>
<br>
</div></div></div>
</blockquote></div><br></div>