<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:#0563C1;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:#954F72;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-family:"Calibri",sans-serif;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-US" link="#0563C1" vlink="#954F72">
<div class="WordSection1">
<p class="MsoNormal">Thought I would share an experience with the community. We have RHEL 7.4 clusters that uses the heartbeat LVM resource (HA-LVM volume group). The LVM resource does a “vgscan --cache" command as part of its monitoring routine.
<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">We have found that the pvmove command option “-i0” will block the vgscan command (most likely any LVM command). The pvmove command just needs to be executed on any physical volume and not specifically on one being managed by RHCS. In
our case, the node where the pvmove was being executed was evicted from the cluster.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Blocking Command: <span style="font-family:"Courier New"">
pvmove -v -i0 -n /dev/testvg/testlv00 /dev/mapper/mpathd1 /dev/mapper/mpaths1</span><o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">When testing without the -i0 option or with the -iX where X is non-zero, the pvmove did not block vgscan commands.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Associated errors in /var/log/messages:<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:03:27 nodeapp1 lvmpolld: W: LVMPOLLD: polling for output of the lvm cmd (PID 74134) has timed out<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
<p class="MsoNormal" style="margin-left:.5in"><skipped lines><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:04:27 nodeapp1 lvmpolld: W: LVMPOLLD: polling for output of the lvm cmd (PID 74134) has timed out<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:04:32 nodeapp1 lrmd[81636]: warning: share1_vg_monitor_60000 process (PID 77254) timed out<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:04:32 nodeapp1 lrmd[81636]: warning: share1_vg_monitor_60000:77254 - timed out after 90000ms<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:04:32 nodeapp1 crmd[81641]: error: Result of monitor operation for share1_vg on nodeapp1: Timed Out<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:04:32 nodeapp1 crmd[81641]: notice: State transition S_IDLE -> S_POLICY_ENGINE<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
<p class="MsoNormal" style="margin-left:.5in"><skipped lines><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:27 nodeapp1 LVM(share1_vg)[88723]: INFO: 0 logical volume(s) in volume group "share1vg" now active<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:27 nodeapp1 lvmpolld: W: LVMPOLLD: polling for output of the lvm cmd (PID 74134) has timed out<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:27 nodeapp1 lvmpolld[74130]: LVMPOLLD: LVM2 cmd is unresponsive too long (PID 74134) (no output for 180 seconds)<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
<p class="MsoNormal" style="margin-left:.5in"><skipped lines><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 lrmd[81636]: warning: share1_vg_stop_0 process (PID 88723) timed out<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 lrmd[81636]: warning: share1_vg_stop_0:88723 - timed out after 30000ms<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 crmd[81641]: error: Result of stop operation for share1_vg on nodeapp1: Timed Out<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 crmd[81641]: warning: Action 6 (share1_vg_stop_0) on nodeapp1 failed (target: 0 vs. rc: 1): Error<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 crmd[81641]: notice: Transition aborted by operation share1_vg_stop_0 'modify' on nodeapp1: Event failed<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 crmd[81641]: warning: Action 6 (share1_vg_stop_0) on nodeapp1 failed (target: 0 vs. rc: 1): Error<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in"><o:p> </o:p></p>
<p class="MsoNormal" style="margin-left:.5in"><skipped lines><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 pengine[81639]: warning: Processing failed op stop for share1_vg on nodeapp1: unknown error (1)<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 pengine[81639]: warning: Processing failed op stop for share1_vg on nodeapp1: unknown error (1)<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 pengine[81639]: warning: Cluster node nodeapp1 will be fenced: share1_vg failed there<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 pengine[81639]: warning: Forcing share1_vg away from nodeapp1 after 1000000 failures (max=1000000)<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:.5in">Mar 26 14:05:55 nodeapp1 pengine[81639]: warning: Scheduling Node nodeapp1 for STONITH<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Hope this helps someone down the line.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><b><span style="font-size:10.0pt;font-family:"Arial",sans-serif;color:navy">Robert</span></b><o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p style="font-family: arial, sans-serif; font-size: 8pt; color: #6A737B">CONFIDENTIALITY NOTICE This message and any included attachments are from Cerner Corporation and are intended only for the addressee. The information contained in this message is confidential
and may constitute inside or non-public information under international, federal, or state securities laws. Unauthorized forwarding, printing, copying, distribution, or use of such information is strictly prohibited and may be unlawful. If you are not the
addressee, please promptly delete this message and notify the sender of the delivery error by e-mail or you may call Cerner's corporate offices in Kansas City, Missouri, U.S.A at (+1) (816)221-1024.</p>
</div>
</body>
</html>