<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<meta name="Generator" content="Microsoft Exchange Server">
<!-- converted from rtf -->
<style><!-- .EmailQuote { margin-left: 1pt; padding-left: 4pt; border-left: #800000 2px solid; } --></style>
</head>
<body>
<font face="Calibri" size="2"><span style="font-size:11pt;">
<div><font color="#1F497D">Hi All,</font></div>
<div><font color="#1F497D"> </font></div>
<div><font color="#1F497D">I have a two node cluster with multicast (udp) transport . The multicast IP used in 224.1.1.1 . </font></div>
<div><font color="#1F497D"> </font></div>
<div><font color="#1F497D">Whenever there is a CPU intensive task the pcs cluster goes into split brain scenario and doesn’t recover automatically . We have to do a manual restart of services to bring both nodes online again. Before the nodes goes into split
brain , the corosync log shows ,</font></div>
<div><font color="#1F497D"> </font></div>
<div><font face="Segoe UI" size="2"><span style="font-size:10pt;">May 24 15:10:02 server1 corosync[4745]: [TOTEM ] Retransmit List: 7c 7e</span></font></div>
<div><font face="Segoe UI" size="2"><span style="font-size:10pt;">May 24 15:10:02 server1 corosync[4745]: [TOTEM ] Retransmit List: 7c 7e</span></font></div>
<div><font face="Segoe UI" size="2"><span style="font-size:10pt;">May 24 15:10:02 server1 corosync[4745]: [TOTEM ] Retransmit List: 7c 7e</span></font></div>
<div><font face="Segoe UI" size="2"><span style="font-size:10pt;">May 24 15:10:02 server1 corosync[4745]: [TOTEM ] Retransmit List: 7c 7e</span></font></div>
<div><font face="Segoe UI" size="2"><span style="font-size:10pt;">May 24 15:10:02 server1 corosync[4745]: [TOTEM ] Retransmit List: 7c 7e</span></font></div>
<div><font face="Segoe UI" size="2"><span style="font-size:10pt;">May 24 15:51:42 server1 corosync[4745]: [TOTEM ] A processor failed, forming new configuration.</span></font></div>
<div><font face="Segoe UI" size="2"><span style="font-size:10pt;">May 24 16:41:42 server1 corosync[4745]: [TOTEM ] A new membership (10.241.31.12:29276) was formed. Members left: 1</span></font></div>
<div><font face="Segoe UI" size="2"><span style="font-size:10pt;">May 24 16:41:42 server1 corosync[4745]: [TOTEM ] Failed to receive the leave message. failed: 1</span></font></div>
<div><font color="#1F497D"> </font></div>
<div><font color="#1F497D">Is there any way we can overcome this or this may be due to any multicast issues in the network side.</font></div>
<div><font color="#1F497D"> </font></div>
<div><font color="#1F497D">With Regards</font></div>
<div><font color="#1F497D">Somanath Thilak J</font></div>
<div><font color="#1F497D"> </font></div>
<div> </div>
<div><font color="#1F497D"> </font></div>
<div> </div>
</span></font>
</body>
</html>