<div dir="ltr">Hi list <div>Need your help.</div><div>Got 2 servers use Pacemaker Corosync Drbd</div><div><br></div><div><div><div>[root@voipserver ~]# pcs config</div><div>Cluster Name: ClusterKrusher</div><div>Corosync Nodes:</div><div> voipserver.primary voipserver.backup</div><div>Pacemaker Nodes:</div><div> voipserver.backup voipserver.primary</div><div><br></div><div>Resources:</div><div> Resource: ClusterIP (class=ocf provider=heartbeat type=IPaddr2)</div><div> Attributes: cidr_netmask=32 ip=172.20.11.10</div><div> Operations: monitor interval=30s (ClusterIP-monitor-interval-30s)</div><div> start interval=0s timeout=20s (ClusterIP-start-interval-0s)</div><div> stop interval=0s timeout=20s (ClusterIP-stop-interval-0s)</div><div> Master: WebDataClone</div><div> Meta Attrs: master-node-max=1 clone-max=2 notify=true master-max=1 clone-node-max=1</div><div> Resource: WebData (class=ocf provider=linbit type=drbd)</div><div> Attributes: drbd_resource=r0</div><div> Operations: demote interval=0s timeout=90 (WebData-demote-interval-0s)</div><div> monitor interval=60s (WebData-monitor-interval-60s)</div><div> promote interval=0s timeout=90 (WebData-promote-interval-0s)</div><div> start interval=0s timeout=240 (WebData-start-interval-0s)</div><div> stop interval=0s timeout=100 (WebData-stop-interval-0s)</div><div> Resource: WebFS (class=ocf provider=heartbeat type=Filesystem)</div><div> Attributes: device=/dev/drbd1 directory=/replica fstype=ext3</div><div> Operations: monitor interval=20 timeout=40 (WebFS-monitor-interval-20)</div><div> start interval=0s timeout=60 (WebFS-start-interval-0s)</div><div> stop interval=0s timeout=60 (WebFS-stop-interval-0s)</div><div> Resource: Asterisk (class=lsb type=asterisk)</div><div> Operations: monitor interval=15 timeout=15 (Asterisk-monitor-interval-15)</div><div> start interval=0s timeout=15 (Asterisk-start-interval-0s)</div><div> stop interval=0s timeout=15 (Asterisk-stop-interval-0s)</div><div> Resource: MYSQL (class=lsb type=mysql)</div><div> Operations: monitor interval=15 timeout=15 (MYSQL-monitor-interval-15)</div><div> start interval=0s timeout=15 (MYSQL-start-interval-0s)</div><div> stop interval=0s timeout=15 (MYSQL-stop-interval-0s)</div><div><br></div><div>Stonith Devices:</div><div>Fencing Levels:</div><div><br></div><div>Location Constraints:</div><div>Ordering Constraints:</div><div> promote WebDataClone then start WebFS (kind:Mandatory)</div><div> start WebFS then start MYSQL (kind:Mandatory)</div><div> start ClusterIP then start Asterisk (kind:Mandatory)</div><div>Colocation Constraints:</div><div> WebFS with WebDataClone (score:INFINITY) (with-rsc-role:Master)</div><div> MYSQL with WebFS (score:INFINITY)</div><div> Asterisk with ClusterIP (score:INFINITY)</div><div>Ticket Constraints:</div><div><br></div><div>Alerts:</div><div> No alerts defined</div><div><br></div><div>Resources Defaults:</div><div> resource-stickiness: 100</div><div>Operations Defaults:</div><div> No defaults set</div><div><br></div><div>Cluster Properties:</div><div> cluster-infrastructure: corosync</div><div> cluster-name: ClusterKrusher</div><div> dc-version: 1.1.16-12.el7_4.2-94ff4df</div><div> have-watchdog: false</div><div> stonith-enabled: false</div><div><br></div><div>Quorum:</div><div> Options:</div></div></div><div>===================</div><div><br></div><div><br></div><div>After some tibe got in logs </div><div><div>[root@voipserver ~]# cat /var/log/messages |grep drbd</div><div><div>Dec 12 14:08:52 voipserver kernel: block drbd1: role( Secondary -> Primary )</div><div>Dec 12 14:08:52 voipserver Filesystem(WebFS)[64935]: INFO: Running start for /dev/drbd1 on /replica</div><div>Dec 12 14:08:52 voipserver kernel: EXT4-fs (drbd1): mounting ext3 file system using the ext4 subsystem</div><div>Dec 12 14:08:53 voipserver kernel: EXT4-fs (drbd1): mounted filesystem with ordered data mode. Opts: (null)</div><div>Dec 12 14:18:13 voipserver Filesystem(WebFS)[3134]: INFO: Running stop for /dev/drbd1 on /replica</div><div>Dec 12 14:18:17 voipserver Filesystem(WebFS)[3319]: INFO: Running start for /dev/drbd1 on /replica</div><div>Dec 12 14:18:17 voipserver kernel: EXT4-fs (drbd1): mounting ext3 file system using the ext4 subsystem</div><div>Dec 12 14:18:17 voipserver kernel: EXT4-fs (drbd1): mounted filesystem with ordered data mode. Opts: (null)</div><div>Dec 12 14:44:07 voipserver Filesystem(WebFS)[11669]: INFO: Running stop for /dev/drbd1 on /replica</div><div>Dec 12 14:44:07 voipserver kernel: block drbd1: role( Primary -> Secondary )</div><div>Dec 12 14:44:07 voipserver kernel: block drbd1: 3552 KB (888 bits) marked out-of-sync by on disk bit-map.</div><div>Dec 12 14:44:08 voipserver kernel: block drbd1: disk( UpToDate -> Failed )</div><div>Dec 12 14:44:08 voipserver kernel: block drbd1: 3552 KB (888 bits) marked out-of-sync by on disk bit-map.</div><div>Dec 12 14:44:08 voipserver kernel: block drbd1: disk( Failed -> Diskless )</div><div>Dec 12 14:44:08 voipserver kernel: drbd r0: Terminating drbd_w_r0</div><div>Dec 12 14:44:19 voipserver kernel: drbd: loading out-of-tree module taints kernel.</div><div>Dec 12 14:44:19 voipserver kernel: drbd: module verification failed: signature and/or required key missing - tainting kernel</div><div>Dec 12 14:44:19 voipserver systemd-modules-load: Inserted module 'drbd'</div><div>Dec 12 14:44:19 voipserver kernel: drbd: initialized. Version: 8.4.10-1 (api:1/proto:86-101)</div><div>Dec 12 14:44:19 voipserver kernel: drbd: GIT-hash: a4d5de01fffd7e4cde48a080e2c686f9e8cebf4c build by mockbuild@, 2017-09-15 14:23:22</div><div>Dec 12 14:44:19 voipserver kernel: drbd: registered as block device major 147</div><div>Dec 12 14:45:02 voipserver Filesystem(WebFS)[1400]: WARNING: Couldn't find device [/dev/drbd1]. Expected /dev/??? to exist</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: Starting worker thread (from drbdsetup-84 [1524])</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: disk( Diskless -> Attaching )</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: Method to ensure write ordering: flush</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: max BIO size = 524288</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: drbd_bm_resize called with capacity == 419153344</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: resync bitmap: bits=52394168 words=818659 pages=1599</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: size = 200 GB (209576672 KB)</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: recounting of set bits took additional 1 jiffies</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: 3552 KB (888 bits) marked out-of-sync by on disk bit-map.</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: disk( Attaching -> UpToDate )</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: attached to UUIDs FBA12F26BE1DEE73:EE5942173C75DE98:1BF4DECFE20D51E2:1BF3DECFE20D51E3</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: conn( StandAlone -> Unconnected )</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: Starting receiver thread (from drbd_w_r0 [1525])</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: receiver (re)started</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: conn( Unconnected -> WFConnection )</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: Handshake successful: Agreed network protocol version 101</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: Feature flags enabled on protocol level: 0x7 TRIM THIN_RESYNC WRITE_SAME.</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: conn( WFConnection -> WFReportParams )</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: Starting ack_recv thread (from drbd_r_r0 [1534])</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: drbd_sync_handshake:</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: self FBA12F26BE1DEE72:EE5942173C75DE98:1BF4DECFE20D51E2:1BF3DECFE20D51E3 bits:888 flags:0</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: peer 93BB6F0A5075345D:EE5942173C75DE99:1BF4DECFE20D51E3:1BF3DECFE20D51E3 bits:38004 flags:2</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: uuid_compare()=100 by rule 90</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: helper command: /sbin/drbdadm initial-split-brain minor-1</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: helper command: /sbin/drbdadm initial-split-brain minor-1 exit code 0 (0x0)</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: Split-Brain detected but unresolved, dropping connection!</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: helper command: /sbin/drbdadm split-brain minor-1</div><div>Dec 12 14:45:03 voipserver kernel: block drbd1: helper command: /sbin/drbdadm split-brain minor-1 exit code 0 (0x0)</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: conn( WFReportParams -> Disconnecting )</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: error receiving ReportState, e: -5 l: 0!</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: ack_receiver terminated</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: Terminating drbd_a_r0</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: Connection closed</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: conn( Disconnecting -> StandAlone )</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: receiver terminated</div><div>Dec 12 14:45:03 voipserver kernel: drbd r0: Terminating drbd_r_r0</div></div></div><div><br></div><div><br></div><div><br></div><div>So i need to decide the best way now to conf split brain recovery </div><div>config files appreciated. </div><div><br></div><div>Primary</div><div><div>[root@voipserver ~]# drbd-overview</div><div>NOTE: drbd-overview will be deprecated soon.</div><div>Please consider using drbdtop.</div><div><br></div><div> 1:r0/0 WFConnection Primary/Unknown UpToDate/DUnknown /replica ext3 197G 720M 186G 1%</div></div><div><div><br></div><div>Secondary</div><div><br></div><div>[root@voipserver ~]# drbd-overview</div><div>NOTE: drbd-overview will be deprecated soon.</div><div>Please consider using drbdtop.</div><div><br></div><div> 1:r0/0 StandAlone Secondary/Unknown UpToDate/DUnknown</div><div><br></div><div><br></div><div><div>So i need to decide the best way now to conf split brain recovery </div><div>config files appreciated. </div></div><div>THANKS</div><div><br></div>-- <br><div class="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div>Best regards<br>Antony<br></div><div>tel. +380669197533</div><div>tel2. +380636564340<br></div><div>Paypal <a href="http://paypal.me/Satskiy?ppid=PPC000654&cnac=PL&rsta=en_PL(en_DK)&cust=NN8XJS9XEP22C&unptid=21db79ac-ef8d-11e5-9553-9c8e992ea258&t=&cal=4d776c21ca7d2&calc=4d776c21ca7d2&calf=4d776c21ca7d2&unp_tpcid=ppme-social-business-profile-created&page=main:email&pgrp=main:email&e=op&mchn=em&s=ci&mail=sys" target="_blank">http://paypal.me/Satskiy</a><br></div><div><a href="mailto:mail%3Asatskiy.a@gmail.com" target="_blank">satskiy.a@gmail.com</a></div></div></div></div></div></div>
</div></div>