[ClusterLabs] Antw: Re: SBD & Failed Peer

Ulrich Windl Ulrich.Windl at rz.uni-regensburg.de
Tue Sep 8 06:12:21 UTC 2015


>>> Jorge Fábregas <jorge.fabregas at gmail.com> schrieb am 08.09.2015 um 04:42
in
Nachricht <55EE4B27.20007 at gmail.com>:
> On 09/06/2015 09:28 PM, Jorge Fábregas wrote:
>> I found out there's the "-P" option to SBD (that's supposed to
>> prevent that as long as there's cluster communication) but it doesn't
>> work in my SLES 11 SP4 setup.  Maybe it's on SLES 12.
> 
> If I run sbd --help I see the option is there in SLES 11 SP4.  When I
> put SBD_OPTS="-P" in both nodes (and restart openais in both) I get
> these errors constantly (every 6 seconds or so):
> 
> Sep  7 22:28:41 sles11a kernel: [ 1434.159558] sbd[8628]: segfault at
> ffffffffffffff44 ip 00007fc43cdff9bb sp 00007ffe5b429270 error 5 in
> libcrmcommon.so.3.5.0[7fc43cdd0000+54000]

I guess if you'd enable core dumps (ulimit -c) temporarily, you'd get a core
dump from sbd. Then possibly you could use gdb to create a backtrace, maybe
even with symbolic debugging information.

> 
> I tried using "-W -P" as well but still the same error.  If I remove the
> SBD option, restart openais, the errors stop so no doubt whatsover that
> the -P options is causing the issue.
> 
> This is on two virtual machines using the i6300esb module for watchdog
> (qemu providing the virtual watchdog) running a simple two node cluster
> (active/passive) as simple as it gets with two primitives:  Filesystem
> (ext3) and sbd-stonith.
> 
> If anyone from SUSE here could recreate it that would be great.
> 
> Thanks,
> Jorge
> 
> _______________________________________________
> Users mailing list: Users at clusterlabs.org 
> http://clusterlabs.org/mailman/listinfo/users 
> 
> Project Home: http://www.clusterlabs.org 
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf 
> Bugs: http://bugs.clusterlabs.org 







More information about the Users mailing list