[ClusterLabs] epic fail

Kristián Feldsam admin at feldhost.cz
Mon Jul 24 12:14:10 EDT 2017


nfs server/share is also managed by pacemaker and orderis set right?

S pozdravem Kristián Feldsam
Tel.: +420 773 303 353, +421 944 137 535
E-mail.: support at feldhost.cz

www.feldhost.cz - FeldHost™ – profesionální hostingové a serverové služby za adekvátní ceny.

FELDSAM s.r.o.
V rohu 434/3
Praha 4 – Libuš, PSČ 142 00
IČ: 290 60 958, DIČ: CZ290 60 958
C 200350 vedená u Městského soudu v Praze

Banka: Fio banka a.s.
Číslo účtu: 2400330446/2010
BIC: FIOBCZPPXX
IBAN: CZ82 2010 0000 0024 0033 0446

> On 24 Jul 2017, at 18:01, Dimitri Maziuk <dmaziuk at bmrb.wisc.edu> wrote:
> 
> On 07/24/2017 10:38 AM, Ken Gaillot wrote:
> 
>> A restart shouldn't lead to fencing in any case where something's not
>> going seriously wrong. I'm not familiar with the "kernel is using it"
>> message, I haven't run into that before.
> 
> I posted it at least once before.
> 
>> 
>> Jul 22 14:03:48 zebrafish Filesystem(drbd_filesystem)[6886]: INFO: Running stop for /dev/drbd0 on /raid
>> Jul 22 14:03:48 zebrafish Filesystem(drbd_filesystem)[6886]: INFO: Trying to unmount /raid
>> Jul 22 14:03:48 zebrafish Filesystem(drbd_filesystem)[6886]: ERROR: Couldn't unmount /raid; trying cleanup with TERM
>> Jul 22 14:03:48 zebrafish Filesystem(drbd_filesystem)[6886]: INFO: No processes on /raid were signalled. force_unmount is set to 'yes'
>> Jul 22 14:03:49 zebrafish Filesystem(drbd_filesystem)[6886]: ERROR: Couldn't unmount /raid; trying cleanup with TERM
>> Jul 22 14:03:49 zebrafish Filesystem(drbd_filesystem)[6886]: INFO: No processes on /raid were signalled. force_unmount is set to 'yes'
>> Jul 22 14:03:50 zebrafish ntpd[596]: Deleting interface #8 enp2s0f0, 144.92.167.221#123, interface stats: received=0, sent=0, dropped=0, active_time=260 secs
>> Jul 22 14:03:50 zebrafish Filesystem(drbd_filesystem)[6886]: ERROR: Couldn't unmount /raid; trying cleanup with TERM
>> Jul 22 14:03:50 zebrafish Filesystem(drbd_filesystem)[6886]: INFO: No processes on /raid were signalled. force_unmount is set to 'yes'
>> Jul 22 14:03:51 zebrafish Filesystem(drbd_filesystem)[6886]: ERROR: Couldn't unmount /raid; trying cleanup with KILL
>> Jul 22 14:03:51 zebrafish Filesystem(drbd_filesystem)[6886]: INFO: No processes on /raid were signalled. force_unmount is set to 'yes'
>> Jul 22 14:03:52 zebrafish Filesystem(drbd_filesystem)[6886]: ERROR: Couldn't unmount /raid; trying cleanup with KILL
>> Jul 22 14:03:53 zebrafish Filesystem(drbd_filesystem)[6886]: INFO: No processes on /raid were signalled. force_unmount is set to 'yes'
>> Jul 22 14:03:54 zebrafish Filesystem(drbd_filesystem)[6886]: ERROR: Couldn't unmount /raid; trying cleanup with KILL
>> Jul 22 14:03:54 zebrafish Filesystem(drbd_filesystem)[6886]: INFO: No processes on /raid were signalled. force_unmount is set to 'yes'
>> Jul 22 14:03:55 zebrafish Filesystem(drbd_filesystem)[6886]: ERROR: Couldn't unmount /raid, giving up!
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ umount: /raid: target is busy. ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [         (In some cases useful info about processes that use ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [          the device is found by lsof(8) or fuser(1)) ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ ocf-exit-reason:Couldn't unmount /raid; trying cleanup with TERM ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ umount: /raid: target is busy. ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [         (In some cases useful info about processes that use ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [          the device is found by lsof(8) or fuser(1)) ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ ocf-exit-reason:Couldn't unmount /raid; trying cleanup with TERM ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ umount: /raid: target is busy. ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [         (In some cases useful info about processes that use ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [          the device is found by lsof(8) or fuser(1)) ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ ocf-exit-reason:Couldn't unmount /raid; trying cleanup with TERM ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ umount: /raid: target is busy. ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [         (In some cases useful info about processes that use ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [          the device is found by lsof(8) or fuser(1)) ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ ocf-exit-reason:Couldn't unmount /raid; trying cleanup with KILL ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ umount: /raid: target is busy. ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [         (In some cases useful info about processes that use ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [          the device is found by lsof(8) or fuser(1)) ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ ocf-exit-reason:Couldn't unmount /raid; trying cleanup with KILL ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ umount: /raid: target is busy. ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [         (In some cases useful info about processes that use ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [          the device is found by lsof(8) or fuser(1)) ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ ocf-exit-reason:Couldn't unmount /raid; trying cleanup with KILL ]
>> Jul 22 14:03:55 zebrafish lrmd[1075]:  notice: drbd_filesystem_stop_0:6886:stderr [ ocf-exit-reason:Couldn't unmount /raid, giving up! ]
>> Jul 22 14:03:55 zebrafish crmd[1078]:  notice: Result of stop operation for drbd_filesystem on zebrafish: 1 (unknown error)
>> Jul 22 14:03:55 zebrafish crmd[1078]:  notice: zebrafish-drbd_filesystem_stop_0:101 [ umount: /raid: target is busy.\n        (In some cases useful info about processes that use\n         the device is found by lsof(8) or fuser(1))\nocf-exit-reason:Couldn't unmount /raid; trying cleanup with TERM\numount: /raid: target is busy.\n        (In some cases useful info about processes that use\n         the device is found by lsof(8) or fuser(1))\nocf-exit-reason:Couldn't unmount /raid; trying cleanup with TERM\numount: /raid: target is busy.\n
>> Jul 22 14:03:55 zebrafish crmd[1078]: warning: Action 45 (drbd_filesystem_stop_0) on zebrafish failed (target: 0 vs. rc: 1): Error
>> Jul 22 14:03:55 zebrafish crmd[1078]:  notice: Transition aborted by operation drbd_filesystem_stop_0 'modify' on zebrafish: Event failed
>> Jul 22 14:03:55 zebrafish crmd[1078]: warning: Action 45 (drbd_filesystem_stop_0) on zebrafish failed (target: 0 vs. rc: 1): Error
>> Jul 22 14:03:55 zebrafish crmd[1078]:  notice: Transition 2 (Complete=21, Pending=0, Fired=0, Skipped=0, Incomplete=43, Source=/var/lib/pacemaker/pengine/pe-input-256.bz2): Complete
> 
> Lsof/fuser show the PID of the process holding FS open as "kernel".
> 
> -- 
> Dimitri Maziuk
> Programmer/sysadmin
> BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu
> 
> _______________________________________________
> Users mailing list: Users at clusterlabs.org
> http://lists.clusterlabs.org/mailman/listinfo/users
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/users/attachments/20170724/89d050cf/attachment-0003.html>


More information about the Users mailing list