flock is held after ceph-osd daemon being stopped - Dev

12 Feb 2020

Hi All,

I noticed a locking issue in kernel device.
When I stopped the ceph cluster and all daemons, the kernel device _lock somehow is still
held and this line below will return r < 0:

int KernelDevice::_lock()
{
  int r = ::flock(fd_directs[WRITE_LIFE_NOT_SET], LOCK_EX | LOCK_NB);
  …
}

The way I stop the cluster and daemons:

sudo ../src/stop.sh
sudo bin/init-ceph --verbose forcestop

This error happens even after the reboot when I try to use vstart:

bdev _lock flock failed on ceph/build/dev/osd0/block 
bdev open failed to lock /home/yzhan298/ceph/build/dev/osd0/block: (11) Resource
temporarily unavailable
OSD::mkfs: couldn't mount ObjectStore: error (11) Resource temporarily unavailable
** ERROR: error creating empty object store in ceph/build/dev/osd0: (11) Resource
temporarily unavailable

Please advice. (On master branch)

Thanks,
Yiming