Yes. After the time-out of 600 secs the OSDs got marked down, all PGs got remapped and
recovery/rebalancing started as usual. In the past, I did service on servers with the flag
noout set and would expect that mon_osd_down_out_subtree_limit=host has the same effect
when shutting down an entire host. Unfortunately, in my case these two settings behave
differently.
If I understand the documentation correctly, the OSDs should not get marked out
automatically.
Best regards,
=================
Frank Schilder
AIT Risø Campus
Bygning 109, rum S14
________________________________________
From: Anthony D'Atri <anthony.datri(a)gmail.com>
Sent: 14 July 2020 04:32:05
To: Frank Schilder
Subject: Re: [ceph-users] mon_osd_down_out_subtree_limit not working?
Did it start rebalancing?
On Jul 13, 2020, at 4:29 AM, Frank Schilder
<frans(a)dtu.dk> wrote:
if I shut down all OSDs on this host, these OSDs should not be marked out automatically
after mon_osd_down_out_interval(=600) seconds. I did a test today and, unfortunately, the
OSDs do get marked as out. Ceph status was showing 1 host down as expected.