[ceph-users] Re: slow "rados ls"

11 Sep 2020

Hi Stefan

I can't recall that that was the case and unfortunately we do not have
enough history for our performance measurements to look back

We are on nautilus. Please let me know your findings when you do your pg
expansion on nautilus

Grtz

Marcel

...
  OK, I'm really curious if you observed the
following behaviour:

 During, or shortly after the rebalance, did you see high CPU usage of
 the OSDs? In particular the ones that hosted the PGs before they were
 moved to the new nodes? As in ~ 300 % CPU per OSD (increasing from a few
 percent to 300% non stop)? RocksDB is doing housekeeping, And we
 observed before, and today again, on Mimic 13.2.8, that with a lot of
 OMAP/META data the OSDs that have to clean up consume a ridiculous
 amount of CPU (for hours on end). Triggering loads of slow ops and
 latency spikes in the somtimes (tens) of seconds.

 Are you running nautilus? If you haven't seen this behaviour this might
 have been fixed in Nautlilus. Or you cluster is different from ours. We
 will do PG expansion after we have upgraded to Nautilus, so we'll
 definitely know by then.

 Thanks,

 Stefan

2024

2023

2022

2021

2020

2019

[ceph-users] Re: slow "rados ls"