MDS troubleshooting documentation: ceph daemon mds.<name> dump cache - ceph-users

31 Aug 2020

Hi list,

We had some stuck ops on our MDS. In order to figure out why, we looked
up the documention. The first thing it mentions is the following:

ceph daemon mds.<name> dump cache /tmp/dump.txt

Our MDS had 170 GB in cache at that moment.

Turns out that is a sure way to get your active MDS replaced by a standby.

Is this supposed to work on MDS with large cache size? If not, than a
big warning sign to prohibit running this on MDSes with large caches
would be appropriate.

Gr. Stefan

P.s. I think our only option was to get the active restarted at that
point, but still.