Any ideas or tips on how to debug further ?

On Mon, Oct 28, 2019 at 7:17 PM Kári Bertilsson <karibertils@gmail.com> wrote:
Hello Patrick,

Here is output from those commands
https://pastebin.com/yUmuQuYj

5 clients have the file system mounted, but only 2 of them have most of the activity.



On Mon, Oct 28, 2019 at 6:54 PM Patrick Donnelly <pdonnell@redhat.com> wrote:
Hello Kári,

On Mon, Oct 28, 2019 at 11:14 AM Kári Bertilsson <karibertils@gmail.com> wrote:
> This seems to happen mostly when listing folders containing 10k+ folders.
>
> The dirlisting hangs indefinitely or until i restart the active MDS and then the hanging "ls" command will finish running.
>
> Every time restarting the active MDS fixes the problem for a while.

Please share details about your cluster. `fs dump`, `ceph status`, and
`ceph versions`. How many clients are using the file system?

--
Patrick Donnelly, Ph.D.
He / Him / His
Senior Software Engineer
Red Hat Sunnyvale, CA
GPG: 19F28A586F808C2402351B93C3301A3E258DD79D