On Mon, Dec 2, 2019 at 10:27 AM Marc Roos <M.Roos(a)f1-outsourcing.eu> wrote:
I have been asking before[1]. Since Nautilus upgrade I am having these,
with a total node failure as a result(?). Was not expecting this in my
'low load' setup. Maybe now someone can help resolving this? I am also
waiting quite some time to get access at
https://tracker.ceph.com/issues.
Hi Marc,
ISTR there were some anti-spam measures put in place. Is your account
waiting for manual approval? If so, David should be able to help.
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9287
ffff911a9a26bd00 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9283
ffff911d34e69d00 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9276
ffff911d34e69c00 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c926c
ffff912068b92c00 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9268
ffff912068b93000 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c926d
ffff912068b92900 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c928a
ffff912118e5be00 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9272
ffff9119950d9500 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9269
ffff911940f3d000 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9270
ffff911748427c00 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c926b
ffff91169b000600 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9281
ffff91169b000500 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9288
ffff9115844d2500 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c927d
ffff9115844d2e00 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9280
ffff91186401b000 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9267
ffff9121535ecc00 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c927c
ffff9121cecb1e00 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9271
ffff9121cecb0400 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9279
ffff911d26646300 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c927f
ffff911d26646900 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9275
ffff9121cecb1700 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9259
ffff91170c9f6600 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9257
ffff9118ef2a8000 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c924e
ffff911a1e091800 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9262
ffff911a1e090c00 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9266
ffff9115e3859500 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c924f
ffff9118aefd1300 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c925f
ffff91170c9f6100 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9252
ffff9115e3859800 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9256
ffff912045dc5300 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9254
ffff91170c9f6900 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9261
ffff91170c9f7100 fail -12
Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020d4ec4
ffff9118aefd0000 fail -12
It is failing to allocate memory. "low load" isn't very specific,
can you describe the setup and the workload in more detail?
How many snapshots do you have?
Do you keep track of memory consumption on the node?
Finally, you say "crash" in the subject. Does the kernel actually
crash or perhaps it locks up? If it actually crashes, do you have the
panic message?
Thanks,
Ilya