Hi Katarzyna,
Incomplete multipart uploads are not considered orphans.
With respect to the 404s…. Which version of ceph are you running? What tooling are you
using to list and cancel? Can you provide a console transcript of the listing and
cancelling?
Thanks,
Eric
--
J. Eric Ivancich
he / him / his
Red Hat Storage
Ann Arbor, Michigan, USA
On Apr 28, 2020, at 2:57 AM, Katarzyna Myrek
<katarzyna(a)myrek.pl> wrote:
Hi all
I am afraid that there is even more thrash available - running
rgw-orphan-list does not find everything. Like I still have broken
multiparts -> when I do s3cmd multipart I get a list of
"pending/interrupted multiparts". When I try to cancel such multipart
I get 404.
Does anyone have a method for cleanup of such things? Or even a list
of tasks which should be run regularly on clusters with rgw ?
Kind regards / Pozdrawiam,
Katarzyna Myrek
wt., 21 kwi 2020 o 09:57 Janne Johansson <icepic.dz(a)gmail.com> napisał(a):
Den tis 21 apr. 2020 kl 07:29 skrev Eric Ivancich <ivancich(a)redhat.com>om>:
Please be certain to read the associated docs in both:
doc/radosgw/orphans.rst
doc/man/8/rgw-orphan-list.rst
so you understand the limitations and potential pitfalls. Generally this tool will be a
precursor to a large delete job, so understanding what’s going on is important.
I look forward to your report! And please feel free to post additional questions in this
forum.
Where are those?
https://github.com/ceph/ceph/tree/master/doc/man/8
https://github.com/ceph/ceph/tree/master/doc/radosgw
don't seem to contain them in master. Nor in nautilus branch or octopus.
This whole issue feels weird, rgw (or its users) produces dead fragments of mulitparts,
orphans and whatnot that needs cleaning up sooner or later and the info we get is that the
old cleaner isn't meant to be used, it hasn't worked for a long while, there is no
fixed version, perhaps there is a script somewhere with caveats. This (slightly
frustrated) issue is of course on top of
"bi trim"
"bilog trim"
"mdlog trim"
"usage trim"
"datalog trim"
"sync error trim"
"gc process"
"reshard stale-instances rm"
that we rgw admins are supposed to know when to run, how often, what their quirks are and
so on.
'Docs' for rgw means "datalog trim" --help says "trims the
datalog", and the long version on the web would be "this operation trims the
datalog" or something that doesn't add anything more.
--
"Grumpy cat was an optimist"