Thank you for your email and for providing the solution to check for shadow
and multipart objects in CEPH. I have checked the objects in my CEPH
cluster and found the following results:
The command rados -p <pool> ls | grep --text -vE "shadow|multipart" | wc
-l
returns about 80 million objects.
The command radosgw-admin user stats --uid <user> 2>/dev/null | jq -r
'.stats | select(.num_objects > 0) | .num_objects' returns 889684340
objects, including all buckets for all users.
The data match and are sensible, but I still encounter an inconsistency
when trying to get objects per PG with the rados command. The sum of all
objects in all PGs without shadow and multipart objects is 18 million.
It appears that the S3 objects are stored in the RADOS layer as follows:
Small objects (less than 4 MB) are stored in RADOS without any prefix. If
an object is larger, it is split into multiple objects, each one 4 MB, and
the remaining part (less than 4 MB) is stored as a shadow file. However, in
any case, all S3 objects have a corresponding object in RADOS as a header
that holds the metadata of the object.
https://access.redhat.com/solutions/4177821
Please let me know if you have any further suggestions or if there is
anything else I can assist with.
On Mon, Feb 6, 2023 at 4:24 PM Robert Sander <r.sander(a)heinlein-support.de>
wrote:
On 04.02.23 20:54, Ramin Najjarbashi wrote:
ceph df | grep mypoo
--- POOLS ---
POOL OBJECTS
mypool 1.11G
---------------
and from this, I got 8.8M objects :
for item in `radosgw-admin user list | jq -r ".[]" | head`; do
B_OBJ=$(radosgw-admin user stats --uid $item 2>/dev/null | jq -r '.stats
|
select(.num_objects > 0) | .num_objects');
SUM=$((SUM + B_OBJ)); done
You have mixed RADOS objects and S3 objects.
These are two different layers. Only small (< 4MB) S3 objects are stored
in a single RADOS object. Larger S3 objects are split into multiple 4MB
sized RAOS objects by the rados-gateway.
This is why you see much more RADOS objects than S3 objects.
Regards
--
Robert Sander
Heinlein Support GmbH
Linux: Akademie - Support - Hosting
http://www.heinlein-support.de
Tel: 030-405051-43
Fax: 030-405051-19
Zwangsangaben lt. §35a GmbHG:
HRB 93818 B / Amtsgericht Berlin-Charlottenburg,
Geschäftsführer: Peer Heinlein -- Sitz: Berlin
_______________________________________________
ceph-users mailing list -- ceph-users(a)ceph.io
To unsubscribe send an email to ceph-users-leave(a)ceph.io