Compression of data in existing cephfs EC pool - ceph-users

4 Jan 2021

Hey everyone,

I've got an EC pool as part of our cephfs for colder data.  When we started using it,
compression was still marked experimental.  Since then it has become stable so I turned
compression on to "aggressive".  Using 'ceph df detail' I can see that
new data is getting compressed.  We have a good chunk of data (~1P) that hasn't been
through the compression side of things that I would love to get squished.  

I know there currently isn't a way to do this in place and requires a new copy to be
made so I made a simple script to "wiggle" the data.   If the file is older than
the date I turned compression on, and the file size is >64k: rsync the file to a new
file sitting right next to the old one, delete the old one then move the new file into
it's place.  New inode = new file = compressed data!   

In theory, this works awesome.  In practice however I'm not seeing the needles move as
my script goes through.  Does anyone have any ideas into what I may have missed for this
dance?  

-paul

Side note:  People with file names that include quotes, pipes and spaces should get their
accounts disabled by default.

--
Paul Mezzanini
Sr Systems Administrator / Engineer, Research Computing
Information & Technology Services
Finance & Administration
Rochester Institute of Technology
o:(585) 475-3245 | pfmeec(a)rit.edu

CONFIDENTIALITY NOTE: The information transmitted, including attachments, is
intended only for the person(s) or entity to which it is addressed and may
contain confidential and/or privileged material. Any review, retransmission,
dissemination or other use of, or taking of any action in reliance upon this
information by persons or entities other than the intended recipient is
prohibited. If you received this in error, please contact the sender and
destroy any copies of this information.
------------------------