April 2021 - Dev - lists.ceph.io

by Xuehan Xu

Recently, we found that if we open versioning, some object versions may be lost, it seems that this is due to resharding. Here is the log output: 2021-04-27 19:35:03.308 7f6a317fa700 0 RGWReshardLock::lock failed to acquire lock on base-bucket2:ebe5dd13-0db4-4739-836c-1486abde1cde.4912.11 ret=-16 2021-04-27 19:35:03.764 7f6a32ffd700 0 RGWReshardLock::lock failed to acquire lock on base-bucket2:ebe5dd13-0db4-4739-836c-1486abde1cde.4912.11 ret=-16 2021-04-27 19:35:08.022 7f6a7ffff700 0 RGWReshardLock::lock failed to acquire lock on base-bucket2:ebe5dd13-0db4-4739-836c-1486abde1cde.4912.11 ret=-16 2021-04-27 19:35:08.022 7f6a33fff700 0 RGWReshardLock::lock failed to acquire lock on base-bucket2:ebe5dd13-0db4-4739-836c-1486abde1cde.4912.11 ret=-16 2021-04-27 19:35:08.256 7f6a30ff9700 0 RGWReshardLock::lock failed to acquire lock on base-bucket2:ebe5dd13-0db4-4739-836c-1486abde1cde.4912.11 ret=-16 2021-04-27 19:35:08.257 7f6b7a307700 0 RGWReshardLock::lock failed to acquire lock on base-bucket2:ebe5dd13-0db4-4739-836c-1486abde1cde.4912.11 ret=-16 2021-04-27 19:35:08.257 7f6a53fff700 0 RGWReshardLock::lock failed to acquire lock on base-bucket2:ebe5dd13-0db4-4739-836c-1486abde1cde.4912.11 ret=-16 2021-04-27 19:35:08.372 7f6a317fa700 0 RGWReshardLock::lock failed to acquire lock on base-bucket2:ebe5dd13-0db4-4739-836c-1486abde1cde.4912.11 ret=-16 2021-04-27 19:35:08.850 7f6a32ffd700 0 RGWReshardLock::lock failed to acquire lock on base-bucket2:ebe5dd13-0db4-4739-836c-1486abde1cde.4912.11 ret=-16 2021-04-27 19:35:14.251 7f6b7a307700 0 check_bucket_shards: resharding needed: stats.num_objects=100070 shard max_objects=100000 2021-04-27 19:35:14.279 7f6a317fa700 0 cls_rgw_bucket_link_olh() returned r=-2 2021-04-27 19:35:14.279 7f6a317fa700 0 bucket_index_link_olh() target_obj=base-bucket2:_:xkNfxlu774LD0RSWFgTwivoXhNOqAMQ_flow-ndg4zg/pcaps/2021042719/27171091938291920 delete_marker=0 returned -2 2021-04-27 19:35:14.616 7f6b79b06700 0 check_bucket_shards: resharding needed: stats.num_objects=100074 shard max_objects=100000 2021-04-27 19:35:15.591 7f6a52ffd700 0 check_bucket_shards: resharding needed: stats.num_objects=100075 shard max_objects=100000 2021-04-27 19:36:07.111 7f6a7d7fa700 0 check_bucket_shards: resharding needed: stats.num_objects=100383 shard max_objects=100000 Does anyone have any clew about the cause of this? Thanks:-) I also add a ticket for this: https://tracker.ceph.com/issues/50552

3 years

1
0
0 0

SEO Company in Faridabad, SEO Services Faridabad

by jeewangarg2017＠gmail.com

At Jeewagarg, our affordable SEO Services in Faridabad? We are guaranteed to enhance your search engine visibility, and placement, and in -turn -site traffic. In this way, you can experience a progressive return on your investment. We have ample experience with SEO practices to make sure we bring the best we can to your brand & website. Our primary objective is to satisfy customers, so we always do our utmost to rank your website on Google search engine results pages. We maintain your Return on investment and also work towards protecting your first-page ranking on Google. Visit Us: https://www.jeewangarg.com/seo-services-company-faridabad.html

3 years

1
0
0 0

Re: BlueStore not surviving power outage

by Xuehan Xu

Thanks, everyone. There is a RAID HBA in each of the machines in our clusters, to which all SATA disks are attached. We configured the RAID HBA cache mode to "write through", but, as I checked yesterday, the BBU of the RAID HBAs are not charged. I'm not quite sure whether the BBU has something to do with the data loss, as far as I know, all data should be persisted to the underlying disk before acknowledging upper layer systems when cache mode is "write through". Am I missing anything? Thanks:-) On Tue, 27 Apr 2021 at 23:43, Martin Verges <martin.verges(a)croit.io> wrote: > > What drives do you use? Do they have PLP (power loss protection)? Is > there any form of raid controller involved? > > -- > Martin Verges > Managing director > > Mobile: +49 174 9335695 > E-Mail: martin.verges(a)croit.io > Chat: https://t.me/MartinVerges > > croit GmbH, Freseniusstr. 31h, 81247 Munich > CEO: Martin Verges - VAT-ID: DE310638492 > Com. register: Amtsgericht Munich HRB 231263 > > Web: https://croit.io > YouTube: https://goo.gl/PGE1Bx > > On Tue, 27 Apr 2021 at 10:54, Xuehan Xu <xxhdx1985126(a)gmail.com> wrote: > > > > Hi, everyone. > > > > Recently, one of our online cluster experienced a whole cluster power > > outage, and after the power recovered, many osd started to log the > > following error: > > > > 2021-04-27 15:38:05.503 2b372b957700 -1 > > bluestore(/var/lib/ceph/osd/ceph-3) _verify_csum bad crc32c/0x1000 > > checksum at blob offset 0x36000, got 0x41fe1397, expected 0x8d7f5975, > > device location [0xa7e76000~1000], logical extent 0x1b6000~1000, > > object #9:45a4e02a:::rbd_data.3b35df93038d.0000000000000095:head# > > 2021-04-27 15:38:05.504 2b372b957700 -1 > > bluestore(/var/lib/ceph/osd/ceph-3) _verify_csum bad crc32c/0x1000 > > checksum at blob offset 0x36000, got 0x41fe1397, expected 0x8d7f5975, > > device location [0xa7e76000~1000], logical extent 0x1b6000~1000, > > object #9:45a4e02a:::rbd_data.3b35df93038d.0000000000000095:head# > > 2021-04-27 15:38:05.505 2b372b957700 -1 > > bluestore(/var/lib/ceph/osd/ceph-3) _verify_csum bad crc32c/0x1000 > > checksum at blob offset 0x36000, got 0x41fe1397, expected 0x8d7f5975, > > device location [0xa7e76000~1000], logical extent 0x1b6000~1000, > > object #9:45a4e02a:::rbd_data.3b35df93038d.0000000000000095:head# > > 2021-04-27 15:38:05.506 2b372b957700 -1 > > bluestore(/var/lib/ceph/osd/ceph-3) _verify_csum bad crc32c/0x1000 > > checksum at blob offset 0x36000, got 0x41fe1397, expected 0x8d7f5975, > > device location [0xa7e76000~1000], logical extent 0x1b6000~1000, > > object #9:45a4e02a:::rbd_data.3b35df93038d.0000000000000095:head# > > 2021-04-27 15:38:28.379 2b372c158700 -1 > > bluestore(/var/lib/ceph/osd/ceph-3) _verify_csum bad crc32c/0x1000 > > checksum at blob offset 0x40000, got 0xce935e16, expected 0x9b502da7, > > device location [0xa9a80000~1000], logical extent 0x80000~1000, object > > #9:c2a6d9ae:::rbd_data.3b35df93038d.0000000000000696:head# > > > > We are using Nautilus 14.2.10 version, and we put rocksdb on top of > > SSDs while bluestore data on SATA disks. It seems that the BlueStore > > didn't survive the power outage, is it supposed to behave this way? Is > > there any way to prevent it? > > > > Thanks:-) > > _______________________________________________ > > Dev mailing list -- dev(a)ceph.io > > To unsubscribe send an email to dev-leave(a)ceph.io

3 years

4
3
0 0

Ceph Code Walkthrough: RGW Bucket Notifications with AMQA/Kafka

by Mike Perez

Hi everyone, Our next Ceph Code Walkthrough is today at 14:00 UTC / 10 AM ET on RGW Bucket Notifications with Yuval Lifshitz. Details to join the stream are on the wiki: https://bluejeans.com/908675367 https://tracker.ceph.com/projects/ceph/wiki/Code_Walkthroughs -- Mike Perez

3 years

1
1
0 0

Download And Install Office 2019

by sapohoj267＠qqmimpi.com

The office is a productivity suite, developed and maintained by one of the biggest companies, market leaders in technology, Microsoft Office. You must already have gotten your answer, Microsoft is not just another company that poses to be developing great software with just a team of 100 members but Microsoft is a big company that needs to keep its reputation and work always on par with the revenue they are getting. This means they need to hire the best people to get the job done and that’s how it works in Microsoft. There is no issue with Microsoft, but the only issue is that Microsoft has been the target for hackers for quite a while now, It is one of the easiest targets anyone can find when it comes to the customer base and data theft. The data Microsoft has stored on their cloud servers is enormous and is really important for most of the people. office.com/setup https://www.officesetup.help office.com/setup https://w-ww-office.com/setup mcafee.com/activate https://www.help-mcafee.me mcafee.com/activate https://w-w-w-mcafee.com/activate

3 years

4
7
0 0

How Can Delta-8 Supplement Help With Marijuana Addiction?

by Talia Myers

Delta-8 helps make the transition from synthetic marijuana more difficult because it is made out of artificial ingredients. Hemp is different because it is grown without the use of pesticides or other chemicals. Delta-8 works better when taken with other nutrients. This combination allows Delta-8 to help make the transition from synthetic marijuana to organic marijuana less difficult. Delta-8 should be considered as a helpful addition to your diet if you suffer from an inability to control your body's blood sugar levels. It can also be helpful after your surgery to help your body get back into shape. If you are looking to use Delta-8 supplements to help with your marijuana addiction, it should be taken with other natural products that can offset the negative side effects of Delta-8. Keep in mind that Delta-8 cannot replace the beneficial effects of medical marijuana. However, it can help to compliment its positive effects. Visit: https://area52.com/delta-8-products/

3 years

1
0
0 0

Streamlining backports

by Ernesto Puerta

Hi all, In order to improve the handling of backports (3 stable releases means x3 process amplification) and to focus attention on the ones really requiring thorough reviews, *what about using a label to indicate that the backport was conflict-less (e.g.: "clean-backport")? * It could start as a manual action, but if useful it could also be easily automated with a Github Action <https://github.com/marketplace/actions/multi-labeler> or directly from the ceph-backport script. Additionally, adding a size label (XL, L, M, S, ...) <https://github.com/marketplace?type=actions&query=label+size> might also help everyone quickly understand the complexity and/or when searching PRs in Github. [image: image.png] These are just 2 things we may start doing right now without much effort needed. Further improvements might be launching the "ceph-backport" script directly from the BackportBot or syncing Github-Redmine to deal with the backporting paperwork. What do you think? Kind Regards, Ernesto

3 years

6
7
0 0

docs backports

by Josh Durgin

Hey folks, now that Pacific is out I wanted to bring up docs backports. Today, docs.ceph.com shows master by default, with an appropriate warning at the top that it represents a development version. Since the primary audience of the docs is users, not developers, I suggest that we switch the default branch to the latest stable, i.e. pacific, and apply the normal backport process to docs that are relevant to the latest stable release as well. To kickstart things, I'll prepare a backport of the existing doc changes since the pacific release. What do folks think? Josh

3 years

6
9
0 0

04/22/2021 perf meeting is on!

by Mark Nelson

Hi Folks, The performance meeting will be starting in about 40 minutes at 8AM PST! Today there's a couple of quick crimson updates and also a couple of new PRs from Adam and Igor to try and improve bluestore cache pinning/trim behavior. Please feel free to add your own topic as well! Etherpad: https://pad.ceph.com/p/performance_weekly Bluejeans: https://bluejeans.com/908675367 Mark

3 years

1
0
0 0

Re: [ceph-users] EC Backfill Observations

by Josh Durgin

Hey Josh, adding the dev list where you may get more input. Generally I think your analysis is correct about the current behavior. In particular if another copy of a shard is available, backfill or recovery will read from just that copy, not the rest of the OSDs. Otherwise, k shards must be read to reconstruct the data (for reed- solomon family erasure codes). IIRC it doesn't matter whether it's a data or parity shard, the path is the same. With respect to reservations, it seems like an oversight that we don't reserve other shards for backfilling. We reserve all shards for recovery [0]. On the other hand, overload from recovery is handled better in pacific and beyond with mclock-based QoS, which provides much more effective control of recovery traffic [1][2]. In prior versions, the osd_recovery_sleep option was the best way to get more fine-grained control of recovery and backfill traffic, but this was not dynamic at all. osd_max_backfills allowed a maximum limit to parallelism. mclock supercedes these both when it's enabled, since it can handle bursting and throttling itself. Josh [0] https://github.com/ceph/ceph/blob/v16.2.1/src/osd/PeeringState.cc#L5914-L59… [1] https://docs.ceph.com/en/latest/rados/configuration/osd-config-ref/#dmclock… [2] https://docs.ceph.com/en/latest/rados/configuration/mclock-config-ref/ On 4/19/21 12:24 PM, Josh Baergen wrote: > Hey all, > > I wanted to confirm my understanding of some of the mechanics of > backfill in EC pools. I've yet to find a document that outlines this > in detail; if there is one, please send it my way. :) Some of what I > write below is likely in the "well, duh" category, but I tended > towards completeness. > > First off, I understand that backfill reservations work the same way > between replicated pools and EC pools. A local reservation is taken on > the primary OSD, then a remote reservation on the backfill target(s), > before the backfill is allowed to begin. Until this point, the > backfill is in the backfill_wait state. > > When the backfill begins, though, is when the differences begin. Let's > say we have an EC 3:2 PG that's backfilling from OSD 2 to OSD 5 > (formatted here like pgs_brief): > > 1.1 active+remapped+backfilling [0,1,5,3,4] 0 [0,1,2,3,4] 0 > > The question in my mind was: Where is the data for this backfill > coming from? In replicated pools, all reads come from the primary. > However, in this case, the primary does not have the data in question; > the primary has to either read the EC chunk from OSD 2, or it has to > reconstruct it by reading from 3 of the OSDs in the acting set. > > Based on observation, I _think_ this is what happens: > 1. As long as the PG is not degraded, the backfill read is simply > forwarded by the primary to OSD 2. > 2. Once the PG becomes degraded, the backfill read needs to use the > reconstructing path, and begins reading from 3 of the OSDs in the > acting set. > > Questions: > 1. Can anyone confirm or correct my description of how EC backfill > operates? In particular, in case 2 above, does it matter whether OSD 2 > is the cause of degradation, for example? Does the read still get > forwarded to a single OSD when it's parity chunks that are being moved > via backfill? > 2. I'm curious as to why a 3rd reservation, for the source OSD, wasn't > introduced as a part of EC in Ceph. We've occasionally seen an OSD > become overloaded because several backfills were reading from it > simultaneously, and there's no way to control this via the normal > osd_max_backfills mechanism. Is anyone aware of discussions to this > effect? > > Thanks! > Josh > _______________________________________________ > ceph-users mailing list -- ceph-users(a)ceph.io > To unsubscribe send an email to ceph-users-leave(a)ceph.io >

3 years

2
3
0 0

2024

2023

2022

2021

2020

2019

Dev April 2021