Cephalocon 2022 has ended
July 11 - 13, 2022 | Portland, Oregon + Virtual
View More Details & Registration

Please note: This schedule is automatically displayed in Pacific Daylight Time (PDT). To view the schedule at your preferred time, please choose your location on the right-hand navigation panel under ’Timezone.’
The schedule is subject to change.
Back To Schedule
Tuesday, July 12 • 11:00am - 11:40am
Improved Business Continuity for an Existing Large Scale Ceph Infrastructure: A Story from Practical Experience - Enrico Bocch & Arthur Outhenin-Chalandre, CERN

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
The IT Department at CERN (European Organization for Nuclear Research) operates a large-scale computing and storage infrastructure for processing scientific data and providing IT services to its user community. Ceph is a critical part of this picture as it provides: 1. Block storage for the OpenStack infrastructure (440k cores - 25 PB), 2. S3 object storage for cloud-native applications, HTTP-based software distribution, and backup needs (16 PB), 3. CephFS for shared filesystems in HPC clusters and storage persistency in OpenShift and Kubernetes (7 PB). In the past year, our Ceph infrastructure has been largely restructured with the goal of offering storage solutions for High(er) Availability and Disaster Recovery / Business Continuity. In this presentation we will detail how we transitioned from a single RBD zone to multiple Storage AZs; how we hardened and optimized RBD snapshot mirroring for OpenStack; how we integrated a restic-based CephFS backup orchestrator with Manila; and our experience merging two independent S3 clusters into a single multi-region Zonegroup, as well as experimentation with Maglev load balancing.

avatar for Enrico Bocchi

Enrico Bocchi

Computing Engineer, CERN
Enrico is a Computing Engineer at CERN, where he has worked in the past 5 years Distributed Storage Systems. He is responsible for the operating and evolving critical production services at the scale of 10's of PBs including Ceph block and object storage. Enrico holds a joint-PhD... Read More →
avatar for Arthur Outhenin-Chalandre

Arthur Outhenin-Chalandre

Computing Fellow, CERN
Arthur is a Computer Engineer at CERN where he started to work on Business Continuity for Ceph in early 2021. He is an active contributor to the Ceph project, especially in the context of RBD mirroring features, and shares responsability for operating production Ceph clusters at CERN... Read More →

Tuesday July 12, 2022 11:00am - 11:40am PDT
Regency Ballroom D