From terabytes to petabyte: scaling of the archiving system for FAIR

TUPD109
23 Sept 2025, 16:00
1h 30m
Palmer House Hilton Chicago

Palmer House Hilton Chicago

17 East Monroe Street Chicago, IL 60603, United States of America
Poster Presentation MC16: Data Management and Analytics TUPD Posters

Speaker

Vitaliy Rapp (GSI Helmholtz Centre for Heavy Ion Research)

Description

With the recent rise of AI and various machine learning models, the importance of storing and managing data generated by control systems is greater than ever before. In 2016, GSI began developing an archiving system to collect, store, and retrieve data from the diverse accelerator devices managed by the GSI control infrastructure. The system was successfully deployed in production in 2021. To evaluate its capabilities and suitability for operational needs, the system was initially launched with a limited storage capacity of 50 TB and reduced computing power. With a current data volume of over 100 GB per day, the archiving system quickly exceeded its initial limits. However, the experience gained in day-to-day operations thus far has allowed us to better understand our use-cases and identify areas for further improvement. In preparation for the anticipated start of FAIR operations in 2027, the system will require significant scaling to meet future demands. Therefore, this is an opportune moment to review and refine the system’s architecture based on the experience gained so far. This paper outlines the challenges encountered with the current implementation and presents the solutions that will be incorporated into the system for FAIR operations.

Author

Vitaliy Rapp (GSI Helmholtz Centre for Heavy Ion Research)

Co-authors

Jules Kerssemakers (GSI Helmholtz Centre for Heavy Ion Research) Krzysztof Klimczyk (S2Innovation Sp z o. o. [Ltd.]) Prof. Piotr Salabura (Jagiellonian University)

Presentation materials

There are no materials yet.