A long term storage solution for Tango attribute data at SKAO

THCR003
25 Sept 2025, 14:30
15m
Red Lacquer Room (Palmer House Hilton Chicago)

Red Lacquer Room

Palmer House Hilton Chicago

17 East Monroe Street Chicago, IL 60603, United States of America
Contributed Oral Presentation MC16: Data Management and Analytics THCR MC16 Data Management and Analytics

Speaker

Mauricio Zambrano (SKA Observatory)

Description

At the Square Kilometre Array Observatory (SKAO), monitoring data is ingested from distributed subsystems via the Tango Controls archiver, with attribute data stored in the Engineering Data Archive (EDA). The EDA uses a PostgreSQL database with the TimescaleDB extension, offering a performant solution for time-series storage. However, as SKAO infrastructure scales, PostgreSQL becomes impractical for long-term retention due to cost and operational complexity. This paper outlines a long-term storage strategy based on S3-compatible object storage. The solution decouples operational and archival storage by exporting and serializing Tango attribute data into efficient formats like Apache Parquet for storage in S3. Metadata indexing ensures the data remains discoverable and retrievable over time. The approach draws from the MeerKAT telescope's experience, a precursor to SKAO operated by SARAO. MeerKAT faced similar challenges archiving large volumes of telemetry data and adopted a database and long term storage model. We also describe supporting tools and processes for managing data lifecycle transitions. The paper concludes with open challenges and future directions for integrating this approach into observatory-wide data access frameworks, ensuring engineering telemetry remains accessible throughout the SKAO system lifecycle.

Manuscript formatting LaTeX

Author

Mauricio Zambrano (SKA Observatory)

Co-authors

Mr Johan Venter (South African Radio Astronomy Observatory) Mr Thomas Juerges (SKA Observatory)

Presentation materials

There are no materials yet.