Speaker
Description
For the past 20 years, the CERN Safety System Monitoring (SSM) framework has safeguarded the operational health of CERN’s access and personnel safety systems. Built on the Zabbix monitoring platform, Grafana, and in-house developments, SSM provides real-time diagnostics, alerts, issue escalation, and predictive analytics for a wide range of critical infrastructure, operating systems, network devices, storage, and specialized equipment like video cameras and UPS units. The objective of SSM is to enhance maintenance and operational efficiency by delivering timely and reliable system feedback, enabling rapid identification of both immediate failures and gradual degradation before they can impact operations. SSM also supports long-term data analysis for post-incident investigations, statistical evaluations, and trend forecasting, thereby contributing to the optimization of safety system designs. It also provides operational statistics and graphs to CERN management and site services offering valuable graphical insights. Ongoing developments aim to expand SSM's capabilities through integration of AI modules for predictive maintenance, enabling pre-emptive interventions and reducing downtime. These enhancements also include automatic generation of reports and notifications to operations teams. By continuously assessing the safety status of operational systems, SSM plays a crucial role in mitigating risks and ensuring long-term reliability of CERN’s technical infrastructure.