IPAC'26 - the 17th International Particle Accelerator Conference

Name: IPAC'26 - the 17th International Particle Accelerator Conference
Start: 2026-05-17T09:00:00+02:00
End: 2026-05-22T18:00:00+02:00
Location: C.I.D

17–22 May 2026

C.I.D

Europe/Zurich timezone

IPAC'26

Koopman-Stabilised World Models for Offline Reinforcement Learning in Accelerator Control

WEP6098

20 May 2026, 16:00

C.I.D

Deauville, France

Poster Presentation MC6.T04: Accelerator/Storage Ring Control Systems Poster session

Simon Hirlaender ()

Particle accelerators generate vast amounts of historical data from logs, yet learning-based control often still relies on risky online optimisation. To better utilise this data and avoid online exploration, we present an offline reinforcement learning (RL) workflow. First, we use XSuite to generate high-fidelity trajectories for steering tasks across representative scenarios, including optics variations, alignment errors and jitter, yielding a synthetic dataset of expert and non-expert behaviour. Second, we learn an uncertainty-aware, Koopman-stabilised world model from this data, in which nonlinear beam dynamics are lifted into a latent space with approximately linear, spectrally constrained evolution and a regularised residual term. This structure provides numerically stable long-horizon rollouts and estimates of epistemic uncertainty in latent space.

The resulting surrogate environment enables model-based offline RL, where policies are optimised entirely on pre-generated data while epistemic uncertainty is used to detect distribution shift and enforce soft safety constraints. We benchmark these offline RL policies against a PPO agent trained directly in simulation. Results show that policies trained purely offline on the Koopman world model can match online PPO performance without requiring any interaction with the real machine. This demonstrates a safe, reproducible pathway for turning historical accelerator data into effective learning-based control policies.

In which format do you inted to submit your paper?	LaTeX

Simon Hirlaender ()

Olga Mironova () Sarah Trausner () Leander Grech () Lorenz Fischl () Dr Andrea Santamaria Garcia ()

There are no materials yet.

IPAC'26 - the 17th International Particle Accelerator Conference

IPAC'26

Koopman-Stabilised World Models for Offline Reinforcement Learning in Accelerator Control

C.I.D

Speaker

Description

Author

Co-authors

Presentation materials

Choose timezone

IPAC'26 - the 17th International Particle Accelerator Conference

IPAC'26

Speaker

Description

Author

Co-authors

Presentation materials