IPAC'25 - the 16th International Particle Accelerator Conference

Name: IPAC'25 - the 16th International Particle Accelerator Conference
Start: 2025-06-01T10:00:00+08:00
End: 2025-06-06T13:20:00+08:00
Location: Taipei International Convention Center (TICC)

1–6 Jun 2025

Taipei International Convention Center (TICC)

Asia/Taipei timezone

IPAC'25

The journey towards a specialized text embedding model for accelerator physics

THPM023

5 Jun 2025, 15:30

Exhibiton Hall A _Magpie (TWTC)

Exhibiton Hall A _Magpie

TWTC

Poster Presentation MC6.D13 Machine Learning Thursday Poster Session

Thorsten Hellert ()

We present PhysBERT and AccPhysBERT, specialized sentence-embedding models trained on 1.2 million arXiv physics papers and fine-tuned for accelerator physics, respectively. Evaluation across retrieval, clustering, and similarity tasks shows gains of up to 12\% over general-purpose models for physics corpora and 18\% for accelerator-specific tasks. Applications include semantic reviewer–paper matching, Retrieval-Augmented Generation for control-room logbooks, and rapid sub-domain adaptation. We analyze key design choices—data curation, masking objectives, and contrastive fine-tuning—and outline strategies for continual adaptation, providing a blueprint for domain-specific embeddings in the physical sciences.

Region represented	America
Paper preparation format	LaTeX

Thorsten Hellert ()

Andrea Pollastro () Mr João Montenegro () Marco Venturini ()

There are no materials yet.

IPAC'25 - the 16th International Particle Accelerator Conference

IPAC'25

The journey towards a specialized text embedding model for accelerator physics

Exhibiton Hall A _Magpie

TWTC

Speaker

Description

Author

Co-authors

Presentation materials

Choose timezone

IPAC'25 - the 16th International Particle Accelerator Conference

IPAC'25

Speaker

Description

Author

Co-authors

Presentation materials