Presentation

P11 - Enabling Lattice QCD Normalizing Flows in HPC Infrastructures
PosterPDF
DescriptionThe Horizon Europe project interTwin aims at developing a prototype for a multidisciplinary Digital Twin Engine, applicable across a whole spectrum of scientific disciplines: High Energy Physics (HEP), Environment, Climate, etc. As part of this effort we explore the extent to which Machine Learning (ML) methods can speed up Lattice Gauge Theory Simulations in challenging areas of the parameter space where Monte Carlo methods suffer from severe critical slowing down. The overall goal is progressing towards designing the digital twin of a HEP detector, where Lattice QCD simulations could provide future realistic simulations of the Standard model. We are exploiting the advantages of the tools developed in the project interTwin, notably intertwinai, to scale up and support the deployment of our simulations in HPC systems, while enabling as well several code features. The itwinai toolkit provides functionalities for distributed machine learning on HPC, supporting different distributed frameworks (DeepSpeed, Horovod, and PyTorch DistributedDataParallel) implementing different communication protocols across different GPUs, suited to different infrastructures. Furthermore, itwinai also offers a profiling feature based on the PyTorch profiling backend, enabling it to identify communication and computation shares. This profiler will enable the identification of bottlenecks, and hence optimize the code to improve performance.
TimeMonday, June 1610:20 - 10:50 CEST
LocationCampussaal - Plenary Room
Event Type
PASC Poster