Otten Sydney, Caron Sascha, de Swart Wieske, van Beekveld Melissa, Hendriks Luc, van Leeuwen Caspar, Podareanu Damian, Ruiz de Austri Roberto, Verheyen Rob
Institute for Mathematics, Astro- and Particle Physics IMAPP Radboud Universiteit, Nijmegen, The Netherlands.
GRAPPA, University of Amsterdam, Amsterdam, The Netherlands.
Nat Commun. 2021 May 20;12(1):2985. doi: 10.1038/s41467-021-22616-z.
Simulating nature and in particular processes in particle physics require expensive computations and sometimes would take much longer than scientists can afford. Here, we explore ways to a solution for this problem by investigating recent advances in generative modeling and present a study for the generation of events from a physical process with deep generative models. The simulation of physical processes requires not only the production of physical events, but to also ensure that these events occur with the correct frequencies. We investigate the feasibility of learning the event generation and the frequency of occurrence with several generative machine learning models to produce events like Monte Carlo generators. We study three processes: a simple two-body decay, the processes ee → Z → ll and [Formula: see text] including the decay of the top quarks and a simulation of the detector response. By buffering density information of encoded Monte Carlo events given the encoder of a Variational Autoencoder we are able to construct a prior for the sampling of new events from the decoder that yields distributions that are in very good agreement with real Monte Carlo events and are generated several orders of magnitude faster. Applications of this work include generic density estimation and sampling, targeted event generation via a principal component analysis of encoded ground truth data, anomaly detection and more efficient importance sampling, e.g., for the phase space integration of matrix elements in quantum field theories.
模拟自然,尤其是粒子物理中的过程,需要耗费大量计算资源,有时所需时间远超科学家所能承受。在此,我们通过研究生成式建模的最新进展,探索解决该问题的方法,并展示一项利用深度生成模型从物理过程生成事件的研究。物理过程的模拟不仅需要生成物理事件,还需确保这些事件以正确的频率发生。我们研究了使用几种生成式机器学习模型来学习事件生成和发生频率以生成类似蒙特卡罗发生器的事件的可行性。我们研究了三个过程:一个简单的两体衰变、ee → Z → ll 过程以及包括顶夸克衰变的[公式:见正文]过程和探测器响应的模拟。通过利用变分自编码器的编码器缓冲编码后的蒙特卡罗事件的密度信息,我们能够为从解码器采样新事件构建一个先验,从而生成与真实蒙特卡罗事件非常吻合且生成速度快几个数量级的分布。这项工作的应用包括通用密度估计和采样、通过对编码的真实数据进行主成分分析进行目标事件生成、异常检测以及更高效的重要性采样,例如用于量子场论中矩阵元相空间积分。