Kasieczka Gregor, Shih David
Institut für Experimentalphysik, Universität Hamburg, 22761 Hamburg, Germany.
NHETC, Dept. of Physics and Astronomy, Rutgers University, Piscataway, New Jersey 08854 USA.
Phys Rev Lett. 2020 Sep 18;125(12):122001. doi: 10.1103/PhysRevLett.125.122001.
While deep learning has proven to be extremely successful at supervised classification tasks at the LHC and beyond, for practical applications, raw classification accuracy is often not the only consideration. One crucial issue is the stability of network predictions, either versus changes of individual features of the input data or against systematic perturbations. We present a new method based on a novel application of "distance correlation," a measure quantifying nonlinear correlations, that achieves equal performance to state-of-the-art adversarial decorrelation networks but is much simpler and more stable to train. To demonstrate the effectiveness of our method, we carefully recast a recent ATLAS study of decorrelation methods as applied to boosted, hadronic W tagging. We also show the feasibility of regularization with distance correlation for more powerful convolutional neural networks, as well as for the problem of hadronic top tagging.
虽然深度学习在大型强子对撞机及其他领域的监督分类任务中已被证明极其成功,但对于实际应用而言,原始分类准确率往往并非唯一考量因素。一个关键问题是网络预测的稳定性,无论是相对于输入数据单个特征的变化,还是针对系统性扰动。我们提出了一种基于“距离相关”新应用的新方法,“距离相关”是一种量化非线性相关性的度量,该方法实现了与当前最先进的对抗去相关网络相同的性能,但训练起来要简单得多且更稳定。为了证明我们方法的有效性,我们仔细地重新构建了最近ATLAS关于去相关方法应用于增强型强子W标记的研究。我们还展示了使用距离相关对更强大的卷积神经网络进行正则化的可行性,以及对强子顶标记问题的可行性。