Matlock Matthew K, Dang Na Le, Swamidass S Joshua
Department of Pathology and Immunology, School of Medicine, Washington University in St. Louis, St. Louis, Missouri 63130, United States.
ACS Cent Sci. 2018 Jan 24;4(1):52-62. doi: 10.1021/acscentsci.7b00405. Epub 2018 Jan 3.
A collection of new approaches to building and training neural networks, collectively referred to as deep learning, are attracting attention in theoretical chemistry. Several groups aim to replace computationally expensive quantum mechanics calculations with learned estimators. This raises questions about the representability of complex quantum chemical systems with neural networks. Can local-variable models efficiently approximate nonlocal quantum chemical features? Here, we find that convolutional architectures, those that only aggregate information locally, cannot efficiently represent aromaticity and conjugation in large systems. They cannot represent long-range nonlocality known to be important in quantum chemistry. This study uses aromatic and conjugated systems computed from molecule graphs, though reproducing quantum simulations is the ultimate goal. This task, by definition, is both computable and known to be important to chemistry. The failure of convolutional architectures on this focused task calls into question their use in modeling quantum mechanics. To remedy this heretofore unrecognized deficiency, we introduce a new architecture that propagates information back and forth in waves of nonlinear computation. This architecture is still a local-variable model, and it is both computationally and representationally efficient, processing molecules in sublinear time with far fewer parameters than convolutional networks. Wave-like propagation models aromatic and conjugated systems with high accuracy, and even models the impact of small structural changes on large molecules. This new architecture demonstrates that some nonlocal features of quantum chemistry can be efficiently represented in local variable models.
一系列构建和训练神经网络的新方法,统称为深度学习,正在理论化学领域引起关注。几个研究团队旨在用学习到的估计器取代计算成本高昂的量子力学计算。这引发了关于神经网络对复杂量子化学系统的表示能力的问题。局部变量模型能否有效地近似非局部量子化学特征?在这里,我们发现卷积架构,即那些仅在局部聚合信息的架构,无法有效地表示大型系统中的芳香性和共轭性。它们无法表示在量子化学中已知很重要的长程非局部性。本研究使用从分子图计算得到的芳香和共轭系统,尽管重现量子模拟是最终目标。根据定义,这项任务既是可计算的,也是对化学很重要的。卷积架构在这个特定任务上的失败,让人质疑它们在量子力学建模中的应用。为了弥补这一迄今未被认识到的缺陷,我们引入了一种新架构,该架构在非线性计算的波中来回传播信息。这种架构仍然是局部变量模型,并且在计算和表示方面都很高效,能以亚线性时间处理分子,所需参数比卷积网络少得多。波状传播模型能高精度地表示芳香和共轭系统,甚至能模拟小结构变化对大分子的影响。这种新架构表明,量子化学的一些非局部特征可以在局部变量模型中得到有效表示。