Davidson School of Chemical Engineering, Purdue University, West Lafayette, Indiana 47906, United States.
Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08540, United States.
J Chem Inf Model. 2021 Oct 25;61(10):5013-5027. doi: 10.1021/acs.jcim.1c00491. Epub 2021 Sep 17.
Force-field development has undergone a revolution in the past decade with the proliferation of quantum chemistry based parametrizations and the introduction of machine learning approximations of the atomistic potential energy surface. Nevertheless, transferable force fields with broad coverage of organic chemical space remain necessary for applications in materials and chemical discovery where throughput, consistency, and computational cost are paramount. Here, we introduce a force-field development framework called Topology Automated Force-Field Interactions (TAFFI) for developing transferable force fields of varying complexity against an extensible database of quantum chemistry calculations. TAFFI formalizes the concept of atom typing and makes it the basis for generating systematic training data that maintains a one-to-one correspondence with force-field terms. This feature makes TAFFI arbitrarily extensible to new chemistries while maintaining internal consistency and transferability. As a demonstration of TAFFI, we have developed a fixed-charge force-field, TAFFI-gen, from scratch that includes coverage for common organic functional groups that is comparable to established transferable force fields. The performance of TAFFI-gen was benchmarked against OPLS and GAFF for reproducing several experimental properties of 87 organic liquids. The consistent performance of these force fields, despite their distinct origins, validates the TAFFI framework while also providing evidence of the representability limitations of fixed-charge force fields.
在过去的十年中,随着基于量子化学的参数化和原子势能面的机器学习逼近方法的普及,力场的发展经历了一场革命。然而,对于材料和化学发现等应用,具有广泛的有机化学空间覆盖范围的可转移力场仍然是必要的,因为这些应用对通量、一致性和计算成本至关重要。在这里,我们引入了一种称为拓扑自动力场相互作用(TAFFI)的力场开发框架,用于针对可扩展的量子化学计算数据库开发具有不同复杂程度的可转移力场。TAFFI 形式化了原子类型化的概念,并使其成为生成与力场项保持一一对应关系的系统训练数据的基础。这一特性使得 TAFFI 可以任意扩展到新的化学领域,同时保持内部一致性和可转移性。作为 TAFFI 的演示,我们从零开始开发了一个固定电荷力场,TAFFI-gen,它包括常见有机官能团的覆盖范围,与已建立的可转移力场相当。TAFFI-gen 的性能与 OPLS 和 GAFF 进行了基准测试,以重现 87 种有机液体的几种实验性质。尽管这些力场起源不同,但它们的一致性能验证了 TAFFI 框架,同时也证明了固定电荷力场的代表性限制。