Nováček Martin, Řezáč Jan
Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, 160 00 Prague, Czech Republic.
J Chem Theory Comput. 2025 Jan 28;21(2):678-690. doi: 10.1021/acs.jctc.4c01330. Epub 2025 Jan 3.
Machine learning (ML) methods offer a promising route to the construction of universal molecular potentials with high accuracy and low computational cost. It is becoming evident that integrating physical principles into these models, or utilizing them in a Δ-ML scheme, significantly enhances their robustness and transferability. This paper introduces PM6-ML, a Δ-ML method that synergizes the semiempirical quantum-mechanical (SQM) method PM6 with a state-of-the-art ML potential applied as a universal correction. The method demonstrates superior performance over standalone SQM and ML approaches and covers a broader chemical space than its predecessors. It is scalable to systems with thousands of atoms, which makes it applicable to large biomolecular systems. Extensive benchmarking confirms PM6-ML's accuracy and robustness. Its practical application is facilitated by a direct interface to MOPAC. The code and parameters are available at https://github.com/Honza-R/mopac-ml.
机器学习(ML)方法为构建高精度、低计算成本的通用分子势能提供了一条很有前景的途径。越来越明显的是,将物理原理整合到这些模型中,或者在Δ-ML方案中使用它们,能显著提高其鲁棒性和可转移性。本文介绍了PM6-ML,这是一种Δ-ML方法,它将半经验量子力学(SQM)方法PM6与作为通用校正应用的最先进的ML势能相结合。该方法展示了优于独立SQM和ML方法的性能,并且比其前身涵盖了更广泛的化学空间。它可扩展到具有数千个原子的系统,这使其适用于大型生物分子系统。广泛的基准测试证实了PM6-ML的准确性和鲁棒性。与MOPAC的直接接口便于其实际应用。代码和参数可在https://github.com/Honza-R/mopac-ml上获取。