Patsalidis Nikolaos, Mohammadi Mohsen Doust, Bhowmick Somnath, Biskos George, Harmandaris Vagelis
Computation-based Science and Technology Research Center, The Cyprus Institute, Aglantzia 2121, Cyprus.
Climate & Atmosphere Research Centre, The Cyprus Institute, Aglantzia 2121, Cyprus.
J Chem Inf Model. 2025 Sep 8;65(17):9009-9021. doi: 10.1021/acs.jcim.5c01193. Epub 2025 Aug 12.
We propose an active learning (AL) framework to develop classical force fields (FFs) that accurately model the potential energy surfaces (PES) of gas/solid atomic-scale complexes. A central challenge is integrating AL with flexible, computationally efficient physics-aware potentials to achieve quantum-level accuracy for complex interfacial systems. Our approach trains physics-aware potentials, with incorporated flexibility and smoothness, on actively sampled density functional theory (DFT) data to describe interactions between undercoordinated atomic silver (Ag) clusters and gaseous pollutants (CO, CO, SO), relevant for environmental applications like sensing. The AL process follows three stages: (1) FFs are trained using adaptable physics aware potentials of semiempirical descriptors, optimized via a Pareto analysis scheme; (2) new candidate structures are generated through the use of the refined FFs in Metropolis Hastings Monte Carlo (MHMC) or stochastic molecular dynamics (sMD) simulations; (3) a subset of candidates is selected for DFT computations based on an outlier score (OS), which utilizes the existing data descriptor distributions, ensuring diverse PES exploration. This framework produces FFs capable of capturing cohesive, physisorption, and chemisorption interactions with admirable accuracy, close to ab initio methods, while retaining the efficiency of semiempirical potentials. To demonstrate, produced FFs are utilized in molecular dynamics (MD) simulations of single Ag clusters embedded in bulk gas phases, examining condensation characteristics. Our methodology is highly versatile, easily accommodating various choices of descriptors, model basis sets, and sampling techniques.
我们提出了一种主动学习(AL)框架,用于开发经典力场(FFs),以精确模拟气体/固体原子尺度复合物的势能面(PES)。一个核心挑战是将主动学习与灵活、计算效率高的物理感知势相结合,以实现复杂界面系统的量子级精度。我们的方法在主动采样的密度泛函理论(DFT)数据上训练具有灵活性和平滑性的物理感知势,以描述欠配位原子银(Ag)簇与气态污染物(CO、CO、SO)之间的相互作用,这与传感等环境应用相关。主动学习过程包括三个阶段:(1)使用半经验描述符的适应性物理感知势训练力场,通过帕累托分析方案进行优化;(2)通过在Metropolis Hastings蒙特卡罗(MHMC)或随机分子动力学(sMD)模拟中使用优化后的力场生成新的候选结构;(3)根据异常值分数(OS)选择一部分候选结构进行DFT计算,该分数利用现有数据描述符分布,确保对势能面进行多样化探索。该框架生成的力场能够以令人钦佩的精度捕捉内聚、物理吸附和化学吸附相互作用,接近从头算方法,同时保留半经验势的效率。为了进行演示,将生成的力场用于嵌入大量气相中的单个Ag簇的分子动力学(MD)模拟,以研究凝聚特性。我们的方法具有高度的通用性,能够轻松适应描述符、模型基组和采样技术的各种选择。