Suppr超能文献

利用机器学习为西地那非和含 H、C、N、O、S 的药物构建高效的大规模原子间势能。

Harnessing machine learning for efficient large-scale interatomic potential for sildenafil and pharmaceuticals containing H, C, N, O, and S.

机构信息

Physics Department, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece.

Center for Interdisciplinary Research & Innovation, Aristotle University of Thessaloniki, Greece.

出版信息

Nanoscale. 2024 Oct 3;16(38):18014-18026. doi: 10.1039/d4nr00929k.

Abstract

In this study a cutting-edge approach to producing accurate and computationally efficient interatomic potentials using machine learning algorithms is presented. Specifically, the study focuses on the application of Allegro, a novel machine learning algorithm, running on high-performance GPUs for training potentials. The choice of training parameters plays a pivotal role in the quality of the potential functions. To enable this methodology, the "Solvated Protein Fragments" dataset, containing nearly 2.7 million Density Functional Theory (DFT) calculations for many-body intermolecular interactions involving protein fragments and water molecules, encompassing H, C, N, O, and S elements, is considered as the training dataset. The project optimizes computational efficiency by reducing the initial dataset size according to the intended application. To assess the efficacy of the approach, the sildenafil citrate, iso-sildenafil, aspirin, ibuprofen, mebendazole and urea, representing all five relevant elements, serve as the test bed. The results of the Allegro-trained potentials demonstrate outstanding performance, benefiting from the combination of an appropriate training dataset and parameter selection. This notably enhanced computational efficiency when compared to the computationally intensive DFT method aided by GPU acceleration. Validation of the produced interatomic potentials is achieved through Allegro's own evaluation mechanism, yielding exceptional accuracy. Further verification is carried out through LAMMPS molecular dynamics simulations. Structural optimization by energy minimization and NPT Molecular Dynamics simulations are performed for each potential, assessing relaxation processes and energy reduction. Additional structures, including urea, ammonia, uracil, oxalic acid, and acetic acid, are tested, highlighting the potential's versatility in describing systems containing the aforementioned elements. Visualization of the results confirms the scientific accuracy of each structure's relaxation. The findings of this study demonstrate strong scaling and the potential for applications in pharmaceutical research, allowing the exploration of larger molecular structures not previously amenable to computational analysis at this level of accuracy The success of the machine learning approach underscores its potential to revolutionize computational solid-state physics.

摘要

在这项研究中,提出了一种使用机器学习算法生成准确且计算高效的原子间势的前沿方法。具体而言,该研究侧重于使用新型机器学习算法 Allegro 在高性能 GPU 上进行势训练。训练参数的选择对势函数的质量起着关键作用。为了实现这种方法,考虑了包含近 270 万个密度泛函理论(DFT)计算的“水合蛋白片段”数据集,这些计算涉及蛋白质片段和水分子之间的多体分子相互作用,涵盖 H、C、N、O 和 S 元素,作为训练数据集。该项目通过根据预期应用缩小初始数据集大小来优化计算效率。为了评估该方法的效果,将西地那非枸橼酸盐、异西地那非、阿司匹林、布洛芬、甲苯咪唑和尿素作为测试床,这些物质代表了所有五个相关元素。Allegro 训练的势的结果表明,由于选择了适当的训练数据集和参数,该方法具有出色的性能。与 GPU 加速辅助的计算密集型 DFT 方法相比,这显著提高了计算效率。通过 Allegro 自身的评估机制实现了对生成的原子间势的验证,得到了出色的准确性。通过 LAMMPS 分子动力学模拟进行了进一步验证。对每个势进行能量最小化和 NPT 分子动力学模拟的结构优化,评估弛豫过程和能量降低。对包括尿素、氨、尿嘧啶、草酸和乙酸在内的其他结构进行了测试,突出了该势在描述包含上述元素的系统方面的多功能性。结果的可视化确认了每个结构弛豫的科学准确性。这项研究的结果表明具有强大的可扩展性和在药物研究中的应用潜力,允许探索以前在这种精度水平下无法进行计算分析的更大分子结构。机器学习方法的成功突出了其在计算固态物理中具有变革性的潜力。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验