• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过进化优化实现自动分子碎片化

Automatic molecular fragmentation by evolutionary optimisation.

作者信息

Yu Fiona C Y, Vallejo Jorge L Gálvez, Barca Giuseppe M J

机构信息

School of Computing, Australian National University, Canberra, 2601, ACT, Australia.

School of Computing and Information Technology, The University of Melbourne, Melbourne, 3052, VIC, Australia.

出版信息

J Cheminform. 2024 Aug 19;16(1):102. doi: 10.1186/s13321-024-00896-z.

DOI:10.1186/s13321-024-00896-z
PMID:39160576
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11331744/
Abstract

Molecular fragmentation is an effective suite of approaches to reduce the formal computational complexity of quantum chemistry calculations while enhancing their algorithmic parallelisability. However, the practical applicability of fragmentation techniques remains hindered by a dearth of automation and effective metrics to assess the quality of a fragmentation scheme. In this article, we present the Quick Fragmentation via Automated Genetic Search (QFRAGS), a novel automated fragmentation algorithm that uses a genetic optimisation procedure to generate molecular fragments that yield low energy errors when adopted in Many Body Expansions (MBEs). Benchmark testing of QFRAGS on protein systems with less than 500 atoms, using two-body (MBE2) and three-body (MBE3) MBE calculations at the HF/6-31G* level, reveals mean absolute energy errors (MAEE) of 20.6 and 2.2 kJ  , respectively. For larger protein systems exceeding 500 atoms, MAEEs are 181.5 kJ  for MBE2 and 24.3 kJ  for MBE3. Furthermore, when compared to three manual fragmentation schemes on a 40-protein dataset, using both MBE and Fragment Molecular Orbital techniques, QFRAGS achieves comparable or often lower MAEEs. When applied to a 10-lipoglycan/glycolipid dataset, MAEs of 7.9 and 0.3 kJ  were observed at the MBE2 and MBE3 levels, respectively.Scientific Contribution This Article presents the Quick Fragmentation via Automated Genetic Search (QFRAGS), an innovative molecular fragmentation algorithm that significantly improves upon existing molecular fragmentation approaches by specifically addressing their lack of automation and effective fragmentation quality metrics. With an evolutionary optimisation strategy, QFRAGS actively pursues high quality fragments, generating fragmentation schemes that exhibit minimal energy errors on systems with hundreds to thousands of atoms. The advent of QFRAGS represents a significant advancement in molecular fragmentation, greatly improving the accessibility and computational feasibility of accurate quantum chemistry calculations.

摘要

分子碎片化是一套有效的方法,可降低量子化学计算的形式计算复杂度,同时提高其算法并行性。然而,碎片化技术的实际适用性仍然受到自动化程度不足以及缺乏评估碎片化方案质量的有效指标的阻碍。在本文中,我们提出了通过自动遗传搜索的快速碎片化方法(QFRAGS),这是一种新颖的自动碎片化算法,它使用遗传优化程序来生成分子片段,这些片段在多体展开(MBE)中使用时产生的能量误差较小。在HF/6-31G*水平下,使用两体(MBE2)和三体(MBE3)MBE计算对原子数少于500的蛋白质系统进行QFRAGS基准测试,结果显示平均绝对能量误差(MAEE)分别为20.6和2.2 kJ 。对于超过500个原子的较大蛋白质系统,MBE2的MAEE为181.5 kJ ,MBE3的MAEE为24.3 kJ 。此外,与40个蛋白质数据集上的三种手动碎片化方案相比,使用MBE和片段分子轨道技术,QFRAGS实现了相当或更低的MAEE。当应用于10个脂多糖/糖脂数据集时,在MBE2和MBE3水平下观察到的平均绝对误差(MAE)分别为7.9和0.3 kJ 。科学贡献 本文提出了通过自动遗传搜索的快速碎片化方法(QFRAGS),这是一种创新的分子碎片化算法,通过专门解决现有分子碎片化方法缺乏自动化和有效碎片化质量指标的问题,对现有方法进行了显著改进。通过进化优化策略,QFRAGS积极追求高质量片段,生成在具有数百到数千个原子的系统上能量误差最小的碎片化方案。QFRAGS的出现代表了分子碎片化的重大进展,大大提高了精确量子化学计算的可及性和计算可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/570294475096/13321_2024_896_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/b225743a5dd9/13321_2024_896_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/61d5e291d26b/13321_2024_896_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/da0d419140e2/13321_2024_896_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/b9b164b952d8/13321_2024_896_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/cf6faf0f8145/13321_2024_896_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/08c0d30bd315/13321_2024_896_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/85e7c21d7fd0/13321_2024_896_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/cad7ecdf3730/13321_2024_896_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/fe2926566a3c/13321_2024_896_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/7afe64fe94d1/13321_2024_896_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/7a12e50125f0/13321_2024_896_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/85e558d51493/13321_2024_896_Figc_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/d154b784cf80/13321_2024_896_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/317f4c58ea3a/13321_2024_896_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/6ff8987be2d5/13321_2024_896_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/4aaf5c394db6/13321_2024_896_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/fdf046afbf8e/13321_2024_896_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/1679136b3c0a/13321_2024_896_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/1748c3be25d6/13321_2024_896_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/faabe1f95ba1/13321_2024_896_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/97be7afe102c/13321_2024_896_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/272763caf910/13321_2024_896_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/570294475096/13321_2024_896_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/b225743a5dd9/13321_2024_896_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/61d5e291d26b/13321_2024_896_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/da0d419140e2/13321_2024_896_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/b9b164b952d8/13321_2024_896_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/cf6faf0f8145/13321_2024_896_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/08c0d30bd315/13321_2024_896_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/85e7c21d7fd0/13321_2024_896_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/cad7ecdf3730/13321_2024_896_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/fe2926566a3c/13321_2024_896_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/7afe64fe94d1/13321_2024_896_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/7a12e50125f0/13321_2024_896_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/85e558d51493/13321_2024_896_Figc_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/d154b784cf80/13321_2024_896_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/317f4c58ea3a/13321_2024_896_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/6ff8987be2d5/13321_2024_896_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/4aaf5c394db6/13321_2024_896_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/fdf046afbf8e/13321_2024_896_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/1679136b3c0a/13321_2024_896_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/1748c3be25d6/13321_2024_896_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/faabe1f95ba1/13321_2024_896_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/97be7afe102c/13321_2024_896_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/272763caf910/13321_2024_896_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bf/11331744/570294475096/13321_2024_896_Fig20_HTML.jpg

相似文献

1
Automatic molecular fragmentation by evolutionary optimisation.通过进化优化实现自动分子碎片化
J Cheminform. 2024 Aug 19;16(1):102. doi: 10.1186/s13321-024-00896-z.
2
Energy-Screened Many-Body Expansion: A Practical Yet Accurate Fragmentation Method for Quantum Chemistry.能量筛选多体展开:量子化学的实用且精确的碎片方法。
J Chem Theory Comput. 2020 Jan 14;16(1):475-487. doi: 10.1021/acs.jctc.9b01095. Epub 2019 Dec 10.
3
Accurate quantum-chemical fragmentation calculations for ion-water clusters with the density-based many-body expansion.基于密度泛函多体展开的离子-水分子团簇的精确量子化学碎裂计算。
Phys Chem Chem Phys. 2022 Dec 21;25(1):736-748. doi: 10.1039/d2cp04539g.
4
Understanding the many-body expansion for large systems. II. Accuracy considerations.理解大系统的多体展开。II. 精度考量。
J Chem Phys. 2016 Apr 28;144(16):164105. doi: 10.1063/1.4947087.
5
Linear-Scaling Systematic Molecular Fragmentation Approach for Perturbation Theory and Coupled-Cluster Methods.线性标度系统分子碎片方法用于微扰理论和耦合簇方法。
J Chem Theory Comput. 2022 Sep 13;18(9):5349-5359. doi: 10.1021/acs.jctc.2c00587. Epub 2022 Aug 16.
6
How accurate can a force field become? A polarizable multipole model combined with fragment-wise quantum-mechanical calculations.力场能有多精确?一种结合片段级量子力学计算的可极化多极模型。
J Phys Chem A. 2009 Jan 22;113(3):617-27. doi: 10.1021/jp8073514.
7
A unified and flexible formulation of molecular fragmentation schemes.一种统一而灵活的分子断裂方案公式化表述。
J Chem Phys. 2021 Oct 28;155(16):164105. doi: 10.1063/5.0059598.
8
Graph-Theory-Based Molecular Fragmentation for Efficient and Accurate Potential Surface Calculations in Multiple Dimensions.基于图论的分子碎片化方法用于高效准确的多维势能面计算
J Chem Theory Comput. 2021 Nov 9;17(11):6671-6690. doi: 10.1021/acs.jctc.1c00065. Epub 2021 Oct 8.
9
Fragment-based models for dissociation of strong acids in water: Electrostatic embedding minimizes the dependence on the fragmentation schemes.基于片段的强酸在水中离解模型:静电嵌入可最大程度减少对片段划分方案的依赖。
J Chem Phys. 2023 Sep 28;159(12). doi: 10.1063/5.0164089.
10
Graph Theoretic Molecular Fragmentation for Multidimensional Potential Energy Surfaces Yield an Adaptive and General Transfer Machine Learning Protocol.图论分子碎裂方法用于多维势能面,可得到自适应的通用迁移机器学习协议。
J Chem Theory Comput. 2022 Sep 13;18(9):5125-5144. doi: 10.1021/acs.jctc.1c01241. Epub 2022 Aug 22.

本文引用的文献

1
High-Performance Multi-GPU Analytic RI-MP2 Energy Gradients.高性能多图形处理器分析性RI-MP2能量梯度
J Chem Theory Comput. 2024 Mar 26;20(6):2505-2519. doi: 10.1021/acs.jctc.3c01424. Epub 2024 Mar 8.
2
Toward an extreme-scale electronic structure system.迈向极大规模电子结构系统。
J Chem Phys. 2023 Jul 28;159(4). doi: 10.1063/5.0156399.
3
Strategies for the Development of Conjugated Polymer Molecular Dynamics Force Fields Validated with Neutron and X-ray Scattering.通过中子和X射线散射验证的共轭聚合物分子动力学力场的开发策略
ACS Polym Au. 2021 Nov 1;1(3):134-152. doi: 10.1021/acspolymersau.1c00027. eCollection 2021 Dec 8.
4
Degradation of pesticides diazinon and diazoxon by phosphotriesterase: insight into divergent mechanisms from QM/MM and MD simulations.利用量子力学/分子力学和分子动力学模拟研究磷酸三酯酶对杀虫剂二嗪农和二嗪酮的降解:不同机制的深入了解。
Phys Chem Chem Phys. 2022 Jan 4;24(2):687-696. doi: 10.1039/d1cp05034f.
5
Accelerating the Convergence of Self-Consistent Field Calculations Using the Many-Body Expansion.利用多体展开加速自洽场计算的收敛
J Chem Theory Comput. 2022 Jan 11;18(1):179-191. doi: 10.1021/acs.jctc.1c00765. Epub 2021 Dec 9.
6
Faster Self-Consistent Field (SCF) Calculations on GPU Clusters.在GPU集群上更快的自洽场(SCF)计算
J Chem Theory Comput. 2021 Dec 14;17(12):7486-7503. doi: 10.1021/acs.jctc.1c00720. Epub 2021 Nov 15.
7
Density-Based Many-Body Expansion as an Efficient and Accurate Quantum-Chemical Fragmentation Method: Application to Water Clusters.基于密度的多体展开作为一种高效且准确的量子化学碎片化方法:应用于水团簇
J Chem Theory Comput. 2021 Jul 13;17(7):4144-4156. doi: 10.1021/acs.jctc.1c00340. Epub 2021 Jul 1.
8
A look inside the black box: Using graph-theoretical descriptors to interpret a Continuous-Filter Convolutional Neural Network (CF-CNN) trained on the global and local minimum energy structures of neutral water clusters.深入黑箱:使用图论描述符解读在中性水团簇的全局和局部最低能量结构上训练的连续滤波器卷积神经网络(CF-CNN)。
J Chem Phys. 2020 Jul 14;153(2):024302. doi: 10.1063/5.0009933.
9
Recent developments in the general atomic and molecular electronic structure system.通用原子和分子电子结构系统的最新进展。
J Chem Phys. 2020 Apr 21;152(15):154102. doi: 10.1063/5.0005188.
10
Fantasy versus reality in fragment-based quantum chemistry.基于片段的量子化学中的幻想与现实。
J Chem Phys. 2019 Nov 7;151(17):170901. doi: 10.1063/1.5126216.