间隙-Δ能量，一种键能状态的新指标，有助于预测分子毒性。

Gap-Δenergy, a New Metric of the Bond Energy State, Assisting to Predict Molecular Toxicity.

作者信息

Zhang Senpeng, Zhao Dongyu, Cui Qinghua

机构信息

Department of Biomedical Informatics, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing 100191, People's Republic of China.

出版信息

ACS Omega. 2024 Apr 12;9(16):17839-17847. doi: 10.1021/acsomega.3c07682. eCollection 2024 Apr 23.

DOI:10.1021/acsomega.3c07682

PMID:38680329

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11044234/

Abstract

Molecular toxicity is a critical feature of drug development. It is thus very important to develop computational models to evaluate the toxicity of small molecules. The accuracy of toxicity prediction largely depends on the quality of molecular representation; however, current methods for this purpose do not address this issue well. Here, we introduce a new metric, gap-Δenergy, which is designed to quantify the intermolecular bond energy difference with atom distance. We next find significant variations in the gap-Δenergy distribution among different types of molecules. Moreover, we show that this metric is able to distinguish the toxic small molecules. We collected data sets of toxic and exogenous small molecules and presented a novel index, namely, global toxicity, to evaluate the overall toxicity of molecules. Based on molecular descriptors and the proposed gap-Δenergy metric, we further constructed machine learning models that were trained with 7816 small molecules. The XGBoost-based model achieved the best performance with an AUC score of 0.965 and an F1 score of 0.849 on the test set (1954 small molecules), which outperformed the model that did not use gap-Δenergy features, with a sensitivity score increase of 3.2%.

摘要

分子毒性是药物研发的一个关键特征。因此，开发计算模型来评估小分子的毒性非常重要。毒性预测的准确性在很大程度上取决于分子表示的质量；然而，目前用于此目的的方法并不能很好地解决这个问题。在这里，我们引入了一种新的指标，即间隙-Δ能量，它旨在量化分子间键能随原子距离的差异。接下来，我们发现不同类型分子之间的间隙-Δ能量分布存在显著差异。此外，我们表明该指标能够区分有毒的小分子。我们收集了有毒和外源性小分子的数据集，并提出了一个新的指标，即全局毒性，以评估分子的整体毒性。基于分子描述符和提出的间隙-Δ能量指标，我们进一步构建了机器学习模型，该模型使用7816个小分子进行训练。基于XGBoost的模型在测试集（1954个小分子）上取得了最佳性能，AUC分数为0.965，F1分数为0.849，优于未使用间隙-Δ能量特征的模型，灵敏度分数提高了3.2%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e4f/11044234/e09a70e6abe3/ao3c07682_0001.jpg

相似文献

Gap-Δenergy, a New Metric of the Bond Energy State, Assisting to Predict Molecular Toxicity.间隙-Δ能量，一种键能状态的新指标，有助于预测分子毒性。

ACS Omega. 2024 Apr 12;9(16):17839-17847. doi: 10.1021/acsomega.3c07682. eCollection 2024 Apr 23.

ADMET Evaluation in Drug Discovery. Part 17: Development of Quantitative and Qualitative Prediction Models for Chemical-Induced Respiratory Toxicity.药物研发中的ADMET评估。第17部分：化学诱导呼吸毒性的定量和定性预测模型的开发。

Mol Pharm. 2017 Jul 3;14(7):2407-2421. doi: 10.1021/acs.molpharmaceut.7b00317. Epub 2017 Jun 21.

General Approach to Estimate Error Bars for Quantitative Structure-Activity Relationship Predictions of Molecular Activity.定量构效关系预测分子活性的误差估计的一般方法。

J Chem Inf Model. 2018 Aug 27;58(8):1561-1575. doi: 10.1021/acs.jcim.8b00114. Epub 2018 Jul 17.

Analysis and Comparison of Vector Space and Metric Space Representations in QSAR Modeling.QSAR 建模中向量空间和度量空间表示的分析与比较。

Molecules. 2019 Apr 30;24(9):1698. doi: 10.3390/molecules24091698.

Preoperative prediction of vessel invasion in locally advanced gastric cancer based on computed tomography radiomics and machine learning.基于计算机断层扫描影像组学和机器学习的局部进展期胃癌血管侵犯术前预测

Oncol Lett. 2023 May 22;26(1):293. doi: 10.3892/ol.2023.13879. eCollection 2023 Jul.

Critical assessment of QSAR models of environmental toxicity against Tetrahymena pyriformis: focusing on applicability domain and overfitting by variable selection.针对梨形四膜虫的环境毒性定量构效关系（QSAR）模型的批判性评估：聚焦适用域及变量选择导致的过拟合问题

J Chem Inf Model. 2008 Sep;48(9):1733-46. doi: 10.1021/ci800151m. Epub 2008 Aug 26.

Predicting the impacts of mutations on protein-ligand binding affinity based on molecular dynamics simulations and machine learning methods.基于分子动力学模拟和机器学习方法预测突变对蛋白质-配体结合亲和力的影响。

Comput Struct Biotechnol J. 2020 Feb 20;18:439-454. doi: 10.1016/j.csbj.2020.02.007. eCollection 2020.

A big data approach to the ultra-fast prediction of DFT-calculated bond energies.一种大数据方法，可实现对 DFT 计算键能的超快速预测。

J Cheminform. 2013 Jul 12;5:34. doi: 10.1186/1758-2946-5-34. eCollection 2013.

Machine Learning to Predict Mortality and Critical Events in a Cohort of Patients With COVID-19 in New York City: Model Development and Validation.机器学习预测纽约市新冠肺炎患者队列中的死亡率和危急事件：模型开发与验证

J Med Internet Res. 2020 Nov 6;22(11):e24018. doi: 10.2196/24018.

J Chem Inf Model. 2019 Jan 28;59(1):181-189. doi: 10.1021/acs.jcim.8b00597. Epub 2018 Nov 19.

本文引用的文献

In-Silico Mining of the Toxins Database (T3DB) towards Hunting Prospective Candidates as ABCB1 Inhibitors: Integrated Molecular Docking and Lipid Bilayer-Enhanced Molecular Dynamics Study.针对寻找作为ABCB1抑制剂的潜在候选物对毒素数据库（T3DB）进行计算机模拟挖掘：综合分子对接和脂质双层增强分子动力学研究

Pharmaceuticals (Basel). 2023 Jul 18;16(7):1019. doi: 10.3390/ph16071019.

Machine Learning Toxicity Prediction: Latest Advances by Toxicity End Point.机器学习毒性预测：按毒性终点划分的最新进展

ACS Omega. 2022 Dec 13;7(51):47536-47546. doi: 10.1021/acsomega.2c05693. eCollection 2022 Dec 27.

Predicting Dose-Range Chemical Toxicity using Novel Hybrid Deep Machine-Learning Method.使用新型混合深度机器学习方法预测剂量范围化学毒性

Toxics. 2022 Nov 18;10(11):706. doi: 10.3390/toxics10110706.

HMDB 5.0: the Human Metabolome Database for 2022.HMDB 5.0：2022 年人类代谢组数据库。

Nucleic Acids Res. 2022 Jan 7;50(D1):D622-D631. doi: 10.1093/nar/gkab1062.

A review on machine learning approaches and trends in drug discovery.关于药物发现中机器学习方法与趋势的综述。

Comput Struct Biotechnol J. 2021 Aug 12;19:4538-4558. doi: 10.1016/j.csbj.2021.08.011. eCollection 2021.

Can preclinical drug development help to predict adverse events in clinical trials?临床前药物研发能否帮助预测临床试验中的不良事件？

Drug Discov Today. 2022 Jan;27(1):257-268. doi: 10.1016/j.drudis.2021.08.010. Epub 2021 Aug 29.

Prediction of Molecular Properties Using Molecular Topographic Map.利用分子地形图谱预测分子性质。

Molecules. 2021 Jul 24;26(15):4475. doi: 10.3390/molecules26154475.

Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets.基于 SVM 和最优描述符集预测药物性肝毒性。

Int J Mol Sci. 2021 Jul 28;22(15):8073. doi: 10.3390/ijms22158073.

Bexagliflozin for type 2 diabetes: an overview of the data.比格列净治疗 2 型糖尿病：数据概述。

Expert Opin Pharmacother. 2021 Nov;22(16):2095-2103. doi: 10.1080/14656566.2021.1959915. Epub 2021 Jul 29.

Algebraic graph-assisted bidirectional transformers for molecular property prediction.基于代数图辅助的双向转换器在分子性质预测中的应用。

Nat Commun. 2021 Jun 10;12(1):3521. doi: 10.1038/s41467-021-23720-w.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验