使用传统机器学习和先进深度学习技术进行人醚-à-戈蛋白相关基因（hERG）毒性预测。

hERG-toxicity prediction using traditional machine learning and advanced deep learning techniques.

作者信息

Ylipää Erik, Chavan Swapnil, Bånkestad Maria, Broberg Johan, Glinghammar Björn, Norinder Ulf, Cotgreave Ian

机构信息

Computer Systems Unit, Research Institutes of Sweden RISE, Kista 164 40, Sweden.

Unit of Chemical and Pharmaceutical Toxicology, Research Institutes of Sweden RISE, Södertalje 151 36, Sweden.

出版信息

Curr Res Toxicol. 2023 Sep 1;5:100121. doi: 10.1016/j.crtox.2023.100121. eCollection 2023.

DOI:10.1016/j.crtox.2023.100121

PMID:37701072

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10493507/

Abstract

The rise of artificial intelligence (AI) based algorithms has gained a lot of interest in the pharmaceutical development field. Our study demonstrates utilization of traditional machine learning techniques such as random forest (RF), support-vector machine (SVM), extreme gradient boosting (XGBoost), deep neural network (DNN) as well as advanced deep learning techniques like gated recurrent unit-based DNN (GRU-DNN) and graph neural network (GNN), towards predicting human ether-á-go-go related gene (hERG) derived toxicity. Using the largest hERG dataset derived to date, we have utilized 203,853 and 87,366 compounds for training and testing the models, respectively. The results show that GNN, SVM, XGBoost, DNN, RF, and GRU-DNN all performed well, with validation set AUC ROC scores equals 0.96, 0.95, 0.95, 0.94, 0.94 and 0.94, respectively. The GNN was found to be the top performing model based on predictive power and generalizability. The GNN technique is free of any feature engineering steps while having a minimal human intervention. The GNN approach may serve as a basis for comprehensive automation in predictive toxicology. We believe that the models presented here may serve as a promising tool, both for academic institutes as well as pharmaceutical industries, in predicting hERG-liability in new molecular structures.

摘要

基于人工智能（AI）的算法的兴起在药物研发领域引起了广泛关注。我们的研究展示了传统机器学习技术的应用，如随机森林（RF）、支持向量机（SVM）、极端梯度提升（XGBoost）、深度神经网络（DNN），以及先进的深度学习技术，如基于门控循环单元的深度神经网络（GRU-DNN）和图神经网络（GNN），用于预测人醚 - 去极化相关基因（hERG）衍生的毒性。使用迄今为止获得的最大的hERG数据集，我们分别利用203,853种和87,366种化合物来训练和测试模型。结果表明，GNN、SVM、XGBoost、DNN、RF和GRU-DNN均表现良好，验证集AUC ROC分数分别为0.96、0.95、0.95、0.94、0.94和0.94。基于预测能力和泛化能力，GNN被发现是表现最佳的模型。GNN技术无需任何特征工程步骤，且人工干预最少。GNN方法可作为预测毒理学全面自动化的基础。我们相信，本文提出的模型可能成为学术机构和制药行业预测新分子结构中hERG毒性的有前景的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0108/10493507/32044df43c20/ga1.jpg

相似文献

hERG-toxicity prediction using traditional machine learning and advanced deep learning techniques.使用传统机器学习和先进深度学习技术进行人醚-à-戈蛋白相关基因（hERG）毒性预测。

Curr Res Toxicol. 2023 Sep 1;5:100121. doi: 10.1016/j.crtox.2023.100121. eCollection 2023.

Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 QSAR data sets.我们是否需要不同的机器学习算法来进行定量构效关系建模？对 16 种机器学习算法在 14 个定量构效关系数据集上的综合评估。

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa321.

Deep Learning-Based Prediction of Drug-Induced Cardiotoxicity.基于深度学习的药物性心脏毒性预测。

J Chem Inf Model. 2019 Mar 25;59(3):1073-1084. doi: 10.1021/acs.jcim.8b00769. Epub 2019 Feb 15.

Developing and comparing deep learning and machine learning algorithms for osteoporosis risk prediction.开发并比较用于骨质疏松症风险预测的深度学习和机器学习算法。

Front Artif Intell. 2024 Jun 11;7:1355287. doi: 10.3389/frai.2024.1355287. eCollection 2024.

Deep Learning and Machine Learning with Grid Search to Predict Later Occurrence of Breast Cancer Metastasis Using Clinical Data.利用临床数据，通过深度学习和带网格搜索的机器学习预测乳腺癌转移的后期发生情况。

J Clin Med. 2022 Sep 29;11(19):5772. doi: 10.3390/jcm11195772.

Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models.图神经网络能否为药物发现学习更好的分子表示？基于描述符和基于图的模型的比较研究。

J Cheminform. 2021 Feb 17;13(1):12. doi: 10.1186/s13321-020-00479-8.

Prediction of emergency department revisits among child and youth mental health outpatients using deep learning techniques.使用深度学习技术预测儿童和青少年心理健康门诊患者的急诊科复诊情况。

BMC Med Inform Decis Mak. 2024 Feb 8;24(1):42. doi: 10.1186/s12911-024-02450-1.

Critical Assessment of Artificial Intelligence Methods for Prediction of hERG Channel Inhibition in the "Big Data" Era.人工智能方法在“大数据”时代预测 hERG 通道抑制的批判性评估。

J Chem Inf Model. 2020 Dec 28;60(12):6007-6019. doi: 10.1021/acs.jcim.0c00884. Epub 2020 Dec 1.

Machine learning and deep learning methods that use omics data for metastasis prediction.利用组学数据进行转移预测的机器学习和深度学习方法。

Comput Struct Biotechnol J. 2021 Sep 4;19:5008-5018. doi: 10.1016/j.csbj.2021.09.001. eCollection 2021.

Predicting post-stroke pneumonia using deep neural network approaches.使用深度神经网络方法预测卒中后肺炎。

Int J Med Inform. 2019 Dec;132:103986. doi: 10.1016/j.ijmedinf.2019.103986. Epub 2019 Oct 1.

引用本文的文献

HERGAI: an artificial intelligence tool for structure-based prediction of hERG inhibitors.HERGAI：一种基于结构预测hERG抑制剂的人工智能工具。

J Cheminform. 2025 Jul 24;17(1):110. doi: 10.1186/s13321-025-01063-8.

hERG toxicity prediction in early drug discovery using extreme gradient boosting and isometric stratified ensemble mapping.使用极端梯度提升和等距分层集成映射在早期药物发现中预测人乙醚-a-去极化相关基因（hERG）毒性

Sci Rep. 2025 May 4;15(1):15585. doi: 10.1038/s41598-025-99766-3.

GraphDeep-hERG: Graph Neural Network PharmacoAnalytics for Assessing hERG-Related Cardiotoxicity.GraphDeep-hERG：用于评估与hERG相关心脏毒性的图神经网络药物分析

Pharm Res. 2025 Apr;42(4):579-591. doi: 10.1007/s11095-025-03848-w. Epub 2025 Mar 26.

hERGAT: predicting hERG blockers using graph attention mechanism through atom- and molecule-level interaction analyses.hERGAT：通过原子和分子水平相互作用分析，利用图注意力机制预测hERG阻滞剂。

J Cheminform. 2025 Jan 28;17(1):11. doi: 10.1186/s13321-025-00957-x.

本文引用的文献

CardioTox net: a robust predictor for hERG channel blockade based on deep learning meta-feature ensembles.心脏毒性网络：基于深度学习元特征集成的hERG通道阻断的强大预测器。

J Cheminform. 2021 Aug 16;13(1):60. doi: 10.1186/s13321-021-00541-z.

J Chem Inf Model. 2020 Dec 28;60(12):6007-6019. doi: 10.1021/acs.jcim.0c00884. Epub 2020 Dec 1.

hERG-Att: Self-attention-based deep neural network for predicting hERG blockers.hERG-Att：用于预测hERG阻滞剂的基于自注意力机制的深度神经网络。

Comput Biol Chem. 2020 May 19;87:107286. doi: 10.1016/j.compbiolchem.2020.107286.

The Study on the hERG Blocker Prediction Using Chemical Fingerprint Analysis.基于化学指纹分析的 hERG 阻滞剂预测研究。

Molecules. 2020 Jun 4;25(11):2615. doi: 10.3390/molecules25112615.

Capsule Networks Showed Excellent Performance in the Classification of hERG Blockers/Nonblockers.胶囊网络在人乙醚-a-去极化相关基因（hERG）阻滞剂/非阻滞剂分类中表现出优异性能。

Front Pharmacol. 2020 Jan 28;10:1631. doi: 10.3389/fphar.2019.01631. eCollection 2019.

DeepHIT: a deep learning framework for prediction of hERG-induced cardiotoxicity.DeepHIT：一种用于预测 hERG 诱导性心脏毒性的深度学习框架。

Bioinformatics. 2020 May 1;36(10):3049-3055. doi: 10.1093/bioinformatics/btaa075.

Support Vector Machine model for hERG inhibitory activities based on the integrated hERG database using descriptor selection by NSGA-II.基于集成 hERG 数据库的 NSGA-II 描述符选择的 hERG 抑制活性支持向量机模型。

Sci Rep. 2019 Aug 21;9(1):12220. doi: 10.1038/s41598-019-47536-3.

Prediction of hERG K+ channel blockage using deep neural networks.使用深度神经网络预测 hERG K+ 通道阻断。

Chem Biol Drug Des. 2019 Sep;94(5):1973-1985. doi: 10.1111/cbdd.13600. Epub 2019 Sep 6.

Deep Learning-Based Prediction of Drug-Induced Cardiotoxicity.基于深度学习的药物性心脏毒性预测。

J Chem Inf Model. 2019 Mar 25;59(3):1073-1084. doi: 10.1021/acs.jcim.8b00769. Epub 2019 Feb 15.

Construction of an integrated database for hERG blocking small molecules.构建 hERG 阻断小分子的综合数据库。

PLoS One. 2018 Jul 6;13(7):e0199348. doi: 10.1371/journal.pone.0199348. eCollection 2018.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用传统机器学习和先进深度学习技术进行人醚-à-戈蛋白相关基因（hERG）毒性预测。

hERG-toxicity prediction using traditional machine learning and advanced deep learning techniques.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献