Suppr
超能文献

人工智能方法在“大数据”时代预测 hERG 通道抑制的批判性评估。

Critical Assessment of Artificial Intelligence Methods for Prediction of hERG Channel Inhibition in the "Big Data" Era.

机构信息

National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States.

出版信息

J Chem Inf Model. 2020 Dec 28;60(12):6007-6019. doi: 10.1021/acs.jcim.0c00884. Epub 2020 Dec 1.

DOI:10.1021/acs.jcim.0c00884

PMID:33259212

Abstract

The rise of novel artificial intelligence (AI) methods necessitates their benchmarking against classical machine learning for a typical drug-discovery project. Inhibition of the potassium ion channel, whose alpha subunit is encoded by the human -related gene (hERG), leads to a prolonged QT interval of the cardiac action potential and is a significant safety pharmacology target for the development of new medicines. Several computational approaches have been employed to develop prediction models for the assessment of hERG liabilities of small molecules including recent work using deep learning methods. Here, we perform a comprehensive comparison of hERG effect prediction models based on classical approaches (random forests and gradient boosting) and modern AI methods [deep neural networks (DNNs) and recurrent neural networks (RNNs)]. The training set (∼9000 compounds) was compiled by integrating the hERG bioactivity data from the ChEMBL database with experimental data generated from an , high-throughput thallium flux assay. We utilized different molecular descriptors including the latent descriptors, which are real-value continuous vectors derived from chemical autoencoders trained on a large chemical space (>1.5 million compounds). The models were prospectively validated on ∼840 compounds screened in the same thallium flux assay. The best results were obtained with the XGBoost method and RDKit descriptors. The comparison of models based only on latent descriptors revealed that the DNNs performed significantly better than the classical methods. The RNNs that operate on SMILES provided the highest model sensitivity. The best models were merged into a consensus model that offered superior performance compared to reference models from academic and commercial domains. Furthermore, we shed light on the potential of AI methods to exploit the big data in chemistry and generate novel chemical representations useful in predictive modeling and tailoring a new chemical space.

摘要

新型人工智能 (AI) 方法的兴起需要将其与经典机器学习方法进行基准测试，以用于典型的药物发现项目。抑制钾离子通道，其 alpha 亚基由人类相关基因 (hERG) 编码，会导致心脏动作电位的 QT 间期延长，是开发新药的重要安全药理学靶标。已经采用了几种计算方法来开发用于评估小分子的 hERG 负债的预测模型，包括最近使用深度学习方法的工作。在这里，我们对基于经典方法（随机森林和梯度提升）和现代 AI 方法（深度神经网络 (DNN) 和递归神经网络 (RNN)）的 hERG 效应预测模型进行了全面比较。训练集（约 9000 种化合物）通过将来自 ChEMBL 数据库的 hERG 生物活性数据与通过高通量铊通量测定法生成的实验数据相结合来编译。我们利用了不同的分子描述符，包括潜在描述符，这是从在 >150 万种化合物的大型化学空间上训练的化学自动编码器中得出的实值连续向量。模型在相同的铊通量测定法中筛选出的约 840 种化合物上进行了前瞻性验证。使用 XGBoost 方法和 RDKit 描述符获得了最佳结果。仅基于潜在描述符的模型比较表明，DNN 比经典方法表现更好。在 SMILES 上运行的 RNN 提供了最高的模型灵敏度。将最佳模型合并到共识模型中，与学术和商业领域的参考模型相比，该模型具有卓越的性能。此外，我们还探讨了 AI 方法在利用化学大数据并生成有用的预测建模和定制新化学空间的新型化学表示形式方面的潜力。

相似文献

Critical Assessment of Artificial Intelligence Methods for Prediction of hERG Channel Inhibition in the "Big Data" Era.

J Chem Inf Model. 2020 Dec 28;60(12):6007-6019. doi: 10.1021/acs.jcim.0c00884. Epub 2020 Dec 1.

A pharmacologically validated, high-capacity, functional thallium flux assay for the human Ether-à-go-go related gene potassium channel.

Assay Drug Dev Technol. 2010 Dec;8(6):714-26. doi: 10.1089/adt.2010.0351.

A comprehensive support vector machine binary hERG classification model based on extensive but biased end point hERG data sets.

Chem Res Toxicol. 2011 Jun 20;24(6):934-49. doi: 10.1021/tx200099j. Epub 2011 May 6.

The Catch-22 of Predicting hERG Blockade Using Publicly Accessible Bioactivity Data.

J Chem Inf Model. 2018 Jun 25;58(6):1224-1233. doi: 10.1021/acs.jcim.8b00150. Epub 2018 May 30.

Indexing molecules for their hERG liability.

Eur J Med Chem. 2013 Jul;65:304-14. doi: 10.1016/j.ejmech.2013.04.059. Epub 2013 May 13.

High-Throughput Chemical Screening and Structure-Based Models to Predict hERG Inhibition.

Biology (Basel). 2022 Jan 28;11(2):209. doi: 10.3390/biology11020209.

Construction of an integrated database for hERG blocking small molecules.

PLoS One. 2018 Jul 6;13(7):e0199348. doi: 10.1371/journal.pone.0199348. eCollection 2018.

hERG classification model based on a combination of support vector machine method and GRIND descriptors.

Mol Pharm. 2008 Jan-Feb;5(1):117-27. doi: 10.1021/mp700124e. Epub 2008 Jan 16.

Mol Divers. 2009 Aug;13(3):321-36. doi: 10.1007/s11030-009-9117-0. Epub 2009 Feb 14.

Modeling of the hERG K+ Channel Blockage Using Online Chemical Database and Modeling Environment (OCHEM).

Mol Inform. 2017 Dec;36(12). doi: 10.1002/minf.201700074. Epub 2017 Aug 30.

引用本文的文献

Mixture of experts for multitask learning in cardiotoxicity assessment.

J Cheminform. 2025 Aug 29;17(1):135. doi: 10.1186/s13321-025-01072-7.

HERGAI: an artificial intelligence tool for structure-based prediction of hERG inhibitors.

J Cheminform. 2025 Jul 24;17(1):110. doi: 10.1186/s13321-025-01063-8.

hERG toxicity prediction in early drug discovery using extreme gradient boosting and isometric stratified ensemble mapping.

Sci Rep. 2025 May 4;15(1):15585. doi: 10.1038/s41598-025-99766-3.

GraphDeep-hERG: Graph Neural Network PharmacoAnalytics for Assessing hERG-Related Cardiotoxicity.

Pharm Res. 2025 Apr;42(4):579-591. doi: 10.1007/s11095-025-03848-w. Epub 2025 Mar 26.

CardioGenAI: a machine learning-based framework for re-engineering drugs for reduced hERG liability.

J Cheminform. 2025 Mar 5;17(1):30. doi: 10.1186/s13321-025-00976-8.

One size does not fit all: revising traditional paradigms for assessing accuracy of QSAR models used for virtual screening.

J Cheminform. 2025 Jan 16;17(1):7. doi: 10.1186/s13321-025-00948-y.

AttenhERG: a reliable and interpretable graph neural network framework for predicting hERG channel blockers.

J Cheminform. 2024 Dec 23;16(1):143. doi: 10.1186/s13321-024-00940-y.

Approaching Pharmacological Space: Events and Components.

Methods Mol Biol. 2025;2834:151-169. doi: 10.1007/978-1-0716-4003-6_7.

Reducing overconfident errors in molecular property classification using Posterior Network.

Patterns (N Y). 2024 May 8;5(6):100991. doi: 10.1016/j.patter.2024.100991. eCollection 2024 Jun 14.

Enhancing hERG Risk Assessment with Interpretable Classificatory and Regression Models.

Chem Res Toxicol. 2024 Jun 17;37(6):910-922. doi: 10.1021/acs.chemrestox.3c00400. Epub 2024 May 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

人工智能方法在“大数据”时代预测 hERG 通道抑制的批判性评估。

Critical Assessment of Artificial Intelligence Methods for Prediction of hERG Channel Inhibition in the "Big Data" Era.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译