利用深度置信网络在药物发现中区分药物样/非药物样小分子。

Distinguishing drug/non-drug-like small molecules in drug discovery using deep belief network.

机构信息

Laboratory of Systems Biology and Bioinformatics (LBB), Department of Bioinformatics, Kish International Campus, University of Tehran, Kish Island, Iran.

Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.

出版信息

Mol Divers. 2021 May;25(2):827-838. doi: 10.1007/s11030-020-10065-7. Epub 2020 Mar 19.

DOI:10.1007/s11030-020-10065-7

PMID:32193758

Abstract

The advent of computational methods for efficient prediction of the druglikeness of small molecules and their ever-burgeoning applications in the fields of medicinal chemistry and drug industries have been a profound scientific development, since only a few amounts of the small molecule libraries were identified as approvable drugs. In this study, a deep belief network was utilized to construct a druglikeness classification model. For this purpose, small molecules and approved drugs from the ZINC database were selected for the unsupervised pre-training step and supervised training step. Various binary fingerprints such as Macc 166 bit, PubChem 881 bit, and Morgan 2048 bit as data features were investigated. The report revealed that using an unsupervised pre-training phase can lead to a good performance model and generalizability capability. Accuracy, precision, and recall of the model for Macc features were 97%, 96%, and 99%, respectively. For more consideration about the generalizability of the model, the external data by expression and investigational drugs in drug banks as drug data and randomly selected data from the ZINC database as non-drug were created. The results confirmed the good performance and generalizability capability of the model. Also, the outcomes depicted that a large proportion of misclassified non-drug small molecules ascertain the bioavailability conditions and could be investigated as a drug in the future. Furthermore, our model attempted to tap potential opportunities as a drug filter in drug discovery.

摘要

小分子药物类药性的高效预测计算方法的出现及其在药物化学和制药行业的不断涌现的应用是一个深远的科学发展，因为只有少数小分子库被确定为可批准药物。在这项研究中，使用深度置信网络来构建药物分类模型。为此，选择了来自 ZINC 数据库的小分子和批准药物进行无监督预训练步骤和监督训练步骤。研究考察了各种二进制指纹，如 Macc 166 位、PubChem 881 位和 Morgan 2048 位作为数据特征。报告显示，使用无监督预训练阶段可以得到性能良好的模型和泛化能力。对于 Macc 特征，模型的准确性、精确性和召回率分别为 97%、96%和 99%。为了更好地考虑模型的泛化能力，我们创建了药物数据库中的表达和研究药物的外部数据作为药物数据，以及从 ZINC 数据库中随机选择的数据作为非药物数据。结果证实了该模型的良好性能和泛化能力。此外，结果表明，大量被错误分类的非药物小分子确定了生物利用度条件，将来可能会被作为药物进行研究。此外，我们的模型试图挖掘药物发现中作为药物筛选器的潜在机会。

相似文献

Distinguishing drug/non-drug-like small molecules in drug discovery using deep belief network.利用深度置信网络在药物发现中区分药物样/非药物样小分子。

Mol Divers. 2021 May;25(2):827-838. doi: 10.1007/s11030-020-10065-7. Epub 2020 Mar 19.

Improved Deep Learning Based Method for Molecular Similarity Searching Using Stack of Deep Belief Networks.基于深度置信网络堆叠的改进深度学习分子相似性搜索方法。

Molecules. 2020 Dec 29;26(1):128. doi: 10.3390/molecules26010128.

Toxic Colors: The Use of Deep Learning for Predicting Toxicity of Compounds Merely from Their Graphic Images.有毒颜色：仅从化合物的图形图像预测其毒性的深度学习应用。

J Chem Inf Model. 2018 Aug 27;58(8):1533-1543. doi: 10.1021/acs.jcim.8b00338. Epub 2018 Aug 15.

Enhancing Retrosynthetic Reaction Prediction with Deep Learning Using Multiscale Reaction Classification.利用多尺度反应分类增强深度学习的逆合成反应预测

J Chem Inf Model. 2019 Feb 25;59(2):673-688. doi: 10.1021/acs.jcim.8b00801. Epub 2019 Feb 1.

Design of Novel Drug-like Molecules Using Informatics Rich Secondary Metabolites Analysis of Indian Medicinal and Aromatic Plants.利用印度药用和芳香植物信息丰富的次生代谢产物分析设计新型类药物分子。

Comb Chem High Throughput Screen. 2020;23(10):1113-1131. doi: 10.2174/1386207323666200606211342.

DrugMetric: quantitative drug-likeness scoring based on chemical space distance.DrugMetric：基于化学空间距离的定量类药性评分。

Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae321.

De Novo Molecule Design by Translating from Reduced Graphs to SMILES.从头设计分子：从简化图到 SMILES 的转换。

J Chem Inf Model. 2019 Mar 25;59(3):1136-1146. doi: 10.1021/acs.jcim.8b00626. Epub 2018 Dec 21.

SWnet: a deep learning model for drug response prediction from cancer genomic signatures and compound chemical structures.SWnet：一种基于癌症基因组特征和化合物化学结构预测药物反应的深度学习模型。

BMC Bioinformatics. 2021 Sep 10;22(1):434. doi: 10.1186/s12859-021-04352-9.

Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology.利用机器学习进展进行药物发现和分子生物学中的数据整合

Methods Mol Biol. 2021;2190:167-184. doi: 10.1007/978-1-0716-0826-5_7.

A Deep Learning-Based Chemical System for QSAR Prediction.基于深度学习的定量构效关系预测化学系统。

IEEE J Biomed Health Inform. 2020 Oct;24(10):3020-3028. doi: 10.1109/JBHI.2020.2977009. Epub 2020 Feb 28.

引用本文的文献

Probing structural requirements for thiazole-based mimetics of sunitinib as potent VEGFR-2 inhibitors.探究基于噻唑的舒尼替尼模拟物作为强效VEGFR-2抑制剂的结构要求。

RSC Med Chem. 2025 Jan 22. doi: 10.1039/d4md00754a.

Drug Discovery in the Age of Artificial Intelligence: Transformative Target-Based Approaches.人工智能时代的药物发现：变革性的基于靶标的方法。

Int J Mol Sci. 2024 Nov 14;25(22):12233. doi: 10.3390/ijms252212233.

Current Trends and Challenges in Drug-Likeness Prediction: Are They Generalizable and Interpretable?药物相似性预测的当前趋势与挑战：它们具有通用性和可解释性吗？

Health Data Sci. 2023 Nov 10;3:0098. doi: 10.34133/hds.0098. eCollection 2023.

MolFilterGAN: a progressively augmented generative adversarial network for triaging AI-designed molecules.MolFilterGAN：一种用于筛选人工智能设计分子的渐进增强生成对抗网络。

J Cheminform. 2023 Apr 8;15(1):42. doi: 10.1186/s13321-023-00711-1.

miDruglikeness: Subdivisional Drug-Likeness Prediction Models Using Active Ensemble Learning Strategies.miDruglikeness：基于主动集成学习策略的细分药物相似性预测模型。

Biomolecules. 2022 Dec 23;13(1):29. doi: 10.3390/biom13010029.

A fuzzy logic-based computational method for the repurposing of drugs against COVID-19.一种基于模糊逻辑的用于新冠病毒药物再利用的计算方法。

Bioimpacts. 2022;12(4):315-324. doi: 10.34172/bi.2021.40. Epub 2021 Aug 10.

Drug-likeness scoring based on unsupervised learning.基于无监督学习的类药性质评分

Chem Sci. 2021 Dec 14;13(2):554-565. doi: 10.1039/d1sc05248a. eCollection 2022 Jan 5.

本文引用的文献

ZINC 15--Ligand Discovery for Everyone.锌15——面向大众的配体发现平台。

J Chem Inf Model. 2015 Nov 23;55(11):2324-37. doi: 10.1021/acs.jcim.5b00559. Epub 2015 Nov 9.

Drug/nondrug classification using Support Vector Machines with various feature selection strategies.使用支持向量机及各种特征选择策略进行药物/非药物分类。

Comput Methods Programs Biomed. 2014 Nov;117(2):51-60. doi: 10.1016/j.cmpb.2014.08.009. Epub 2014 Sep 6.

Molecular fingerprint similarity search in virtual screening.虚拟筛选中的分子指纹相似性搜索。

Methods. 2015 Jan;71:58-63. doi: 10.1016/j.ymeth.2014.08.005. Epub 2014 Aug 15.

DrugLogit: logistic discrimination between drugs and nondrugs including disease-specificity by assigning probabilities based on molecular properties.DrugLogit：根据分子性质分配概率，对药物和非药物进行逻辑判别，包括基于疾病特异性的判别。

J Chem Inf Model. 2012 Aug 27;52(8):2165-80. doi: 10.1021/ci200587h. Epub 2012 Aug 7.

Drug-likeness analysis of traditional Chinese medicines: prediction of drug-likeness using machine learning approaches.中药类药性分析：基于机器学习方法的类药性预测。

Mol Pharm. 2012 Oct 1;9(10):2875-86. doi: 10.1021/mp300198d. Epub 2012 Sep 20.

A large descriptor set and a probabilistic kernel-based classifier significantly improve druglikeness classification.一个大型描述符集和一个基于概率核的分类器显著提高了类药物性分类。

J Chem Inf Model. 2007 Sep-Oct;47(5):1776-86. doi: 10.1021/ci700107y. Epub 2007 Aug 25.

A fast learning algorithm for deep belief nets.一种用于深度信念网络的快速学习算法。

Neural Comput. 2006 Jul;18(7):1527-54. doi: 10.1162/neco.2006.18.7.1527.

Comparison of support vector machine and artificial neural network systems for drug/nondrug classification.支持向量机与人工神经网络系统用于药物/非药物分类的比较。

J Chem Inf Comput Sci. 2003 Nov-Dec;43(6):1882-9. doi: 10.1021/ci0341161.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用深度置信网络在药物发现中区分药物样/非药物样小分子。

Distinguishing drug/non-drug-like small molecules in drug discovery using deep belief network.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献