MOST：基于最相似配体的靶点预测方法。

MOST: most-similar ligand based approach to target prediction.

作者信息

Huang Tao, Mi Hong, Lin Cheng-Yuan, Zhao Ling, Zhong Linda L D, Liu Feng-Bin, Zhang Ge, Lu Ai-Ping, Bian Zhao-Xiang

机构信息

Lab of Brain and Gut Research, School of Chinese Medicine, Hong Kong Baptist University, 7 Baptist University Road, Hong Kong, People's Republic of China.

Department of Gastroenterology, the First Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, 510405, People's Republic of China.

出版信息

BMC Bioinformatics. 2017 Mar 11;18(1):165. doi: 10.1186/s12859-017-1586-z.

DOI:10.1186/s12859-017-1586-z

PMID:28284192

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5346209/

Abstract

BACKGROUND

Many computational approaches have been used for target prediction, including machine learning, reverse docking, bioactivity spectra analysis, and chemical similarity searching. Recent studies have suggested that chemical similarity searching may be driven by the most-similar ligand. However, the extent of bioactivity of most-similar ligands has been oversimplified or even neglected in these studies, and this has impaired the prediction power.

RESULTS

Here we propose the MOst-Similar ligand-based Target inference approach, namely MOST, which uses fingerprint similarity and explicit bioactivity of the most-similar ligands to predict targets of the query compound. Performance of MOST was evaluated by using combinations of different fingerprint schemes, machine learning methods, and bioactivity representations. In sevenfold cross-validation with a benchmark Ki dataset from CHEMBL release 19 containing 61,937 bioactivity data of 173 human targets, MOST achieved high average prediction accuracy (0.95 for pKi ≥ 5, and 0.87 for pKi ≥ 6). Morgan fingerprint was shown to be slightly better than FP2. Logistic Regression and Random Forest methods performed better than Naïve Bayes. In a temporal validation, the Ki dataset from CHEMBL19 were used to train models and predict the bioactivity of newly deposited ligands in CHEMBL20. MOST also performed well with high accuracy (0.90 for pKi ≥ 5, and 0.76 for pKi ≥ 6), when Logistic Regression and Morgan fingerprint were employed. Furthermore, the p values associated with explicit bioactivity were found be a robust index for removing false positive predictions. Implicit bioactivity did not offer this capability. Finally, p values generated with Logistic Regression, Morgan fingerprint and explicit activity were integrated with a false discovery rate (FDR) control procedure to reduce false positives in multiple-target prediction scenario, and the success of this strategy it was demonstrated with a case of fluanisone. In the case of aloe-emodin's laxative effect, MOST predicted that acetylcholinesterase was the mechanism-of-action target; in vivo studies validated this prediction.

CONCLUSIONS

Using the MOST approach can result in highly accurate and robust target prediction. Integrated with a FDR control procedure, MOST provides a reliable framework for multiple-target inference. It has prospective applications in drug repurposing and mechanism-of-action target prediction.

摘要

背景

许多计算方法已被用于靶点预测，包括机器学习、反向对接、生物活性谱分析和化学相似性搜索。最近的研究表明，化学相似性搜索可能由最相似的配体驱动。然而，在这些研究中，最相似配体的生物活性程度被过度简化甚至被忽视，这削弱了预测能力。

结果

在此，我们提出了基于最相似配体的靶点推断方法，即MOST，它利用指纹相似性和最相似配体的明确生物活性来预测查询化合物的靶点。通过使用不同指纹方案、机器学习方法和生物活性表示的组合来评估MOST的性能。在对来自CHEMBL版本19的包含173个人类靶点的61937个生物活性数据的基准Ki数据集进行七折交叉验证时，MOST实现了较高的平均预测准确率（对于pKi≥5为0.95，对于pKi≥6为0.87）。结果表明，摩根指纹略优于FP2。逻辑回归和随机森林方法的表现优于朴素贝叶斯。在一次时间验证中，使用CHEMBL19的Ki数据集训练模型并预测CHEMBL20中新存入配体的生物活性。当采用逻辑回归和摩根指纹时，MOST也表现良好，准确率较高（对于pKi≥5为0.90，对于pKi≥6为0.76）。此外，发现与明确生物活性相关的p值是去除假阳性预测的可靠指标。隐式生物活性不具备此能力。最后，将通过逻辑回归、摩根指纹和明确活性生成的p值与错误发现率（FDR）控制程序相结合，以减少多靶点预测场景中的假阳性，并且以氟胺酮为例证明了该策略的成功。在芦荟大黄素的通便作用案例中，MOST预测乙酰胆碱酯酶是其作用机制靶点；体内研究验证了这一预测。

结论

使用MOST方法可实现高度准确和稳健的靶点预测。与FDR控制程序相结合，MOST为多靶点推断提供了一个可靠的框架。它在药物再利用和作用机制靶点预测方面具有潜在应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8e/5346209/f63e27270229/12859_2017_1586_Fig1_HTML.jpg

相似文献

BMC Bioinformatics. 2017 Mar 11;18(1):165. doi: 10.1186/s12859-017-1586-z.

Training based on ligand efficiency improves prediction of bioactivities of ligands and drug target proteins in a machine learning approach.基于配体效率的训练可以提高机器学习方法中配体和药物靶标蛋白生物活性预测的准确性。

J Chem Inf Model. 2013 Oct 28;53(10):2525-37. doi: 10.1021/ci400240u. Epub 2013 Sep 24.

Prediction of selective estrogen receptor beta agonist using open data and machine learning approach.利用开放数据和机器学习方法预测选择性雌激素受体β激动剂

Drug Des Devel Ther. 2016 Jul 18;10:2323-31. doi: 10.2147/DDDT.S110603. eCollection 2016.

Employing Molecular Conformations for Ligand-Based Virtual Screening with Equivariant Graph Neural Network and Deep Multiple Instance Learning.利用基于分子构象的等价图神经网络和深度多重实例学习进行配体虚拟筛选。

Molecules. 2023 Aug 9;28(16):5982. doi: 10.3390/molecules28165982.

The Development of Target-Specific Machine Learning Models as Scoring Functions for Docking-Based Target Prediction.基于对接的靶标预测中目标特异性机器学习模型作为评分函数的发展。

J Chem Inf Model. 2019 Mar 25;59(3):1238-1252. doi: 10.1021/acs.jcim.8b00773. Epub 2019 Mar 18.

WDL-RF: predicting bioactivities of ligand molecules acting with G protein-coupled receptors by combining weighted deep learning and random forest.WDL-RF：通过结合加权深度学习和随机森林预测与 G 蛋白偶联受体相互作用的配体分子的生物活性。

Bioinformatics. 2018 Jul 1;34(13):2271-2282. doi: 10.1093/bioinformatics/bty070.

Drug repositioning of herbal compounds via a machine-learning approach.基于机器学习的中草药化合物的药物再定位。

BMC Bioinformatics. 2019 May 29;20(Suppl 10):247. doi: 10.1186/s12859-019-2811-8.

DStruBTarget: Integrating Binding Affinity with Structure Similarity for Ligand-Binding Protein Prediction.DStruBTarget：将结合亲和力与结构相似性相结合进行配体结合蛋白预测。

J Chem Inf Model. 2020 Jan 27;60(1):400-409. doi: 10.1021/acs.jcim.9b00717. Epub 2019 Dec 27.

Molecular interaction fingerprint approaches for GPCR drug discovery.用于G蛋白偶联受体（GPCR）药物发现的分子相互作用指纹方法。

Curr Opin Pharmacol. 2016 Oct;30:59-68. doi: 10.1016/j.coph.2016.07.007. Epub 2016 Jul 29.

Recent Advances in the Machine Learning-Based Drug-Target Interaction Prediction.基于机器学习的药物-靶标相互作用预测的最新进展。

Curr Drug Metab. 2019;20(3):194-202. doi: 10.2174/1389200219666180821094047.

引用本文的文献

Optimizing drug design by merging generative AI with a physics-based active learning framework.通过将生成式人工智能与基于物理学的主动学习框架相结合来优化药物设计。

Commun Chem. 2025 Aug 8;8(1):238. doi: 10.1038/s42004-025-01635-7.

Fifteen years of ChEMBL and its role in cheminformatics and drug discovery.ChEMBL的十五年及其在化学信息学和药物发现中的作用。

J Cheminform. 2025 Mar 10;17(1):32. doi: 10.1186/s13321-025-00963-z.

Step Forward Cross Validation for Bioactivity Prediction: Out of Distribution Validation in Drug Discovery.用于生物活性预测的向前交叉验证：药物发现中的分布外验证

bioRxiv. 2024 Jul 4:2024.07.02.601740. doi: 10.1101/2024.07.02.601740.

Prediction of compound-target interaction using several artificial intelligence algorithms and comparison with a consensus-based strategy.使用多种人工智能算法预测化合物-靶点相互作用并与基于共识的策略进行比较。

J Cheminform. 2024 Mar 7;16(1):27. doi: 10.1186/s13321-024-00816-1.

Using Generative Modeling to Endow with Potency Initially Inert Compounds with Good Bioavailability and Low Toxicity.利用生成式建模赋予具有良好生物利用度和低毒性的初始惰性化合物效力。

J Chem Inf Model. 2024 Feb 12;64(3):590-596. doi: 10.1021/acs.jcim.3c01777. Epub 2024 Jan 23.

MIFNN: Molecular Information Feature Extraction and Fusion Deep Neural Network for Screening Potential Drugs.MIFNN：用于筛选潜在药物的分子信息特征提取与融合深度神经网络

Curr Issues Mol Biol. 2022 Nov 13;44(11):5638-5654. doi: 10.3390/cimb44110382.

De Novo Prediction of Drug Targets and Candidates by Chemical Similarity-Guided Network-Based Inference.基于化学相似性引导的网络推理的从头预测药物靶点和候选物。

Int J Mol Sci. 2022 Aug 26;23(17):9666. doi: 10.3390/ijms23179666.

Bioactivity Comparison across Multiple Machine Learning Algorithms Using over 5000 Datasets for Drug Discovery.利用 5000 多个数据集进行药物发现的多种机器学习算法的生物活性比较。

Mol Pharm. 2021 Jan 4;18(1):403-415. doi: 10.1021/acs.molpharmaceut.0c01013. Epub 2020 Dec 16.

A platform for target prediction of phenotypic screening hit molecules.一个用于表型筛选命中分子的靶标预测平台。

J Mol Graph Model. 2020 Mar;95:107485. doi: 10.1016/j.jmgm.2019.107485. Epub 2019 Oct 24.

Computational/in silico methods in drug target and lead prediction.计算/计算方法在药物靶点和先导化合物预测中的应用。

Brief Bioinform. 2020 Sep 25;21(5):1663-1675. doi: 10.1093/bib/bbz103.

本文引用的文献

Global Mapping of Traditional Chinese Medicine into Bioactivity Space and Pathways Annotation Improves Mechanistic Understanding and Discovers Relationships between Therapeutic Action (Sub)classes.将中药全局映射到生物活性空间并进行通路注释可提高对作用机制的理解，并发现治疗作用（亚）类之间的关系。

Evid Based Complement Alternat Med. 2016;2016:2106465. doi: 10.1155/2016/2106465. Epub 2016 Feb 18.

Target prediction utilising negative bioactivity data covering large chemical space.利用涵盖大化学空间的负生物活性数据进行靶点预测。

J Cheminform. 2015 Oct 24;7:51. doi: 10.1186/s13321-015-0098-y. eCollection 2015.

Large-scale chemical similarity networks for target profiling of compounds identified in cell-based chemical screens.用于基于细胞的化学筛选中鉴定出的化合物的靶点分析的大规模化学相似性网络。

PLoS Comput Biol. 2015 Mar 31;11(3):e1004153. doi: 10.1371/journal.pcbi.1004153. eCollection 2015 Mar.

Tools for in silico target fishing.用于计算机虚拟靶点筛选的工具。

Methods. 2015 Jan;71:98-103. doi: 10.1016/j.ymeth.2014.09.006. Epub 2014 Sep 30.

SwissTargetPrediction: a web server for target prediction of bioactive small molecules.瑞士靶点预测：一个用于生物活性小分子靶点预测的网络服务器。

Nucleic Acids Res. 2014 Jul;42(Web Server issue):W32-8. doi: 10.1093/nar/gku293. Epub 2014 May 3.

Synthesis and multitarget biological profiling of a novel family of rhein derivatives as disease-modifying anti-Alzheimer agents.新型大黄素衍生物的合成及多靶点生物活性评价作为治疗阿尔茨海默病的药物。

J Med Chem. 2014 Mar 27;57(6):2549-67. doi: 10.1021/jm401824w. Epub 2014 Mar 10.

Acetylshikonin, a Novel AChE Inhibitor, Inhibits Apoptosis via Upregulation of Heme Oxygenase-1 Expression in SH-SY5Y Cells.乙酰紫草素，一种新型的乙酰胆碱酯酶抑制剂，通过上调 SH-SY5Y 细胞血红素加氧酶-1 的表达抑制细胞凋亡。

Evid Based Complement Alternat Med. 2013;2013:937370. doi: 10.1155/2013/937370. Epub 2013 Nov 5.

Acetylcholinesterase inhibitors: pharmacology and toxicology.乙酰胆碱酯酶抑制剂：药理学和毒理学。

Curr Neuropharmacol. 2013 May;11(3):315-35. doi: 10.2174/1570159X11311030006.

J Med Chem. 2014 Apr 24;57(8):3186-204. doi: 10.1021/jm401411z. Epub 2013 Nov 11.

Shaping the interaction landscape of bioactive molecules.塑造生物活性分子的相互作用景观。

Bioinformatics. 2013 Dec 1;29(23):3073-9. doi: 10.1093/bioinformatics/btt540. Epub 2013 Sep 17.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

MOST：基于最相似配体的靶点预测方法。

MOST: most-similar ligand based approach to target prediction.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献