Suppr超能文献

开发一种机器学习模型以识别蛋白质-蛋白质相互作用热点,从而促进药物发现。

Developing a machine learning model to identify protein-protein interaction hotspots to facilitate drug discovery.

作者信息

Nandakumar Rohit, Dinu Valentin

机构信息

Program of Biomedical Informatics, College of Health Solutions, Arizona State University, Tempe, AZ, USA.

出版信息

PeerJ. 2020 Dec 7;8:e10381. doi: 10.7717/peerj.10381. eCollection 2020.

Abstract

Throughout the history of drug discovery, an enzymatic-based approach for identifying new drug molecules has been primarily utilized. Recently, protein-protein interfaces that can be disrupted to identify small molecules that could be viable targets for certain diseases, such as cancer and the human immunodeficiency virus, have been identified. Existing studies computationally identify hotspots on these interfaces, with most models attaining accuracies of ~70%. Many studies do not effectively integrate information relating to amino acid chains and other structural information relating to the complex. Herein, (1) a machine learning model has been created and (2) its ability to integrate multiple features, such as those associated with amino-acid chains, has been evaluated to enhance the ability to predict protein-protein interface hotspots. Virtual drug screening analysis of a set of hotspots determined on the EphB2-ephrinB2 complex has also been performed. The predictive capabilities of this model offer an AUROC of 0.842, sensitivity/recall of 0.833, and specificity of 0.850. Virtual screening of a set of hotspots identified by the machine learning model developed in this study has identified potential medications to treat diseases caused by the overexpression of the EphB2-ephrinB2 complex, including prostate, gastric, colorectal and melanoma cancers which are linked to EphB2 mutations. The efficacy of this model has been demonstrated through its successful ability to predict drug-disease associations previously identified in literature, including cimetidine, idarubicin, pralatrexate for these conditions. In addition, nadolol, a beta blocker, has also been identified in this study to bind to the EphB2-ephrinB2 complex, and the possibility of this drug treating multiple cancers is still relatively unexplored.

摘要

在药物研发的历史进程中,基于酶的方法一直是识别新药分子的主要手段。最近,人们发现了一些蛋白质-蛋白质界面,通过破坏这些界面可以识别出可能成为某些疾病(如癌症和人类免疫缺陷病毒)可行靶点的小分子。现有研究通过计算识别这些界面上的热点区域,大多数模型的准确率约为70%。许多研究未能有效地整合与氨基酸链相关的信息以及与复合物相关的其他结构信息。在此,(1)创建了一个机器学习模型,(2)评估了其整合多种特征(如与氨基酸链相关的特征)的能力,以增强预测蛋白质-蛋白质界面热点的能力。还对在EphB2-ephrinB2复合物上确定的一组热点进行了虚拟药物筛选分析。该模型的预测能力提供了0.842的曲线下面积(AUROC)、0.833的灵敏度/召回率和0.850的特异性。对本研究开发的机器学习模型识别出的一组热点进行虚拟筛选,发现了可治疗由EphB2-ephrinB2复合物过表达引起的疾病的潜在药物,包括与EphB2突变相关的前列腺癌、胃癌、结直肠癌和黑色素瘤。该模型的有效性已通过其成功预测文献中先前确定的药物-疾病关联得到证明,包括西咪替丁、伊达比星、普拉曲沙用于这些病症。此外,在本研究中还发现β受体阻滞剂纳多洛尔可与EphB2-ephrinB2复合物结合,而这种药物治疗多种癌症的可能性仍相对未被探索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7131/7727375/98a9099f990f/peerj-08-10381-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验