Suppr超能文献

螺旋编码器:一种专门为A类G蛋白偶联受体设计的复合蛋白相互作用预测模型。

Helix encoder: a compound-protein interaction prediction model specifically designed for class A GPCRs.

作者信息

Yamane Haruki, Ishida Takashi

机构信息

Department of Computer Science, School of Computing, Tokyo Institute of Technology, Tokyo, Japan.

出版信息

Front Bioinform. 2023 May 26;3:1193025. doi: 10.3389/fbinf.2023.1193025. eCollection 2023.

Abstract

Class A G protein-coupled receptors (GPCRs) represent the largest class of GPCRs. They are essential targets of drug discovery and thus various computational approaches have been applied to predict their ligands. However, there are a large number of orphan receptors in class A GPCRs and it is difficult to use a general protein-specific supervised prediction scheme. Therefore, the compound-protein interaction (CPI) prediction approach has been considered one of the most suitable for class A GPCRs. However, the accuracy of CPI prediction is still insufficient. The current CPI prediction model generally employs the whole protein sequence as the input because it is difficult to identify the important regions in general proteins. In contrast, it is well-known that only a few transmembrane helices of class A GPCRs play a critical role in ligand binding. Therefore, using such domain knowledge, the CPI prediction performance could be improved by developing an encoding method that is specifically designed for this family. In this study, we developed a protein sequence encoder called the Helix encoder, which takes only a protein sequence of transmembrane regions of class A GPCRs as input. The performance evaluation showed that the proposed model achieved a higher prediction accuracy compared to a prediction model using the entire protein sequence. Additionally, our analysis indicated that several extracellular loops are also important for the prediction as mentioned in several biological researches.

摘要

A类G蛋白偶联受体(GPCRs)是GPCRs中最大的一类。它们是药物研发的重要靶点,因此已应用各种计算方法来预测其配体。然而,A类GPCRs中有大量孤儿受体,难以使用通用的蛋白质特异性监督预测方案。因此,化合物 - 蛋白质相互作用(CPI)预测方法被认为是最适合A类GPCRs的方法之一。然而,CPI预测的准确性仍然不足。当前的CPI预测模型通常采用整个蛋白质序列作为输入,因为难以在一般蛋白质中识别重要区域。相比之下,众所周知,A类GPCRs中只有少数跨膜螺旋在配体结合中起关键作用。因此,利用此类领域知识,通过开发专门为此家族设计的编码方法,可以提高CPI预测性能。在本研究中,我们开发了一种称为螺旋编码器的蛋白质序列编码器,它仅将A类GPCRs跨膜区域的蛋白质序列作为输入。性能评估表明,与使用整个蛋白质序列的预测模型相比,所提出的模型实现了更高的预测准确性。此外,我们的分析表明,如几项生物学研究所提到的,几个细胞外环对预测也很重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbd0/10250622/3c75d58c689a/fbinf-03-1193025-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验