• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ContactPFP:利用预测的接触信息进行蛋白质功能预测。

ContactPFP: Protein function prediction using predicted contact information.

作者信息

Kagaya Yuki, Flannery Sean T, Jain Aashish, Kihara Daisuke

机构信息

Department of Biological Sciences, Purdue University, West Lafayette, IN, US.

Department of Computer Science, Purdue University, West Lafayette, IN, US.

出版信息

Front Bioinform. 2022 Jun;2. doi: 10.3389/fbinf.2022.896295. Epub 2022 Jun 2.

DOI:10.3389/fbinf.2022.896295
PMID:35875419
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9302406/
Abstract

Computational function prediction is one of the most important problems in bioinformatics as elucidating the function of genes is a central task in molecular biology and genomics. Most of the existing function prediction methods use protein sequences as the primary source of input information because the sequence is the most available information for query proteins. There are attempts to consider other attributes of query proteins. Among these attributes, the three-dimensional (3D) structure of proteins is known to be very useful in identifying the evolutionary relationship of proteins, from which functional similarity can be inferred. Here, we report a novel protein function prediction method, ContactPFP, which uses predicted residue-residue contact maps as input structural features of query proteins. Although 3D structure information is known to be useful, it has not been routinely used in function prediction because the 3D structure is not experimentally determined for many proteins. In ContactPFP, we overcome this limitation by using residue-residue contact prediction, which has become increasingly accurate due to rapid development in the protein structure prediction field. ContactPFP takes a query protein sequence as input and uses predicted residue-residue contact as a proxy for the 3D protein structure. To characterize how predicted contacts contribute to function prediction accuracy, we compared the performance of ContactPFP with several well-established sequence-based function prediction methods. The comparative study revealed the advantages and weaknesses of ContactPFP compared to contemporary sequence-based methods. There were many cases where it showed higher prediction accuracy. We examined factors that affected the accuracy of ContactPFP using several illustrative cases that highlight the strength of our method.

摘要

计算功能预测是生物信息学中最重要的问题之一,因为阐明基因功能是分子生物学和基因组学的核心任务。大多数现有的功能预测方法将蛋白质序列作为主要的输入信息来源,因为序列是查询蛋白质最容易获得的信息。也有人尝试考虑查询蛋白质的其他属性。在这些属性中,蛋白质的三维(3D)结构在识别蛋白质的进化关系方面非常有用,从中可以推断出功能相似性。在这里,我们报告了一种新的蛋白质功能预测方法ContactPFP,它使用预测的残基-残基接触图作为查询蛋白质的输入结构特征。虽然已知3D结构信息很有用,但它尚未在功能预测中常规使用,因为许多蛋白质的3D结构不是通过实验确定的。在ContactPFP中,我们通过使用残基-残基接触预测克服了这一限制,由于蛋白质结构预测领域的快速发展,残基-残基接触预测变得越来越准确。ContactPFP以查询蛋白质序列作为输入,并使用预测的残基-残基接触作为3D蛋白质结构的替代。为了表征预测的接触如何有助于功能预测准确性,我们将ContactPFP的性能与几种成熟的基于序列的功能预测方法进行了比较。比较研究揭示了ContactPFP与当代基于序列的方法相比的优缺点。在许多情况下,它显示出更高的预测准确性。我们使用几个说明性案例来突出我们方法的优势,研究了影响ContactPFP准确性的因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/5c69320696a6/fbinf-02-896295-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/9aab0a5d456c/fbinf-02-896295-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/8b4a964c29c7/fbinf-02-896295-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/548a87a8cf81/fbinf-02-896295-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/c845ecff2143/fbinf-02-896295-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/410ca85d1062/fbinf-02-896295-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/1bdacf13fd80/fbinf-02-896295-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/58c124e1ccac/fbinf-02-896295-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/18d2b6a6101a/fbinf-02-896295-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/5c69320696a6/fbinf-02-896295-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/9aab0a5d456c/fbinf-02-896295-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/8b4a964c29c7/fbinf-02-896295-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/548a87a8cf81/fbinf-02-896295-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/c845ecff2143/fbinf-02-896295-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/410ca85d1062/fbinf-02-896295-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/1bdacf13fd80/fbinf-02-896295-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/58c124e1ccac/fbinf-02-896295-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/18d2b6a6101a/fbinf-02-896295-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ee5/9580906/5c69320696a6/fbinf-02-896295-g009.jpg

相似文献

1
ContactPFP: Protein function prediction using predicted contact information.ContactPFP:利用预测的接触信息进行蛋白质功能预测。
Front Bioinform. 2022 Jun;2. doi: 10.3389/fbinf.2022.896295. Epub 2022 Jun 2.
2
3
Improving AlphaFold Predicted Contacts for Alpha-Helical Transmembrane Proteins Using Structural Features.利用结构特征改进针对 α-螺旋跨膜蛋白的 AlphaFold 预测接触。
Int J Mol Sci. 2024 May 11;25(10):5247. doi: 10.3390/ijms25105247.
4
Identification of residue pairing in interacting β-strands from a predicted residue contact map.从预测的残基接触图中鉴定相互作用的β-折叠中的残基对。
BMC Bioinformatics. 2018 Apr 19;19(1):146. doi: 10.1186/s12859-018-2150-1.
5
Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks.基于二维递归神经网络的多类别距离图的从头预测和基于模板的预测。
BMC Struct Biol. 2009 Jan 30;9:5. doi: 10.1186/1472-6807-9-5.
6
R2C: improving ab initio residue contact map prediction using dynamic fusion strategy and Gaussian noise filter.R2C:使用动态融合策略和高斯噪声滤波器改进从头开始的残基接触图预测。
Bioinformatics. 2016 Aug 15;32(16):2435-43. doi: 10.1093/bioinformatics/btw181. Epub 2016 Apr 10.
7
Phylo-PFP: improved automated protein function prediction using phylogenetic distance of distantly related sequences.Phylo-PFP:利用远缘序列的系统发育距离改进自动化蛋白质功能预测。
Bioinformatics. 2019 Mar 1;35(5):753-759. doi: 10.1093/bioinformatics/bty704.
8
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
9
Predicted residue-residue contacts can help the scoring of 3D models.预测的残基-残基接触可以帮助 3D 模型的评分。
Proteins. 2010 Jun;78(8):1980-91. doi: 10.1002/prot.22714.
10
Using inferred residue contacts to distinguish between correct and incorrect protein models.利用推断的残基接触来区分正确和错误的蛋白质模型。
Bioinformatics. 2008 Jul 15;24(14):1575-82. doi: 10.1093/bioinformatics/btn248. Epub 2008 May 29.

引用本文的文献

1
Lipid Trafficking in Diverse Bacteria.不同细菌中的脂质转运
Acc Chem Res. 2025 Jan 7;58(1):36-46. doi: 10.1021/acs.accounts.4c00540. Epub 2024 Dec 16.
2
Rapid proteome-wide prediction of lipid-interacting proteins through ligand-guided structural genomics.通过配体引导的结构基因组学对脂质相互作用蛋白进行全蛋白质组快速预测。
bioRxiv. 2024 Jan 30:2024.01.26.577452. doi: 10.1101/2024.01.26.577452.
3
Domain-PFP allows protein function prediction using function-aware domain embedding representations.域-PFP 使用感知功能的域嵌入表示来进行蛋白质功能预测。

本文引用的文献

1
Real-time structure search and structure classification for AlphaFold protein models.实时的 AlphaFold 蛋白质模型结构搜索和结构分类。
Commun Biol. 2022 Apr 5;5(1):316. doi: 10.1038/s42003-022-03261-8.
2
flDPnn: Accurate intrinsic disorder prediction with putative propensities of disorder functions.flDPnn:利用无序功能的假定倾向进行准确的固有无序预测。
Nat Commun. 2021 Jul 21;12(1):4438. doi: 10.1038/s41467-021-24773-7.
3
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Commun Biol. 2023 Oct 31;6(1):1103. doi: 10.1038/s42003-023-05476-9.
4
Cell phenotypes can be predicted from propensities of protein conformations.细胞表型可以从蛋白质构象的倾向性来预测。
Curr Opin Struct Biol. 2023 Dec;83:102722. doi: 10.1016/j.sbi.2023.102722. Epub 2023 Oct 21.
5
Domain-PFP: Protein Function Prediction Using Function-Aware Domain Embedding Representations.领域-蛋白质功能预测:使用功能感知领域嵌入表示进行蛋白质功能预测。
bioRxiv. 2023 Aug 24:2023.08.23.554486. doi: 10.1101/2023.08.23.554486.
6
TEMPROT: protein function annotation using transformers embeddings and homology search.TEMPROT:使用转换器嵌入和同源搜索进行蛋白质功能注释。
BMC Bioinformatics. 2023 Jun 8;24(1):242. doi: 10.1186/s12859-023-05375-0.
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
4
DeepGraphGO: graph neural network for large-scale, multispecies protein function prediction.DeepGraphGO:用于大规模多物种蛋白质功能预测的图神经网络。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i262-i271. doi: 10.1093/bioinformatics/btab270.
5
Structure-based protein function prediction using graph convolutional networks.基于结构的蛋白质功能预测使用图卷积网络。
Nat Commun. 2021 May 26;12(1):3168. doi: 10.1038/s41467-021-23303-9.
6
Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction.分析四重序列比对对基于深度学习的蛋白质残基间距离预测的影响。
Sci Rep. 2021 Apr 7;11(1):7574. doi: 10.1038/s41598-021-87204-z.
7
Protein contact map refinement for improving structure prediction using generative adversarial networks.利用生成对抗网络进行蛋白质接触图优化以改进结构预测
Bioinformatics. 2021 Oct 11;37(19):3168-3174. doi: 10.1093/bioinformatics/btab220.
8
UniProt: the universal protein knowledgebase in 2021.UniProt:2021 年的通用蛋白质知识库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489. doi: 10.1093/nar/gkaa1100.
9
CATH functional families predict functional sites in proteins.CATH 功能家族可预测蛋白质中的功能位点。
Bioinformatics. 2021 May 23;37(8):1099-1106. doi: 10.1093/bioinformatics/btaa937.
10
Pfam: The protein families database in 2021.Pfam:2021 年的蛋白质家族数据库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D412-D419. doi: 10.1093/nar/gkaa913.