• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PFP-WGAN:通过生成对抗网络发现与基因本体论术语相关性进行蛋白质功能预测。

PFP-WGAN: Protein function prediction by discovering Gene Ontology term correlations with generative adversarial networks.

机构信息

Department of Computer Engineering, Sharif University of Technology, Tehran, Iran.

Department of Mechanical Engineering, University of California Berkeley, Berkeley, California, United States of America.

出版信息

PLoS One. 2021 Feb 25;16(2):e0244430. doi: 10.1371/journal.pone.0244430. eCollection 2021.

DOI:10.1371/journal.pone.0244430
PMID:33630862
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7906332/
Abstract

Understanding the functionality of proteins has emerged as a critical problem in recent years due to significant roles of these macro-molecules in biological mechanisms. However, in-laboratory techniques for protein function prediction are not as efficient as methods developed and processed for protein sequencing. While more than 70 million protein sequences are available today, only the functionality of around one percent of them are known. These facts have encouraged researchers to develop computational methods to infer protein functionalities from their sequences. Gene Ontology is the most well-known database for protein functions which has a hierarchical structure, where deeper terms are more determinative and specific. However, the lack of experimentally approved annotations for these specific terms limits the performance of computational methods applied on them. In this work, we propose a method to improve protein function prediction using their sequences by deeply extracting relationships between Gene Ontology terms. To this end, we construct a conditional generative adversarial network which helps to effectively discover and incorporate term correlations in the annotation process. In addition to the baseline algorithms, we compare our method with two recently proposed deep techniques that attempt to utilize Gene Ontology term correlations. Our results confirm the superiority of the proposed method compared to the previous works. Moreover, we demonstrate how our model can effectively help to assign more specific terms to sequences.

摘要

近年来,由于这些大分子在生物机制中的重要作用,理解蛋白质的功能已成为一个关键问题。然而,蛋白质功能预测的实验室技术并不像蛋白质测序那样高效。尽管目前有超过 7000 万个蛋白质序列,但已知的只有大约百分之一的蛋白质具有功能。这些事实促使研究人员开发计算方法,从蛋白质序列中推断其功能。

GO 是最著名的蛋白质功能数据库,具有层次结构,较深的术语更具决定性和特异性。然而,由于缺乏针对这些特定术语的实验验证注释,限制了应用于这些术语的计算方法的性能。在这项工作中,我们提出了一种利用蛋白质序列进行功能预测的方法,通过深入挖掘基因本体术语之间的关系来实现。为此,我们构建了一个条件生成对抗网络,以帮助在注释过程中有效地发现和合并术语相关性。

除了基准算法之外,我们还将我们的方法与两种最近提出的试图利用基因本体论术语相关性的深度技术进行了比较。我们的结果证实了与以前的工作相比,所提出的方法具有优越性。此外,我们还证明了我们的模型如何有效地帮助为序列分配更具体的术语。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/a27a4b6526a7/pone.0244430.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/27fe2941a0e4/pone.0244430.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/067d32885e83/pone.0244430.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/27b3c1410cd4/pone.0244430.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/66ca28c9739f/pone.0244430.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/e15ca025cc6e/pone.0244430.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/a27a4b6526a7/pone.0244430.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/27fe2941a0e4/pone.0244430.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/067d32885e83/pone.0244430.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/27b3c1410cd4/pone.0244430.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/66ca28c9739f/pone.0244430.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/e15ca025cc6e/pone.0244430.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcf/7906332/a27a4b6526a7/pone.0244430.g006.jpg

相似文献

1
PFP-WGAN: Protein function prediction by discovering Gene Ontology term correlations with generative adversarial networks.PFP-WGAN:通过生成对抗网络发现与基因本体论术语相关性进行蛋白质功能预测。
PLoS One. 2021 Feb 25;16(2):e0244430. doi: 10.1371/journal.pone.0244430. eCollection 2021.
2
PFP/ESG: automated protein function prediction servers enhanced with Gene Ontology visualization tool.PFP/ESG:通过基因本体可视化工具增强的自动化蛋白质功能预测服务器。
Bioinformatics. 2015 Jan 15;31(2):271-2. doi: 10.1093/bioinformatics/btu646. Epub 2014 Oct 1.
3
PFP: Automated prediction of gene ontology functional annotations with confidence scores using protein sequence data.PFP:利用蛋白质序列数据自动预测具有置信度分数的基因本体功能注释。
Proteins. 2009 Feb 15;74(3):566-82. doi: 10.1002/prot.22172.
4
GOLabeler: improving sequence-based large-scale protein function prediction by learning to rank.GOLabeler:通过学习排序提高基于序列的大规模蛋白质功能预测。
Bioinformatics. 2018 Jul 15;34(14):2465-2473. doi: 10.1093/bioinformatics/bty130.
5
Exploiting ontology graph for predicting sparsely annotated gene function.利用本体图预测注释稀疏的基因功能。
Bioinformatics. 2015 Jun 15;31(12):i357-64. doi: 10.1093/bioinformatics/btv260.
6
Enhanced automated function prediction using distantly related sequences and contextual association by PFP.通过PFP使用远缘相关序列和上下文关联增强自动功能预测。
Protein Sci. 2006 Jun;15(6):1550-6. doi: 10.1110/ps.062153506. Epub 2006 May 2.
7
8
Integration of molecular network data reconstructs Gene Ontology.分子网络数据的整合重建了基因本体论。
Bioinformatics. 2014 Sep 1;30(17):i594-600. doi: 10.1093/bioinformatics/btu470.
9
DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.DeepGO:使用深度本体感知分类器从序列和相互作用预测蛋白质功能。
Bioinformatics. 2018 Feb 15;34(4):660-668. doi: 10.1093/bioinformatics/btx624.
10
DeepSS2GO: protein function prediction from secondary structure.DeepSS2GO:基于二级结构的蛋白质功能预测
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae196.

引用本文的文献

1
Generative models for protein sequence modeling: recent advances and future directions.蛋白质序列建模的生成模型:最新进展和未来方向。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad358.
2
Deep learning on graphs for multi-omics classification of COPD.基于图的深度学习在 COPD 多组学分类中的应用。
PLoS One. 2023 Apr 21;18(4):e0284563. doi: 10.1371/journal.pone.0284563. eCollection 2023.
3
An Augmented High-Dimensional Graphical Lasso Method to Incorporate Prior Biological Knowledge for Global Network Learning.一种用于整合先验生物学知识以进行全局网络学习的增强型高维图形套索方法。

本文引用的文献

1
Synthetic promoter design in Escherichia coli based on a deep generative network.基于深度生成网络的大肠杆菌合成启动子设计
Nucleic Acids Res. 2020 Jul 9;48(12):6403-6412. doi: 10.1093/nar/gkaa325.
2
GO2Vec: transforming GO terms and proteins to vector representations via graph embeddings.GO2Vec:通过图嵌入将 GO 术语和蛋白质转换为向量表示。
BMC Genomics. 2019 Dec 24;20(Suppl 9):918. doi: 10.1186/s12864-019-6272-2.
3
DeepFunc: A Deep Learning Framework for Accurate Prediction of Protein Functions from Protein Sequences and Interactions.
Front Genet. 2022 Jan 27;12:760299. doi: 10.3389/fgene.2021.760299. eCollection 2021.
4
Protein function prediction with gene ontology: from traditional to deep learning models.利用基因本体进行蛋白质功能预测:从传统模型到深度学习模型
PeerJ. 2021 Aug 24;9:e12019. doi: 10.7717/peerj.12019. eCollection 2021.
DeepFunc:一种从蛋白质序列和相互作用中准确预测蛋白质功能的深度学习框架。
Proteomics. 2019 Jun;19(12):e1900019. doi: 10.1002/pmic.201900019. Epub 2019 May 27.
4
Improving protein function prediction using protein sequence and GO-term similarities.利用蛋白质序列和 GO 术语相似性提高蛋白质功能预测。
Bioinformatics. 2019 Apr 1;35(7):1116-1124. doi: 10.1093/bioinformatics/bty751.
5
Onto2Vec: joint vector-based representation of biological entities and their ontology-based annotations.Onto2Vec:基于向量的生物实体联合表示及其基于本体论的标注。
Bioinformatics. 2018 Jul 1;34(13):i52-i60. doi: 10.1093/bioinformatics/bty259.
6
Predicting human protein function with multi-task deep neural networks.用多任务深度神经网络预测人类蛋白质功能。
PLoS One. 2018 Jun 11;13(6):e0198216. doi: 10.1371/journal.pone.0198216. eCollection 2018.
7
deepNF: deep network fusion for protein function prediction.深度网络融合的蛋白质功能预测。
Bioinformatics. 2018 Nov 15;34(22):3873-3881. doi: 10.1093/bioinformatics/bty440.
8
MetaGO: Predicting Gene Ontology of Non-homologous Proteins Through Low-Resolution Protein Structure Prediction and Protein-Protein Network Mapping.MetaGO:通过低分辨率蛋白质结构预测和蛋白质-蛋白质网络映射预测非同源蛋白质的基因本体论。
J Mol Biol. 2018 Jul 20;430(15):2256-2265. doi: 10.1016/j.jmb.2018.03.004. Epub 2018 Mar 10.
9
GOLabeler: improving sequence-based large-scale protein function prediction by learning to rank.GOLabeler:通过学习排序提高基于序列的大规模蛋白质功能预测。
Bioinformatics. 2018 Jul 15;34(14):2465-2473. doi: 10.1093/bioinformatics/bty130.
10
UniProt: the universal protein knowledgebase.通用蛋白质知识库:UniProt
Nucleic Acids Res. 2018 Mar 16;46(5):2699. doi: 10.1093/nar/gky092.