• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用基因本体进行蛋白质功能预测:从传统模型到深度学习模型

Protein function prediction with gene ontology: from traditional to deep learning models.

作者信息

Vu Thi Thuy Duong, Jung Jaehee

机构信息

Department of Information and Communication Engineering, Myongji University, Yongin-si, Gyeonggi-do, South Korea.

出版信息

PeerJ. 2021 Aug 24;9:e12019. doi: 10.7717/peerj.12019. eCollection 2021.

DOI:10.7717/peerj.12019
PMID:34513334
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8395570/
Abstract

Protein function prediction is a crucial part of genome annotation. Prediction methods have recently witnessed rapid development, owing to the emergence of high-throughput sequencing technologies. Among the available databases for identifying protein function terms, Gene Ontology (GO) is an important resource that describes the functional properties of proteins. Researchers are employing various approaches to efficiently predict the GO terms. Meanwhile, deep learning, a fast-evolving discipline in data-driven approach, exhibits impressive potential with respect to assigning GO terms to amino acid sequences. Herein, we reviewed the currently available computational GO annotation methods for proteins, ranging from conventional to deep learning approach. Further, we selected some suitable predictors from among the reviewed tools and conducted a mini comparison of their performance using a worldwide challenge dataset. Finally, we discussed the remaining major challenges in the field, and emphasized the future directions for protein function prediction with GO.

摘要

蛋白质功能预测是基因组注释的关键部分。由于高通量测序技术的出现,预测方法近年来得到了快速发展。在用于识别蛋白质功能术语的现有数据库中,基因本体论(GO)是描述蛋白质功能特性的重要资源。研究人员正在采用各种方法来高效预测GO术语。与此同时,深度学习作为数据驱动方法中快速发展的学科,在将GO术语分配给氨基酸序列方面展现出了令人印象深刻的潜力。在此,我们综述了目前可用的蛋白质计算GO注释方法,从传统方法到深度学习方法。此外,我们从综述的工具中选择了一些合适的预测器,并使用一个全球挑战数据集对它们的性能进行了小型比较。最后,我们讨论了该领域仍然存在的主要挑战,并强调了利用GO进行蛋白质功能预测的未来方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be0e/8395570/3d812b12d2e4/peerj-09-12019-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be0e/8395570/73736839f3b1/peerj-09-12019-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be0e/8395570/06a549d18b63/peerj-09-12019-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be0e/8395570/3d812b12d2e4/peerj-09-12019-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be0e/8395570/73736839f3b1/peerj-09-12019-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be0e/8395570/06a549d18b63/peerj-09-12019-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be0e/8395570/3d812b12d2e4/peerj-09-12019-g003.jpg

相似文献

1
Protein function prediction with gene ontology: from traditional to deep learning models.利用基因本体进行蛋白质功能预测:从传统模型到深度学习模型
PeerJ. 2021 Aug 24;9:e12019. doi: 10.7717/peerj.12019. eCollection 2021.
2
FunPredCATH: An ensemble method for predicting protein function using CATH.FunPredCATH:一种使用 CATH 预测蛋白质功能的集成方法。
Biochim Biophys Acta Proteins Proteom. 2024 Feb 1;1872(2):140985. doi: 10.1016/j.bbapap.2023.140985. Epub 2023 Dec 19.
3
PANDA2: protein function prediction using graph neural networks.PANDA2:使用图神经网络进行蛋白质功能预测
NAR Genom Bioinform. 2022 Feb 2;4(1):lqac004. doi: 10.1093/nargab/lqac004. eCollection 2022 Mar.
4
5
A bacterial phyla dataset for protein function prediction.用于蛋白质功能预测的细菌门数据集。
Data Brief. 2019 Dec 18;28:105002. doi: 10.1016/j.dib.2019.105002. eCollection 2020 Feb.
6
Predicting functions of maize proteins using graph convolutional network.利用图卷积网络预测玉米蛋白的功能。
BMC Bioinformatics. 2020 Dec 16;21(Suppl 16):420. doi: 10.1186/s12859-020-03745-6.
7
DeepAdd: Protein function prediction from k-mer embedding and additional features.DeepAdd:基于 k -mer 嵌入和附加特征的蛋白质功能预测。
Comput Biol Chem. 2020 Dec;89:107379. doi: 10.1016/j.compbiolchem.2020.107379. Epub 2020 Sep 23.
8
DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks.DEEPred:基于多任务前馈深度神经网络的蛋白质自动功能预测。
Sci Rep. 2019 May 14;9(1):7344. doi: 10.1038/s41598-019-43708-3.
9
A deep learning ensemble for function prediction of hypothetical proteins from pathogenic bacterial species.基于深度学习的方法对致病菌中假定蛋白质功能进行预测。
Comput Biol Chem. 2019 Dec;83:107147. doi: 10.1016/j.compbiolchem.2019.107147. Epub 2019 Oct 19.
10
DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.DeepGO:使用深度本体感知分类器从序列和相互作用预测蛋白质功能。
Bioinformatics. 2018 Feb 15;34(4):660-668. doi: 10.1093/bioinformatics/btx624.

引用本文的文献

1
Whole-genome sequencing and probiotic properties of Lactobacillus helveticus KM7 isolated from the gut of the Chinese honey bee (Apis cerana): A promising exopolysaccharide-producing strain.从中华蜜蜂(Apis cerana)肠道分离的瑞士乳杆菌KM7的全基因组测序及益生菌特性:一株有前景的胞外多糖产生菌株。
BMC Microbiol. 2025 Aug 23;25(1):530. doi: 10.1186/s12866-025-04286-9.
2
GhostBuster: A Deep-Learning-based, Literature-Unbiased Gene Prioritization Tool for Gene Annotation Prediction.幽灵克星:一种基于深度学习、不受文献偏差影响的用于基因注释预测的基因优先级排序工具。
bioRxiv. 2025 Jun 27:2025.06.22.660948. doi: 10.1101/2025.06.22.660948.
3

本文引用的文献

1
TALE: Transformer-based protein function Annotation with joint sequence-Label Embedding.TALE:基于 Transformer 的蛋白质功能注释与联合序列-标签嵌入。
Bioinformatics. 2021 Sep 29;37(18):2825-2833. doi: 10.1093/bioinformatics/btab198.
2
PFP-WGAN: Protein function prediction by discovering Gene Ontology term correlations with generative adversarial networks.PFP-WGAN:通过生成对抗网络发现与基因本体论术语相关性进行蛋白质功能预测。
PLoS One. 2021 Feb 25;16(2):e0244430. doi: 10.1371/journal.pone.0244430. eCollection 2021.
3
DeepAdd: Protein function prediction from k-mer embedding and additional features.
Prediction and validation of nanowire proteins in G20 using machine learning and feature engineering.
使用机器学习和特征工程对G20中的纳米线蛋白进行预测与验证。
Comput Struct Biotechnol J. 2025 Apr 19;27:1706-1718. doi: 10.1016/j.csbj.2025.04.022. eCollection 2025.
4
Unveiling Prospective Therapeutic Potential of Conserved Hypothetical Plasmodium falciparum Proteins by Using Integrated Proteo Genomic Annotation and In-Silico Therapeutic Discovery Approach.通过整合蛋白质基因组注释和计算机辅助治疗发现方法揭示恶性疟原虫保守假设蛋白的潜在治疗潜力
Protein J. 2025 Apr 11. doi: 10.1007/s10930-025-10265-w.
5
Machine Learning Framework for Conotoxin Class and Molecular Target Prediction.用于 Conotoxin 类和分子靶标预测的机器学习框架。
Toxins (Basel). 2024 Nov 3;16(11):475. doi: 10.3390/toxins16110475.
6
An experimental analysis of graph representation learning for Gene Ontology based protein function prediction.基于基因本体论的蛋白质功能预测的图表示学习的实验分析。
PeerJ. 2024 Nov 14;12:e18509. doi: 10.7717/peerj.18509. eCollection 2024.
7
An informatic workflow for the enhanced annotation of excretory/secretory proteins of .一种用于增强对……的排泄/分泌蛋白注释的信息学工作流程。 (原文中“of”后面似乎缺失了具体内容)
Comput Struct Biotechnol J. 2023 Mar 18;21:2696-2704. doi: 10.1016/j.csbj.2023.03.025. eCollection 2023.
8
Computational prediction of disordered binding regions.无序结合区域的计算预测
Comput Struct Biotechnol J. 2023 Feb 10;21:1487-1497. doi: 10.1016/j.csbj.2023.02.018. eCollection 2023.
9
GOProFormer: A Multi-Modal Transformer Method for Gene Ontology Protein Function Prediction.GOProFormer:一种用于基因本体蛋白质功能预测的多模态 Transformer 方法。
Biomolecules. 2022 Nov 18;12(11):1709. doi: 10.3390/biom12111709.
10
Multi-Omics Approaches and Resources for Systems-Level Gene Function Prediction in the Plant Kingdom.植物王国中用于系统水平基因功能预测的多组学方法与资源
Plants (Basel). 2022 Oct 5;11(19):2614. doi: 10.3390/plants11192614.
DeepAdd:基于 k -mer 嵌入和附加特征的蛋白质功能预测。
Comput Biol Chem. 2020 Dec;89:107379. doi: 10.1016/j.compbiolchem.2020.107379. Epub 2020 Sep 23.
4
SDN2GO: An Integrated Deep Learning Model for Protein Function Prediction.SDN2GO:一种用于蛋白质功能预测的集成深度学习模型。
Front Bioeng Biotechnol. 2020 Apr 29;8:391. doi: 10.3389/fbioe.2020.00391. eCollection 2020.
5
A Literature Review of Gene Function Prediction by Modeling Gene Ontology.基于基因本体建模的基因功能预测文献综述
Front Genet. 2020 Apr 24;11:400. doi: 10.3389/fgene.2020.00400. eCollection 2020.
6
Integrating multi-network topology for gene function prediction using deep neural networks.使用深度神经网络整合多网络拓扑结构进行基因功能预测。
Brief Bioinform. 2021 Mar 22;22(2):2096-2105. doi: 10.1093/bib/bbaa036.
7
A Deep Learning Framework for Gene Ontology Annotations With Sequence- and Network-Based Information.基于序列和网络信息的基因本体论注释深度学习框架。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2208-2217. doi: 10.1109/TCBB.2020.2968882. Epub 2021 Dec 8.
8
The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.CAFA 挑战赛报告称,通过实验筛选,提高了数百个基因的蛋白质功能预测和新的功能注释。
Genome Biol. 2019 Nov 19;20(1):244. doi: 10.1186/s13059-019-1835-8.
9
Machine learning techniques for protein function prediction.基于机器学习的蛋白质功能预测技术。
Proteins. 2020 Mar;88(3):397-413. doi: 10.1002/prot.25832. Epub 2019 Nov 14.
10
DeepGOPlus: improved protein function prediction from sequence.DeepGOPlus:从序列中改进蛋白质功能预测。
Bioinformatics. 2020 Jan 15;36(2):422-429. doi: 10.1093/bioinformatics/btz595.