FFPred：一个用于脊椎动物蛋白质组的基于综合特征的功能预测服务器。

FFPred: an integrated feature-based function prediction server for vertebrate proteomes.

作者信息

Lobley A E, Nugent T, Orengo C A, Jones D T

机构信息

Department of Computer Science, University College London, London WC1E 6BT, United Kingdom.

出版信息

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W297-302. doi: 10.1093/nar/gkn193. Epub 2008 May 7.

DOI:10.1093/nar/gkn193

PMID:18463141

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2447771/

Abstract

One of the challenges of the post-genomic era is to provide accurate function annotations for large volumes of data resulting from genome sequencing projects. Most function prediction servers utilize methods that transfer existing database annotations between orthologous sequences. In contrast, there are few methods that are independent of homology and can annotate distant and orphan protein sequences. The FFPred server adopts a machine-learning approach to perform function prediction in protein feature space using feature characteristics predicted from amino acid sequence. The features are scanned against a library of support vector machines representing over 300 Gene Ontology (GO) classes and probabilistic confidence scores returned for each annotation term. The GO term library has been modelled on human protein annotations; however, benchmark performance testing showed robust performance across higher eukaryotes. FFPred offers important advantages over traditional function prediction servers in its ability to annotate distant homologues and orphan protein sequences, and achieves greater coverage and classification accuracy than other feature-based prediction servers. A user may upload an amino acid and receive annotation predictions via email. Feature information is provided as easy to interpret graphics displayed on the sequence of interest, allowing for back-interpretation of the associations between features and function classes.

摘要

后基因组时代的挑战之一是为基因组测序项目产生的大量数据提供准确的功能注释。大多数功能预测服务器采用在直系同源序列之间转移现有数据库注释的方法。相比之下，几乎没有独立于同源性且能注释远缘和孤儿蛋白序列的方法。FFPred服务器采用机器学习方法，利用从氨基酸序列预测的特征特性在蛋白质特征空间中进行功能预测。将这些特征与一个代表300多个基因本体（GO）类别的支持向量机库进行比对，并为每个注释术语返回概率置信度得分。GO术语库是基于人类蛋白质注释构建的；然而，基准性能测试表明在高等真核生物中其性能稳健。FFPred在注释远缘同源物和孤儿蛋白序列方面比传统功能预测服务器具有重要优势，并且比其他基于特征的预测服务器实现了更高的覆盖率和分类准确率。用户可以上传氨基酸序列并通过电子邮件接收注释预测。特征信息以易于解释的图形形式显示在感兴趣的序列上，便于反向解读特征与功能类别之间的关联。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0734/2447771/854c4c1059de/gkn193f1.jpg

相似文献

FFPred: an integrated feature-based function prediction server for vertebrate proteomes.FFPred：一个用于脊椎动物蛋白质组的基于综合特征的功能预测服务器。

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W297-302. doi: 10.1093/nar/gkn193. Epub 2008 May 7.

FFPred 2.0: improved homology-independent prediction of gene ontology terms for eukaryotic protein sequences.FFPred 2.0：改进了真核蛋白质序列的同源无关基因本体术语预测。

PLoS One. 2013 May 22;8(5):e63754. doi: 10.1371/journal.pone.0063754. Print 2013.

JAFA: a protein function annotation meta-server.JAFA：一个蛋白质功能注释元服务器。

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W379-81. doi: 10.1093/nar/gkl045.

Bioverse: Functional, structural and contextual annotation of proteins and proteomes.生物宇宙：蛋白质和蛋白质组的功能、结构及情境注释

Nucleic Acids Res. 2003 Jul 1;31(13):3736-7. doi: 10.1093/nar/gkg550.

MESSA: MEta-Server for protein Sequence Analysis.MESSA：蛋白质序列分析元服务器。

BMC Biol. 2012 Oct 2;10:82. doi: 10.1186/1741-7007-10-82.

Using PFP and ESG Protein Function Prediction Web Servers.使用PFP和ESG蛋白质功能预测网络服务器。

Methods Mol Biol. 2017;1611:1-14. doi: 10.1007/978-1-4939-7015-5_1.

A grid environment for high-throughput proteomics.用于高通量蛋白质组学的网格环境。

IEEE Trans Nanobioscience. 2007 Jun;6(2):117-23. doi: 10.1109/tnb.2007.897495.

WILMA-automated annotation of protein sequences.WILMA - 蛋白质序列的自动注释

Bioinformatics. 2004 Jan 1;20(1):127-8. doi: 10.1093/bioinformatics/btg380.

BIOVERSE: enhancements to the framework for structural, functional and contextual modeling of proteins and proteomes.生物宇宙：蛋白质和蛋白质组结构、功能及上下文建模框架的增强功能。

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W324-5. doi: 10.1093/nar/gki401.

REGANOR: a gene prediction server for prokaryotic genomes and a database of high quality gene predictions for prokaryotes.REGANOR：一个用于原核生物基因组的基因预测服务器以及一个原核生物高质量基因预测数据库。

Appl Bioinformatics. 2006;5(3):193-8. doi: 10.2165/00822942-200605030-00008.

引用本文的文献

The De Novo Emergence of Two Brain Genes in the Human Lineage Appears to be Unsupported.人类谱系中两个大脑基因的从头出现似乎缺乏依据。

J Mol Evol. 2025 Feb;93(1):3-10. doi: 10.1007/s00239-024-10227-3. Epub 2024 Dec 27.

An experimental analysis of graph representation learning for Gene Ontology based protein function prediction.基于基因本体论的蛋白质功能预测的图表示学习的实验分析。

PeerJ. 2024 Nov 14;12:e18509. doi: 10.7717/peerj.18509. eCollection 2024.

Integrating unsupervised language model with triplet neural networks for protein gene ontology prediction.将无监督语言模型与三重态神经网络集成，用于蛋白质基因本体预测。

PLoS Comput Biol. 2022 Dec 22;18(12):e1010793. doi: 10.1371/journal.pcbi.1010793. eCollection 2022 Dec.

Protein function prediction with gene ontology: from traditional to deep learning models.利用基因本体进行蛋白质功能预测：从传统模型到深度学习模型

PeerJ. 2021 Aug 24;9:e12019. doi: 10.7717/peerj.12019. eCollection 2021.

A lncRNA-SWI/SNF complex crosstalk controls transcriptional activation at specific promoter regions.长链非编码 RNA-SWI/SNF 复合物相互作用控制特定启动子区域的转录激活。

Nat Commun. 2020 Feb 18;11(1):936. doi: 10.1038/s41467-020-14623-3.

Predicting human protein function with multi-task deep neural networks.用多任务深度神经网络预测人类蛋白质功能。

PLoS One. 2018 Jun 11;13(6):e0198216. doi: 10.1371/journal.pone.0198216. eCollection 2018.

Assessing the Performances of Protein Function Prediction Algorithms from the Perspectives of Identification Accuracy and False Discovery Rate.从识别准确率和假发现率的角度评估蛋白质功能预测算法的性能。

Int J Mol Sci. 2018 Jan 8;19(1):183. doi: 10.3390/ijms19010183.

Analysis of temporal transcription expression profiles reveal links between protein function and developmental stages of Drosophila melanogaster.对时间转录表达谱的分析揭示了果蝇蛋白质功能与发育阶段之间的联系。

PLoS Comput Biol. 2017 Oct 18;13(10):e1005791. doi: 10.1371/journal.pcbi.1005791. eCollection 2017 Oct.

Primer on the Gene Ontology.基因本体论入门

Methods Mol Biol. 2017;1446:25-37. doi: 10.1007/978-1-4939-3743-1_3.

Substrate specificity characterization for eight putative nudix hydrolases. Evaluation of criteria for substrate identification within the Nudix family.八种假定的Nudix水解酶的底物特异性表征。Nudix家族内底物识别标准的评估。

Proteins. 2016 Dec;84(12):1810-1822. doi: 10.1002/prot.25163. Epub 2016 Oct 1.

本文引用的文献

Inferring function using patterns of native disorder in proteins.利用蛋白质天然无序模式推断功能。

PLoS Comput Biol. 2007 Aug;3(8):e162. doi: 10.1371/journal.pcbi.0030162. Epub 2007 Jul 3.

Classification of conformational stability of protein mutants from 3D pseudo-folding graph representation of protein sequences using support vector machines.利用支持向量机从蛋白质序列的三维伪折叠图表示中对蛋白质突变体的构象稳定性进行分类。

Proteins. 2008 Jan 1;70(1):167-75. doi: 10.1002/prot.21524.

SVM Classifier - a comprehensive java interface for support vector machine classification of microarray data.支持向量机分类器——用于对微阵列数据进行支持向量机分类的全面Java接口。

BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S25. doi: 10.1186/1471-2105-7-S4-S25.

Protein-protein interactions more conserved within species than across species.蛋白质与蛋白质之间的相互作用在物种内部比在物种之间更为保守。

PLoS Comput Biol. 2006 Jul 21;2(7):e79. doi: 10.1371/journal.pcbi.0020079. Epub 2006 May 18.

Beyond annotation transfer by homology: novel protein-function prediction methods to assist drug discovery.超越同源性注释转移：助力药物发现的新型蛋白质功能预测方法。

Drug Discov Today. 2005 Nov 1;10(21):1475-82. doi: 10.1016/S1359-6446(05)03621-4.

Predicting enzyme class from protein structure without alignments.无需比对即可从蛋白质结构预测酶的类别。

J Mol Biol. 2005 Jan 7;345(1):187-99. doi: 10.1016/j.jmb.2004.10.024.

The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology.基因本体注释（GOA）数据库：在UniProt中与基因本体共享知识。

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D262-6. doi: 10.1093/nar/gkh021.

UniProt: the Universal Protein knowledgebase.通用蛋白质知识库（UniProt）。

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D115-9. doi: 10.1093/nar/gkh131.

How well is enzyme function conserved as a function of pairwise sequence identity?酶功能作为成对序列同一性的函数，其保守程度如何？

J Mol Biol. 2003 Oct 31;333(4):863-82. doi: 10.1016/j.jmb.2003.08.057.

Asymptotic behaviors of support vector machines with Gaussian kernel.具有高斯核的支持向量机的渐近行为

Neural Comput. 2003 Jul;15(7):1667-89. doi: 10.1162/089976603321891855.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

FFPred：一个用于脊椎动物蛋白质组的基于综合特征的功能预测服务器。

FFPred: an integrated feature-based function prediction server for vertebrate proteomes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献