• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PANNZER:在易出错环境中对未表征蛋白质进行高通量功能注释。

PANNZER: high-throughput functional annotation of uncharacterized proteins in an error-prone environment.

作者信息

Koskinen Patrik, Törönen Petri, Nokso-Koivisto Jussi, Holm Liisa

机构信息

Department of Biosciences, University of Helsinki, 00014 Helsinki, Finland and Institute of Biotechnology, University of Helsinki, 00014 Helsinki, Finland.

Department of Biosciences, University of Helsinki, 00014 Helsinki, Finland and Institute of Biotechnology, University of Helsinki, 00014 Helsinki, Finland Department of Biosciences, University of Helsinki, 00014 Helsinki, Finland and Institute of Biotechnology, University of Helsinki, 00014 Helsinki, Finland.

出版信息

Bioinformatics. 2015 May 15;31(10):1544-52. doi: 10.1093/bioinformatics/btu851. Epub 2015 Jan 8.

DOI:10.1093/bioinformatics/btu851
PMID:25653249
Abstract

MOTIVATION

The last decade has seen a remarkable growth in protein databases. This growth comes at a price: a growing number of submitted protein sequences lack functional annotation. Approximately 32% of sequences submitted to the most comprehensive protein database UniProtKB are labelled as 'Unknown protein' or alike. Also the functionally annotated parts are reported to contain 30-40% of errors. Here, we introduce a high-throughput tool for more reliable functional annotation called Protein ANNotation with Z-score (PANNZER). PANNZER predicts Gene Ontology (GO) classes and free text descriptions about protein functionality. PANNZER uses weighted k-nearest neighbour methods with statistical testing to maximize the reliability of a functional annotation.

RESULTS

Our results in free text description line prediction show that we outperformed all competing methods with a clear margin. In GO prediction we show clear improvement to our older method that performed well in CAFA 2011 challenge.

摘要

动机

在过去十年中,蛋白质数据库显著增长。这种增长是有代价的:提交的蛋白质序列中缺乏功能注释的数量越来越多。提交到最全面的蛋白质数据库UniProtKB的序列中,约32%被标记为“未知蛋白质”或类似名称。此外,据报道,功能注释部分也包含30%-40%的错误。在此,我们推出了一种名为蛋白质Z分数注释(PANNZER)的高通量工具,用于更可靠的功能注释。PANNZER预测基因本体(GO)类别和有关蛋白质功能的自由文本描述。PANNZER使用加权k近邻方法和统计测试,以最大限度地提高功能注释的可靠性。

结果

我们在自由文本描述行预测中的结果表明,我们以明显优势超过了所有竞争方法。在GO预测中,我们相对于在2011年CAFA挑战赛中表现良好的旧方法有了明显改进。

相似文献

1
PANNZER: high-throughput functional annotation of uncharacterized proteins in an error-prone environment.PANNZER:在易出错环境中对未表征蛋白质进行高通量功能注释。
Bioinformatics. 2015 May 15;31(10):1544-52. doi: 10.1093/bioinformatics/btu851. Epub 2015 Jan 8.
2
PANNZER-A practical tool for protein function prediction.PANNZER——一种用于蛋白质功能预测的实用工具。
Protein Sci. 2022 Jan;31(1):118-128. doi: 10.1002/pro.4193. Epub 2021 Oct 14.
3
Protein function prediction using text-based features extracted from the biomedical literature: the CAFA challenge.基于生物医学文献中提取的文本特征进行蛋白质功能预测:CAFA 挑战赛。
BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S14. doi: 10.1186/1471-2105-14-S3-S14. Epub 2013 Feb 28.
4
How to inherit statistically validated annotation within BAR+ protein clusters.如何在 BAR+ 蛋白簇中继承经过统计学验证的注释。
BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S4. doi: 10.1186/1471-2105-14-S3-S4. Epub 2013 Feb 28.
5
Mutual annotation-based prediction of protein domain functions with Domain2GO.基于互注释的蛋白质结构域功能预测与 Domain2GO。
Protein Sci. 2024 Jun;33(6):e4988. doi: 10.1002/pro.4988.
6
Automatic consistency assurance for literature-based gene ontology annotation.基于文献的基因本体论自动一致性保证。
BMC Bioinformatics. 2021 Nov 25;22(1):565. doi: 10.1186/s12859-021-04479-9.
7
Blinded Testing of Function Annotation for uPE1 Proteins by I-TASSER/COFACTOR Pipeline Using the 2018-2019 Additions to neXtProt and the CAFA3 Challenge.使用 2018-2019 年 neXtProt 的新增内容和 CAFA3 挑战赛,通过 I-TASSER/COFACTOR 管道对 uPE1 蛋白的功能注释进行盲测。
J Proteome Res. 2019 Dec 6;18(12):4154-4166. doi: 10.1021/acs.jproteome.9b00537. Epub 2019 Oct 18.
8
UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches.UniRef聚类:一种用于改进序列相似性搜索的全面且可扩展的替代方法。
Bioinformatics. 2015 Mar 15;31(6):926-32. doi: 10.1093/bioinformatics/btu739. Epub 2014 Nov 13.
9
Exploiting ontology graph for predicting sparsely annotated gene function.利用本体图预测注释稀疏的基因功能。
Bioinformatics. 2015 Jun 15;31(12):i357-64. doi: 10.1093/bioinformatics/btv260.
10
B2G-FAR, a species-centered GO annotation repository.B2G-FAR,一个以物种为中心的 GO 注释知识库。
Bioinformatics. 2011 Apr 1;27(7):919-24. doi: 10.1093/bioinformatics/btr059. Epub 2011 Feb 18.

引用本文的文献

1
Transcriptomic analysis reveals molecular phenological changes during the flower-to-fruit transition in Vanilla planifolia Andrews (Orchidaceae).转录组分析揭示了香草兰(兰科)从花到果实转变过程中的分子物候变化。
BMC Plant Biol. 2025 Apr 5;25(1):437. doi: 10.1186/s12870-025-06476-z.
2
Repeat-induced point mutations driving Parastagonospora nodorum genomic diversity are balanced by selection against non-synonymous mutations.重复诱导的点突变驱动小麦根腐平脐蠕孢基因组多样性,这种多样性通过对非同义突变的选择而得到平衡。
Commun Biol. 2024 Dec 4;7(1):1614. doi: 10.1038/s42003-024-07327-7.
3
Enhancing Gene Co-Expression Network Inference for the Malaria Parasite .
增强疟原虫基因共表达网络推断
Genes (Basel). 2024 May 25;15(6):685. doi: 10.3390/genes15060685.
4
Current status and emerging frontiers in enzyme engineering: An industrial perspective.酶工程的现状与新兴前沿:工业视角
Heliyon. 2024 Jun 7;10(11):e32673. doi: 10.1016/j.heliyon.2024.e32673. eCollection 2024 Jun 15.
5
Characterization of the Mitochondrial Proteome in the Ctenophore Mnemiopsis leidyi Using MitoPredictor.使用 MitoPredictor 对中胚层栉水母 Mnemiopsis leidyi 的线粒体蛋白质组进行特征描述。
Methods Mol Biol. 2024;2757:239-257. doi: 10.1007/978-1-0716-3642-8_10.
6
Comparative gene co-expression networks show enrichment of brassinosteroid and vitamin B processes in a seagrass under simulated ocean warming and extreme climatic events.比较基因共表达网络显示,在模拟海洋变暖和极端气候事件下,一种海草中的油菜素内酯和维生素B相关过程出现富集。
Front Plant Sci. 2024 Jan 26;15:1309956. doi: 10.3389/fpls.2024.1309956. eCollection 2024.
7
Evolutionary genomics of three agricultural pest moths reveals rapid evolution of host adaptation and immune-related genes.三种农业害虫蛾的进化基因组学揭示了宿主适应性和免疫相关基因的快速进化。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giad103.
8
Prediction of Thermostability of Enzymes Based on the Amino Acid Index (AAindex) Database and Machine Learning.基于氨基酸指数(AAindex)数据库和机器学习预测酶的热稳定性
Molecules. 2023 Dec 15;28(24):8097. doi: 10.3390/molecules28248097.
9
Osmoprotectants play a major role in the resistance to high levels of salinity stress-insights from a metabolomics and proteomics integrated approach.渗透保护剂在抵抗高盐胁迫中起主要作用——来自代谢组学和蛋白质组学整合方法的见解
Front Plant Sci. 2023 Jun 13;14:1187803. doi: 10.3389/fpls.2023.1187803. eCollection 2023.
10
Targeting Essential Hypothetical Proteins of PAO1 for Mining of Novel Therapeutics: An Approach.靶向 PAO1 的必需假设蛋白以挖掘新型治疗方法:一种方法。
Biomed Res Int. 2023 Apr 11;2023:1787485. doi: 10.1155/2023/1787485. eCollection 2023.