• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CAFA 挑战赛报告称,通过实验筛选,提高了数百个基因的蛋白质功能预测和新的功能注释。

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.

机构信息

Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA, USA.

Program in Bioinformatics and Computational Biology, Ames, IA, USA.

出版信息

Genome Biol. 2019 Nov 19;20(1):244. doi: 10.1186/s13059-019-1835-8.

DOI:10.1186/s13059-019-1835-8
PMID:31744546
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6864930/
Abstract

BACKGROUND

The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function.

RESULTS

Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory.

CONCLUSION

We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.

摘要

背景

功能注释的关键评估(CAFA)是一个正在进行的、全球性的、由社区驱动的努力,旨在评估和改进蛋白质功能的计算注释。

结果

在这里,我们报告了第三次 CAFA 挑战(CAFA3)的结果,与之前的 CAFA 轮次相比,无论是在分析数据的数量还是执行的分析类型方面,都进行了扩展分析。在一个新颖且重大的新发展中,计算预测和评估目标推动了一些实验测定,为 1000 多个基因提供了新的功能注释。具体来说,我们在白色念珠菌和铜绿假单胞菌基因组中进行了全基因组突变筛选实验,为与生物膜形成和运动相关的基因提供了全基因组实验数据。我们还在果蝇中对选定的基因进行了靶向测定,我们怀疑这些基因与长期记忆有关。

结论

我们得出的结论是,尽管分子功能和生物学过程注释的预测随着时间的推移略有改善,但细胞成分的预测并没有改善。实验注释的基于术语的预测仍然具有挑战性;尽管在白色念珠菌和果蝇中,顶级方法的性能明显优于基线方法设定的预期,但仍有相当大的改进空间和需求。最后,我们报告说,CAFA 社区现在涉及到广泛的参与者,他们在生物信息学、生物实验、生物注释和生物本体方面具有专业知识,共同努力提高功能注释、计算功能预测以及我们在大型实验筛选时代管理大数据的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/6a19490ca7fb/13059_2019_1835_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/b65cb2d09470/13059_2019_1835_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/f838bd9e1f4a/13059_2019_1835_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/be60901a6897/13059_2019_1835_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/8155cb3cf2c8/13059_2019_1835_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/7af112ed0b56/13059_2019_1835_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/ddb02255ebfe/13059_2019_1835_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/b5a5948a10b2/13059_2019_1835_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/e4144cd0dce0/13059_2019_1835_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/40e306b70c62/13059_2019_1835_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/da1000eb1734/13059_2019_1835_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/25f4dcbf9dd5/13059_2019_1835_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/597a5b5fa1ca/13059_2019_1835_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/19a0b371e48c/13059_2019_1835_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/8710a2738590/13059_2019_1835_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/6a19490ca7fb/13059_2019_1835_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/b65cb2d09470/13059_2019_1835_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/f838bd9e1f4a/13059_2019_1835_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/be60901a6897/13059_2019_1835_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/8155cb3cf2c8/13059_2019_1835_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/7af112ed0b56/13059_2019_1835_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/ddb02255ebfe/13059_2019_1835_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/b5a5948a10b2/13059_2019_1835_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/e4144cd0dce0/13059_2019_1835_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/40e306b70c62/13059_2019_1835_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/da1000eb1734/13059_2019_1835_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/25f4dcbf9dd5/13059_2019_1835_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/597a5b5fa1ca/13059_2019_1835_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/19a0b371e48c/13059_2019_1835_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/8710a2738590/13059_2019_1835_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d39/6864930/6a19490ca7fb/13059_2019_1835_Fig15_HTML.jpg

相似文献

1
The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.CAFA 挑战赛报告称,通过实验筛选,提高了数百个基因的蛋白质功能预测和新的功能注释。
Genome Biol. 2019 Nov 19;20(1):244. doi: 10.1186/s13059-019-1835-8.
2
New Long-Term Memory Genes Revealed by Assessing Computational Function Prediction Methods.通过评估计算功能预测方法揭示新的长时记忆基因。
G3 (Bethesda). 2019 Jan 9;9(1):251-267. doi: 10.1534/g3.118.200867.
3
4
Improved gene ontology annotation for biofilm formation, filamentous growth, and phenotypic switching in Candida albicans.白色念珠菌生物膜形成、丝状生长和表型转换的基因本体注释改进
Eukaryot Cell. 2013 Jan;12(1):101-8. doi: 10.1128/EC.00238-12. Epub 2012 Nov 9.
5
Protein function prediction using text-based features extracted from the biomedical literature: the CAFA challenge.基于生物医学文献中提取的文本特征进行蛋白质功能预测:CAFA 挑战赛。
BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S14. doi: 10.1186/1471-2105-14-S3-S14. Epub 2013 Feb 28.
6
A domain-centric solution to functional genomics via dcGO Predictor.通过 dcGO Predictor 实现功能基因组学的以域为中心的解决方案。
BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S9. doi: 10.1186/1471-2105-14-S3-S9. Epub 2013 Feb 28.
7
Assigning protein function from domain-function associations using DomFun.基于域-功能关联来分配蛋白质功能,使用 DomFun。
BMC Bioinformatics. 2022 Jan 15;23(1):43. doi: 10.1186/s12859-022-04565-6.
8
Blinded Testing of Function Annotation for uPE1 Proteins by I-TASSER/COFACTOR Pipeline Using the 2018-2019 Additions to neXtProt and the CAFA3 Challenge.使用 2018-2019 年 neXtProt 的新增内容和 CAFA3 挑战赛,通过 I-TASSER/COFACTOR 管道对 uPE1 蛋白的功能注释进行盲测。
J Proteome Res. 2019 Dec 6;18(12):4154-4166. doi: 10.1021/acs.jproteome.9b00537. Epub 2019 Oct 18.
9
Mutual annotation-based prediction of protein domain functions with Domain2GO.基于互注释的蛋白质结构域功能预测与 Domain2GO。
Protein Sci. 2024 Jun;33(6):e4988. doi: 10.1002/pro.4988.
10
An expanded evaluation of protein function prediction methods shows an improvement in accuracy.对蛋白质功能预测方法的扩展评估显示准确性有所提高。
Genome Biol. 2016 Sep 7;17(1):184. doi: 10.1186/s13059-016-1037-6.

引用本文的文献

1
DIAMOND2GO: rapid Gene Ontology assignment and enrichment detection for functional genomics.DIAMOND2GO:用于功能基因组学的快速基因本体分配与富集检测
Front Bioinform. 2025 Aug 15;5:1634042. doi: 10.3389/fbinf.2025.1634042. eCollection 2025.
2
LocPro: A deep learning-based prediction of protein subcellular localization for promoting multi-directional pharmaceutical research.LocPro:基于深度学习的蛋白质亚细胞定位预测,以促进多方向药物研究。
J Pharm Anal. 2025 Aug;15(8):101255. doi: 10.1016/j.jpha.2025.101255. Epub 2025 Mar 5.
3
Protein functional site annotation using local structure embeddings.

本文引用的文献

1
NetGO: improving large-scale protein function prediction with massive network information.NetGO:利用大规模网络信息提高大规模蛋白质功能预测。
Nucleic Acids Res. 2019 Jul 2;47(W1):W379-W387. doi: 10.1093/nar/gkz388.
2
INGA 2.0: improving protein function prediction for the dark proteome.INGA 2.0:改进黑暗蛋白质组中蛋白质功能的预测。
Nucleic Acids Res. 2019 Jul 2;47(W1):W373-W378. doi: 10.1093/nar/gkz375.
3
New Long-Term Memory Genes Revealed by Assessing Computational Function Prediction Methods.通过评估计算功能预测方法揭示新的长时记忆基因。
利用局部结构嵌入进行蛋白质功能位点注释。
Proc Natl Acad Sci U S A. 2025 Aug 26;122(34):e2513219122. doi: 10.1073/pnas.2513219122. Epub 2025 Aug 20.
4
MKFGO: integrating multi-source knowledge fusion with pretrained language model for high-accuracy protein function prediction.MKFGO:将多源知识融合与预训练语言模型相结合用于高精度蛋白质功能预测
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf420.
5
FANTASIA leverages language models to decode the functional dark proteome across the animal tree of life.FANTASIA利用语言模型来解码整个动物生命树中的功能性暗蛋白质组。
Commun Biol. 2025 Aug 14;8(1):1227. doi: 10.1038/s42003-025-08651-2.
6
ProT-GFDM: A generative fractional diffusion model for protein generation.ProT-GFDM:一种用于蛋白质生成的生成式分数扩散模型。
Comput Struct Biotechnol J. 2025 Aug 5;27:3464-3480. doi: 10.1016/j.csbj.2025.07.045. eCollection 2025.
7
Bag-of-words is competitive with sum-of-embeddings language-inspired representations on protein inference.词袋模型在蛋白质推理方面与基于语言启发的词嵌入求和表示法具有竞争力。
PLoS One. 2025 Aug 6;20(8):e0325531. doi: 10.1371/journal.pone.0325531. eCollection 2025.
8
GOAnnotator: accurate protein function annotation using automatically retrieved literature.GO注释器:利用自动检索的文献进行准确的蛋白质功能注释。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i410-i419. doi: 10.1093/bioinformatics/btaf199.
9
POSA-GO: Fusion of Hierarchical Gene Ontology and Protein Language Models for Protein Function Prediction.POSA-GO:用于蛋白质功能预测的分层基因本体与蛋白质语言模型融合
Int J Mol Sci. 2025 Jul 1;26(13):6362. doi: 10.3390/ijms26136362.
10
ProtGO: universal protein function prediction utilizing multi-modal gene ontology knowledge.ProtGO:利用多模态基因本体知识进行通用蛋白质功能预测
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf390.
G3 (Bethesda). 2019 Jan 9;9(1):251-267. doi: 10.1534/g3.118.200867.
4
Large-scale protein function prediction using heterogeneous ensembles.使用异构集成进行大规模蛋白质功能预测。
F1000Res. 2018 Sep 28;7. doi: 10.12688/f1000research.16415.1. eCollection 2018.
5
Enumerating consistent sub-graphs of directed acyclic graphs: an insight into biomedical ontologies.枚举有向无环图的一致子图:生物医学本体论的一个见解。
Bioinformatics. 2018 Jul 1;34(13):i313-i322. doi: 10.1093/bioinformatics/bty268.
6
Predicting human protein function with multi-task deep neural networks.用多任务深度神经网络预测人类蛋白质功能。
PLoS One. 2018 Jun 11;13(6):e0198216. doi: 10.1371/journal.pone.0198216. eCollection 2018.
7
GOLabeler: improving sequence-based large-scale protein function prediction by learning to rank.GOLabeler:通过学习排序提高基于序列的大规模蛋白质功能预测。
Bioinformatics. 2018 Jul 15;34(14):2465-2473. doi: 10.1093/bioinformatics/bty130.
8
Unsupervised Extraction of Stable Expression Signatures from Public Compendia with an Ensemble of Neural Networks.无监督提取公共文库中稳定表达特征的神经网络集成方法。
Cell Syst. 2017 Jul 26;5(1):63-71.e6. doi: 10.1016/j.cels.2017.06.003. Epub 2017 Jul 12.
9
UniProt: the universal protein knowledgebase.通用蛋白质知识库:UniProt
Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169. doi: 10.1093/nar/gkw1099. Epub 2016 Nov 29.
10
Community-Wide Evaluation of Computational Function Prediction.计算功能预测的全社区评估
Methods Mol Biol. 2017;1446:133-146. doi: 10.1007/978-1-4939-3743-1_10.