• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

黑腹果蝇全基因组蛋白质功能分类评估

Assessment of genome-wide protein function classification for Drosophila melanogaster.

作者信息

Mi Huaiyu, Vandergriff Jody, Campbell Michael, Narechania Apurva, Majoros William, Lewis Suzanna, Thomas Paul D, Ashburner Michael

机构信息

Protein Informatics, Celera Genomics, Foster City, California 94404, USA.

出版信息

Genome Res. 2003 Sep;13(9):2118-28. doi: 10.1101/gr.771603.

DOI:10.1101/gr.771603
PMID:12952880
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC403707/
Abstract

The functional classification of genes on a genome-wide scale is now in its infancy, and we make a first attempt to assess existing methods and identify sources of error. To this end, we compared two independent efforts for associating proteins with functions, one implemented by FlyBase and the other by PANTHER at Celera Genomics. Both methods make inferences based on sequence similarity and the available experimental evidence. However, they differ considerably in methodology and process. Overall, assuming that the systematic error across the two methods is relatively small, we find the protein-to-function association error rate of both the FlyBase and PANTHER methods to be <2%. The primary source of error for both methods appears to be simple human error. Although homology-based inference can certainly cause errors in annotation, our analysis indicates that the frequency of such errors is relatively small compared with the number of correct inferences. Moreover, these homology errors can be minimized by careful tree-based inference, such as that implemented in PANTHER. Often, functional associations are made by one method and not the other, indicating that one of the greatest challenges lies in improving the completeness of available ontology associations.

摘要

全基因组范围内基因的功能分类目前尚处于起步阶段,我们首次尝试评估现有方法并识别错误来源。为此,我们比较了两项将蛋白质与功能相关联的独立工作,一项由FlyBase实施,另一项由赛雷拉基因组公司的PANTHER实施。两种方法均基于序列相似性和现有的实验证据进行推断。然而,它们在方法和过程上有很大差异。总体而言,假设两种方法之间的系统误差相对较小,我们发现FlyBase和PANTHER方法的蛋白质与功能关联错误率均<2%。两种方法的主要错误来源似乎都是简单的人为错误。虽然基于同源性的推断肯定会导致注释错误,但我们的分析表明,与正确推断的数量相比,此类错误的频率相对较小。此外,通过仔细的基于树的推断,如PANTHER中实施的推断,可以将这些同源性错误降至最低。通常,功能关联是通过一种方法而非另一种方法进行的,这表明最大的挑战之一在于提高可用本体关联的完整性。

相似文献

1
Assessment of genome-wide protein function classification for Drosophila melanogaster.黑腹果蝇全基因组蛋白质功能分类评估
Genome Res. 2003 Sep;13(9):2118-28. doi: 10.1101/gr.771603.
2
Drosophila melanogaster: a case study of a model genomic sequence and its consequences.黑腹果蝇:一个关于模型基因组序列及其影响的案例研究。
Genome Res. 2005 Dec;15(12):1661-7. doi: 10.1101/gr.3726705.
3
FlyBase: genes and gene models.果蝇数据库:基因与基因模型。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D390-5. doi: 10.1093/nar/gki046.
4
Towards comprehensive annotation of Drosophila melanogaster enzymes in FlyBase.致力于在 FlyBase 中全面注释黑腹果蝇的酶。
Database (Oxford). 2019 Jan 1;2019:bay144. doi: 10.1093/database/bay144.
5
The FlyBase database of the Drosophila genome projects and community literature.果蝇基因组计划和社区文献的FlyBase数据库。
Nucleic Acids Res. 2002 Jan 1;30(1):106-8. doi: 10.1093/nar/30.1.106.
6
Detection of orphan domains in Drosophila using "hydrophobic cluster analysis".利用“疏水簇分析”检测果蝇中的孤儿结构域
Biochimie. 2015 Dec;119:244-53. doi: 10.1016/j.biochi.2015.02.019. Epub 2015 Feb 28.
7
The Drosophila melanogaster PeptideAtlas facilitates the use of peptide data for improved fly proteomics and genome annotation.果蝇肽图谱有助于利用肽数据改进果蝇蛋白质组学和基因组注释。
BMC Bioinformatics. 2009 Feb 11;10:59. doi: 10.1186/1471-2105-10-59.
8
Comparative genome and proteome analysis of Anopheles gambiae and Drosophila melanogaster.冈比亚按蚊和黑腹果蝇的比较基因组与蛋白质组分析。
Science. 2002 Oct 4;298(5591):149-59. doi: 10.1126/science.1077061.
9
In silico identification of new secretory peptide genes in Drosophila melanogaster.在计算机上对黑腹果蝇中新的分泌肽基因进行鉴定。
Mol Cell Proteomics. 2006 Mar;5(3):510-22. doi: 10.1074/mcp.M400114-MCP200. Epub 2005 Nov 16.
10
Annotation of the Drosophila melanogaster euchromatic genome: a systematic review.黑腹果蝇常染色体基因组注释:一项系统综述。
Genome Biol. 2002;3(12):RESEARCH0083. doi: 10.1186/gb-2002-3-12-research0083. Epub 2002 Dec 31.

引用本文的文献

1
Entabolons: How Metabolites Modify the Biochemical Function of Proteins and Cause the Correlated Behavior of Proteins in Pathways.代谢物组:代谢物如何改变蛋白质的生化功能并导致蛋白质在代谢途径中的相关行为。
J Chem Inf Model. 2025 Jun 9;65(11):5785-5800. doi: 10.1021/acs.jcim.5c00462. Epub 2025 May 16.
2
PANTHER: Making genome-scale phylogenetics accessible to all.PANTHER:让所有人大开眼界的基因组系统发生学。
Protein Sci. 2022 Jan;31(1):8-22. doi: 10.1002/pro.4218. Epub 2021 Nov 25.
3
Large-scale gene function analysis with the PANTHER classification system.大规模基因功能分析与 PANTHER 分类系统。
Nat Protoc. 2013 Aug;8(8):1551-66. doi: 10.1038/nprot.2013.092. Epub 2013 Jul 18.
4
A threading-based method for the prediction of DNA-binding proteins with application to the human genome.基于串联的方法预测 DNA 结合蛋白及其在人类基因组中的应用。
PLoS Comput Biol. 2009 Nov;5(11):e1000567. doi: 10.1371/journal.pcbi.1000567. Epub 2009 Nov 13.
5
FINDSITE: a combined evolution/structure-based approach to protein function prediction.FINDSITE:一种基于进化与结构相结合的蛋白质功能预测方法。
Brief Bioinform. 2009 Jul;10(4):378-91. doi: 10.1093/bib/bbp017. Epub 2009 Mar 26.
6
A Drosophila systems model of pentylenetetrazole induced locomotor plasticity responsive to antiepileptic drugs.一种对戊四氮诱导的运动可塑性有反应且对抗癫痫药物敏感的果蝇系统模型。
BMC Syst Biol. 2009 Jan 21;3:11. doi: 10.1186/1752-0509-3-11.
7
Expression profiles of urbilaterian genes uniquely shared between honey bee and vertebrates.蜜蜂和脊椎动物之间独特共享的原口动物基因的表达谱。
BMC Genomics. 2009 Jan 12;10:17. doi: 10.1186/1471-2164-10-17.
8
Framework for a protein ontology.蛋白质本体论框架。
BMC Bioinformatics. 2007 Nov 27;8 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2105-8-S9-S1.
9
Genomic and functional studies of Drosophila sex hierarchy regulated gene expression in adult head and nervous system tissues.果蝇性别等级制度在成年头部和神经系统组织中调控基因表达的基因组学和功能研究。
PLoS Genet. 2007 Nov;3(11):e216. doi: 10.1371/journal.pgen.0030216.
10
Prediction of gene expression in embryonic structures of Drosophila melanogaster.黑腹果蝇胚胎结构中基因表达的预测
PLoS Comput Biol. 2007 Jul;3(7):e144. doi: 10.1371/journal.pcbi.0030144.

本文引用的文献

1
PANTHER: a library of protein families and subfamilies indexed by function.PANTHER:一个按功能索引的蛋白质家族和亚家族库。
Genome Res. 2003 Sep;13(9):2129-41. doi: 10.1101/gr.772403.
2
PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification.PANTHER:一个可浏览的基因产物数据库,根据生物学功能进行组织,采用经过整理的蛋白质家族和亚家族分类。
Nucleic Acids Res. 2003 Jan 1;31(1):334-41. doi: 10.1093/nar/gkg115.
3
Initial sequencing and comparative analysis of the mouse genome.小鼠基因组的初步测序与比较分析。
Nature. 2002 Dec 5;420(6915):520-62. doi: 10.1038/nature01262.
4
Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes.红鳍东方鲀全基因组鸟枪法测序组装与基因组分析
Science. 2002 Aug 23;297(5585):1301-10. doi: 10.1126/science.1072104. Epub 2002 Jul 25.
5
A comparison of whole-genome shotgun-derived mouse chromosome 16 and the human genome.基于全基因组鸟枪法测序得到的小鼠16号染色体与人类基因组的比较。
Science. 2002 May 31;296(5573):1661-71. doi: 10.1126/science.1069193.
6
The genome sequence of Schizosaccharomyces pombe.粟酒裂殖酵母的基因组序列。
Nature. 2002 Feb 21;415(6874):871-80. doi: 10.1038/nature724.
7
The FlyBase database of the Drosophila genome projects and community literature.果蝇基因组计划和社区文献的FlyBase数据库。
Nucleic Acids Res. 2002 Jan 1;30(1):106-8. doi: 10.1093/nar/30.1.106.
8
Initial sequencing and analysis of the human genome.人类基因组的初步测序与分析。
Nature. 2001 Feb 15;409(6822):860-921. doi: 10.1038/35057062.
9
The sequence of the human genome.人类基因组序列。
Science. 2001 Feb 16;291(5507):1304-51. doi: 10.1126/science.1058040.
10
Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.开花植物拟南芥的基因组序列分析。
Nature. 2000 Dec 14;408(6814):796-815. doi: 10.1038/35048692.