• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用基因本体论注释评估直系同源物和旁系同源物之间的功能相似性:简短报告。

On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report.

机构信息

Division of Bioinformatics, Department of Preventive Medicine, University of Southern California, Los Angeles, California, United States of America.

出版信息

PLoS Comput Biol. 2012;8(2):e1002386. doi: 10.1371/journal.pcbi.1002386. Epub 2012 Feb 16.

DOI:10.1371/journal.pcbi.1002386
PMID:22359495
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3280971/
Abstract

A recent paper (Nehrt et al., PLoS Comput. Biol. 7:e1002073, 2011) has proposed a metric for the "functional similarity" between two genes that uses only the Gene Ontology (GO) annotations directly derived from published experimental results. Applying this metric, the authors concluded that paralogous genes within the mouse genome or the human genome are more functionally similar on average than orthologous genes between these genomes, an unexpected result with broad implications if true. We suggest, based on both theoretical and empirical considerations, that this proposed metric should not be interpreted as a functional similarity, and therefore cannot be used to support any conclusions about the "ortholog conjecture" (or, more properly, the "ortholog functional conservation hypothesis"). First, we reexamine the case studies presented by Nehrt et al. as examples of orthologs with divergent functions, and come to a very different conclusion: they actually exemplify how GO annotations for orthologous genes provide complementary information about conserved biological functions. We then show that there is a global ascertainment bias in the experiment-based GO annotations for human and mouse genes: particular types of experiments tend to be performed in different model organisms. We conclude that the reported statistical differences in annotations between pairs of orthologous genes do not reflect differences in biological function, but rather complementarity in experimental approaches. Our results underscore two general considerations for researchers proposing novel types of analysis based on the GO: 1) that GO annotations are often incomplete, potentially in a biased manner, and subject to an "open world assumption" (absence of an annotation does not imply absence of a function), and 2) that conclusions drawn from a novel, large-scale GO analysis should whenever possible be supported by careful, in-depth examination of examples, to help ensure the conclusions have a justifiable biological basis.

摘要

最近的一篇论文(Nehrt 等人,《公共科学图书馆计算生物学》7:e1002073,2011 年)提出了一种仅使用直接从已发表的实验结果中得出的基因本体论(GO)注释来衡量两个基因之间“功能相似性”的方法。应用该方法,作者得出结论,与这些基因组之间的同源基因相比,小鼠基因组或人类基因组中的旁系同源基因平均更具有功能相似性,如果这一结果真实,这将是一个具有广泛影响的意外结果。我们基于理论和经验的考虑,提出该方法不应被解释为功能相似性,因此不能用于支持任何关于“同源假说”(或者更准确地说,“同源功能保守假说”)的结论。首先,我们重新审视了 Nehrt 等人提出的作为功能差异的同源基因的案例研究,并得出了截然不同的结论:它们实际上例证了 GO 注释如何为同源基因提供有关保守生物功能的补充信息。然后,我们表明,基于实验的人类和小鼠基因的 GO 注释存在全局确证偏差:特定类型的实验往往在不同的模型生物中进行。我们得出的结论是,报告的同源基因对注释之间的统计差异并不反映生物学功能的差异,而是实验方法的互补性。我们的研究结果强调了基于 GO 提出新类型分析的研究人员应考虑的两个一般性问题:1)GO 注释通常是不完整的,可能具有偏向性,并受“开放世界假设”(没有注释并不意味着没有功能)的影响;2)从新颖的大规模 GO 分析中得出的结论应尽可能通过仔细、深入地检查示例来支持,以帮助确保结论具有合理的生物学基础。

相似文献

1
On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report.利用基因本体论注释评估直系同源物和旁系同源物之间的功能相似性:简短报告。
PLoS Comput Biol. 2012;8(2):e1002386. doi: 10.1371/journal.pcbi.1002386. Epub 2012 Feb 16.
2
The ortholog conjecture is untestable by the current gene ontology but is supported by RNA sequencing data.直系同源假说无法通过当前的基因本体论进行检验,但得到了 RNA 测序数据的支持。
PLoS Comput Biol. 2012;8(11):e1002784. doi: 10.1371/journal.pcbi.1002784. Epub 2012 Nov 29.
3
GO functional similarity clustering depends on similarity measure, clustering method, and annotation completeness.GO 功能相似性聚类取决于相似性度量、聚类方法和注释完整性。
BMC Bioinformatics. 2019 Mar 27;20(1):155. doi: 10.1186/s12859-019-2752-2.
4
Interspecies gene function prediction using semantic similarity.基于语义相似性的跨物种基因功能预测
BMC Syst Biol. 2016 Dec 23;10(Suppl 4):121. doi: 10.1186/s12918-016-0361-5.
5
A procedure for assessing GO annotation consistency.一种评估基因本体(GO)注释一致性的程序。
Bioinformatics. 2005 Jun;21 Suppl 1:i136-43. doi: 10.1093/bioinformatics/bti1019.
6
Gene family level comparative analysis of gene expression in mammals validates the ortholog conjecture.哺乳动物基因表达的基因家族水平比较分析验证了直系同源物猜想。
Genome Biol Evol. 2014 Apr;6(4):754-62. doi: 10.1093/gbe/evu051.
7
Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.通过成对物种比较对直系同源基因和旁系同源基因进行自动聚类。
J Mol Biol. 2001 Dec 14;314(5):1041-52. doi: 10.1006/jmbi.2000.5197.
8
DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection.DODO:一种基于结构域的高效的直系同源基因分配工具。基于结构域的直系同源检测。
BMC Bioinformatics. 2010 Oct 15;11 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-11-S7-S6.
9
GASS: genome structural annotation for Eukaryotes based on species similarity.GASS:基于物种相似性的真核生物基因组结构注释
BMC Genomics. 2015 Mar 4;16(1):150. doi: 10.1186/s12864-015-1353-3.
10
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks.通过结合基因本体注释和基因共功能网络来测量语义相似性。
BMC Bioinformatics. 2015 Feb 14;16:44. doi: 10.1186/s12859-015-0474-7.

引用本文的文献

1
Structural Changes in Gene Ontology Reveal Modular and Complex Representations of Biological Function.基因本体论中的结构变化揭示了生物功能的模块化和复杂表征。
Mol Biol Evol. 2025 Jun 4;42(6). doi: 10.1093/molbev/msaf148.
2
A longitudinal analysis of function annotations of the human proteome reveals consistently high biases.对人类蛋白质组功能注释的纵向分析揭示了始终存在的高度偏差。
Database (Oxford). 2025 May 7;2025. doi: 10.1093/database/baaf036.
3
Model Organisms in Aging Research: Evolution of Database Annotation and Ortholog Discovery.衰老研究中的模式生物:数据库注释与直系同源物发现的演变
Genes (Basel). 2024 Dec 25;16(1):8. doi: 10.3390/genes16010008.
4
A Bayesian hierarchical hidden Markov model for clustering and gene selection: Application to kidney cancer gene expression data.一种用于聚类和基因选择的贝叶斯分层隐马尔可夫模型:在肾癌基因表达数据中的应用。
Biom J. 2024 Jun;66(4):e2300173. doi: 10.1002/bimj.202300173.
5
FAS: assessing the similarity between proteins using multi-layered feature architectures.FAS:使用多层特征架构评估蛋白质之间的相似性。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad226.
6
Defining the extent of gene function using ROC curvature.使用 ROC 曲率定义基因功能的范围。
Bioinformatics. 2022 Dec 13;38(24):5390-5397. doi: 10.1093/bioinformatics/btac692.
7
Non-synonymous to synonymous substitutions suggest that orthologs tend to keep their functions, while paralogs are a source of functional novelty.非同义替换与同义替换表明,直系同源物倾向于保持其功能,而旁系同源物是功能新颖性的来源。
PeerJ. 2022 Aug 31;10:e13843. doi: 10.7717/peerj.13843. eCollection 2022.
8
Predicting functions of maize proteins using graph convolutional network.利用图卷积网络预测玉米蛋白的功能。
BMC Bioinformatics. 2020 Dec 16;21(Suppl 16):420. doi: 10.1186/s12859-020-03745-6.
9
Recurrent sequence evolution after independent gene duplication.独立基因重复后的序列反复进化。
BMC Evol Biol. 2020 Aug 8;20(1):98. doi: 10.1186/s12862-020-01660-1.
10
The ortholog conjecture revisited: the value of orthologs and paralogs in function prediction.重新审视直系同源推断假说:直系同源物和旁系同源物在功能预测中的价值。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i219-i226. doi: 10.1093/bioinformatics/btaa468.

本文引用的文献

1
The Mineralocorticoid Receptor-How to Get Away with Promiscuity: Evolution of Hormone-Receptor Complexity by Molecular Exploitation. Science 312: 97-101, 2006.盐皮质激素受体——如何在滥交中全身而退:通过分子利用实现激素受体复杂性的进化。《科学》312卷:97 - 101页,2006年。
J Am Soc Nephrol. 2006 Jul;17(7):1759-1764. doi: 10.1681/01.asn.0000926836.46869.e5.
2
Testing the ortholog conjecture with comparative functional genomic data from mammals.利用来自哺乳动物的比较功能基因组数据检验直系同源假说。
PLoS Comput Biol. 2011 Jun;7(6):e1002073. doi: 10.1371/journal.pcbi.1002073. Epub 2011 Jun 9.
3
When orthologs diverge between human and mouse.当人类和老鼠的同源基因发生分歧时。
Brief Bioinform. 2011 Sep;12(5):436-41. doi: 10.1093/bib/bbr031. Epub 2011 Jun 15.
4
The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics.小鼠基因组数据库(MGD):哺乳动物基因组学和遗传学的首要模式生物资源。
Nucleic Acids Res. 2011 Jan;39(Database issue):D842-8. doi: 10.1093/nar/gkq1008. Epub 2010 Nov 3.
5
Protein evolution by molecular tinkering: diversification of the nuclear receptor superfamily from a ligand-dependent ancestor.分子杂合进化:核受体超家族从配体依赖的祖先多样化而来。
PLoS Biol. 2010 Oct 5;8(10):e1000497. doi: 10.1371/journal.pbio.1000497.
6
The MAP kinase signaling cascades: a system of hundreds of components regulates a diverse array of physiological functions.丝裂原活化蛋白激酶信号级联反应:一个由数百个成分组成的系统调节着各种各样的生理功能。
Methods Mol Biol. 2010;661:3-38. doi: 10.1007/978-1-60761-795-2_1.
7
The Gene Ontology in 2010: extensions and refinements.2010 年的基因本体论:扩展和改进。
Nucleic Acids Res. 2010 Jan;38(Database issue):D331-5. doi: 10.1093/nar/gkp1018. Epub 2009 Nov 17.
8
How confident can we be that orthologs are similar, but paralogs differ?我们对直系同源物相似而旁系同源物不同这一点能有多大的信心呢?
Trends Genet. 2009 May;25(5):210-6. doi: 10.1016/j.tig.2009.03.004. Epub 2009 Apr 14.
9
The GOA database in 2009--an integrated Gene Ontology Annotation resource.2009年的基因本体注释(GOA)数据库——一个整合的基因本体注释资源。
Nucleic Acids Res. 2009 Jan;37(Database issue):D396-403. doi: 10.1093/nar/gkn803. Epub 2008 Oct 27.
10
Gene Ontology annotations: what they mean and where they come from.基因本体论注释:它们的含义及来源
BMC Bioinformatics. 2008 Apr 29;9 Suppl 5(Suppl 5):S2. doi: 10.1186/1471-2105-9-S5-S2.