• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从直系同源基因对推断层次同源物组。

Inferring hierarchical orthologous groups from orthologous gene pairs.

机构信息

Department of Computer Science, ETH Zurich, Zurich, Switzerland.

出版信息

PLoS One. 2013;8(1):e53786. doi: 10.1371/journal.pone.0053786. Epub 2013 Jan 14.

DOI:10.1371/journal.pone.0053786
PMID:23342000
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3544860/
Abstract

Hierarchical orthologous groups are defined as sets of genes that have descended from a single common ancestor within a taxonomic range of interest. Identifying such groups is useful in a wide range of contexts, including inference of gene function, study of gene evolution dynamics and comparative genomics. Hierarchical orthologous groups can be derived from reconciled gene/species trees but, this being a computationally costly procedure, many phylogenomic databases work on the basis of pairwise gene comparisons instead ("graph-based" approach). To our knowledge, there is only one published algorithm for graph-based hierarchical group inference, but both its theoretical justification and performance in practice are as of yet largely uncharacterised. We establish a formal correspondence between the orthology graph and hierarchical orthologous groups. Based on that, we devise GETHOGs ("Graph-based Efficient Technique for Hierarchical Orthologous Groups"), a novel algorithm to infer hierarchical groups directly from the orthology graph, thus without needing gene tree inference nor gene/species tree reconciliation. GETHOGs is shown to correctly reconstruct hierarchical orthologous groups when applied to perfect input, and several extensions with stringency parameters are provided to deal with imperfect input data. We demonstrate its competitiveness using both simulated and empirical data. GETHOGs is implemented as a part of the freely-available OMA standalone package (http://omabrowser.org/standalone). Furthermore, hierarchical groups inferred by GETHOGs ("OMA HOGs") on >1,000 genomes can be interactively queried via the OMA browser (http://omabrowser.org).

摘要

层次同源群被定义为在感兴趣的分类范围内从单个共同祖先中衍生出来的一组基因。在广泛的上下文中,识别这样的群体是有用的,包括推断基因功能、研究基因进化动态和比较基因组学。层次同源群可以从调和的基因/物种树中推导出来,但这是一个计算成本很高的过程,许多基因组学数据库基于基因对的比较工作(“基于图的”方法)。据我们所知,目前只有一种用于基于图的层次分组推断的算法,但它的理论依据和实际性能在很大程度上尚未得到充分描述。我们在同源关系图和层次同源群之间建立了正式的对应关系。在此基础上,我们设计了 GETHOGs(“基于图的高效层次同源群推断技术”),这是一种直接从同源关系图推断层次分组的新算法,因此不需要基因树推断或基因/物种树调和。GETHOGs 在应用于完美输入时被证明可以正确重建层次同源群,并且提供了几个带有严格参数的扩展来处理不完美的输入数据。我们使用模拟和真实数据证明了它的竞争力。GETHOGs 作为免费提供的 OMA 独立包(http://omabrowser.org/standalone)的一部分实现。此外,GETHOGs 推断的层次组(“OMA HOGs”)可以通过 OMA 浏览器(http://omabrowser.org)在 >1000 个基因组上进行交互式查询。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/299691d8efb1/pone.0053786.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/ddfdb539b17c/pone.0053786.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/7370c013b2c1/pone.0053786.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/2956a2bb4136/pone.0053786.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/9849d3153719/pone.0053786.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/299691d8efb1/pone.0053786.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/ddfdb539b17c/pone.0053786.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/7370c013b2c1/pone.0053786.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/2956a2bb4136/pone.0053786.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/9849d3153719/pone.0053786.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e63/3544860/299691d8efb1/pone.0053786.g005.jpg

相似文献

1
Inferring hierarchical orthologous groups from orthologous gene pairs.从直系同源基因对推断层次同源物组。
PLoS One. 2013;8(1):e53786. doi: 10.1371/journal.pone.0053786. Epub 2013 Jan 14.
2
Orthologous Matrix (OMA) algorithm 2.0: more robust to asymmetric evolutionary rates and more scalable hierarchical orthologous group inference.直系同源矩阵(OMA)算法2.0:对不对称进化速率更具鲁棒性,且在分层直系同源组推断方面更具扩展性。
Bioinformatics. 2017 Jul 15;33(14):i75-i82. doi: 10.1093/bioinformatics/btx229.
3
Identifying orthologs with OMA: A primer.使用OMA鉴定直系同源基因:入门指南。
F1000Res. 2020 Jan 17;9:27. doi: 10.12688/f1000research.21508.1. eCollection 2020.
4
Inferring Orthology and Paralogy.推断直系同源和旁系同源关系。
Methods Mol Biol. 2019;1910:149-175. doi: 10.1007/978-1-4939-9074-0_5.
5
OMA 2011: orthology inference among 1000 complete genomes.OMA 2011:1000个完整基因组间的直系同源推断
Nucleic Acids Res. 2011 Jan;39(Database issue):D289-94. doi: 10.1093/nar/gkq1238. Epub 2010 Nov 27.
6
OMA orthology in 2024: improved prokaryote coverage, ancestral and extant GO enrichment, a revamped synteny viewer and more in the OMA Ecosystem.2024 年的 OMA 同源物:改进的原核生物覆盖范围、祖先和现存 GO 富集、重新设计的同线性视图以及更多的 OMA 生态系统。
Nucleic Acids Res. 2024 Jan 5;52(D1):D513-D521. doi: 10.1093/nar/gkad1020.
7
Computational methods for Gene Orthology inference.基因直系同源推断的计算方法。
Brief Bioinform. 2011 Sep;12(5):379-91. doi: 10.1093/bib/bbr030. Epub 2011 Jun 19.
8
OMA Browser--exploring orthologous relations across 352 complete genomes.OMA浏览器——探索352个完整基因组间的直系同源关系。
Bioinformatics. 2007 Aug 15;23(16):2180-2. doi: 10.1093/bioinformatics/btm295. Epub 2007 Jun 1.
9
How to build phylogenetic species trees with OMA.如何使用OMA构建系统发育物种树。
F1000Res. 2020 Jun 4;9:511. doi: 10.12688/f1000research.23790.2. eCollection 2020.
10
Orthology prediction at scalable resolution by phylogenetic tree analysis.通过系统发育树分析实现可扩展分辨率下的直系同源预测。
BMC Bioinformatics. 2007 Mar 8;8:83. doi: 10.1186/1471-2105-8-83.

引用本文的文献

1
Orthology inference at scale with FastOMA.使用FastOMA进行大规模直系同源推断。
Nat Methods. 2025 Feb;22(2):269-272. doi: 10.1038/s41592-024-02552-8. Epub 2025 Jan 3.
2
New developments for the Quest for Orthologs benchmark service.直系同源物搜索基准服务的新进展。
NAR Genom Bioinform. 2024 Dec 11;6(4):lqae167. doi: 10.1093/nargab/lqae167. eCollection 2024 Dec.
3
Inferring Interaction Networks from Transcriptomic Data: Methods and Applications.从转录组数据推断相互作用网络:方法与应用。

本文引用的文献

1
Inferring orthology and paralogy.推断直系同源和旁系同源关系。
Methods Mol Biol. 2012;855:259-79. doi: 10.1007/978-1-61779-582-4_9.
2
ALF--a simulation framework for genome evolution.ALF--一个用于基因组进化的模拟框架。
Mol Biol Evol. 2012 Apr;29(4):1115-23. doi: 10.1093/molbev/msr268. Epub 2011 Dec 8.
3
eggNOG v3.0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges.eggNOG v3.0:涵盖了 41 个不同分类范围的 1133 个生物体的直系同源物组。
Methods Mol Biol. 2024;2812:11-37. doi: 10.1007/978-1-0716-3886-6_2.
4
Protein-Coding Gene Families in Prokaryote Genome Comparisons.原核生物基因组比较中的蛋白质编码基因家族。
Methods Mol Biol. 2024;2802:33-55. doi: 10.1007/978-1-0716-3838-5_2.
5
Quality assessment of gene repertoire annotations with OMArk.使用OMArk对基因库注释进行质量评估。
Nat Biotechnol. 2025 Jan;43(1):124-133. doi: 10.1038/s41587-024-02147-w. Epub 2024 Feb 21.
6
Informatic challenges and advances in illuminating the druggable proteome.阐明可成药蛋白质组中的信息学挑战与进展。
Drug Discov Today. 2024 Mar;29(3):103894. doi: 10.1016/j.drudis.2024.103894. Epub 2024 Jan 22.
7
De novo genome assembly and comparative genomics for the colonial ascidian Botrylloides violaceus.从头组装和比较基因组学分析殖民地海鞘 Botrylloides violaceus。
G3 (Bethesda). 2023 Sep 30;13(10). doi: 10.1093/g3journal/jkad181.
8
Nano-Strainer: A workflow for the identification of single-copy nuclear loci for plant systematic studies, using target capture kits and Oxford Nanopore long reads.纳米过滤器:一种使用靶向捕获试剂盒和牛津纳米孔长读长进行植物系统研究中单拷贝核基因座鉴定的工作流程。
Ecol Evol. 2023 Jul 18;13(7):e10190. doi: 10.1002/ece3.10190. eCollection 2023 Jul.
9
Insights into the Genomics of Clownfish Adaptive Radiation: The Genomic Substrate of the Diversification.小丑鱼适应性辐射的基因组学研究进展:多样化的基因组基础。
Genome Biol Evol. 2023 Jul 3;15(7). doi: 10.1093/gbe/evad088.
10
Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences.芽孢杆菌中分类受限的基因可能形成同源物簇,并可追溯到大量非编码序列的文库。
Genome Biol Evol. 2023 Mar 3;15(3). doi: 10.1093/gbe/evad023.
Nucleic Acids Res. 2012 Jan;40(Database issue):D284-9. doi: 10.1093/nar/gkr1060. Epub 2011 Nov 16.
4
Orthology prediction methods: a quality assessment using curated protein families.同源基因预测方法:基于已验证蛋白质家族的质量评估
Bioessays. 2011 Oct;33(10):769-80. doi: 10.1002/bies.201100062. Epub 2011 Aug 19.
5
Conceptual framework and pilot study to benchmark phylogenomic databases based on reference gene trees.基于参考基因树对系统基因组数据库进行基准测试的概念框架和初步研究。
Brief Bioinform. 2011 Sep;12(5):423-35. doi: 10.1093/bib/bbr034. Epub 2011 Jul 7.
6
Computational methods for Gene Orthology inference.基因直系同源推断的计算方法。
Brief Bioinform. 2011 Sep;12(5):379-91. doi: 10.1093/bib/bbr030. Epub 2011 Jun 19.
7
Representative proteomes: a stable, scalable and unbiased proteome set for sequence analysis and functional annotation.代表性蛋白质组:用于序列分析和功能注释的稳定、可扩展且无偏的蛋白质组集。
PLoS One. 2011 Apr 27;6(4):e18910. doi: 10.1371/journal.pone.0018910.
8
Ultra-fast sequence clustering from similarity networks with SiLiX.使用 SiLiX 从相似度网络中进行超快速序列聚类。
BMC Bioinformatics. 2011 Apr 22;12:116. doi: 10.1186/1471-2105-12-116.
9
OrthoInspector: comprehensive orthology analysis and visual exploration.OrthoInspector:全面的直系同源分析和可视化探索。
BMC Bioinformatics. 2011 Jan 10;12:11. doi: 10.1186/1471-2105-12-11.
10
OMA 2011: orthology inference among 1000 complete genomes.OMA 2011:1000个完整基因组间的直系同源推断
Nucleic Acids Res. 2011 Jan;39(Database issue):D289-94. doi: 10.1093/nar/gkq1238. Epub 2010 Nov 27.