• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于保守概况构建的基因组树。

Genome trees from conservation profiles.

作者信息

Tekaia Fredj, Yeramian Edouard

机构信息

Unité de Génétique Moléculaire des Levures (URA 2171 CNRS and UFR927 Univ. P.M. Curie), Institut Pasteur, Paris, France.

出版信息

PLoS Comput Biol. 2005 Dec;1(7):e75. doi: 10.1371/journal.pcbi.0010075. Epub 2005 Dec 16.

DOI:10.1371/journal.pcbi.0010075
PMID:16362074
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1314884/
Abstract

The concept of the genome tree depends on the potential evolutionary significance in the clustering of species according to similarities in the gene content of their genomes. In this respect, genome trees have often been identified with species trees. With the rapid expansion of genome sequence data it becomes of increasing importance to develop accurate methods for grasping global trends for the phylogenetic signals that mutually link the various genomes. We therefore derive here the methodological concept of genome trees based on protein conservation profiles in multiple species. The basic idea in this derivation is that the multi-component "presence-absence" protein conservation profiles permit tracking of common evolutionary histories of genes across multiple genomes. We show that a significant reduction in informational redundancy is achieved by considering only the subset of distinct conservation profiles. Beyond these basic ideas, we point out various pitfalls and limitations associated with the data handling, paving the way for further improvements. As an illustration for the methods, we analyze a genome tree based on the above principles, along with a series of other trees derived from the same data and based on pair-wise comparisons (ancestral duplication-conservation and shared orthologs). In all trees we observe a sharp discrimination between the three primary domains of life: Bacteria, Archaea, and Eukarya. The new genome tree, based on conservation profiles, displays a significant correspondence with classically recognized taxonomical groupings, along with a series of departures from such conventional clusterings.

摘要

基因组树的概念取决于根据物种基因组中基因含量的相似性对物种进行聚类的潜在进化意义。在这方面,基因组树常常与物种树等同起来。随着基因组序列数据的迅速扩展,开发准确的方法以把握相互关联的各种基因组的系统发育信号的全球趋势变得越来越重要。因此,我们在此基于多个物种的蛋白质保守概况推导基因组树的方法概念。这种推导的基本思想是,多组分的“存在 - 缺失”蛋白质保守概况允许追踪多个基因组中基因的共同进化历史。我们表明,仅考虑不同保守概况的子集就能显著减少信息冗余。除了这些基本思想,我们指出了与数据处理相关的各种陷阱和局限性,为进一步改进铺平道路。作为该方法的一个示例,我们基于上述原则分析了一个基因组树,以及一系列从相同数据并基于成对比较(祖先复制 - 保守和共享直系同源物)得出的其他树。在所有的树中,我们都观察到生命的三个主要域(细菌、古菌和真核生物)之间有明显的区分。基于保守概况的新基因组树与经典认可的分类分组有显著的对应关系,同时也与这些传统聚类有一系列的偏离。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/d8548e15b169/pcbi.0010075.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/e21ca93afcdc/pcbi.0010075.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/34c91739445b/pcbi.0010075.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/de6bbca4947c/pcbi.0010075.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/9c7783faf8c7/pcbi.0010075.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/8bb137e12acf/pcbi.0010075.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/0decbe098737/pcbi.0010075.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/d8548e15b169/pcbi.0010075.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/e21ca93afcdc/pcbi.0010075.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/34c91739445b/pcbi.0010075.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/de6bbca4947c/pcbi.0010075.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/9c7783faf8c7/pcbi.0010075.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/8bb137e12acf/pcbi.0010075.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/0decbe098737/pcbi.0010075.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acb8/1323463/d8548e15b169/pcbi.0010075.g007.jpg

相似文献

1
Genome trees from conservation profiles.基于保守概况构建的基因组树。
PLoS Comput Biol. 2005 Dec;1(7):e75. doi: 10.1371/journal.pcbi.0010075. Epub 2005 Dec 16.
2
The genomic tree as revealed from whole proteome comparisons.从全蛋白质组比较中揭示的基因组树。
Genome Res. 1999 Jun;9(6):550-7.
3
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
4
A configuration space of homologous proteins conserving mutual information and allowing a phylogeny inference based on pair-wise Z-score probabilities.同源蛋白质的一种构象空间,其保留互信息并允许基于成对Z分数概率进行系统发育推断。
BMC Bioinformatics. 2005 Mar 10;6:49. doi: 10.1186/1471-2105-6-49.
5
Universal trees based on large combined protein sequence data sets.基于大型综合蛋白质序列数据集的通用树。
Nat Genet. 2001 Jul;28(3):281-5. doi: 10.1038/90129.
6
Analysis of phenetic trees based on metabolic capabilites across the three domains of life.基于生命三个域的代谢能力对表型树的分析。
J Mol Biol. 2004 Jul 9;340(3):491-512. doi: 10.1016/j.jmb.2004.04.059.
7
From phylogenetics to phylogenomics: the evolutionary relationships of insect endosymbiotic gamma-Proteobacteria as a test case.从系统发育学到系统基因组学:以昆虫内共生γ-变形菌的进化关系为例
Syst Biol. 2007 Feb;56(1):1-16. doi: 10.1080/10635150601109759.
8
Harvesting evolutionary signals in a forest of prokaryotic gene trees.在原核基因树森林中提取进化信号。
Mol Biol Evol. 2011 Apr;28(4):1393-405. doi: 10.1093/molbev/msq323. Epub 2010 Dec 20.
9
Maximum likelihood estimates of species trees: how accuracy of phylogenetic inference depends upon the divergence history and sampling design.最大似然估计物种树:系统发育推断的准确性如何取决于分歧历史和采样设计。
Syst Biol. 2009 Oct;58(5):501-8. doi: 10.1093/sysbio/syp045. Epub 2009 Aug 20.
10
Reconciling gene and genome duplication events: using multiple nuclear gene families to infer the phylogeny of the aquatic plant family Pontederiaceae.协调基因和基因组复制事件:利用多个核基因家族推断水生植物家族水鳖科的系统发育。
Mol Biol Evol. 2011 Nov;28(11):3009-18. doi: 10.1093/molbev/msr119. Epub 2011 Jun 1.

引用本文的文献

1
Genome Data Exploration Using Correspondence Analysis.使用对应分析进行基因组数据探索。
Bioinform Biol Insights. 2016 Jun 7;10:59-72. doi: 10.4137/BBI.S39614. eCollection 2016.
2
Inferring Orthologs: Open Questions and Perspectives.推断直系同源基因:未解决的问题与展望
Genomics Insights. 2016 Feb 25;9:17-28. doi: 10.4137/GEI.S37925. eCollection 2016.
3
Origin of eukaryotes from within archaea, archaeal eukaryome and bursts of gene gain: eukaryogenesis just made easier?真核生物起源于古菌内部、古菌真核基因组与基因获得爆发:真核生物起源是否变得更简单了?

本文引用的文献

1
Hemiascomycetous yeasts at the forefront of comparative genomics.处于比较基因组学前沿的半子囊菌酵母。
Curr Opin Genet Dev. 2005 Dec;15(6):614-20. doi: 10.1016/j.gde.2005.09.005. Epub 2005 Sep 26.
2
Genome trees and the nature of genome evolution.基因组树与基因组进化的本质。
Annu Rev Microbiol. 2005;59:191-209. doi: 10.1146/annurev.micro.59.030804.121233.
3
Comparative analyses of fundamental differences in membrane transport capabilities in prokaryotes and eukaryotes.原核生物和真核生物膜转运能力基本差异的比较分析。
Philos Trans R Soc Lond B Biol Sci. 2015 Sep 26;370(1678):20140333. doi: 10.1098/rstb.2014.0333.
4
Dissemination of cephalosporin resistance genes between Escherichia coli strains from farm animals and humans by specific plasmid lineages.特定质粒谱系在农场动物和人类来源的大肠杆菌菌株之间传播头孢菌素抗性基因。
PLoS Genet. 2014 Dec 18;10(12):e1004776. doi: 10.1371/journal.pgen.1004776. eCollection 2014 Dec.
5
Plasmid flux in Escherichia coli ST131 sublineages, analyzed by plasmid constellation network (PLACNET), a new method for plasmid reconstruction from whole genome sequences.通过质粒星座网络(PLACNET,一种从全基因组序列重建质粒的新方法)分析大肠杆菌ST131亚系中的质粒通量。
PLoS Genet. 2014 Dec 18;10(12):e1004766. doi: 10.1371/journal.pgen.1004766. eCollection 2014 Dec.
6
The dispersed archaeal eukaryome and the complex archaeal ancestor of eukaryotes.分散的古菌真核生物组和真核生物的复杂古菌祖先。
Cold Spring Harb Perspect Biol. 2014 Apr 1;6(4):a016188. doi: 10.1101/cshperspect.a016188.
7
Hierarchical partitioning of metazoan protein conservation profiles provides new functional insights.后生动物蛋白质保守性图谱的层次划分提供了新的功能见解。
PLoS One. 2014 Mar 4;9(3):e90282. doi: 10.1371/journal.pone.0090282. eCollection 2014.
8
Detection of genomic idiosyncrasies using fuzzy phylogenetic profiles.使用模糊系统发育轮廓检测基因组特征。
PLoS One. 2013;8(1):e52854. doi: 10.1371/journal.pone.0052854. Epub 2013 Jan 14.
9
Meta-analysis of general bacterial subclades in whole-genome phylogenies using tree topology profiling.基于树拓扑特征分析的全基因组系统发育中常见细菌亚群的荟萃分析。
Evol Bioinform Online. 2012;8:489-525. doi: 10.4137/EBO.S9642. Epub 2012 Aug 7.
10
Standard operating procedure for computing pangenome trees.计算泛基因组树的标准操作程序。
Stand Genomic Sci. 2010 Jan 28;2(1):135-41. doi: 10.4056/sigs.38923.
PLoS Comput Biol. 2005 Aug;1(3):e27. doi: 10.1371/journal.pcbi.0010027. Epub 2005 Aug 19.
4
The net of life: reconstructing the microbial phylogenetic network.生命之网:重建微生物系统发育网络
Genome Res. 2005 Jul;15(7):954-9. doi: 10.1101/gr.3666505. Epub 2005 Jun 17.
5
Do orthologous gene phylogenies really support tree-thinking?直系同源基因系统发育树真的支持树形思维吗?
BMC Evol Biol. 2005 May 24;5:33. doi: 10.1186/1471-2148-5-33.
6
Phylogenomics and the reconstruction of the tree of life.系统发育基因组学与生命之树的重建
Nat Rev Genet. 2005 May;6(5):361-75. doi: 10.1038/nrg1603.
7
Measuring genome conservation across taxa: divided strains and united kingdoms.跨分类群测量基因组保守性:分裂的菌株与联合的王国
Nucleic Acids Res. 2005 Jan 28;33(2):616-21. doi: 10.1093/nar/gki181. Print 2005.
8
Phylogeny determined by protein domain content.由蛋白质结构域含量确定的系统发育。
Proc Natl Acad Sci U S A. 2005 Jan 11;102(2):373-8. doi: 10.1073/pnas.0408810102. Epub 2005 Jan 3.
9
Continued colonization of the human genome by mitochondrial DNA.线粒体DNA对人类基因组的持续定殖。
PLoS Biol. 2004 Sep;2(9):E273. doi: 10.1371/journal.pbio.0020273. Epub 2004 Sep 7.
10
The ring of life provides evidence for a genome fusion origin of eukaryotes.生命之环为真核生物的基因组融合起源提供了证据。
Nature. 2004 Sep 9;431(7005):152-5. doi: 10.1038/nature02848.