• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重建生命三域中高度异质的基因含量进化。

Reconstruction of highly heterogeneous gene-content evolution across the three domains of life.

作者信息

Iwasaki Wataru, Takagi Toshihisa

机构信息

Department of Computational Biology, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8568, Japan.

出版信息

Bioinformatics. 2007 Jul 1;23(13):i230-9. doi: 10.1093/bioinformatics/btm165.

DOI:10.1093/bioinformatics/btm165
PMID:17646301
Abstract

MOTIVATION

Reconstruction of gene-content evolutionary history is fundamental in studying the evolution of genomes and biological systems. To reconstruct plausible evolutionary history, rates of gene gain/loss should be estimated by considering the high level of heterogeneity: e.g. genome duplication and parasitization, respectively, result in high rates of gene gain and loss. Gene-content evolution reconstruction methods that consider this heterogeneity and that are both effective in estimating the rates of gene gain and loss and sufficiently efficient to analyze abundant genomic data had not been developed.

RESULTS

An effective and efficient method for reconstructing heterogeneous gene-content evolution was developed. This method comprises analytically integrable modeling of gene-content evolution, analytical formulation of expectation-maximization and efficient calculation of marginal likelihood using an inside-outside-like algorithm. Simulation tests on the scale of hundreds of genomes showed that both the gene gain/loss rates and evolutionary history were effectively estimated within a few days of computational time. Subsequently, this algorithm was applied to an actual data set of nearly 200 genomes to reconstruct the heterogeneous gene-content evolution across the three domains of life. The reconstructed history, which contained several features consistent with biological observations, showed that the trends of gene-content evolution were not only drastically different between prokaryotes and eukaryotes, but were highly variable within each form of life. The results suggest that heterogeneity should be considered in studies of the evolution of gene content, genomes and biological systems.

AVAILABILITY

An R script that implements the algorithm is available upon request.

摘要

动机

基因含量进化历史的重建是研究基因组和生物系统进化的基础。为了重建合理的进化历史,应通过考虑高度的异质性来估计基因获得/丢失的速率:例如,基因组复制和寄生作用分别导致高基因获得率和高基因丢失率。尚未开发出考虑这种异质性且在估计基因获得和丢失速率方面有效且足够高效以分析大量基因组数据的基因含量进化重建方法。

结果

开发了一种有效且高效的重建异质基因含量进化的方法。该方法包括基因含量进化的解析可积建模、期望最大化的解析公式以及使用类似内外算法的边际似然的高效计算。在数百个基因组规模上的模拟测试表明,在几天的计算时间内就能有效地估计基因获得/丢失速率和进化历史。随后,该算法被应用于近200个基因组的实际数据集,以重建生命三个域的异质基因含量进化。重建的历史包含了几个与生物学观察一致的特征,表明基因含量进化的趋势不仅在原核生物和真核生物之间有很大差异,而且在每种生命形式内部也高度可变。结果表明,在基因含量、基因组和生物系统进化的研究中应考虑异质性。

可用性

可根据要求提供实现该算法的R脚本。

相似文献

1
Reconstruction of highly heterogeneous gene-content evolution across the three domains of life.重建生命三域中高度异质的基因含量进化。
Bioinformatics. 2007 Jul 1;23(13):i230-9. doi: 10.1093/bioinformatics/btm165.
2
A simulation test bed for hypotheses of genome evolution.用于基因组进化假说的模拟试验台。
Bioinformatics. 2007 Apr 1;23(7):825-31. doi: 10.1093/bioinformatics/btm024. Epub 2007 Jan 31.
3
[Homology and evolution of gene orders: simulation and reconstruction of the evolutionary process].[基因顺序的同源性与进化:进化过程的模拟与重建]
Genetika. 1997 Jan;33(1):31-9.
4
The evolutionary forest algorithm.进化森林算法
Bioinformatics. 2007 Aug 1;23(15):1962-8. doi: 10.1093/bioinformatics/btm264. Epub 2007 May 22.
5
Maximum likelihood of phylogenetic networks.系统发育网络的最大似然法
Bioinformatics. 2006 Nov 1;22(21):2604-11. doi: 10.1093/bioinformatics/btl452. Epub 2006 Aug 23.
6
Ancestral animal genomes reconstruction.祖先动物基因组重建。
Curr Opin Immunol. 2007 Oct;19(5):542-6. doi: 10.1016/j.coi.2007.06.009. Epub 2007 Aug 15.
7
Automatic genome-wide reconstruction of phylogenetic gene trees.系统发育基因树的全基因组自动重建
Bioinformatics. 2007 Jul 1;23(13):i549-58. doi: 10.1093/bioinformatics/btm193.
8
Genomic analyses and the origin of the eukaryotes.基因组分析与真核生物的起源
Chem Biodivers. 2007 Nov;4(11):2631-8. doi: 10.1002/cbdv.200790215.
9
Innovation from reduction: gene loss, domain loss and sequence divergence in genome evolution.简约带来的创新:基因组进化中的基因丢失、结构域丢失和序列分歧
Appl Bioinformatics. 2003;2(1):13-34.
10
Inferring horizontal transfers in the presence of rearrangements by the minimum evolution criterion.在存在重排的情况下,依据最小进化标准推断水平转移。
Bioinformatics. 2008 Mar 15;24(6):826-32. doi: 10.1093/bioinformatics/btn024. Epub 2008 Jan 18.

引用本文的文献

1
Mirage: estimation of ancestral gene-copy numbers by considering different evolutionary patterns among gene families.Mirage:通过考虑基因家族间不同进化模式来估计祖先基因拷贝数
Bioinform Adv. 2021 Jul 30;1(1):vbab014. doi: 10.1093/bioadv/vbab014. eCollection 2021.
2
Machine learning enables prediction of metabolic system evolution in bacteria.机器学习可用于预测细菌代谢系统的进化。
Sci Adv. 2023 Jan 13;9(2):eadc9130. doi: 10.1126/sciadv.adc9130. Epub 2023 Jan 11.
3
Dynamic evolution of great ape Y chromosomes.巨猿 Y 染色体的动态进化。
Proc Natl Acad Sci U S A. 2020 Oct 20;117(42):26273-26280. doi: 10.1073/pnas.2001749117. Epub 2020 Oct 5.
4
Ancestral state reconstruction of metabolic pathways across pangenome ensembles.泛基因组集合中代谢途径的祖先状态重建。
Microb Genom. 2020 Nov;6(11). doi: 10.1099/mgen.0.000429.
5
A Fast Likelihood Method to Reconstruct and Visualize Ancestral Scenarios.一种快速的似然方法,用于重建和可视化祖先场景。
Mol Biol Evol. 2019 Sep 1;36(9):2069-2085. doi: 10.1093/molbev/msz131.
6
Metagenomics and Bioinformatics in Microbial Ecology: Current Status and Beyond.微生物生态学中的宏基因组学与生物信息学:现状与展望
Microbes Environ. 2016 Sep 29;31(3):204-12. doi: 10.1264/jsme2.ME16024. Epub 2016 Jul 5.
7
Robustness of birth-death and gain models for inferring evolutionary events.用于推断进化事件的生死模型和增益模型的稳健性。
BMC Genomics. 2014;15 Suppl 6(Suppl 6):S9. doi: 10.1186/1471-2164-15-S6-S9. Epub 2014 Oct 17.
8
Current understanding of the formation and adaptation of metabolic systems based on network theory.基于网络理论对代谢系统形成与适应性的当前理解。
Metabolites. 2012 Jul 12;2(3):429-57. doi: 10.3390/metabo2030429.
9
A Bayesian model for gene family evolution.贝叶斯模型在基因家族进化研究中的应用。
BMC Bioinformatics. 2011 Nov 1;12:426. doi: 10.1186/1471-2105-12-426.
10
On the artefactual parasitic eubacteria clan in conditioned logdet phylogenies: heterotachy and ortholog identification artefacts as explanations.条件 logdet 系统发育中人为寄生真细菌类群:作为解释的异时性和直系同源鉴定人为产物。
BMC Evol Biol. 2010 Nov 9;10:343. doi: 10.1186/1471-2148-10-343.