• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于随机映射和最大简约法的系统发育模式推断增益和损失事件——模拟研究。

Inference of gain and loss events from phyletic patterns using stochastic mapping and maximum parsimony--a simulation study.

机构信息

Department of Cell Research and Immunology, Tel Aviv University, Tel Aviv, Israel.

出版信息

Genome Biol Evol. 2011;3:1265-75. doi: 10.1093/gbe/evr101. Epub 2011 Oct 4.

DOI:10.1093/gbe/evr101
PMID:21971516
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3215202/
Abstract

Bacterial evolution is characterized by frequent gain and loss events of gene families. These events can be inferred from phyletic pattern data-a compact representation of gene family repertoire across multiple genomes. The maximum parsimony paradigm is a classical and prevalent approach for the detection of gene family gains and losses mapped on specific branches. We and others have previously developed probabilistic models that aim to account for the gain and loss stochastic dynamics. These models are a critical component of a methodology termed stochastic mapping, in which probabilities and expectations of gain and loss events are estimated for each branch of an underlying phylogenetic tree. In this work, we present a phyletic pattern simulator in which the gain and loss dynamics are assumed to follow a continuous-time Markov chain along the tree. Various models and options are implemented to make the simulation software useful for a large number of studies in which binary (presence/absence) data are analyzed. Using this simulation software, we compared the ability of the maximum parsimony and the stochastic mapping approaches to accurately detect gain and loss events along the tree. Our simulations cover a large array of evolutionary scenarios in terms of the propensities for gene family gains and losses and the variability of these propensities among gene families. Although in all simulation schemes, both methods obtain relatively low levels of false positive rates, stochastic mapping outperforms maximum parsimony in terms of true positive rates. We further studied the factors that influence the performance of both methods. We find, for example, that the accuracy of maximum parsimony inference is substantially reduced when the goal is to map gain and loss events along internal branches of the phylogenetic tree. Furthermore, the accuracy of stochastic mapping is reduced with smaller data sets (limited number of gene families) due to unreliable estimation of branch lengths. Our simulator and simulation results are additionally relevant for the analysis of other types of binary-coded data, such as the existence of homologues restriction sites, gaps, and introns, to name a few. Both the simulation software and the inference methodology are freely available at a user-friendly server: http://gloome.tau.ac.il/.

摘要

细菌进化的特点是基因家族的频繁获得和缺失事件。这些事件可以从系统发育模式数据中推断出来,系统发育模式数据是跨越多个基因组的基因家族库的紧凑表示。最大简约范式是检测映射到特定分支上的基因家族获得和缺失的经典和流行方法。我们和其他人之前开发了旨在解释获得和损失随机动态的概率模型。这些模型是一种称为随机映射的方法的关键组成部分,在该方法中,为基础系统发育树的每个分支估计获得和损失事件的概率和期望。在这项工作中,我们提出了一种系统发育模式模拟器,其中获得和损失动态被假设沿树遵循连续时间马尔可夫链。实现了各种模型和选项,以使模拟软件可用于大量分析二进制(存在/不存在)数据的研究。使用此模拟软件,我们比较了最大简约法和随机映射法准确检测树中获得和损失事件的能力。我们的模拟涵盖了基因家族获得和损失倾向以及这些倾向在基因家族之间的变异性方面的大量进化场景。尽管在所有模拟方案中,两种方法的假阳性率都相对较低,但在真阳性率方面,随机映射优于最大简约法。我们进一步研究了影响这两种方法性能的因素。例如,当目标是沿系统发育树的内部分支映射获得和缺失事件时,最大简约法推断的准确性会大大降低。此外,由于分支长度的估计不可靠,因此随着数据集(基因家族数量有限)的减小,随机映射的准确性会降低。我们的模拟器和模拟结果对于分析其他类型的二进制编码数据也很重要,例如同源限制位点、间隙和内含子的存在等。模拟软件和推理方法都可在用户友好的服务器上免费获得:http://gloome.tau.ac.il/。

相似文献

1
Inference of gain and loss events from phyletic patterns using stochastic mapping and maximum parsimony--a simulation study.基于随机映射和最大简约法的系统发育模式推断增益和损失事件——模拟研究。
Genome Biol Evol. 2011;3:1265-75. doi: 10.1093/gbe/evr101. Epub 2011 Oct 4.
2
GLOOME: gain loss mapping engine.GLOOME:增益损耗映射引擎。
Bioinformatics. 2010 Nov 15;26(22):2914-5. doi: 10.1093/bioinformatics/btq549. Epub 2010 Sep 27.
3
Inference and characterization of horizontally transferred gene families using stochastic mapping.基于随机映射推断和刻画水平转移基因家族。
Mol Biol Evol. 2010 Mar;27(3):703-13. doi: 10.1093/molbev/msp240. Epub 2009 Oct 6.
4
A likelihood framework to analyse phyletic patterns.一种用于分析系统发育模式的似然框架。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3903-11. doi: 10.1098/rstb.2008.0177.
5
CoPAP: Coevolution of presence-absence patterns.CoPAP:存在-缺失模式的共同进化。
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W232-7. doi: 10.1093/nar/gkt471. Epub 2013 Jun 8.
6
Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes.用于计算基因组进化简约进化情景、最后共同祖先以及原核生物进化中水平基因转移主导地位的算法。
BMC Evol Biol. 2003 Jan 6;3:2. doi: 10.1186/1471-2148-3-2.
7
Maximum likelihood models and algorithms for gene tree evolution with duplications and losses.具有重复和缺失的基因树进化的最大似然模型和算法。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S15. doi: 10.1186/1471-2105-12-S1-S15.
8
Theoretical and Practical Considerations when using Retroelement Insertions to Estimate Species Trees in the Anomaly Zone.在异常区域使用逆转录元件插入来估计物种树时的理论与实践考量
Syst Biol. 2022 Apr 19;71(3):721-740. doi: 10.1093/sysbio/syab086.
9
SFREEMAP - A simulation-free tool for stochastic mapping.SFREEMAP - 一种用于随机映射的无模拟工具。
BMC Bioinformatics. 2017 Feb 22;18(1):123. doi: 10.1186/s12859-017-1554-7.
10
A probabilistic version of Sankoff's maximum parsimony algorithm.一种 Sankoff 最大简约算法的概率版本。
J Bioinform Comput Biol. 2020 Feb;18(1):2050004. doi: 10.1142/S0219720020500043.

引用本文的文献

1
Evolution towards simplicity in bacterial small heat shock protein system.细菌小分子热激蛋白系统向简单性的进化。
Elife. 2023 Dec 8;12:RP89813. doi: 10.7554/eLife.89813.
2
Mirage: estimation of ancestral gene-copy numbers by considering different evolutionary patterns among gene families.Mirage:通过考虑基因家族间不同进化模式来估计祖先基因拷贝数
Bioinform Adv. 2021 Jul 30;1(1):vbab014. doi: 10.1093/bioadv/vbab014. eCollection 2021.
3
Machine learning enables prediction of metabolic system evolution in bacteria.机器学习可用于预测细菌代谢系统的进化。
Sci Adv. 2023 Jan 13;9(2):eadc9130. doi: 10.1126/sciadv.adc9130. Epub 2023 Jan 11.
4
Ancestral Sequence Reconstruction for Exploring Alkaloid Evolution.探索生物碱进化的祖先序列重建。
Methods Mol Biol. 2022;2505:165-179. doi: 10.1007/978-1-0716-2349-7_12.
5
Horizontal gene transfer drives the evolution of dependencies in bacteria.水平基因转移推动细菌中依赖性的进化。
iScience. 2022 Apr 27;25(5):104312. doi: 10.1016/j.isci.2022.104312. eCollection 2022 May 20.
6
Phylogenetic Distribution and Evolution of Type VI Secretion System in the Genus .属中VI型分泌系统的系统发育分布与进化
Front Microbiol. 2022 Apr 14;13:840308. doi: 10.3389/fmicb.2022.840308. eCollection 2022.
7
Two NLR immune receptors acquired high-affinity binding to a fungal effector through convergent evolution of their integrated domain.两个 NLR 免疫受体通过其整合结构域的趋同进化获得了与真菌效应蛋白的高亲和力结合。
Elife. 2021 Jul 21;10:e66961. doi: 10.7554/eLife.66961.
8
Evolution of Microbial Genomics: Conceptual Shifts over a Quarter Century.微生物基因组学的演变:二十五年来的概念转变。
Trends Microbiol. 2021 Jul;29(7):582-592. doi: 10.1016/j.tim.2021.01.005. Epub 2021 Feb 1.
9
Genomic diversifications of five Gossypium allopolyploid species and their impact on cotton improvement.五个棉属异源多倍体物种的基因组多样化及其对棉花改良的影响。
Nat Genet. 2020 May;52(5):525-533. doi: 10.1038/s41588-020-0614-5. Epub 2020 Apr 20.
10
Evolution of Predicted Acid Resistance Mechanisms in the Extremely Acidophilic Genus.预测嗜酸属的耐酸机制的进化
Genes (Basel). 2020 Apr 3;11(4):389. doi: 10.3390/genes11040389.

本文引用的文献

1
Biases in Maximum Likelihood and Parsimony: A Simulation Approach to a 10-Taxon Case.最大似然法和简约法中的偏差:一种针对10分类单元案例的模拟方法
Cladistics. 2001 Sep;17(3):266-281. doi: 10.1111/j.1096-0031.2001.tb00123.x.
2
PHYLOGENETIC INFERENCE FROM RESTRICTION ENDONUCLEASE CLEAVAGE SITE MAPS WITH PARTICULAR REFERENCE TO THE EVOLUTION OF HUMANS AND THE APES.基于限制性内切酶切割位点图谱的系统发育推断,特别涉及人类和猿类的进化
Evolution. 1983 Mar;37(2):221-244. doi: 10.1111/j.1558-5646.1983.tb05533.x.
3
PHYLOGENIES FROM RESTRICTION SITES: A MAXIMUM-LIKELIHOOD APPROACH.基于限制性酶切位点的系统发育分析:一种最大似然法
Evolution. 1992 Feb;46(1):159-173. doi: 10.1111/j.1558-5646.1992.tb01991.x.
4
Gene gain and loss events in Rickettsia and Orientia species.立克次体和东方体物种中的基因获得和丢失事件。
Biol Direct. 2011 Feb 8;6:6. doi: 10.1186/1745-6150-6-6.
5
Networks of gene sharing among 329 proteobacterial genomes reveal differences in lateral gene transfer frequency at different phylogenetic depths.329 个蛋白细菌基因组中的基因共享网络揭示了不同系统发育深度下水平基因转移频率的差异。
Mol Biol Evol. 2011 Feb;28(2):1057-74. doi: 10.1093/molbev/msq297. Epub 2010 Nov 8.
6
GLOOME: gain loss mapping engine.GLOOME:增益损耗映射引擎。
Bioinformatics. 2010 Nov 15;26(22):2914-5. doi: 10.1093/bioinformatics/btq549. Epub 2010 Sep 27.
7
Comparison of eukaryotic phylogenetic profiling approaches using species tree aware methods.使用具有种系发生树意识的方法比较真核生物系统发生谱分析方法。
BMC Bioinformatics. 2009 Nov 24;10:383. doi: 10.1186/1471-2105-10-383.
8
Inference and characterization of horizontally transferred gene families using stochastic mapping.基于随机映射推断和刻画水平转移基因家族。
Mol Biol Evol. 2010 Mar;27(3):703-13. doi: 10.1093/molbev/msp240. Epub 2009 Oct 6.
9
A phylogenetic mixture model for gene family loss in parasitic bacteria.一种用于寄生细菌基因家族丢失的系统发育混合模型。
Mol Biol Evol. 2009 Aug;26(8):1901-8. doi: 10.1093/molbev/msp102. Epub 2009 May 12.
10
Horizontal gene transfer in cyanobacterial signature genes.蓝藻特征基因中的水平基因转移
Methods Mol Biol. 2009;532:339-66. doi: 10.1007/978-1-60327-853-9_20.