• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

常规的生物序列模拟会导致对多基因座系统发生分析的有偏评估。

Conventional simulation of biological sequences leads to a biased assessment of multi-Loci phylogenetic analysis.

机构信息

Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil.

出版信息

Evol Bioinform Online. 2013 Aug 13;9:317-25. doi: 10.4137/EBO.S12483. eCollection 2013.

DOI:10.4137/EBO.S12483
PMID:23997573
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3748088/
Abstract

Phylogenetic analysis based on multi-loci data sets is performed by means of supermatrix (SM) or supertree (ST) approaches. Recently, methods that rely on species tree (SppT) inference by the multi-species coalescence have also been implemented to tackle this problem. Generally, the relative performance of these three major strategies has been calculated using simulation of biological sequences. However, sequence simulation may not entirely replicate the complexity of the evolutionary process. Thus, issues regarding the usefulness of in silico sequences in studying the performance of phylogenetic methods have been raised. Here, we used both classical simulation and empirical data to investigate the relative performance of ST, SM, and the SppT methods. SM analyses performed better than the ST and SppTs in simulations, but not in empirical analyses where some ST methods significantly outperformed the others. Additionally, SM was the only method that was robust under evolutionary model violations in simulations. These results show that conventional biological sequence simulation cannot adequately resolve which method is most efficient to recover the SppT. In such simulations, the SM approach recovers the established phylogeny in most instances, whereas the performance of the ST and SppT methods is downgraded in simpler cases. When compared, the analyses based on empirical and simulated sequences yielded largely inconsistent results, with the latter showing a bias towards a seemingly superiority of SM approaches.

摘要

基于多基因数据集的系统发育分析是通过超矩阵(SM)或超树(ST)方法来实现的。最近,还实施了依赖多物种合并的种系树(SppT)推断的方法来解决这个问题。一般来说,这些三种主要策略的相对性能是通过生物序列的模拟来计算的。然而,序列模拟可能不完全复制进化过程的复杂性。因此,关于在研究系统发育方法性能时使用计算机序列的有用性的问题已经被提出。在这里,我们使用经典模拟和实证数据来研究 ST、SM 和 SppT 方法的相对性能。SM 分析在模拟中比 ST 和 SppT 表现更好,但在实证分析中,一些 ST 方法的表现明显优于其他方法。此外,SM 是唯一一种在模拟中对进化模型违反具有稳健性的方法。这些结果表明,传统的生物序列模拟不能充分解决哪种方法最有效地恢复 SppT 的问题。在这种模拟中,SM 方法在大多数情况下恢复了已建立的系统发育,而 ST 和 SppT 方法的性能在更简单的情况下会降低。相比之下,基于实证和模拟序列的分析产生了很大不一致的结果,后者似乎偏向于 SM 方法的优势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f18/3748088/7e59812293e9/ebo-9-2013-317f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f18/3748088/312de67a6794/ebo-9-2013-317f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f18/3748088/edb2e7a505ad/ebo-9-2013-317f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f18/3748088/f7b124fcc8cc/ebo-9-2013-317f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f18/3748088/7e59812293e9/ebo-9-2013-317f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f18/3748088/312de67a6794/ebo-9-2013-317f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f18/3748088/edb2e7a505ad/ebo-9-2013-317f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f18/3748088/f7b124fcc8cc/ebo-9-2013-317f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f18/3748088/7e59812293e9/ebo-9-2013-317f4.jpg

相似文献

1
Conventional simulation of biological sequences leads to a biased assessment of multi-Loci phylogenetic analysis.常规的生物序列模拟会导致对多基因座系统发生分析的有偏评估。
Evol Bioinform Online. 2013 Aug 13;9:317-25. doi: 10.4137/EBO.S12483. eCollection 2013.
2
The gene tree delusion.基因树错觉
Mol Phylogenet Evol. 2016 Jan;94(Pt A):1-33. doi: 10.1016/j.ympev.2015.07.018. Epub 2015 Jul 31.
3
A likelihood look at the supermatrix-supertree controversy.对超级矩阵-超级树之争的似然性审视。
Gene. 2009 Jul 15;441(1-2):119-25. doi: 10.1016/j.gene.2008.04.002. Epub 2008 Apr 10.
4
Partitioned coalescence support reveals biases in species-tree methods and detects gene trees that determine phylogenomic conflicts.分区凝聚支持揭示了种系树方法中的偏差,并检测到决定系统发育冲突的基因树。
Mol Phylogenet Evol. 2019 Oct;139:106539. doi: 10.1016/j.ympev.2019.106539. Epub 2019 Jun 18.
5
Phylogenetic analysis at deep timescales: unreliable gene trees, bypassed hidden support, and the coalescence/concatalescence conundrum.深度时间尺度下的系统发育分析:不可靠的基因树、被忽视的隐藏支持以及合并/串联难题
Mol Phylogenet Evol. 2014 Nov;80:231-66. doi: 10.1016/j.ympev.2014.08.013. Epub 2014 Aug 22.
6
Assessing approaches for inferring species trees from multi-copy genes.评估从多拷贝基因推断物种树的方法。
Syst Biol. 2015 Mar;64(2):325-39. doi: 10.1093/sysbio/syu128. Epub 2014 Dec 23.
7
Computational Performance and Statistical Accuracy of *BEAST and Comparisons with Other Methods.BEAST的计算性能和统计准确性以及与其他方法的比较。
Syst Biol. 2016 May;65(3):381-96. doi: 10.1093/sysbio/syv118. Epub 2016 Jan 28.
8
A performance study of the impact of recombination on species tree analysis.关于重组对物种树分析影响的性能研究。
BMC Genomics. 2016 Nov 11;17(Suppl 10):785. doi: 10.1186/s12864-016-3104-5.
9
SuperFine: fast and accurate supertree estimation.SuperFine:快速准确的超级树估计。
Syst Biol. 2012 Mar;61(2):214-27. doi: 10.1093/sysbio/syr092. Epub 2011 Sep 20.
10
Impact of deep coalescence and recombination on the estimation of phylogenetic relationships among species using AFLP markers.深度合并和重组对使用AFLP标记估计物种间系统发育关系的影响。
Mol Phylogenet Evol. 2014 Jul;76:102-9. doi: 10.1016/j.ympev.2014.03.001. Epub 2014 Mar 13.

本文引用的文献

1
Species tree inference by the STAR method and its generalizations.通过STAR方法及其推广进行物种树推断。
J Comput Biol. 2013 Jan;20(1):50-61. doi: 10.1089/cmb.2012.0101.
2
A genomic approach to examine the complex evolution of laurasiatherian mammals.运用基因组方法研究劳亚兽类哺乳动物的复杂进化。
PLoS One. 2011;6(12):e28199. doi: 10.1371/journal.pone.0028199. Epub 2011 Dec 2.
3
Estimating species trees using approximate Bayesian computation.使用近似贝叶斯计算估计物种树。
Mol Phylogenet Evol. 2011 May;59(2):354-63. doi: 10.1016/j.ympev.2011.02.019. Epub 2011 Mar 21.
4
Comparing two Bayesian methods for gene tree/species tree reconstruction: simulations with incomplete lineage sorting and horizontal gene transfer.比较两种用于基因树/种系树重建的贝叶斯方法:不完全谱系分选和水平基因转移的模拟。
Syst Biol. 2011 May;60(3):261-75. doi: 10.1093/sysbio/syr003. Epub 2011 Mar 2.
5
Accuracy of phylogeny reconstruction methods combining overlapping gene data sets.结合重叠基因数据集的系统发育重建方法的准确性。
Algorithms Mol Biol. 2010 Dec 6;5:37. doi: 10.1186/1748-7188-5-37.
6
The accuracy of species tree estimation under simulation: a comparison of methods.基于模拟的物种树估计精度:方法比较。
Syst Biol. 2011 Mar;60(2):126-37. doi: 10.1093/sysbio/syq073. Epub 2010 Nov 18.
7
Mammalian evolution may not be strictly bifurcating.哺乳动物的进化可能不是严格二分的。
Mol Biol Evol. 2010 Dec;27(12):2804-16. doi: 10.1093/molbev/msq166. Epub 2010 Jun 29.
8
Estimating species phylogenies using coalescence times among sequences.使用序列间的合并时间估计物种系统发育。
Syst Biol. 2009 Oct;58(5):468-77. doi: 10.1093/sysbio/syp031. Epub 2009 Jul 16.
9
Phybase: an R package for species tree analysis.Phybase:用于物种树分析的 R 包。
Bioinformatics. 2010 Apr 1;26(7):962-3. doi: 10.1093/bioinformatics/btq062. Epub 2010 Feb 15.
10
A simulation study comparing supertree and combined analysis methods using SMIDGen.一项使用SMIDGen比较超树和联合分析方法的模拟研究。
Algorithms Mol Biol. 2010 Jan 4;5:8. doi: 10.1186/1748-7188-5-8.