区分终端单系类群与网状分类单元：表型、基于树和网络方法的性能

Reeves Patrick A, Richards Christopher M

United States Department of Agriculture, Agricultural Research Service, National Center for Genetic Resources Preservation, 1111 South Mason Street, Fort Collins, Colorado 80521, USA.

Syst Biol. 2007 Apr;56(2):302-20. doi: 10.1080/10635150701324225.

Hybridization is a well-documented, natural phenomenon that is common at low taxonomic levels in the higher plants and other groups. In spite of the obvious potential for gene flow via hybridization to cause reticulation in an evolutionary tree, analytical methods based on a strictly bifurcating model of evolution have frequently been applied to data sets containing taxa known to hybridize in nature. Using simulated data, we evaluated the relative performance of phenetic, tree-based, and network approaches for distinguishing between taxa with known reticulate history and taxa that were true terminal monophyletic groups. In all methods examined, type I error (the erroneous rejection of the null hypothesis that a taxon of interest is not monophyletic) was likely during the early stages of introgressive hybridization. We used the gradual erosion of type I error with continued gene flow as a metric for assessing relative performance. Bifurcating tree-based methods performed poorly, with highly supported, incorrect topologies appearing during some phases of the simulation. Based on our model, we estimate that many thousands of gene flow events may be required in natural systems before reticulate taxa will be reliably detected using tree-based methods of phylogeny reconstruction. We conclude that the use of standard bifurcating tree-based methods to identify terminal monophyletic groups for the purposes of defining or delimiting phylogenetic species, or for prioritizing populations for conservation purposes, is difficult to justify when gene flow between sampled taxa is possible. As an alternative, we explored the use of two network methods. Minimum spanning networks performed worse than most tree-based methods and did not yield topologies that were easily interpretable as phylogenies. The performance of NeighborNet was comparable to parsimony bootstrap analysis. NeighborNet and reverse successive weighting were capable of identifying an ephemeral signature of reticulate evolution during the early stages of introgression by revealing conflicting phylogenetic signal. However, when gene flow was topologically complex, the conflicting phylogenetic signal revealed by these methods resulted in a high probability of type II error (inferring that a monophyletic taxon has a reticulate history). Lastly, we present a novel application of an existing nonparametric clustering procedure that, when used against a density landscape derived from principal coordinate data, showed superior performance to the tree-based and network procedures tested.

杂交是一种有充分文献记载的自然现象，在高等植物和其他类群的低分类水平上很常见。尽管通过杂交实现基因流动明显有可能在进化树中导致网状进化，但基于严格二叉分支进化模型的分析方法却经常被应用于包含已知在自然中杂交的分类单元的数据集。我们使用模拟数据，评估了表型法、基于树的方法和网络方法在区分具有已知网状进化历史的分类单元和真正的末端单系类群方面的相对性能。在所研究的所有方法中，在渐渗杂交的早期阶段，I型错误（错误地拒绝感兴趣的分类单元不是单系类群的零假设）很可能出现。我们将随着基因流动持续I型错误的逐渐减少作为评估相对性能的一个指标。基于二叉分支树的方法表现不佳，在模拟的某些阶段出现了支持度很高但错误的拓扑结构。根据我们的模型，我们估计在自然系统中可能需要数千次基因流动事件，之后使用基于树的系统发育重建方法才能可靠地检测到网状分类单元。我们得出结论，当采样的分类单元之间可能存在基因流动时，使用基于标准二叉分支树的方法来识别末端单系类群以定义或界定系统发育物种，或为保护目的对种群进行优先级排序，很难说得通。作为一种替代方法，我们探索了两种网络方法的应用。最小生成网络的表现比大多数基于树的方法更差，并且没有产生易于解释为系统发育的拓扑结构。邻接网络的性能与简约自展分析相当。邻接网络和反向连续加权能够通过揭示相互冲突的系统发育信号，在渐渗的早期阶段识别出网状进化的短暂特征。然而，当基因流动的拓扑结构复杂时，这些方法揭示的相互冲突的系统发育信号导致II型错误（推断一个单系分类单元具有网状进化历史）的概率很高。最后，我们展示了一种现有非参数聚类程序的新应用，当针对从主坐标数据导出的密度景观使用时，它表现出优于所测试的基于树和网络的程序的性能。

相似文献

Distinguishing terminal monophyletic groups from reticulate taxa: performance of phenetic, tree-based, and network procedures.

Syst Biol. 2007 Apr;56(2):302-20. doi: 10.1080/10635150701324225.

[Foundations of the new phylogenetics].

Zh Obshch Biol. 2004 Jul-Aug;65(4):334-66.

Application of phylogenetic networks in evolutionary studies.

Mol Biol Evol. 2006 Feb;23(2):254-67. doi: 10.1093/molbev/msj030. Epub 2005 Oct 12.

Does gene flow destroy phylogenetic signal? The performance of three methods for estimating species phylogenies in the presence of gene flow.

Mol Phylogenet Evol. 2008 Dec;49(3):832-42. doi: 10.1016/j.ympev.2008.09.008. Epub 2008 Sep 21.

When being "most likely" is not enough: examining the performance of three uses of the parametric bootstrap in phylogenetics.

J Mol Evol. 2003 Feb;56(2):198-222. doi: 10.1007/s00239-002-2394-1.

Inferring phylogenetic networks by the maximum parsimony criterion: a case study.

Mol Biol Evol. 2007 Jan;24(1):324-37. doi: 10.1093/molbev/msl163. Epub 2006 Oct 26.

Detecting hybrid speciation in the presence of incomplete lineage sorting using gene tree incongruence: a model.

Theor Popul Biol. 2009 Feb;75(1):35-45. doi: 10.1016/j.tpb.2008.10.004. Epub 2008 Nov 5.

Reticulate evolution and incomplete lineage sorting among the ponderosa pines.

Mol Phylogenet Evol. 2009 Aug;52(2):498-511. doi: 10.1016/j.ympev.2009.02.011. Epub 2009 Feb 26.

SuperTRI: A new approach based on branch support analyses of multiple independent data sets for assessing reliability of phylogenetic inferences.

C R Biol. 2009 Sep;332(9):832-47. doi: 10.1016/j.crvi.2009.05.001. Epub 2009 Jun 18.

Towards building the tree of life: a simulation study for all angiosperm genera.

Syst Biol. 2005 Apr;54(2):183-96. doi: 10.1080/10635150590923254.

引用本文的文献

Coalescent-Based Species Delimitation in Herbaceous Bamboos (Bambusoideae, Olyreae) from Eastern Brazil: Implications for Taxonomy and Conservation in a Group with Weak Morphological Divergence Coupled with Low Genetic Diversity.

Plants (Basel). 2022 Dec 26;12(1):107. doi: 10.3390/plants12010107.

A genetic legacy of introgression confounds phylogeny and biogeography in oaks.

Proc Biol Sci. 2017 May 17;284(1854). doi: 10.1098/rspb.2017.0300.

Evaluating multiple criteria for species delimitation: an empirical example using Hawaiian palms (Arecaceae: Pritchardia).

BMC Evol Biol. 2012 Feb 22;12:23. doi: 10.1186/1471-2148-12-23.

Responses to historical climate change identify contemporary threats to diversity in Dodecatheon.

Proc Natl Acad Sci U S A. 2011 Apr 5;108(14):5655-60. doi: 10.1073/pnas.1012302108. Epub 2011 Mar 14.

Natural hybridization generates mammalian lineage with species characteristics.

Proc Natl Acad Sci U S A. 2010 Jun 22;107(25):11447-52. doi: 10.1073/pnas.1000133107. Epub 2010 Jun 2.

Accurate inference of subtle population structure (and other genetic discontinuities) using principal coordinates.

PLoS One. 2009;4(1):e4269. doi: 10.1371/journal.pone.0004269. Epub 2009 Jan 27.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Distinguishing terminal monophyletic groups from reticulate taxa: performance of phenetic, tree-based, and network procedures.

Syst Biol. 2007 Apr;56(2):302-20. doi: 10.1080/10635150701324225.

[Foundations of the new phylogenetics].

Zh Obshch Biol. 2004 Jul-Aug;65(4):334-66.

Application of phylogenetic networks in evolutionary studies.

Mol Biol Evol. 2006 Feb;23(2):254-67. doi: 10.1093/molbev/msj030. Epub 2005 Oct 12.

Does gene flow destroy phylogenetic signal? The performance of three methods for estimating species phylogenies in the presence of gene flow.

Mol Phylogenet Evol. 2008 Dec;49(3):832-42. doi: 10.1016/j.ympev.2008.09.008. Epub 2008 Sep 21.

When being "most likely" is not enough: examining the performance of three uses of the parametric bootstrap in phylogenetics.

J Mol Evol. 2003 Feb;56(2):198-222. doi: 10.1007/s00239-002-2394-1.

Inferring phylogenetic networks by the maximum parsimony criterion: a case study.

Mol Biol Evol. 2007 Jan;24(1):324-37. doi: 10.1093/molbev/msl163. Epub 2006 Oct 26.

Detecting hybrid speciation in the presence of incomplete lineage sorting using gene tree incongruence: a model.

Theor Popul Biol. 2009 Feb;75(1):35-45. doi: 10.1016/j.tpb.2008.10.004. Epub 2008 Nov 5.

Reticulate evolution and incomplete lineage sorting among the ponderosa pines.

Mol Phylogenet Evol. 2009 Aug;52(2):498-511. doi: 10.1016/j.ympev.2009.02.011. Epub 2009 Feb 26.

SuperTRI: A new approach based on branch support analyses of multiple independent data sets for assessing reliability of phylogenetic inferences.

C R Biol. 2009 Sep;332(9):832-47. doi: 10.1016/j.crvi.2009.05.001. Epub 2009 Jun 18.

Towards building the tree of life: a simulation study for all angiosperm genera.

Syst Biol. 2005 Apr;54(2):183-96. doi: 10.1080/10635150590923254.

引用本文的文献

Plants (Basel). 2022 Dec 26;12(1):107. doi: 10.3390/plants12010107.

A genetic legacy of introgression confounds phylogeny and biogeography in oaks.

Proc Biol Sci. 2017 May 17;284(1854). doi: 10.1098/rspb.2017.0300.

Evaluating multiple criteria for species delimitation: an empirical example using Hawaiian palms (Arecaceae: Pritchardia).

BMC Evol Biol. 2012 Feb 22;12:23. doi: 10.1186/1471-2148-12-23.

Responses to historical climate change identify contemporary threats to diversity in Dodecatheon.

Proc Natl Acad Sci U S A. 2011 Apr 5;108(14):5655-60. doi: 10.1073/pnas.1012302108. Epub 2011 Mar 14.

Natural hybridization generates mammalian lineage with species characteristics.

Proc Natl Acad Sci U S A. 2010 Jun 22;107(25):11447-52. doi: 10.1073/pnas.1000133107. Epub 2010 Jun 2.

Accurate inference of subtle population structure (and other genetic discontinuities) using principal coordinates.

PLoS One. 2009;4(1):e4269. doi: 10.1371/journal.pone.0004269. Epub 2009 Jan 27.

Distinguishing terminal monophyletic groups from reticulate taxa: performance of phenetic, tree-based, and network procedures.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献