Center for Clinical and Translational Science, 89 Beaumont Avenue, Given Courtyard N309, Burlington, VT 05405, USA.
Brief Bioinform. 2012 Jan;13(1):122-34. doi: 10.1093/bib/bbr014. Epub 2011 Mar 23.
Over the past two decades, there has been a long-standing debate about the impact of taxon sampling on phylogenetic inference. Studies have been based on both real and simulated data sets, within actual and theoretical contexts, and using different inference methods, to study the impact of taxon sampling. In some cases, conflicting conclusions have been drawn for the same data set. The main questions explored in studies to date have been about the effects of using sparse data, adding new taxa, including more characters from genome sequences and using different (or concatenated) locus regions. These questions can be reduced to more fundamental ones about the assessment of data quality and the design guidelines of taxon sampling in phylogenetic inference experiments. This review summarizes progress to date in understanding the impact of taxon sampling on the accuracy of phylogenetic analysis.
在过去的二十年中,关于分类群采样对系统发育推断的影响一直存在着长期的争论。这些研究基于真实和模拟数据集,在实际和理论背景下,并使用不同的推断方法,来研究分类群采样的影响。在某些情况下,对于相同的数据集得出了相互矛盾的结论。迄今为止的研究中主要探讨的问题是使用稀疏数据、增加新分类群、包括更多来自基因组序列的特征以及使用不同(或串联)的基因座区域的影响。这些问题可以归结为更基本的关于数据质量评估和系统发育推断实验中分类群采样设计准则的问题。本综述总结了目前对于理解分类群采样对系统发育分析准确性影响的进展。