Pirie Michael D, Humphreys Aelys M, Galley Chloe, Barker Nigel P, Verboom G Anthony, Orlovich David, Draffin Suzy J, Lloyd Kelvin, Baeza C Marcelo, Negritto Maria, Ruiz Eduardo, Sanchez J Hugo Cota, Reimer Elizabeth, Linder H Peter
Institute for Systematic Botany, University of Zurich, Switzerland.
Mol Phylogenet Evol. 2008 Sep;48(3):1106-19. doi: 10.1016/j.ympev.2008.05.030. Epub 2008 Jul 2.
Phylogeny reconstruction is challenging when branch lengths vary and when different genetic loci show conflicting signals. The number of DNA sequence characters required to obtain robust support for all the nodes in a phylogeny becomes greater with denser taxon sampling. We test the usefulness of an approach mixing densely sampled, variable non-coding sequences (trnL-F; rpl16; atpB-rbcL; ITS) with sparsely sampled, more conservative protein coding and ribosomal sequences (matK; ndhF; rbcL; 26S), for the grass subfamily Danthonioideae. Previous phylogenetic studies of Danthonioideae revealed extensive generic paraphyly, but were often impeded by insufficient character and taxon sampling and apparent inter-gene conflict. Our variably-sampled supermatrix approach allowed us to represent 79% of the species with up to c. 9900 base pairs for taxa representing the major clades. A 'taxon duplication' approach for taxa with conflicting phylogenetic signals allowed us to combine the data whilst representing the differences between chloroplast and nuclear encoded gene trees. This approach efficiently improves resolution and support whilst maximising representation of taxa and their sometimes composite evolutionary histories, resulting in a phylogeny of the Danthonioideae that will be useful both for a wide range of evolutionary studies and to inform forthcoming realignment of generic delimitations in the subfamily.
当分支长度不同以及不同基因座显示出相互冲突的信号时,系统发育重建具有挑战性。随着分类群抽样密度的增加,为系统发育中的所有节点获得有力支持所需的DNA序列特征数量会变得更多。我们测试了一种方法的有效性,该方法将密集抽样的可变非编码序列(trnL-F;rpl16;atpB-rbcL;ITS)与稀疏抽样的、更保守的蛋白质编码和核糖体序列(matK;ndhF;rbcL;26S)混合,用于草亚科Danthonioideae。先前对Danthonioideae的系统发育研究揭示了广泛的属并系现象,但常常受到特征和分类群抽样不足以及明显的基因间冲突的阻碍。我们的可变抽样超级矩阵方法使我们能够用代表主要分支的分类群中多达约9900个碱基对来代表79%的物种。对于具有相互冲突的系统发育信号的分类群,一种“分类群重复”方法使我们能够在代表叶绿体和核编码基因树之间差异的同时合并数据。这种方法有效地提高了分辨率和支持度,同时最大限度地体现了分类群及其有时复杂的进化历史,从而得出一个Danthonioideae的系统发育树,这对于广泛的进化研究以及为该亚科即将进行的属界限重新划分提供参考都将是有用的。