Petersen Gitte, Seberg Ole, Davis Jerrold I
Botanical Garden, Natural History Museum of Denmark, University of Copenhagen, Sølvgade 83 Opg. S, DK-1307 Copenhagen, Denmark.
L. H. Bailey Hortorium and Department of Plant Biology, Cornell University, Ithaca, New York 14853, USA.
Cladistics. 2013 Jun;29(3):274-295. doi: 10.1111/j.1096-0031.2012.00427.x. Epub 2012 Oct 3.
A phylogenetic analysis of the monocot order Liliales was performed using sequence data from three mitochondrial (atp1, cob, nad5) and two plastid genes (rbcL, ndhF). The complete data matrix includes 46 terminals representing all 10 families currently included in Liliales. The two major partitions, mitochondrial and plastid data, were congruent, and parsimony analysis resulted in 50 equally parsimonious trees and a well resolved consensus tree confirming monophyly of all families. Mitochondrial genes are known to include RNA edited sites, and in some cases unprocessed genes are replaced by retro-processed gene copies, that is processed paralogs. To test the effects on phylogeny reconstruction of predicted edited sites and potentially unintentionally sampled processed paralogs, a number of analyses were performed using subsets of the complete data matrix. In general, predicted edited sites were more homoplasious than the other characters and increased incongruence among most data partitions. The predicted edited sites have a non-random phylogenetic signal in conflict with the signal of the non-edited sites. The potentially misleading signal was caused partially by the apparent presence of processed paralogs in Galanthus (Amaryllidaceae), part of the outgroup, but also by a deviating evolutionary pattern of predicted edited sites in Liliaceae compared with the remainder of the Liliales. Despite the problems that processed paralogs may cause, we argue that they should not a priori be excluded from phylogenetic analysis.
利用来自三个线粒体基因(atp1、cob、nad5)和两个质体基因(rbcL、ndhF)的序列数据,对单子叶植物百合目进行了系统发育分析。完整的数据矩阵包括46个终端,代表了目前百合目中所有10个科。线粒体数据和质体数据这两个主要分区是一致的,简约分析得到了50棵同等简约的树和一棵解析良好的共识树,证实了所有科的单系性。已知线粒体基因包含RNA编辑位点,在某些情况下,未加工的基因会被反转录加工的基因拷贝所取代,即加工过的旁系同源基因。为了测试预测的编辑位点和潜在的无意中采样的加工旁系同源基因对系统发育重建的影响,使用完整数据矩阵的子集进行了一些分析。一般来说,预测的编辑位点比其他特征更具同塑性,并且增加了大多数数据分区之间的不一致性。预测的编辑位点具有与未编辑位点的信号相冲突的非随机系统发育信号。这种潜在的误导性信号部分是由外类群一部分的雪花莲属(石蒜科)中明显存在的加工旁系同源基因引起的,但也由百合科中预测的编辑位点与百合目其他科相比偏离的进化模式引起的。尽管加工旁系同源基因可能会带来问题,但我们认为它们不应在系统发育分析中被先验排除。