Sampedro Javier, Lee Yi, Carey Robert E, dePamphilis Claude, Cosgrove Daniel J
Department of Biology, 208 Mueller Lab, Pennsylvania State University, University Park, PA 16802, USA.
Plant J. 2005 Nov;44(3):409-19. doi: 10.1111/j.1365-313X.2005.02540.x.
Polyploidy events have played an important role in the evolution of angiosperm genomes. Here, we demonstrate how genomic histories can increase phylogenetic resolution in a gene family, specifically the expansin superfamily of cell wall proteins. There are 36 expansins in Arabidopsis and 58 in rice. Traditional sequence-based phylogenetic trees yield poor resolution below the family level. To improve upon these analyses, we searched for gene colinearity (microsynteny) between Arabidopsis and rice genomic segments containing expansin genes. Multiple rounds of genome duplication and extensive gene loss have obscured synteny. However, by simultaneously aligning groups of up to 10 potentially orthologous segments from the two species, we traced the history of 49 out of 63 expansin-containing segments back to the ancestor of monocots and eudicots. Our results indicate that this ancestor had 15-17 expansin genes, each ancestral to an extant clade. Some clades have strikingly different growth patterns in the rice and Arabidopsis lineages, with more than half of all rice expansins arising from two ancestral genes. Segmental duplications, most of them part of polyploidy events, account for 12 out of 21 new expansin genes in Arabidopsis and 16 out of 44 in rice. Tandem duplications explain most of the rest. We were also able to estimate a minimum of 28 gene deaths in the Arabidopsis lineage and nine in rice. This analysis greatly clarifies expansin evolution since the last common ancestor of monocots and eudicots and the method should be broadly applicable to many other gene families.
多倍体事件在被子植物基因组的进化中发挥了重要作用。在此,我们展示了基因组历史如何提高一个基因家族(特别是细胞壁蛋白扩展蛋白超家族)的系统发育分辨率。拟南芥中有36个扩展蛋白,水稻中有58个。传统基于序列的系统发育树在科级以下的分辨率较差。为了改进这些分析,我们在拟南芥和水稻中搜索了包含扩展蛋白基因的基因组片段之间的基因共线性(微同线性)。多轮基因组复制和广泛的基因丢失掩盖了同线性。然而,通过同时比对来自这两个物种的多达10个潜在直系同源片段的组,我们将63个含扩展蛋白片段中的49个的历史追溯到了单子叶植物和双子叶植物的祖先。我们的结果表明,这个祖先有15 - 17个扩展蛋白基因,每个基因都是一个现存分支的祖先。在水稻和拟南芥谱系中,一些分支具有显著不同的生长模式,所有水稻扩展蛋白中超过一半来自两个祖先基因。片段重复,其中大部分是多倍体事件的一部分,在拟南芥21个新扩展蛋白基因中占12个,在水稻44个中占16个。串联重复解释了其余的大部分。我们还能够估计拟南芥谱系中至少有28个基因死亡,水稻中有9个。该分析极大地阐明了单子叶植物和双子叶植物的最后共同祖先以来扩展蛋白的进化,并且该方法应该广泛适用于许多其他基因家族。