Wu Ningning, Zhu Yan, Song Wanlu, Li Yaxuan, Yan Yueming, Hu Yingkao
BMC Plant Biol. 2014 Dec 19;14:373. doi: 10.1186/s12870-014-0373-5.
GRAS proteins belong to a plant transcription factor family that is involved with multifarious roles in plants. Although previous studies of this protein family have been reported for Arabidopsis, rice, Chinese cabbage and other species, investigation of expansion patterns and evolutionary rate on the basis of comparative genomics in different species remains inadequate.
A total of 289 GRAS genes were identified in Arabidopsis, B. distachyon, rice, soybean, S. moellendorffii, and P. patens and were grouped into seven subfamilies, supported by the similarity of their exon-intron patterns and structural motifs. All of tandem duplicated genes were found in group II except one cluster of rice, indicating that tandem duplication greatly promoted the expansion of group II. Furthermore, segment duplications were mainly found in the soybean genome, whereas no single expansion pattern dominated in other plant species indicating that GRAS genes from these five species might be subject to a more complex evolutionary mechanism. Interestingly, branch-site model analyses of positive selection showed that a number of sites were positively selected under foreground branches I and V. These results strongly indicated that these groups were experiencing higher positive selection pressure. Meanwhile, the site-specific model revealed that the GRAS genes were under strong positive selection in P. patens. DIVERGE v2.0 was used to detect critical amino acid sites, and the results showed that the shifted evolutionary rate was mainly attributed to the functional divergence between the GRAS genes in the two groups. In addition, the results also demonstrated the expression divergence of the GRAS duplicated genes in the evolution. In short, the results above provide a solid foundation for further functional dissection of the GRAS gene superfamily.
In this work, differential expression, evolutionary rate, and expansion patterns of the GRAS gene family in the six species were predicted. Especially, tandem duplication events played an important role in expansion of group II. Together, these results contribute to further functional analysis and the molecular evolution of the GRAS gene superfamily.
GRAS蛋白属于植物转录因子家族,在植物中发挥着多种作用。尽管此前已针对拟南芥、水稻、大白菜及其他物种对该蛋白家族展开了研究,但基于不同物种比较基因组学对其扩张模式和进化速率的研究仍显不足。
在拟南芥、二穗短柄草、水稻、大豆、小立碗藓和小立碗藓中总共鉴定出289个GRAS基因,并根据其外显子-内含子模式和结构基序的相似性将它们分为七个亚家族。除了水稻的一个簇外,所有串联重复基因均在第二组中被发现,这表明串联重复极大地促进了第二组的扩张。此外,片段重复主要存在于大豆基因组中,而在其他植物物种中没有单一的扩张模式占主导地位,这表明来自这五个物种的GRAS基因可能经历了更复杂的进化机制。有趣的是,正选择的分支位点模型分析表明,在前景分支I和V下有许多位点受到正选择。这些结果有力地表明,这些组正经历着更高的正选择压力。同时,位点特异性模型显示GRAS基因在小立碗藓中受到强烈的正选择。使用DIVERGE v2.0检测关键氨基酸位点,结果表明进化速率的变化主要归因于两组GRAS基因之间的功能差异。此外,结果还证明了GRAS重复基因在进化过程中的表达差异。简而言之,上述结果为进一步对GRAS基因超家族进行功能剖析奠定了坚实基础。
在这项工作中,预测了六个物种中GRAS基因家族的差异表达、进化速率和扩张模式。特别是,串联重复事件在第二组的扩张中发挥了重要作用。总之,这些结果有助于对GRAS基因超家族进行进一步的功能分析和分子进化研究。