Krishnamurthy Panneerselvam, Kim Jin A, Jeong Mi-Jeong, Kang Chang Ho, Lee Soo In
Department of Agricultural Biotechnology, National Academy of Agricultural Science (NAAS), Jeonju, 560-500, Korea.
Division of Applied Life Science and PMBBRC, Gyeongsang National University, Jinju, 660-701, Korea.
Mol Genet Genomics. 2015 Dec;290(6):2279-95. doi: 10.1007/s00438-015-1080-0. Epub 2015 Jun 30.
RNA-binding glycine-rich (RBG) proteins play diverse roles in plant growth, development, protection and genome organization. An overly broad definition for class IV glycine-rich proteins (GRPs), namely RNA-binding activity and a glycine-rich C-terminus, has resulted in many distantly related and/or non-related proteins being grouped into this class of RBGs. This definition has hampered the study of RBG evolution. In this study, we used a comparative genomic approach consisting of ortholog, homolog, synteny and phylogenetic analyses to legitimately exclude all distantly/non-related proteins from class IV GRPs and to identify 15, 22, 12 and 18 RBG proteins in Arabidopsis, Chinese cabbage, rice and maize genomes, respectively. All identified RBGs could be divided into three subclasses, namely RBGA, RBGB and RBGD, which may be derived from a common ancestor. We assigned RBGs excluded from class IV GRPs to a separate RBG superfamily. RBGs have evolved and diversified in different species via different mechanisms; segmental duplication and recombination have had major effects, with tandem duplication, intron addition/deletion and domain recombination/deletion playing minor roles. Loss and retention of duplicated RBGs after polyploidization has been species and subclass specific. For example, following recent whole-genome duplication and triplication in maize and Chinese cabbage, respectively, most duplicated copies of RBGA have been lost in maize while RBGD duplicates have been retained; in Chinese cabbage, in contrast, RBGA duplicates have been retained while RBGD duplicates have been lost. Our findings reveal fundamental information and shed new light on the structural characteristics and evolutionary dynamics of RBGs.
富含甘氨酸的RNA结合(RBG)蛋白在植物生长、发育、保护及基因组组织中发挥着多种作用。对IV类富含甘氨酸蛋白(GRP)的定义过于宽泛,即RNA结合活性和富含甘氨酸的C末端,导致许多远缘相关和/或不相关的蛋白被归入此类RBG。这一定义阻碍了RBG进化的研究。在本研究中,我们采用了一种比较基因组学方法,包括直系同源、同源、共线性和系统发育分析,以合理地将所有远缘/不相关蛋白从IV类GRP中排除,并分别在拟南芥、大白菜、水稻和玉米基因组中鉴定出15、22、12和18个RBG蛋白。所有鉴定出的RBG可分为三个亚类,即RBGA、RBGB和RBGD,它们可能起源于一个共同祖先。我们将从IV类GRP中排除的RBG归入一个单独的RBG超家族。RBG在不同物种中通过不同机制进化和多样化;片段重复和重组起主要作用,串联重复、内含子添加/缺失和结构域重组/缺失起次要作用。多倍体化后重复RBG的丢失和保留具有物种和亚类特异性。例如,在玉米和大白菜分别经历近期的全基因组复制和三倍化后,RBGA的大多数重复拷贝在玉米中丢失,而RBGD的重复拷贝被保留;相反,在大白菜中,RBGA的重复拷贝被保留,而RBGD的重复拷贝丢失。我们的研究结果揭示了RBG的基本信息,并为其结构特征和进化动态提供了新的线索。