Jiménez-López Domingo, Bravo Jaime, Guzmán Plinio
Departamento de Ingeniería Genética, Centro de Investigación y de Estudios Avanzados, Unidad Irapuato, Apartado Postal 629, Irapuato, Gto., 36821, Mexico.
Present address: Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston, TX, 77555, USA.
BMC Evol Biol. 2015 Sep 16;15:195. doi: 10.1186/s12862-015-0475-1.
Poly(A)-binding proteins (PABPs) are evolutionarily conserved proteins that have important functions in the regulation of translation and the control of mRNA stability in eukaryotes. Most PABPs encode a C-terminal domain known as the MLLE domain (previously PABC or CTC), which can mediate protein interactions. In earlier work we identified and predicted that four classes of MLLE-interacting proteins were present in Arabidopsis thaliana, which we named CID A, B, C, and D. These proteins encode transcription-activating domains (CID A), the Lsm and LsmAD domains of ataxin-2 (CID B), the CUE and small MutS-related domains (CID C), and two RNA recognition domains (CID D). We recently found that a novel class that lacks the LsmAD domain is present in CID B proteins.
We extended our analysis to other classes of CIDs present in the viridiplantae. We found that novel variants also evolved in classes CID A and CID C. A specific transcription factor domain is present in a distinct lineage in class A, and a variant that lacks at least two distinct domains was also identified in a divergent lineage in class C. We did not detect any variants in Class D CIDs. This class often consists of four to six highly conserved RNA-binding proteins, which suggests that major redundancy is present in this class.
CIDs are likely to operate as components of posttranscriptional regulatory assemblies. The evident diversification of CIDs may be neutral or may be important for plant adaptation to the environment and for acquisition of specific traits during evolution. The fact that CIDs subclasses are maintained in early lineages suggest that a presumed interference between duplicates was resolved, and a defined function for each subclass was achieved.
聚腺苷酸结合蛋白(PABP)是进化上保守的蛋白质,在真核生物的翻译调控和mRNA稳定性控制中具有重要功能。大多数PABP编码一个称为MLLE结构域(以前称为PABC或CTC)的C末端结构域,该结构域可介导蛋白质相互作用。在早期工作中,我们鉴定并预测拟南芥中存在四类与MLLE相互作用的蛋白质,我们将其命名为CID A、B、C和D。这些蛋白质编码转录激活结构域(CID A)、ataxin-2的Lsm和LsmAD结构域(CID B)、CUE和小MutS相关结构域(CID C)以及两个RNA识别结构域(CID D)。我们最近发现CID B蛋白中存在一类缺乏LsmAD结构域的新类型。
我们将分析扩展到绿藻门中存在的其他类别的CID。我们发现CID A和CID C类中也进化出了新的变体。A类中的一个特定转录因子结构域存在于一个独特的谱系中,并且在C类的一个不同谱系中也鉴定出了一个至少缺乏两个不同结构域的变体。我们在D类CID中未检测到任何变体。这类通常由四到六个高度保守的RNA结合蛋白组成,这表明该类中存在主要的冗余。
CID可能作为转录后调控组件的组成部分发挥作用。CID明显的多样化可能是中性的,也可能对植物适应环境以及在进化过程中获得特定性状很重要。CID亚类在早期谱系中得以保留这一事实表明,假定的重复基因之间的干扰得到了解决,并且每个亚类都实现了明确的功能。