Institute of Genetics and Biotechnology, Faculty of Biology, University of Warsaw, Warsaw, 02-106, Poland.
Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Warsaw, 02-106, Poland.
RNA. 2020 Sep;26(9):1268-1282. doi: 10.1261/rna.074880.120. Epub 2020 May 28.
PPR proteins are a diverse family of RNA binding factors found in all Eukaryotic lineages. They perform multiple functions in the expression of organellar genes, mostly on the post-transcriptional level. PPR proteins are also significant determinants of evolutionary nucleo-organellar compatibility. Plant PPR proteins recognize their RNA substrates using a simple modular code. No target sequences recognized by animal or yeast PPR proteins were identified prior to the present study, making it impossible to assess whether this plant PPR code is conserved in other organisms. Dmr1p (Ccm1p, Ygr150cp) is a PPR protein essential for mitochondrial gene expression and involved in the stability of 15S ribosomal RNA. We demonstrate that in vitro Dmr1p specifically binds a motif composed of multiple AUA repeats occurring twice in the 15S rRNA sequence as the minimal 14 nt (AUA)AU or longer (AUA) variant. Short RNA fragments containing this motif are protected by Dmr1p from exoribonucleolytic activity in vitro. Presence of the identified motif in mtDNA of different yeast species correlates with the compatibility between their Dmr1p orthologs and mtDNA. RNA recognition by Dmr1p is likely based on a rudimentary form of a PPR code specifying U at every third position, and depends on other factors, like RNA structure.
PPR 蛋白是一类广泛存在于所有真核生物中的 RNA 结合因子。它们在细胞器基因的表达中发挥多种功能,主要在转录后水平发挥作用。PPR 蛋白也是核质体进化相容性的重要决定因素。植物 PPR 蛋白使用简单的模块化代码来识别其 RNA 底物。在本研究之前,没有鉴定出动物或酵母 PPR 蛋白识别的靶序列,因此无法评估这种植物 PPR 代码是否在其他生物体中保守。Dmr1p(Ccm1p、Ygr150cp)是一种线粒体基因表达所必需的 PPR 蛋白,参与 15S 核糖体 RNA 的稳定性。我们证明,Dmr1p 在体外特异性结合由 15S rRNA 序列中两次出现的多个 AUA 重复组成的基序,其最小长度为 14nt(AUA)AU 或更长(AUA)变体。含有该基序的短 RNA 片段在体外受到 Dmr1p 的保护,免受外切核酸酶的活性。不同酵母物种 mtDNA 中存在的鉴定基序与其 Dmr1p 同源物和 mtDNA 之间的兼容性相关。Dmr1p 对 RNA 的识别可能基于一种简单的 PPR 代码形式,指定每个第三位的 U,并取决于其他因素,如 RNA 结构。