Hazle Thomas, Bonen Linda
Biology Department, University of Ottawa, Ottawa, Canada.
Mol Biol Evol. 2007 May;24(5):1101-12. doi: 10.1093/molbev/msm030. Epub 2007 Feb 13.
We examined the nucleotide sequences preceding 23 mitochondrial protein-coding genes held in common by maize, rice, wheat, sugar beet, tobacco, Arabidopsis, and Brassica to look for features related to translation initiation and to assess the degree of conservation in mitochondrial mRNA leaders among these plants. We observed broad variation in sequence similarity as illustrated by dot plot analysis, ranging from a level rivaling that of coding sequences to complete absence of homology due to lineage-specific DNA rearrangements. Genes encoding ATP synthase subunits predominated in the latter category, whereas ones encoding cytochrome c biogenesis proteins and NADH dehydrogenase subunits were primarily of the highly conserved type. Within the region immediately preceding initiation codons, in most cases we did not observe motifs consistent with a bacterial-type Shine-Dalgarno interaction to assist in ribosome binding, nor was any other consensus sequence evident. In fact, indels in the form of tandem repeats were seen among homologues from different plants. We did, however, observe a bias for high adenosine and low cytosine in the proximal approximately 30 nt compared with further upstream. Duplicates of some sequences in our data set were found to be associated with more than one gene within a genome. Indeed, 3 such families of upstream cassettes were identified, and they exhibit a lineage-specific distribution among plants. Moreover, the presence of related sequences at genomic sites distant from known genes raises the possibility of future recruitment as regulatory elements. Our observations point to a dynamic nature in the makeup of the 5' leaders of plant mitochondrial mRNAs and an apparent plasticity in translational control elements.
我们研究了玉米、水稻、小麦、甜菜、烟草、拟南芥和油菜共有的23个线粒体蛋白质编码基因之前的核苷酸序列,以寻找与翻译起始相关的特征,并评估这些植物线粒体mRNA前导序列中的保守程度。如点阵分析所示,我们观察到序列相似性存在广泛差异,范围从与编码序列相当的水平到由于谱系特异性DNA重排而完全没有同源性。编码ATP合酶亚基的基因在后一类中占主导地位,而编码细胞色素c生物合成蛋白和NADH脱氢酶亚基的基因主要是高度保守型。在起始密码子之前紧邻的区域内,在大多数情况下,我们没有观察到与细菌型Shine-Dalgarno相互作用一致的基序来协助核糖体结合,也没有明显的其他共有序列。事实上,在来自不同植物的同源物中发现了串联重复形式的插入缺失。然而,与更上游相比,我们确实观察到在近端约30个核苷酸中腺苷含量高而胞嘧啶含量低的偏向性。我们数据集中某些序列的重复片段被发现与基因组内的多个基因相关。确实,鉴定出了3个这样的上游盒式结构家族,它们在植物中呈现谱系特异性分布。此外,在远离已知基因的基因组位点存在相关序列增加了未来作为调控元件被招募的可能性。我们的观察结果表明植物线粒体mRNA 5'前导序列的组成具有动态性质,并且翻译控制元件具有明显的可塑性。