Li Xiu-Qing
Potato Research Centre, Agriculture and Agri-Food Canada, Fredericton, New Brunswick, Canada.
PLoS One. 2014 Jun 18;9(6):e99928. doi: 10.1371/journal.pone.0099928. eCollection 2014.
The precursor messenger RNA (pre-mRNA) three-prime cleaved-off region (3'COR) and the mRNA three-prime untranslated region (3'UTR) play critical roles in regulating gene expression. The differences in base composition between these regions and the corresponding genomes are still largely uncharacterized in animals and plants. In this study, the base compositions of non-redundant 3'CORs and 3'UTRs were compared with the corresponding whole genomes of eleven animals, four dicotyledonous plants, and three monocotyledonous (cereal) plants. Among the four bases (A, C, G, and U for adenine, cytosine, guanine, and uracil, respectively), U (which corresponds to T, for thymine, in DNA) was the most frequent, A the second most frequent, G the third most frequent, and C the least frequent in most of the species in both the 3'COR and 3'UTR regions. In comparison with the whole genomes, in both regions the U content was usually the most overrepresented (particularly in the monocotyledonous plants), and the C content was the most underrepresented. The order obtained for the species groups, when ranked from high to low according to the U contents in the 3'COR and 3'UTR was as follows: dicotyledonous plants, monocotyledonous plants, non-mammal animals, and mammals. In contrast, the genomic T content was highest in dicotyledonous plants, lowest in monocotyledonous plants, and intermediate in animals. These results suggest the following: 1) there is a mechanism operating in both animals and plants which is biased toward U and against C in the 3'COR and 3'UTR; 2) the 3'UTR and 3'COR, as functional units, minimized the difference between dicotyledonous and monocotyledonous plants, while the dicotyledonous and monocotyledonous genomes evolved into two extreme groups in terms of base composition.
前体信使核糖核酸(前体mRNA)3′端裂解区(3′COR)和mRNA 3′端非翻译区(3′UTR)在调节基因表达中发挥着关键作用。在动物和植物中,这些区域与相应基因组之间的碱基组成差异在很大程度上仍未得到充分表征。在本研究中,将非冗余3′COR和3′UTR的碱基组成与11种动物、4种双子叶植物和3种单子叶(谷类)植物的相应全基因组进行了比较。在四种碱基(分别为腺嘌呤、胞嘧啶、鸟嘌呤和尿嘧啶的A、C、G和U)中,U(在DNA中对应于胸腺嘧啶T)在3′COR和3′UTR区域的大多数物种中是最常见的,A是第二常见的,G是第三常见的,C是最不常见的。与全基因组相比,在这两个区域中,U含量通常是最富集的(特别是在单子叶植物中),而C含量是最缺乏的。根据3′COR和3′UTR中U含量从高到低对物种组进行排序,得到的顺序如下:双子叶植物、单子叶植物、非哺乳动物和哺乳动物。相比之下,基因组T含量在双子叶植物中最高,在单子叶植物中最低,在动物中处于中间水平。这些结果表明:1)在动物和植物中都存在一种机制,该机制在3′COR和3′UTR中偏向于U而不利于C;2)3′UTR和3′COR作为功能单元,使双子叶植物和单子叶植物之间的差异最小化,而双子叶植物和单子叶植物的基因组在碱基组成方面进化成了两个极端的组。