Beaudoing E, Freier S, Wyatt J R, Claverie J M, Gautheret D
Structural and Genetic Information Laboratory, Marseille, France.
Genome Res. 2000 Jul;10(7):1001-10. doi: 10.1101/gr.10.7.1001.
The formation of mature mRNAs in vertebrates involves the cleavage and polyadenylation of the pre-mRNA, 10-30 nt downstream of an AAUAAA or AUUAAA signal sequence. The extensive cDNA data now available shows that these hexamers are not strictly conserved. In order to identify variant polyadenylation signals on a large scale, we compared over 8700 human 3' untranslated sequences to 157,775 polyadenylated expressed sequence tags (ESTs), used as markers of actual mRNA 3' ends. About 5600 EST-supported putative mRNA 3' ends were collected and analyzed for significant hexameric sequences. Known polyadenylation signals were found in only 73% of the 3' fragments. Ten single-base variants of the AAUAAA sequence were identified with a highly significant occurrence rate, potentially representing 14.9% of the actual polyadenylation signals. Of the mRNAs, 28.6% displayed two or more polyadenylation sites. In these mRNAs, the poly(A) sites proximal to the coding sequence tend to use variant signals more often, while the 3'-most site tends to use a canonical signal. The average number of ESTs associated with each signal type suggests that variant signals (including the common AUUAAA) are processed less efficiently than the canonical signal and could therefore be selected for regulatory purposes. However, the position of the site in the untranslated region may also play a role in polyadenylation rate.
脊椎动物中成熟mRNA的形成涉及前体mRNA在AAUAAA或AUUAAA信号序列下游10 - 30个核苷酸处的切割和聚腺苷酸化。目前可得的大量cDNA数据表明,这些六聚体并非严格保守。为了大规模鉴定可变聚腺苷酸化信号,我们将8700多条人类3'非翻译序列与157,775条聚腺苷酸化表达序列标签(EST)进行了比较,这些EST用作实际mRNA 3'末端的标记。收集了约5600个由EST支持的推定mRNA 的3'末端,并分析其中显著的六聚体序列。在3'片段中,仅73%发现了已知的聚腺苷酸化信号。鉴定出AAUAAA序列的10个单碱基变体,其出现率具有高度显著性,可能占实际聚腺苷酸化信号的14.9%。在这些mRNA中,28.6%显示出两个或更多的聚腺苷酸化位点。在这些mRNA中,靠近编码序列的聚(A)位点往往更频繁地使用可变信号,而最靠近3'端的位点倾向于使用典型信号。与每种信号类型相关的EST平均数量表明,可变信号(包括常见的AUUAAA)的加工效率低于典型信号,因此可能出于调控目的而被选择。然而,该位点在非翻译区的位置也可能在聚腺苷酸化速率中发挥作用。