Chung Betty Y W, Simons Cas, Firth Andrew E, Brown Chris M, Hellens Roger P
Biochemistry Department, University of Otago, Dunedin, New Zealand.
BMC Genomics. 2006 May 19;7:120. doi: 10.1186/1471-2164-7-120.
The majority of introns in gene transcripts are found within the coding sequences (CDSs). A small but significant fraction of introns are also found to reside within the untranslated regions (5'UTRs and 3'UTRs) of expressed sequences. Alignment of the whole genome and expressed sequence tags (ESTs) of the model plant Arabidopsis thaliana has identified introns residing in both coding and non-coding regions of the genome.
A bioinformatic analysis revealed some interesting observations: (1) the density of introns in 5'UTRs is similar to that in CDSs but much higher than that in 3'UTRs; (2) the 5'UTR introns are preferentially located close to the initiating ATG codon; (3) introns in the 5'UTRs are, on average, longer than introns in the CDSs and 3'UTRs; and (4) 5'UTR introns have a different nucleotide composition to that of CDS and 3'UTR introns. Furthermore, we show that the 5'UTR intron of the A. thaliana EF1alpha-A3 gene affects the gene expression and the size of the 5'UTR intron influences the level of gene expression.
Introns within the 5'UTR show specific features that distinguish them from introns that reside within the coding sequence and the 3'UTR. In the EF1alpha-A3 gene, the presence of a long intron in the 5'UTR is sufficient to enhance gene expression in plants in a size dependent manner.
基因转录本中的大多数内含子存在于编码序列(CDS)内。但也发现一小部分但数量可观的内含子存在于表达序列的非翻译区(5'UTR和3'UTR)。模式植物拟南芥的全基因组与表达序列标签(EST)的比对确定了存在于基因组编码区和非编码区的内含子。
生物信息学分析揭示了一些有趣的发现:(1)5'UTR中内含子的密度与CDS中的相似,但远高于3'UTR中的;(2)5'UTR内含子优先位于起始ATG密码子附近;(3)5'UTR中的内含子平均比CDS和3'UTR中的内含子长;(4)5'UTR内含子的核苷酸组成与CDS和3'UTR内含子不同。此外,我们表明拟南芥EF1alpha - A3基因的5'UTR内含子影响基因表达,且5'UTR内含子大小影响基因表达水平。
5'UTR内的内含子具有特定特征,使其区别于编码序列和3'UTR中的内含子。在EF1alpha - A3基因中,5'UTR中长内含子的存在足以以大小依赖的方式增强植物中的基因表达。