Meyer Irmtraud M, Miklós István
European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK.
Nucleic Acids Res. 2005 Nov 7;33(19):6338-48. doi: 10.1093/nar/gki923. Print 2005.
Owing to the degeneracy of the genetic code, protein-coding regions of mRNA sequences can harbour more than only amino acid information. We search the mRNA sequences of 11 human protein-coding genes for evolutionarily conserved secondary structure elements using RNA-Decoder, a comparative secondary structure prediction program that is capable of explicitly taking the known protein-coding context of the mRNA sequences into account. We detect well-defined, conserved RNA secondary structure elements in the coding regions of the mRNA sequences and show that base-paired codons strongly correlate with sparse codons. We also investigate the role of repetitive elements in the formation of secondary structure and explain the use of alternate start codons in the caveolin-1 gene by a conserved secondary structure element overlapping the nominal start codon. We discuss the functional roles of our novel findings in regulating the gene expression on mRNA level. We also investigate the role of secondary structure on the correct splicing of the human CFTR gene. We study the wild-type version of the pre-mRNA as well as 29 variants with synonymous mutations in exon 12. By comparing our predicted secondary structures to the experimentally determined splicing efficiencies, we find with weak statistical significance that pre-mRNAs with high-splicing efficiencies have different predicted secondary structures than pre-mRNAs with low-splicing efficiencies.
由于遗传密码的简并性,mRNA序列的蛋白质编码区域可能蕴含不止氨基酸信息。我们使用RNA-Decoder在11个人类蛋白质编码基因的mRNA序列中搜索进化保守的二级结构元件,RNA-Decoder是一个比较二级结构预测程序,能够明确考虑mRNA序列已知的蛋白质编码背景。我们在mRNA序列的编码区域检测到明确的、保守的RNA二级结构元件,并表明碱基配对密码子与稀有密码子强烈相关。我们还研究了重复元件在二级结构形成中的作用,并通过与标称起始密码子重叠的保守二级结构元件解释了小窝蛋白-1基因中交替起始密码子的使用。我们讨论了我们新发现的在mRNA水平调控基因表达中的功能作用。我们还研究了二级结构对人类CFTR基因正确剪接的作用。我们研究了前体mRNA的野生型版本以及外显子12中具有同义突变的29个变体。通过将我们预测的二级结构与实验确定的剪接效率进行比较,我们发现剪接效率高的前体mRNA与剪接效率低的前体mRNA具有不同的预测二级结构,不过统计显著性较弱。