Lithwick Gila, Margalit Hanah
Department of Molecular Genetics and Biotechnology, Faculty of Medicine, The Hebrew University, Jerusalem 91120, Israel.
Genome Res. 2003 Dec;13(12):2665-73. doi: 10.1101/gr.1485203.
Protein expression in the cell is affected by various sequence-dependent features. Several such sequence-dependent features have been individually studied,yet they have not been compared quantitatively in terms of their relative influence on protein expression,and a hierarchy of these elements has not been determined. Here we present a quantitative analysis examining sequence-dependent features involved in prokaryotic translation,namely,the base-pairing potential between the mRNA Shine-Dalgarno sequence and the ribosomal RNA,codon bias,and the identity of the stop codon. We analyzed these features both at intra- and intergenomic levels using the Escherichia coli and Haemophilus influenzae genomes. Within each genome,we examined the relationship between each feature and protein expression levels determined by 2D-gel analyses. At the intergenomic level,comparative genomic principles were applied to study the relative preservation of the different sequence-dependent properties between orthologs. From these analyses,we determined that biased codon usage is the property that is most highly associated with protein expression and that is most conserved. The identity of the stop codon and the base-pairing potential of the mRNA Shine-Dalgarno sequence and the rRNA seem to have less of an effect on protein expression.
细胞中的蛋白质表达受多种序列依赖性特征的影响。已经对几种这样的序列依赖性特征进行了单独研究,但尚未就它们对蛋白质表达的相对影响进行定量比较,并且尚未确定这些元件的层次结构。在这里,我们进行了一项定量分析,研究原核生物翻译中涉及的序列依赖性特征,即mRNA Shine-Dalgarno序列与核糖体RNA之间的碱基配对潜力、密码子偏好以及终止密码子的特性。我们使用大肠杆菌和流感嗜血杆菌基因组在基因组内和基因组间水平上分析了这些特征。在每个基因组内,我们研究了每个特征与通过二维凝胶分析确定的蛋白质表达水平之间的关系。在基因组间水平上,应用比较基因组学原理研究直系同源物之间不同序列依赖性特性的相对保守性。通过这些分析,我们确定密码子使用偏好是与蛋白质表达最高度相关且最保守的特性。终止密码子的特性以及mRNA Shine-Dalgarno序列与rRNA的碱基配对潜力似乎对蛋白质表达的影响较小。