Horstick Eric J, Jordan Diana C, Bergeron Sadie A, Tabor Kathryn M, Serpe Mihaela, Feldman Benjamin, Burgess Harold A
Program in Genomics of Differentiation, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, MD 20892, USA.
Program in Cellular Regulation and Metabolism, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, MD 20892, USA.
Nucleic Acids Res. 2015 Apr 20;43(7):e48. doi: 10.1093/nar/gkv035. Epub 2015 Jan 27.
Many genetic manipulations are limited by difficulty in obtaining adequate levels of protein expression. Bioinformatic and experimental studies have identified nucleotide sequence features that may increase expression, however it is difficult to assess the relative influence of these features. Zebrafish embryos are rapidly injected with calibrated doses of mRNA, enabling the effects of multiple sequence changes to be compared in vivo. Using RNAseq and microarray data, we identified a set of genes that are highly expressed in zebrafish embryos and systematically analyzed for enrichment of sequence features correlated with levels of protein expression. We then tested enriched features by embryo microinjection and functional tests of multiple protein reporters. Codon selection, releasing factor recognition sequence and specific introns and 3' untranslated regions each increased protein expression between 1.5- and 3-fold. These results suggested principles for increasing protein yield in zebrafish through biomolecular engineering. We implemented these principles for rational gene design in software for codon selection (CodonZ) and plasmid vectors incorporating the most active non-coding elements. Rational gene design thus significantly boosts expression in zebrafish, and a similar approach will likely elevate expression in other animal models.
许多基因操作受到难以获得足够水平蛋白质表达的限制。生物信息学和实验研究已经确定了可能增加表达的核苷酸序列特征,然而,很难评估这些特征的相对影响。将校准剂量的mRNA快速注射到斑马鱼胚胎中,能够在体内比较多个序列变化的影响。利用RNA测序和微阵列数据,我们鉴定了一组在斑马鱼胚胎中高表达的基因,并系统分析了与蛋白质表达水平相关的序列特征的富集情况。然后,我们通过胚胎显微注射和多种蛋白质报告基因的功能测试对富集的特征进行了测试。密码子选择、释放因子识别序列以及特定的内含子和3'非翻译区均可使蛋白质表达提高1.5至3倍。这些结果提出了通过生物分子工程提高斑马鱼蛋白质产量的原则。我们将这些原则应用于密码子选择软件(CodonZ)中的合理基因设计以及包含最活跃非编码元件的质粒载体中。合理的基因设计因此显著提高了斑马鱼中的表达,类似的方法可能也会提高其他动物模型中的表达。