Noderer William L, Flockhart Ross J, Bhaduri Aparna, Diaz de Arce Alexander J, Zhang Jiajing, Khavari Paul A, Wang Clifford L
Department of Chemical Engineering, Stanford University, Stanford, CA, USA.
The Program in Epithelial Biology, Stanford University School of Medicine, Stanford, CA, USA.
Mol Syst Biol. 2014 Aug 28;10(8):748. doi: 10.15252/msb.20145136.
An approach combining fluorescence-activated cell sorting and high-throughput DNA sequencing (FACS-seq) was employed to determine the efficiency of start codon recognition for all possible translation initiation sites (TIS) utilizing AUG start codons. Using FACS-seq, we measured translation from a genetic reporter library representing all 65,536 possible TIS sequences spanning the -6 to +5 positions. We found that the motif RYMRMVAUGGC enhanced start codon recognition and translation efficiency. However, dinucleotide interactions, which cannot be conveyed by a single motif, were also important for modeling TIS efficiency. Our dataset combined with modeling allowed us to predict genome-wide translation initiation efficiency for all mRNA transcripts. Additionally, we screened somatic TIS mutations associated with tumorigenesis to identify candidate driver mutations consistent with known tumor expression patterns. Finally, we implemented a quantitative leaky scanning model to predict alternative initiation sites that produce truncated protein isoforms and compared predictions with ribosome footprint profiling data. The comprehensive analysis of the TIS sequence space enables quantitative predictions of translation initiation based on genome sequence.
采用一种结合荧光激活细胞分选和高通量DNA测序的方法(FACS-seq)来确定利用AUG起始密码子的所有可能翻译起始位点(TIS)的起始密码子识别效率。使用FACS-seq,我们测量了来自一个遗传报告文库的翻译,该文库代表了跨越-6至+5位置的所有65536种可能的TIS序列。我们发现基序RYMRMVAUGGC增强了起始密码子识别和翻译效率。然而,单基序无法体现的二核苷酸相互作用对于模拟TIS效率也很重要。我们的数据集与模型相结合,使我们能够预测所有mRNA转录本的全基因组翻译起始效率。此外,我们筛选了与肿瘤发生相关的体细胞TIS突变,以识别与已知肿瘤表达模式一致的候选驱动突变。最后,我们实施了一个定量漏扫描模型来预测产生截短蛋白异构体的替代起始位点,并将预测结果与核糖体足迹分析数据进行比较。对TIS序列空间的全面分析能够基于基因组序列对翻译起始进行定量预测。