Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA.
Computational Biology Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA.
Genome Res. 2018 Feb;28(2):214-222. doi: 10.1101/gr.221507.117. Epub 2017 Dec 18.
Upstream open reading frames (uORFs), located in transcript leaders (5' UTRs), are potent -acting regulators of translation and mRNA turnover. Recent genome-wide ribosome profiling studies suggest that thousands of uORFs initiate with non-AUG start codons. Although intriguing, these non-AUG uORF predictions have been made without statistical control or validation; thus, the importance of these elements remains to be demonstrated. To address this, we took a comparative genomics approach to study AUG and non-AUG uORFs. We mapped transcription leaders in multiple yeast species and applied a novel machine learning algorithm (uORF-seqr) to ribosome profiling data to identify statistically significant uORFs. We found that AUG and non-AUG uORFs are both frequently found in yeasts. Although most non-AUG uORFs are found in only one species, hundreds have either conserved sequence or position within uORFs initiating with UUG are particularly common and are shared between species at rates similar to that of AUG uORFs. However, non-AUG uORFs are translated less efficiently than AUG-uORFs and are less subject to removal via alternative transcription initiation under normal growth conditions. These results suggest that a subset of non-AUG uORFs may play important roles in regulating gene expression.
上游开放阅读框 (uORFs) 位于转录本的前导区(5'UTR),是一种强大的翻译和 mRNA 周转调控因子。最近的全基因组核糖体分析研究表明,数千个 uORFs 以非 AUG 起始密码子开始。尽管这些非 AUG uORF 的预测很有趣,但它们是在没有统计学控制或验证的情况下做出的;因此,这些元件的重要性仍有待证明。为了解决这个问题,我们采用了比较基因组学的方法来研究 AUG 和非 AUG uORFs。我们在多个酵母物种中绘制了转录本前导区,并应用了一种新的机器学习算法 (uORF-seqr) 对核糖体分析数据进行分析,以识别具有统计学意义的 uORFs。我们发现,AUG 和非 AUG uORFs 在酵母中都经常被发现。虽然大多数非 AUG uORFs 仅在一个物种中被发现,但数百个非 AUG uORFs 具有保守的序列或位置,以 UUG 起始的 uORFs 尤其常见,并且在物种间的共享率与 AUG uORFs 相似。然而,非 AUG uORFs 的翻译效率低于 AUG-uORFs,并且在正常生长条件下,通过替代转录起始去除的可能性也较小。这些结果表明,一部分非 AUG uORFs 可能在调节基因表达中发挥重要作用。