Molecular and Cellular Biology, University of Arizona, Tucson, AZ.
Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ.
Mol Biol Evol. 2020 Jun 1;37(6):1761-1774. doi: 10.1093/molbev/msaa046.
De novo protein-coding innovations sometimes emerge from ancestrally noncoding DNA, despite the expectation that translating random sequences is overwhelmingly likely to be deleterious. The "preadapting selection" hypothesis claims that emergence is facilitated by prior, low-level translation of noncoding sequences via molecular errors. It predicts that selection on polypeptides translated only in error is strong enough to matter and is strongest when erroneous expression is high. To test this hypothesis, we examined noncoding sequences located downstream of stop codons (i.e., those potentially translated by readthrough errors) in Saccharomyces cerevisiae genes. We identified a class of "fragile" proteins under strong selection to reduce readthrough, which are unlikely substrates for co-option. Among the remainder, sequences showing evidence of readthrough translation, as assessed by ribosome profiling, encoded C-terminal extensions with higher intrinsic structural disorder, supporting the preadapting selection hypothesis. The cryptic sequences beyond the stop codon, rather than spillover effects from the regular C-termini, are primarily responsible for the higher disorder. Results are robust to controlling for the fact that stronger selection also reduces the length of C-terminal extensions. These findings indicate that selection acts on 3' UTRs in Saccharomyces cerevisiae to purge potentially deleterious variants of cryptic polypeptides, acting more strongly in genes that experience more readthrough errors.
新的蛋白质编码创新有时会从祖先非编码 DNA 中出现,尽管人们普遍认为翻译随机序列极有可能是有害的。“预适应选择”假说声称,通过分子错误,先前低水平的非编码序列的翻译促进了出现。它预测,仅因错误而翻译的多肽的选择足够强而重要,并且在错误表达较高时最强。为了检验这一假说,我们研究了酿酒酵母基因中终止密码子下游的非编码序列(即,那些可能通过通读错误翻译的序列)。我们鉴定了一类在强烈选择下减少通读的“脆弱”蛋白,它们不太可能被共选择。在其余的序列中,核糖体图谱评估显示存在通读翻译的证据,其编码的 C 末端延伸具有更高的内在结构无序性,支持预适应选择假说。终止密码子后面的隐匿序列,而不是来自常规 C 末端的溢出效应,主要负责更高的无序性。结果对于控制更强的选择也会降低 C 末端延伸的长度这一事实是稳健的。这些发现表明,在酿酒酵母中,选择作用于 3'UTR 以清除隐匿多肽的潜在有害变体,在经历更多通读错误的基因中作用更强。