Suppr超能文献

初生的从头基因可以从逃避快速转录周转的冻结事故中进化而来。

Incipient de novo genes can evolve from frozen accidents that escaped rapid transcript turnover.

机构信息

Institute for Evolution and Biodiversity, University of Münster, Münster, Germany.

Max-Planck Institute for Evolutionary Biology, Plön, Germany.

出版信息

Nat Ecol Evol. 2018 Oct;2(10):1626-1632. doi: 10.1038/s41559-018-0639-7. Epub 2018 Sep 10.

Abstract

A recent surge of studies have suggested that many novel genes arise de novo from previously noncoding DNA and not by duplication. However, most studies concentrated on longer evolutionary time scales and rarely considered protein structural properties. Therefore, it remains unclear how these properties are shaped by evolution, depend on genetic mechanisms and influence gene survival. Here we compare open reading frames (ORFs) from high coverage transcriptomes from mouse and another four mammals covering 160 million years of evolution. We find that novel ORFs pervasively emerge from noncoding regions but are rapidly lost again, while relatively fewer arise from the divergence of coding sequences but are retained much longer. We also find that a subset (14%) of the mouse-specific ORFs bind ribosomes and are potentially translated, showing that such ORFs can be the starting points of gene emergence. Surprisingly, disorder and other protein properties of young ORFs hardly change with gene age in short time frames. Only length and nucleotide composition change significantly. Thus, some transcribed de novo genes resemble 'frozen accidents' of randomly emerged ORFs that survived initial purging. This perspective complies with very recent studies indicating that some neutrally evolving transcripts containing random protein sequences may be translated and be viable starting points of de novo gene emergence.

摘要

最近的研究表明,许多新基因是从头从以前的非编码 DNA 中产生的,而不是通过复制产生的。然而,大多数研究集中在更长的进化时间尺度上,很少考虑蛋白质结构特性。因此,这些特性是如何被进化塑造的,取决于遗传机制并影响基因存活,仍然不清楚。在这里,我们比较了来自老鼠和其他四种哺乳动物的高覆盖率转录组中的开放阅读框(ORFs),涵盖了 1.6 亿年的进化。我们发现,新的 ORFs 普遍从非编码区域出现,但很快又再次丢失,而相对较少的 ORFs 从编码序列的分歧中产生,但保留的时间更长。我们还发现,老鼠特有的 ORFs 中有一部分(14%)与核糖体结合并可能被翻译,这表明这些 ORFs 可以成为基因出现的起点。令人惊讶的是,年轻 ORFs 的无序和其他蛋白质特性在短时间内几乎不会随基因年龄而变化。只有长度和核苷酸组成发生显著变化。因此,一些转录的从头基因类似于随机出现的 ORFs 的“冻结事故”,这些 ORFs 幸存下来并经过了最初的净化。这种观点与最近的研究一致,表明一些中性进化的含有随机蛋白质序列的转录本可能被翻译,并成为从头基因出现的可行起点。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验