Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America ; Department of Biology, University of Maryland at College Park, College Park, Maryland, United States of America.
PLoS Genet. 2013;9(10):e1003860. doi: 10.1371/journal.pgen.1003860. Epub 2013 Oct 17.
How non-coding DNA gives rise to new protein-coding genes (de novo genes) is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs), while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important.
非编码 DNA 如何产生新的蛋白质编码基因(从头基因)尚不清楚。最近的工作揭示了一些从头基因的起源和功能,但控制这些基因进化或生物学功能的一般原则尚不清楚。为了更好地定义这些原则,我们对在果蝇中描述的六个假定的蛋白质编码从头基因的进化和功能进行了平行分析。从头基因的转录历史重建表明,两个从头基因是从至少在开放阅读框进化之前 500 万年出现的新的长非编码 RNA 中产生的。相比之下,其他四个从头基因进化出了翻译的开放阅读框和转录,这表明新生的开放阅读框(原 ORF)虽然不是必需的,但可以促进新的从头基因的出现。然而,这些基因都不是从很久以前就存在的原 ORF 中产生的。与附近基因相比,从头基因的序列和结构进化非常迅速,并且从头基因的结构复杂性随着进化时间的推移而稳步增加。尽管这些基因在雄性中的转录水平高于雌性,并且在睾丸中表达最强,但 RNAi 实验表明,在变态期间,这些基因在两性中大多数都是必需的。这种致死性表明,果蝇中的蛋白质编码从头基因很快就变得具有重要的功能。