Moyers Bryan A, Zhang Jianzhi
Department of Computational Medicine and Bioinformatics, University of Michigan.
Department of Ecology and Evolutionary Biology, University of Michigan
Mol Biol Evol. 2016 May;33(5):1245-56. doi: 10.1093/molbev/msw008. Epub 2016 Jan 11.
The source of genetic novelty is an area of wide interest and intense investigation. Although gene duplication is conventionally thought to dominate the production of new genes, this view was recently challenged by a proposal of widespread de novo gene origination in eukaryotic evolution. Specifically, distributions of various gene properties such as coding sequence length, expression level, codon usage, and probability of being subject to purifying selection among groups of genes with different estimated ages were reported to support a model in which new protein-coding proto-genes arise from noncoding DNA and gradually integrate into cellular networks. Here we show that the genomic patterns asserted to support widespread de novo gene origination are largely attributable to biases in gene age estimation by phylostratigraphy, because such patterns are also observed in phylostratigraphic analysis of simulated genes bearing identical ages. Furthermore, there is no evidence of purifying selection on very young de novo genes previously claimed to show such signals. Together, these findings are consistent with the prevailing view that de novo gene birth is a relatively minor contributor to new genes in genome evolution. They also illustrate the danger of using phylostratigraphy in the study of new gene origination without considering its inherent bias.
遗传新奇性的来源是一个备受广泛关注且深入研究的领域。虽然传统观点认为基因复制主导新基因的产生,但最近这一观点受到了真核生物进化中广泛存在从头起源基因这一观点的挑战。具体而言,据报道,不同估计年龄的基因群体在编码序列长度、表达水平、密码子使用以及受到纯化选择的概率等各种基因特性的分布情况,支持了一种模型,即新的蛋白质编码原基因从非编码DNA中产生,并逐渐融入细胞网络。在这里,我们表明,那些据称支持广泛从头起源基因的基因组模式,很大程度上归因于系统发育地层学在估计基因年龄时的偏差,因为在对具有相同年龄的模拟基因进行系统发育地层分析时也观察到了此类模式。此外,对于先前声称显示出此类信号的非常年轻的从头起源基因,没有证据表明存在纯化选择。总之,这些发现与普遍观点一致,即从头起源基因在基因组进化中对新基因的贡献相对较小。它们还说明了在研究新基因起源时使用系统发育地层学而不考虑其固有偏差的危险性。