Schmitz Jonathan F, Bornberg-Bauer Erich
Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany.
F1000Res. 2017 Jan 19;6:57. doi: 10.12688/f1000research.10079.1. eCollection 2017.
Over the last few years, there has been an increasing amount of evidence for the emergence of protein-coding genes, i.e. out of non-coding DNA. Here, we review the current literature and summarize the state of the field. We focus specifically on open questions and challenges in the study of protein-coding genes such as the identification and verification of -emerged genes. The greatest obstacle to date is the lack of high-quality genomic data with very short divergence times which could help precisely pin down the location of origin of a gene. We conclude that, while there is plenty of evidence from a genetics perspective, there is a lack of functional studies of bona fide genes and almost no knowledge about protein structures and how they come about during the emergence of protein-coding genes. We suggest that future studies should concentrate on the functional and structural characterization of protein-coding genes as well as the detailed study of the emergence of functional protein-coding genes.
在过去几年中,越来越多的证据表明蛋白质编码基因正在从非编码DNA中产生。在此,我们回顾当前的文献并总结该领域的现状。我们特别关注蛋白质编码基因研究中的开放性问题和挑战,例如新出现基因的识别和验证。迄今为止最大的障碍是缺乏高质量的基因组数据,这些数据的分歧时间非常短,有助于精确确定基因的起源位置。我们得出结论,虽然从遗传学角度有大量证据,但缺乏对真正基因的功能研究,并且几乎不了解蛋白质结构以及它们在蛋白质编码基因出现过程中是如何形成的。我们建议未来的研究应集中在蛋白质编码基因的功能和结构表征以及功能性蛋白质编码基因出现的详细研究上。