Dept. of Evolutionary Biology and the Institute of Evolution, University of Haifa, Haifa, Israel.
Dept. of Molecular, Cell and Developmental Biology; University of California Los Angeles, Los Angeles, CA 90095, United States of America.
PLoS One. 2018 Nov 1;13(11):e0204322. doi: 10.1371/journal.pone.0204322. eCollection 2018.
Pseudogenes are non-functional sequences in the genome with homologous sequences that are functional (i.e. genes). They are abundant in eukaryotes where they have been extensively investigated, while in prokaryotes they are significantly scarcer and less well studied. Here we conduct a comprehensive analysis of the evolution of orthologs of Mycobacterium leprae pseudogenes in prokaryotes. The leprosy pathogen M. leprae is of particular interest since it contains an unusually large number of pseudogenes, comprising approximately 40% of its entire genome. The analysis is conducted in both broad and narrow phylogenetic ranges.
We have developed an informatics-based approach to characterize the evolution of pseudogenes. This approach combines tools from phylogenomics, genomics, and transcriptomics. The results we obtain are used to assess the contributions of two mechanisms for pseudogene formation: failed horizontal gene transfer events and disruption of native genes.
We conclude that, although it was reported that in most bacteria the former is most likely responsible for the majority of pseudogenization events, in mycobacteria, and in particular in M. leprae with its exceptionally high pseudogene numbers, the latter predominates. We believe that our study sheds new light on the evolution of pseudogenes in bacteria, by utilizing new methodologies that are applied to the unusually abundant M. leprae pseudogenes and their orthologs.
假基因是基因组中具有同源序列但无功能的序列(即基因)。它们在真核生物中大量存在,并得到了广泛研究,而在原核生物中则相对较少,研究也较少。在这里,我们对麻风分枝杆菌假基因在原核生物中的同源物的进化进行了全面分析。麻风分枝杆菌是一种特别有趣的病原体,因为它含有异常数量的假基因,约占其整个基因组的 40%。该分析在广泛和狭义的系统发育范围内进行。
我们开发了一种基于信息学的方法来描述假基因的进化。该方法结合了系统基因组学、基因组学和转录组学的工具。我们得到的结果用于评估两种假基因形成机制的贡献:失败的水平基因转移事件和原生基因的破坏。
我们的结论是,尽管有报道称在大多数细菌中,前者最有可能导致大多数假基因化事件,但在分枝杆菌中,特别是在麻风分枝杆菌中,其异常高的假基因数量,后者占主导地位。我们相信,我们的研究通过利用新的方法来研究异常丰富的麻风分枝杆菌假基因及其同源物,为细菌假基因的进化提供了新的见解。