National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, U.S.A.
Biochem Soc Trans. 2019 Feb 28;47(1):389-398. doi: 10.1042/BST20180560. Epub 2019 Feb 1.
A substantial fraction of archaeal genes, from ∼30% to as much as 80%, encode 'hypothetical' proteins or genomic 'dark matter'. Archaeal genomes typically contain a higher fraction of dark matter compared with bacterial genomes, primarily, because isolation and cultivation of most archaea in the laboratory, and accordingly, experimental characterization of archaeal genes, are difficult. In the present study, we present quantitative characteristics of the archaeal genomic dark matter and discuss comparative genomic approaches for functional prediction for 'hypothetical' proteins. We propose a list of top priority candidates for experimental characterization with a broad distribution among archaea and those that are characteristic of poorly studied major archaeal groups such as Thaumarchaea, DPANN (Diapherotrites, Parvarchaeota, Aenigmarchaeota, Nanoarchaeota and Nanohaloarchaeota) and Asgard.
大量的古菌基因,约占 30%到 80%,编码“假设”蛋白质或基因组“暗物质”。与细菌基因组相比,古菌基因组通常含有更高比例的暗物质,主要是因为大多数古菌在实验室中的分离和培养,以及相应的古菌基因的实验表征都很困难。在本研究中,我们提出了古菌基因组暗物质的定量特征,并讨论了用于“假设”蛋白质功能预测的比较基因组方法。我们提出了一份具有广泛分布的古菌和那些特征不明显的主要古菌群(如 Thaumarchaea、DPANN(Diapherotrites、Parvarchaeota、Aenigmarchaeota、Nanarchaeota 和 Nanohaloarchaeota)和 Asgard)的实验表征的高优先级候选者名单。