Department of Earth and Planetary Science, University of California, Berkeley, CA, USA.
Innovative Genomics Institute, University of California, Berkeley, CA, USA.
BMC Biol. 2022 Jul 5;20(1):154. doi: 10.1186/s12915-022-01348-6.
Archaea play fundamental roles in the environment, for example by methane production and consumption, ammonia oxidation, protein degradation, carbon compound turnover, and sulfur compound transformations. Recent genomic analyses have profoundly reshaped our understanding of the distribution and functionalities of Archaea and their roles in eukaryotic evolution.
Here, 1179 representative genomes were selected from 3197 archaeal genomes. The representative genomes clustered based on the content of 10,866 newly defined archaeal protein families (that will serve as a community resource) recapitulates archaeal phylogeny. We identified the co-occurring proteins that distinguish the major lineages. Those with metabolic roles were consistent with experimental data. However, two families specific to Asgard were determined to be new eukaryotic signature proteins. Overall, the blocks of lineage-specific families are dominated by proteins that lack functional predictions.
Given that these hypothetical proteins are near ubiquitous within major archaeal groups, we propose that they were important in the origin of most of the major archaeal lineages. Interestingly, although there were clearly phylum-specific co-occurring proteins, no such blocks of protein families were shared across superphyla, suggesting a burst-like origin of new lineages early in archaeal evolution.
古菌在环境中发挥着基本作用,例如产生和消耗甲烷、氨氧化、蛋白质降解、碳化合物转化和硫化合物转化。最近的基因组分析深刻地改变了我们对古菌的分布和功能及其在真核生物进化中的作用的理解。
从 3197 个古菌基因组中选择了 1179 个代表性基因组。基于 10866 个新定义的古菌蛋白家族的内容聚类的代表性基因组再现了古菌的系统发育。我们确定了区分主要谱系的共同出现的蛋白质。那些具有代谢作用的蛋白质与实验数据一致。然而,确定两个特定于 Asgard 的家族是新的真核生物特征蛋白。总体而言,谱系特异性家族的块主要由缺乏功能预测的蛋白质组成。
鉴于这些假设的蛋白质在主要古菌群中普遍存在,我们提出它们在大多数主要古菌谱系的起源中很重要。有趣的是,尽管存在明显的门特异性共同出现的蛋白质,但没有这样的超门共享的蛋白质家族块,这表明在古菌进化的早期,新谱系的起源是爆发式的。