Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu, China.
Department of Bioinformatics, School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu, China.
J Biomol Struct Dyn. 2021 Nov;39(18):7080-7092. doi: 10.1080/07391102.2020.1808531. Epub 2020 Aug 21.
Archaea are a group of primary life forms on Earth and could thrive in many unique environments. Their successful colonization of extreme niches requires corresponding adaptations at proteogenomic level in order to maintain stable cellular structures and active physiological functions. Although some studies have already investigated the extremophilic lifestyles of archaeal species based on genomic features and protein structures, there is a lack of comparative proteogenomic analysis in a large scale. In this study, we explored 686 high-quality archaeal genomes (proteomes) sourced from the Pathosystems Resource Integration Center (PATRIC) database. General patterns of genomic features such as genome size, coding capacity (coding genes and non-coding regions), and G + C contents were re-confirmed. Protein domain distribution patterns were then identified across archaeal species. Domains with unknown functions (DUFs) and mini proteins were investigated in terms of their distributions due to their importance in archaeal physiological functions. In addition, physicochemical properties of protein sequences, such as stability, hydrophobicity, isoelectric point, aromaticity and amino acid compositions in corresponding archaeal groups were compared. Unique features associated with extremophilic lifestyles were observed, which suggested that evolutionary adaptations to different extreme environments had intrinsic impacts on archaeal protein features. Taken together, this systematic study facilitates a better understanding of the mechanisms behind the extremophilic lifestyles of archaeal species, which will further contribute to the evolutionary explorations of archaeal adaptations both experimentally and theoretically in the future studies.Communicated by Ramaswamy H. Sarma.
古菌是地球上主要的生命形式之一,可以在许多独特的环境中茁壮成长。它们在极端小生境中的成功殖民化需要在蛋白质组学水平上进行相应的适应,以维持稳定的细胞结构和活跃的生理功能。尽管一些研究已经根据基因组特征和蛋白质结构研究了古菌物种的极端生境生活方式,但在大规模范围内缺乏比较蛋白质组学分析。在这项研究中,我们探索了来自 Pathosystems Resource Integration Center(PATRIC)数据库的 686 个高质量古菌基因组(蛋白质组)。重新确认了基因组特征的一般模式,如基因组大小、编码能力(编码基因和非编码区)和 G+C 含量。然后在古菌物种中确定了蛋白质结构域的分布模式。由于它们在古菌生理功能中的重要性,研究了具有未知功能(DUFs)和小型蛋白质的结构域的分布。此外,还比较了蛋白质序列的物理化学性质,如稳定性、疏水性、等电点、芳香性和相应古菌组中的氨基酸组成。观察到与极端生境生活方式相关的独特特征,这表明对不同极端环境的进化适应对内生影响了古菌蛋白的特征。总之,这项系统研究有助于更好地理解古菌物种极端生境生活方式的机制,这将有助于未来在实验和理论上进一步探索古菌的适应性。由 Ramaswamy H. Sarma 交流。