Axen Seth D, Erbilgin Onur, Kerfeld Cheryl A
DOE Joint Genome Institute, Walnut Creek, California, United States of America.
Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, California, United States of America.
PLoS Comput Biol. 2014 Oct 23;10(10):e1003898. doi: 10.1371/journal.pcbi.1003898. eCollection 2014 Oct.
Bacterial microcompartments (BMCs) are proteinaceous organelles involved in both autotrophic and heterotrophic metabolism. All BMCs share homologous shell proteins but differ in their complement of enzymes; these are typically encoded adjacent to shell protein genes in genetic loci, or operons. To enable the identification and prediction of functional (sub)types of BMCs, we developed LoClass, an algorithm that finds putative BMC loci and inventories, weights, and compares their constituent pfam domains to construct a locus similarity network and predict locus (sub)types. In addition to using LoClass to analyze sequences in the Non-redundant Protein Database, we compared predicted BMC loci found in seven candidate bacterial phyla (six from single-cell genomic studies) to the LoClass taxonomy. Together, these analyses resulted in the identification of 23 different types of BMCs encoded in 30 distinct locus (sub)types found in 23 bacterial phyla. These include the two carboxysome types and a divergent set of metabolosomes, BMCs that share a common catalytic core and process distinct substrates via specific signature enzymes. Furthermore, many Candidate BMCs were found that lack one or more core metabolosome components, including one that is predicted to represent an entirely new paradigm for BMC-associated metabolism, joining the carboxysome and metabolosome. By placing these results in a phylogenetic context, we provide a framework for understanding the horizontal transfer of these loci, a starting point for studies aimed at understanding the evolution of BMCs. This comprehensive taxonomy of BMC loci, based on their constituent protein domains, foregrounds the functional diversity of BMCs and provides a reference for interpreting the role of BMC gene clusters encoded in isolate, single cell, and metagenomic data. Many loci encode ancillary functions such as transporters or genes for cofactor assembly; this expanded vocabulary of BMC-related functions should be useful for design of genetic modules for introducing BMCs in bioengineering applications.
细菌微区室(BMCs)是参与自养和异养代谢的蛋白质细胞器。所有的BMCs都有同源的外壳蛋白,但酶的组成不同;这些酶通常在基因位点或操纵子中与外壳蛋白基因相邻编码。为了能够识别和预测BMCs的功能(亚)类型,我们开发了LoClass算法,该算法可以找到假定的BMC位点,并对其组成的pfam结构域进行清点、加权和比较,以构建一个位点相似性网络并预测位点(亚)类型。除了使用LoClass分析非冗余蛋白质数据库中的序列外,我们还将在七个候选细菌门(六个来自单细胞基因组研究)中发现的预测BMC位点与LoClass分类法进行了比较。这些分析共同导致了在23个细菌门中发现的30种不同的位点(亚)类型中编码的23种不同类型的BMCs的识别。其中包括两种羧酶体类型和一组不同的代谢体,这些BMCs共享一个共同的催化核心,并通过特定的标志性酶处理不同的底物。此外,还发现了许多缺少一种或多种核心代谢体成分的候选BMCs,其中一种预计代表了与BMC相关代谢的全新模式,加入了羧酶体和代谢体。通过将这些结果置于系统发育背景下,我们提供了一个理解这些位点水平转移的框架,这是旨在理解BMCs进化的研究的起点。这种基于其组成蛋白质结构域的BMC位点综合分类法突出了BMCs的功能多样性,并为解释分离株、单细胞和宏基因组数据中编码的BMC基因簇的作用提供了参考。许多位点编码辅助功能,如转运蛋白或辅因子组装基因;这种扩展的与BMC相关的功能词汇对于在生物工程应用中引入BMCs的遗传模块设计应该是有用的。