College of Life Sciences, Zhejiang University, Hangzhou 310058, PR China.
BMC Bioinformatics. 2012 Jun 25;13 Suppl 10(Suppl 10):S8. doi: 10.1186/1471-2105-13-S10-S8.
Streptomycetes are filamentous soil-dwelling bacteria. They are best known as the producers of a great variety of natural products such as antibiotics, antifungals, antiparasitics, and anticancer agents and the decomposers of organic substances for carbon recycling. They are also model organisms for the studies of gene regulatory networks, morphological differentiation, and stress response. The availability of sets of genomes from closely related Streptomyces strains makes it possible to assess the mechanisms underlying genome plasticity and systems adaptation.
We present the results of a comprehensive analysis of the genomes of five Streptomyces species with distinct phenotypes. These streptomycetes have a pan-genome comprised of 17,362 orthologous families which includes 3,096 components in the core genome, 5,066 components in the dispensable genome, and 9,200 components that are uniquely present in only one species. The core genome makes up about 33%-45% of each genome repertoire. It contains important genes for Streptomyces biology including those involved in gene regulation, secretion, secondary metabolism and morphological differentiation. Abundant duplicate genes have been identified, with 4%-11% of the whole genomes composed of lineage-specific expansions (LSEs), suggesting that frequent gene duplication or lateral gene transfer events play a role in shaping the genome diversification within this genus. Two patterns of expansion, single gene expansion and chromosome block expansion are observed, representing different scales of duplication.
Our results provide a catalog of genome components and their potential functional roles in gene regulatory networks and metabolic networks. The core genome components reveal the minimum requirement for streptomycetes to sustain a successful lifecycle in the soil environment, reflecting the effects of both genome evolution and environmental stress acting upon the expressed phenotypes. A better understanding of the LSE gene families will, on the other hand, bring a wealth of new insights into the mechanisms underlying strain-specific phenotypes, such as the production of novel antibiotics, pathogenesis, and adaptive response to environmental challenges.
链霉菌是一种丝状土壤细菌。它们以生产各种天然产物而闻名,如抗生素、抗真菌药、抗寄生虫药和抗癌药,以及有机物质的分解者,用于碳循环。它们也是研究基因调控网络、形态分化和应激反应的模式生物。具有密切相关链霉菌菌株基因组集的可用性使得评估基因组可塑性和系统适应的机制成为可能。
我们介绍了对具有不同表型的五种链霉菌物种的基因组进行全面分析的结果。这些链霉菌具有一个由 17362 个直系同源家族组成的泛基因组,其中包括核心基因组中的 3096 个组成部分、可分配基因组中的 5066 个组成部分以及仅存在于一种物种中的 9200 个组成部分。核心基因组约占每个基因组库的 33%-45%。它包含与链霉菌生物学相关的重要基因,包括参与基因调控、分泌、次生代谢和形态分化的基因。大量重复基因已经被鉴定出来,整个基因组的 4%-11%由谱系特异性扩张(LSE)组成,这表明频繁的基因复制或横向基因转移事件在该属的基因组多样化中起作用。观察到两种扩张模式,即单个基因扩张和染色体块扩张,代表不同的复制规模。
我们的研究结果提供了基因组组成及其在基因调控网络和代谢网络中的潜在功能作用的目录。核心基因组组成揭示了链霉菌在土壤环境中维持成功生命周期的最低要求,反映了基因组进化和环境压力对表现型的影响。另一方面,更好地了解 LSE 基因家族将为菌株特异性表型(如新型抗生素的产生、发病机制和对环境挑战的适应性反应)的机制提供丰富的新见解。