Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea.
KAIST Institute for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea.
Nucleic Acids Res. 2019 Jul 9;47(12):6114-6129. doi: 10.1093/nar/gkz471.
Determining transcriptional and translational regulatory elements in GC-rich Streptomyces genomes is essential to elucidating the complex regulatory networks that govern secondary metabolite biosynthetic gene cluster (BGC) expression. However, information about such regulatory elements has been limited for Streptomyces genomes. To address this limitation, a high-quality genome sequence of β-lactam antibiotic-producing Streptomyces clavuligerus ATCC 27 064 is completed, which contains 7163 newly annotated genes. This provides a fundamental reference genome sequence to integrate multiple genome-scale data types, including dRNA-Seq, RNA-Seq and ribosome profiling. Data integration results in the precise determination of 2659 transcription start sites which reveal transcriptional and translational regulatory elements, including -10 and -35 promoter components specific to sigma (σ) factors, and 5'-untranslated region as a determinant for translation efficiency regulation. Particularly, sequence analysis of a wide diversity of the -35 components enables us to predict potential σ-factor regulons, along with various spacer lengths between the -10 and -35 elements. At last, the primary transcriptome landscape of the β-lactam biosynthetic pathway is analyzed, suggesting temporal changes in metabolism for the synthesis of secondary metabolites driven by transcriptional regulation. This comprehensive genetic information provides a versatile genetic resource for rational engineering of secondary metabolite BGCs in Streptomyces.
确定富含 GC 的链霉菌基因组中的转录和翻译调控元件对于阐明调控次级代谢物生物合成基因簇 (BGC) 表达的复杂调控网络至关重要。然而,关于这些调控元件的信息在链霉菌基因组中一直很有限。为了解决这个限制,完成了生产β-内酰胺抗生素的链霉菌 clavuligerus ATCC 27064 的高质量基因组序列,其中包含 7163 个新注释的基因。这为整合多种基因组规模的数据类型提供了一个基本的参考基因组序列,包括 dRNA-Seq、RNA-Seq 和核糖体分析。数据集成精确地确定了 2659 个转录起始位点,揭示了转录和翻译调控元件,包括特定于 σ (σ) 因子的 -10 和 -35 启动子组件,以及作为翻译效率调节决定因素的 5'-非翻译区。特别是,-35 组件的广泛多样性的序列分析使我们能够预测潜在的 σ 因子调控子,以及 -10 和 -35 元件之间的各种间隔长度。最后,分析了β-内酰胺生物合成途径的初级转录组图谱,表明代谢的时间变化是由转录调控驱动的次级代谢物的合成。这种综合的遗传信息为在链霉菌中进行次级代谢物 BGC 的合理工程提供了一种多功能的遗传资源。