Department of Biological Sciences and KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea.
Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA.
Sci Data. 2020 Dec 15;7(1):436. doi: 10.1038/s41597-020-00775-w.
Streptomyces species are gram-positive bacteria with GC-rich linear genomes and they serve as dominant reservoirs for producing clinically and industrially important secondary metabolites. Genome mining of Streptomyces revealed that each Streptomyces species typically encodes 20-50 secondary metabolite biosynthetic gene clusters (smBGCs), emphasizing their potential for novel compound discovery. Unfortunately, most of smBGCs are uncharacterized in terms of their products and regulation since they are silent under laboratory culture conditions. To translate the genomic potential of Streptomyces to practical applications, it is essential to understand the complex regulation of smBGC expression and to identify the underlying regulatory elements. To progress towards these goals, we applied two Next-Generation Sequencing methods, dRNA-Seq and Term-Seq, to industrially relevant Streptomyces species to reveal the 5´ and 3´ boundaries of RNA transcripts on a genome scale. This data provides a fundamental resource to aid our understanding of Streptomyces' regulation of smBGC expression and to enhance their potential for secondary metabolite synthesis.
链霉菌是革兰氏阳性细菌,具有 GC 含量丰富的线性基因组,它们是产生具有临床和工业重要性的次级代谢产物的主要储库。链霉菌的基因组挖掘表明,每个链霉菌物种通常编码 20-50 个次级代谢生物合成基因簇(smBGC),强调了它们在发现新化合物方面的潜力。不幸的是,由于它们在实验室培养条件下处于沉默状态,大多数 smBGC 在其产物和调控方面都没有得到描述。为了将链霉菌的基因组潜力转化为实际应用,了解 smBGC 表达的复杂调控并确定潜在的调控元件是至关重要的。为了朝着这些目标前进,我们应用了两种下一代测序方法,dRNA-Seq 和 Term-Seq,来研究工业相关的链霉菌物种,以在基因组范围内揭示 RNA 转录物的 5´和 3´边界。这些数据提供了一个基本资源,有助于我们理解链霉菌对 smBGC 表达的调控,并增强它们合成次级代谢产物的潜力。