Kang Hahk-Soo, Brady Sean F
Laboratory of Genetically Encoded Small Molecules, The Rockefeller University, Howard Hughes Medical Institute , 1230 York Avenue, New York, New York 10065, United States.
J Am Chem Soc. 2014 Dec 31;136(52):18111-9. doi: 10.1021/ja510606j. Epub 2014 Dec 18.
Sequence-guided mining of metagenomic libraries provides a means of recovering specific natural product gene clusters of interest from the environment. In this study, we use ketosynthase gene (KS) PCR amplicon sequences (sequence tags) to explore the structural and biosynthetic diversities of pentangular polyphenols (PP). In phylogenetic analyses, eDNA-derived sequence tags often fall between closely related clades that are associated with gene clusters known to encode distinct chemotypes. We show that these common "intermediate" sequence tags are useful for guiding the discovery of not only novel bioactive metabolites but also collections of closely related gene clusters that can provide new insights into the evolution of natural product structural diversity. Gene clusters corresponding to two eDNA-derived KSβ sequence tags that reside between well-defined KSβ clades associated with the biosynthesis of (C24)-pradimicin and (C26)-xantholipin type metabolites were recovered from archived soil eDNA libraries. Heterologous expression of these gene clusters in Streptomyces albus led to the isolation of three new PPs (compounds 1-3). Calixanthomycin A (1) shows potent antiproliferative activity against HCT-116 cells, whereas arenimycins C (2) and D (3) display potent antibacterial activity. By comparing genotypes and chemotypes across all known PP gene clusters, we define four PP subfamilies, and also observe that the horizontal transfer of PP tailoring genes has likely been restricted to gene clusters that encode closely related chemical structures, suggesting that only a fraction of the "natural product-like" chemical space that can theoretically be encoded by these secondary metabolite tailoring genes has likely been sampled naturally.
基于序列引导的宏基因组文库挖掘为从环境中回收感兴趣的特定天然产物基因簇提供了一种方法。在本研究中,我们使用酮合成酶基因(KS)PCR扩增子序列(序列标签)来探索五角多酚(PP)的结构和生物合成多样性。在系统发育分析中,源自环境DNA的序列标签常常落在与已知编码不同化学型的基因簇相关的密切相关分支之间。我们表明,这些常见的“中间”序列标签不仅有助于发现新型生物活性代谢物,还能用于指导发现密切相关的基因簇集合,从而为天然产物结构多样性的进化提供新见解。从存档的土壤环境DNA文库中回收了与两个源自环境DNA的KSβ序列标签相对应的基因簇,这两个标签位于与(C24)-普拉地霉素和(C26)-黄脂素型代谢物生物合成相关的明确KSβ分支之间。这些基因簇在白色链霉菌中的异源表达导致分离出三种新的PP(化合物1-3)。杯状黄霉素A(1)对HCT-116细胞显示出强大的抗增殖活性,而沙雷霉素C(2)和D(3)则具有强大的抗菌活性。通过比较所有已知PP基因簇的基因型和化学型,我们定义了四个PP亚家族,并且还观察到PP修饰基因的水平转移可能仅限于编码密切相关化学结构的基因簇,这表明这些次生代谢物修饰基因理论上可以编码的“类天然产物”化学空间中只有一小部分可能已被自然采样。