Du Minghui, Ren Yuxiang, Zhang Yang, Li Wenwen, Yang Hongtao, Chu Huiying, Zhao Yongshan
School of Life Science and Bio-Pharmaceutics, Shenyang Pharmaceutical University, Shenyang, 110016, China.
State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116000, China.
Interdiscip Sci. 2025 Mar;17(1):27-41. doi: 10.1007/s12539-024-00656-5. Epub 2024 Sep 30.
The sluggish pace of new antibacterial drug development reflects a vulnerability in the face of the current severe threat posed by bacterial resistance. Microbial natural products (NPs), as a reservoir of immense chemical potential, have emerged as the most promising avenue for the discovery of next generation antibacterial agent. Directly accessing the antibacterial activity of potential products derived from biosynthetic gene clusters (BGCs) would significantly expedite the process. To tackle this issue, we propose a CSEL-BGC framework that integrates machine learning (ML) techniques. This framework involves the development of a novel cascade-stacking ensemble learning (CSEL) model and the establishment of a groundbreaking model evaluation system. Based on this framework, we predict 6,666 BGCs with antibacterial activity from 3,468 complete bacterial genomes and elucidate a biosynthetic evolutionary landscape to reveal their antibacterial potential. This provides crucial insights for interpretating the synthesis and secretion mechanisms of unknown NPs.
新型抗菌药物研发的缓慢步伐反映出面对当前细菌耐药性构成的严重威胁时的脆弱性。微生物天然产物(NPs)作为巨大化学潜力的宝库,已成为发现下一代抗菌剂最有前景的途径。直接获取源自生物合成基因簇(BGCs)的潜在产物的抗菌活性将显著加快这一进程。为解决这个问题,我们提出了一个整合机器学习(ML)技术的CSEL - BGC框架。该框架涉及开发一种新型的级联堆叠集成学习(CSEL)模型以及建立一个开创性的模型评估系统。基于此框架,我们从3468个完整细菌基因组中预测出6666个具有抗菌活性的BGCs,并阐明生物合成进化景观以揭示它们的抗菌潜力。这为解释未知NPs的合成和分泌机制提供了关键见解。