Terlouw Barbara R, Biermann Friederike, Vromans Sophie P J M, Zamani Elham, Helfrich Eric J N, Medema Marnix H
Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands.
Institute for Molecular Bio Science, Goethe University Frankfurt, Max-von-Laue Strasse 9, 60438, Frankfurt am Main, Germany.
J Cheminform. 2024 Sep 3;16(1):106. doi: 10.1186/s13321-024-00898-x.
Natural products are molecules that fulfil a range of important ecological functions. Many natural products have been exploited for pharmaceutical and agricultural applications. In contrast to many other specialised metabolites, the products of modular nonribosomal peptide synthetase (NRPS) and polyketide synthase (PKS) systems can often (partially) be predicted from the DNA sequence of the biosynthetic gene clusters. This is because the biosynthetic pathways of NRPS and PKS systems adhere to consistent rulesets. These universal biosynthetic rules can be leveraged to generate biosynthetic models of biosynthetic pathways. While these principles have been largely deciphered, software that leverages these rules to automatically generate visualisations of biosynthetic models has not yet been developed. To enable high-quality automated visualisations of natural product biosynthetic pathways, we developed RAIChU (Reaction Analysis through Illustrating Chemical Units), which produces depictions of biosynthetic transformations of PKS, NRPS, and hybrid PKS/NRPS systems from predicted or experimentally verified module architectures and domain substrate specificities. RAIChU also boasts a library of functions to perform and visualise reactions and pathways whose specifics (e.g., regioselectivity, stereoselectivity) are still difficult to predict, including terpenes, ribosomally synthesised and posttranslationally modified peptides and alkaloids. Additionally, RAIChU includes 34 prevalent tailoring reactions to enable the visualisation of biosynthetic pathways of fully maturated natural products. RAIChU can be integrated into Python pipelines, allowing users to upload and edit results from antiSMASH, a widely used BGC detection and annotation tool, or to build biosynthetic PKS/NRPS systems from scratch. RAIChU's cluster drawing correctness (100%) and drawing readability (97.66%) were validated on 5000 randomly generated PKS/NRPS systems, and on the MIBiG database. The automated visualisation of these pathways accelerates the generation of biosynthetic models, facilitates the analysis of large (meta-) genomic datasets and reduces human error. RAIChU is available at https://github.com/BTheDragonMaster/RAIChU and https://pypi.org/project/raichu .Scientific contributionRAIChU is the first software package capable of automating high-quality visualisations of natural product biosynthetic pathways. By leveraging universal biosynthetic rules, RAIChU enables the depiction of complex biosynthetic transformations for PKS, NRPS, ribosomally synthesised and posttranslationally modified peptide (RiPP), terpene and alkaloid systems, enhancing predictive and analytical capabilities. This innovation not only streamlines the creation of biosynthetic models, making the analysis of large genomic datasets more efficient and accurate, but also bridges a crucial gap in predicting and visualising the complexities of natural product biosynthesis.
天然产物是发挥一系列重要生态功能的分子。许多天然产物已被用于制药和农业应用。与许多其他特殊代谢产物不同,模块化非核糖体肽合成酶(NRPS)和聚酮化合物合成酶(PKS)系统的产物通常可以(部分)从生物合成基因簇的DNA序列中预测出来。这是因为NRPS和PKS系统的生物合成途径遵循一致的规则集。这些通用的生物合成规则可用于生成生物合成途径的生物合成模型。虽然这些原理已基本被破译,但尚未开发出利用这些规则自动生成生物合成模型可视化的软件。为了实现天然产物生物合成途径的高质量自动可视化,我们开发了RAIChU(通过化学单元说明进行反应分析),它可以根据预测的或经过实验验证的模块结构和结构域底物特异性,生成PKS、NRPS和混合PKS/NRPS系统生物合成转化的图示。RAIChU还拥有一个功能库,用于执行和可视化其细节(例如区域选择性、立体选择性)仍然难以预测的反应和途径,包括萜类化合物、核糖体合成及翻译后修饰的肽和生物碱。此外,RAIChU包括34种常见的修饰反应,以实现完全成熟的天然产物生物合成途径的可视化。RAIChU可以集成到Python管道中,允许用户上传和编辑来自antiSMASH(一种广泛使用的BGC检测和注释工具)的结果,或从头构建生物合成PKS/NRPS系统。RAIChU的簇图正确性(100%)和图的可读性(97.66%)在5000个随机生成的PKS/NRPS系统以及MIBiG数据库上得到了验证。这些途径的自动可视化加速了生物合成模型的生成,便于分析大型(元)基因组数据集并减少人为错误。RAIChU可在https://github.com/BTheDragonMaster/RAIChU和https://pypi.org/project/raichu获取。科学贡献RAIChU是第一个能够自动实现天然产物生物合成途径高质量可视化的软件包。通过利用通用的生物合成规则,RAIChU能够描绘PKS、NRPS、核糖体合成及翻译后修饰的肽(RiPP)、萜类化合物和生物碱系统的复杂生物合成转化,增强预测和分析能力。这一创新不仅简化了生物合成模型的创建,使大型基因组数据集的分析更高效、准确,还弥合了预测和可视化天然产物生物合成复杂性方面的关键差距。