Chen Nianhang, Jiang Zhennan, Xie Zhekai, Zhou Su, Zeng Tao, Jiang Siqi, Zheng Ying, Yuan Yuan, Wu Ruibo
School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou 510006, China.
State Key Laboratory for Quality Ensurance and Sustainable Use of Dao-di Herbs, Experimental Research Center, China Academy of Chinese Medical Sciences, Beijing 100700, China.
ACS Synth Biol. 2025 Jun 20;14(6):2064-2080. doi: 10.1021/acssynbio.4c00886. Epub 2025 May 16.
The GT-B type glycosyltransferases play a crucial post-modification role in synthesizing natural products, such as triterpenoid and steroidal saponins, renowned for their diverse pharmacological activities. Despite phylogenetic analysis aiding in enzyme family classification, distinguishing substrate specificity between triterpenoid and steroidal saponins, with their highly similar cyclic scaffolds, remains a formidable challenge. Our studies unveil the potential transport tunnels for the glycosyl donor and acceptor in PpUGT73CR1, by molecular dynamics simulations. This revelation leads to a plausible substrate transport mechanism, highlighting the regulatory role of the N-terminal domain (NTD) in glycosyl acceptor binding and transport. Inspired by these structural and mechanistic insights, we further analyze the binding pockets of 44 plant-derived UGTs known to glycosylate triterpenes and sterols. Notably, sterol UGTs are found to harbor aromatic and hydrophobic residues with polar residues typically present at the bottom of the active pocket. Drawing inspiration from the substrate binding and product release mechanism revealed through structure-based molecular modeling, we devised a fast sequence-based method for classifying UGTs using the pre-trained ESM2 protein model. This method involved extracting the NTD features of UGTs and performing PCA clustering analysis, enabling accurate identification of enzyme function, and even differentiation of substrate specificity/promiscuity between structurally similar triterpenoid and steroidal substrates, which is further validated by experiments. This work not only deepens our understanding of substrate binding mechanisms but also provides an effective computational protocol for predicting the catalytic function of unknown UGTs.
GT-B型糖基转移酶在天然产物的合成中发挥着关键的后修饰作用,这些天然产物如三萜类和甾体皂苷,以其多样的药理活性而闻名。尽管系统发育分析有助于酶家族的分类,但区分三萜类和甾体皂苷之间的底物特异性仍然是一项艰巨的挑战,因为它们具有高度相似的环状支架。我们的研究通过分子动力学模拟揭示了PpUGT73CR1中糖基供体和受体的潜在转运通道。这一发现引出了一种合理的底物转运机制,突出了N端结构域(NTD)在糖基受体结合和转运中的调节作用。受这些结构和机制见解的启发,我们进一步分析了44种已知可糖基化三萜和甾醇的植物源UGT的结合口袋。值得注意的是,发现甾醇UGT含有芳香族和疏水残基,极性残基通常位于活性口袋的底部。从基于结构的分子建模揭示的底物结合和产物释放机制中获得灵感,我们设计了一种基于快速序列的方法,使用预训练的ESM2蛋白质模型对UGT进行分类。该方法包括提取UGT的NTD特征并进行主成分分析聚类,能够准确识别酶的功能,甚至区分结构相似的三萜类和甾体底物之间的底物特异性/混杂性,这在实验中得到了进一步验证。这项工作不仅加深了我们对底物结合机制的理解,还为预测未知UGT的催化功能提供了一种有效的计算方案。