Xie Hanqing, Liu Kaifeng, Li Zhengqiang, Wang Zhi, Wang Chunyu, Li Fengxi, Han Weiwei, Wang Lei
Key Laboratory of Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, Changchun 130023, P. R. China.
State Key Laboratory of Supramolecular Structure and Materials, Jilin University, Changchun 130023, P. R. China.
JACS Au. 2024 Nov 21;4(12):4957-4967. doi: 10.1021/jacsau.4c01045. eCollection 2024 Dec 23.
In this study, we developed a machine-learning-aided protein design strategy for engineering hemoglobin (VHb) as carbene transferase. A Natural Language Processing (NLP) model was used for the first time to construct an algorithm (EESP, enzyme enantioselectivity score predictor) and predict the enantioselectivity of VHb. We identified critical amino acid residue sites by molecular docking and established a simplified mutation library by site-saturated mutagenesis. Based on the simplified mutant library, the trianed EESP scored 160,000 virtual mutants, and 15 predicted high-score mutants were chosen for experimental validation. Among these mutants, VHb-WK (Y29W/P54K) demonstrated the highest diastereoselectivity and enantioselectivity of carbene transferase for the olefin cyclopropanation in aqueous conditions. Subsequently, molecular dynamics simulations were performed to explore the interaction between protein and substrates, finding that the high enantioselectivity of VHb-WK stems from the interactions of R47, Q53, and K84, which narrows the entrance of the enzyme's pocket, favoring the restriction of the formation of reaction intermediates. Integrating the NLP model and enzyme modification offers significant advantages by reducing economic costs and workloads associated with the protein engineering process.
在本研究中,我们开发了一种机器学习辅助的蛋白质设计策略,用于将血红蛋白(VHb)工程改造为卡宾转移酶。首次使用自然语言处理(NLP)模型构建了一种算法(EESP,酶对映选择性评分预测器)并预测VHb的对映选择性。我们通过分子对接确定了关键氨基酸残基位点,并通过位点饱和诱变建立了简化的突变文库。基于简化的突变文库,经过训练的EESP对160,000个虚拟突变体进行了评分,并选择了15个预测的高分突变体进行实验验证。在这些突变体中,VHb-WK(Y29W/P54K)在水相条件下对烯烃环丙烷化反应表现出卡宾转移酶最高的非对映选择性和对映选择性。随后,进行了分子动力学模拟以探索蛋白质与底物之间的相互作用,发现VHb-WK的高对映选择性源于R47、Q53和K84之间的相互作用,这缩小了酶口袋的入口,有利于限制反应中间体的形成。将NLP模型与酶修饰相结合,通过降低与蛋白质工程过程相关的经济成本和工作量,具有显著优势。