Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India.
PSN College of Engineering and Technology, Melathediyoor, Tirunelveli, Tamil Nadu, India.
J Mol Biol. 2022 Jun 15;434(11):167526. doi: 10.1016/j.jmb.2022.167526. Epub 2022 Mar 5.
Protein-carbohydrate interactions play an important role in several biological processes. The mutation of amino acid residues in carbohydrate-binding proteins may alter the binding affinity, affect the functions and lead to diseases. Elucidating the factors influencing the binding affinity change (ΔΔG) of protein-carbohydrate complexes upon mutation is a challenging task. In this work, we have collected the experimental data for the binding affinity change of 318 unique mutants and related with sequence and structural features of amino acid residues at the mutant sites. We found that accessible surface area, secondary structure, mutation preference, conservation score, hydrophobicity and contact energies are important to understand the binding affinity change upon mutation. We have developed multiple regression equations for predicting the binding affinity change upon mutation and our method showed an average correlation of 0.74 and a mean absolute error of 0.70 kcal/mol between experimental and predicted ΔΔG on a 10-fold cross-validation. Further, we have validated our method using an independent test data set of 124 (62 unique) mutations, which showed a correlation and MAE of 0.79 and 0.56 kcal/mol, respectively. We have developed a web server PCA-MutPred, Protein-CArbohydrate complex Mutation affinity Predictor, for predicting the change in binding affinity of protein-carbohydrate complexes and it is freely accessible at https://web.iitm.ac.in/bioinfo2/pcamutpred. We suggest that the method could be a useful resource for designing protein-carbohydrate complexes with desired affinities.
蛋白质-碳水化合物相互作用在许多生物过程中起着重要作用。碳水化合物结合蛋白中氨基酸残基的突变可能会改变结合亲和力、影响功能并导致疾病。阐明影响蛋白质-碳水化合物复合物突变时结合亲和力变化(ΔΔG)的因素是一项具有挑战性的任务。在这项工作中,我们收集了 318 个独特突变体的实验数据,并与突变部位氨基酸残基的序列和结构特征相关联。我们发现,可及表面积、二级结构、突变偏好、保守评分、疏水性和接触能对于理解突变时的结合亲和力变化很重要。我们已经开发了用于预测突变时结合亲和力变化的多元回归方程,我们的方法在 10 倍交叉验证中显示出实验和预测的 ΔΔG 之间的平均相关性为 0.74,平均绝对误差为 0.70 kcal/mol。此外,我们使用 124 个(62 个独特)突变的独立测试数据集验证了我们的方法,其相关性和 MAE 分别为 0.79 和 0.56 kcal/mol。我们开发了一个名为 PCA-MutPred 的网络服务器,用于预测蛋白质-碳水化合物复合物结合亲和力的变化,网址为 https://web.iitm.ac.in/bioinfo2/pcamutpred。我们建议该方法可以成为设计具有所需亲和力的蛋白质-碳水化合物复合物的有用资源。