Harini K, Sekijima M, Gromiha M Michael
Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, 600036 Tamil Nadu, India.
Department of Computer Science, Tokyo Institute of Technology, Yokohama 226-8501, Japan.
J Chem Inf Model. 2025 Feb 10;65(3):1605-1614. doi: 10.1021/acs.jcim.4c01452. Epub 2025 Jan 23.
Interactions between proteins and RNAs are essential for the proper functioning of cells, and mutations in these molecules may lead to diseases. These protein mutations alter the strength of interactions between the protein and RNA, generally described as binding affinity (Δ). Hence, the affinity change upon mutation (ΔΔ) is an important parameter for understanding the effect of mutations in protein-RNA complexes. In this work, we developed a machine-learning model to predict ΔΔ values upon mutations in protein-RNA complexes. We collected experimentally determined ΔΔ values of 710 mutations in 134 protein-RNA complexes. Diverse sequence and structural features were generated from both wild-type and modeled mutant complexes, which include conservation scores, residue-based, network-based, and interface features. Further, we developed a support vector regressor model with a correlation of 0.75 and a mean absolute error of 0.84 kcal/mol in the jack-knife test. We observed that the performance of the model is dictated by structural features, such as contact potentials, atom contacts in the interface of protein-RNA complexes, and the solvent accessibility of the mutated residue. We also developed a Web server, PRA-MutPred, predicting the protein-RNA binding affinity change upon mutation, which is available in the link https://web.iitm.ac.in/bioinfo2/pramutpred/.
蛋白质与RNA之间的相互作用对于细胞的正常功能至关重要,这些分子中的突变可能导致疾病。这些蛋白质突变会改变蛋白质与RNA之间相互作用的强度,通常将其描述为结合亲和力(Δ)。因此,突变后的亲和力变化(ΔΔ)是理解蛋白质-RNA复合物中突变影响的一个重要参数。在这项工作中,我们开发了一种机器学习模型来预测蛋白质-RNA复合物突变后的ΔΔ值。我们收集了134个蛋白质-RNA复合物中710个突变的实验测定的ΔΔ值。从野生型和建模的突变复合物中生成了多种序列和结构特征,包括保守性得分、基于残基的、基于网络的和界面特征。此外,我们开发了一种支持向量回归模型,在留一法测试中相关性为0.75,平均绝对误差为0.84千卡/摩尔。我们观察到该模型的性能由结构特征决定,如接触势、蛋白质-RNA复合物界面中的原子接触以及突变残基的溶剂可及性。我们还开发了一个网络服务器PRA-MutPred,用于预测突变后蛋白质-RNA的结合亲和力变化,可通过链接https://web.iitm.ac.in/bioinfo2/pramutpred/获取。