Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, P. R. China.
PLoS Comput Biol. 2021 Apr 19;17(4):e1008951. doi: 10.1371/journal.pcbi.1008951. eCollection 2021 Apr.
The binding affinities of protein-nucleic acid interactions could be altered due to missense mutations occurring in DNA- or RNA-binding proteins, therefore resulting in various diseases. Unfortunately, a systematic comparison and prediction of the effects of mutations on protein-DNA and protein-RNA interactions (these two mutation classes are termed MPDs and MPRs, respectively) is still lacking. Here, we demonstrated that these two classes of mutations could generate similar or different tendencies for binding free energy changes in terms of the properties of mutated residues. We then developed regression algorithms separately for MPDs and MPRs by introducing novel geometric partition-based energy features and interface-based structural features. Through feature selection and ensemble learning, similar computational frameworks that integrated energy- and nonenergy-based models were established to estimate the binding affinity changes resulting from MPDs and MPRs, but the selected features for the final models were different and therefore reflected the specificity of these two mutation classes. Furthermore, the proposed methodology was extended to the identification of mutations that significantly decreased the binding affinities. Extensive validations indicated that our algorithm generally performed better than the state-of-the-art methods on both the regression and classification tasks. The webserver and software are freely available at http://liulab.hzau.edu.cn/PEMPNI and https://github.com/hzau-liulab/PEMPNI.
蛋白质-核酸相互作用的结合亲和力可能由于 DNA 或 RNA 结合蛋白中的错义突变而改变,从而导致各种疾病。不幸的是,对于突变对蛋白质-DNA 和蛋白质-RNA 相互作用的影响(这两类突变分别称为 MPD 和 MPR),仍然缺乏系统的比较和预测。在这里,我们证明了这些两类突变可能会根据突变残基的性质产生相似或不同的结合自由能变化趋势。然后,我们通过引入新的基于几何分区的能量特征和基于界面的结构特征,分别为 MPD 和 MPR 开发了回归算法。通过特征选择和集成学习,建立了相似的计算框架,将能量和非能量模型集成在一起,以估计 MPD 和 MPR 引起的结合亲和力变化,但最终模型中选择的特征不同,因此反映了这两类突变的特异性。此外,该方法还扩展到识别显著降低结合亲和力的突变。广泛的验证表明,我们的算法在回归和分类任务上的性能通常优于最先进的方法。网络服务器和软件可免费在 http://liulab.hzau.edu.cn/PEMPNI 和 https://github.com/hzau-liulab/PEMPNI 获得。