Department of Mathematics.
Department of Biochemistry and Molecular Biology.
Bioinformatics. 2017 Nov 15;33(22):3549-3557. doi: 10.1093/bioinformatics/btx460.
Site directed mutagenesis is widely used to understand the structure and function of biomolecules. Computational prediction of mutation impacts on protein stability offers a fast, economical and potentially accurate alternative to laboratory mutagenesis. Most existing methods rely on geometric descriptions, this work introduces a topology based approach to provide an entirely new representation of mutation induced protein stability changes that could not be obtained from conventional techniques.
Topology based mutation predictor (T-MP) is introduced to dramatically reduce the geometric complexity and number of degrees of freedom of proteins, while element specific persistent homology is proposed to retain essential biological information. The present approach is found to outperform other existing methods in the predictions of globular protein stability changes upon mutation. A Pearson correlation coefficient of 0.82 with an RMSE of 0.92 kcal/mol is obtained on a test set of 350 mutation samples. For the prediction of membrane protein stability changes upon mutation, the proposed topological approach has a 84% higher Pearson correlation coefficient than the current state-of-the-art empirical methods, achieving a Pearson correlation of 0.57 and an RMSE of 1.09 kcal/mol in a 5-fold cross validation on a set of 223 membrane protein mutation samples.
http://weilab.math.msu.edu/TML/TML-MP/.
Supplementary data are available at Bioinformatics online.
定点突变广泛用于理解生物分子的结构和功能。计算预测突变对蛋白质稳定性的影响提供了一种快速、经济且具有潜在准确性的替代实验室诱变的方法。大多数现有方法依赖于几何描述,本工作引入了一种基于拓扑的方法,为突变诱导的蛋白质稳定性变化提供了一种全新的表示形式,这是传统技术无法获得的。
引入基于拓扑的突变预测器(T-MP)来显著降低蛋白质的几何复杂度和自由度数量,同时提出元素特定的持久同调来保留基本的生物学信息。本方法在预测突变对球状蛋白质稳定性变化的预测中表现优于其他现有方法。在 350 个突变样本的测试集中,得到了 0.82 的 Pearson 相关系数和 0.92kcal/mol 的 RMSE。对于预测突变对膜蛋白稳定性变化的影响,所提出的拓扑方法比当前最先进的经验方法具有 84%更高的 Pearson 相关系数,在 223 个膜蛋白突变样本的 5 倍交叉验证中,实现了 0.57 的 Pearson 相关系数和 1.09kcal/mol 的 RMSE。
http://weilab.math.msu.edu/TML/TML-MP/。
补充数据可在生物信息学在线获得。