Suppr超能文献

评估用于估计错义突变后蛋白质稳定性变化的计算预测器的性能。

Assessing the performance of computational predictors for estimating protein stability changes upon missense mutations.

作者信息

Iqbal Shahid, Li Fuyi, Akutsu Tatsuya, Ascher David B, Webb Geoffrey I, Song Jiangning

机构信息

Computer System Engineering from Ghulam Ishaq Khan Institute of Engineering Sciences and Technology, Pakistan.

Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, the University of Melbourne, Australia.

出版信息

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab184.

Abstract

Understanding how a mutation might affect protein stability is of significant importance to protein engineering and for understanding protein evolution genetic diseases. While a number of computational tools have been developed to predict the effect of missense mutations on protein stability protein stability upon mutations, they are known to exhibit large biases imparted in part by the data used to train and evaluate them. Here, we provide a comprehensive overview of predictive tools, which has provided an evolving insight into the importance and relevance of features that can discern the effects of mutations on protein stability. A diverse selection of these freely available tools was benchmarked using a large mutation-level blind dataset of 1342 experimentally characterised mutations across 130 proteins from ThermoMutDB, a second test dataset encompassing 630 experimentally characterised mutations across 39 proteins from iStable2.0 and a third blind test dataset consisting of 268 mutations in 27 proteins from the newly published ProThermDB. The performance of the methods was further evaluated with respect to the site of mutation, type of mutant residue and by ranging the pH and temperature. Additionally, the classification performance was also evaluated by classifying the mutations as stabilizing (∆∆G ≥ 0) or destabilizing (∆∆G < 0). The results reveal that the performance of the predictors is affected by the site of mutation and the type of mutant residue. Further, the results show very low performance for pH values 6-8 and temperature higher than 65 for all predictors except iStable2.0 on the S630 dataset. To illustrate how stability and structure change upon single point mutation, we considered four stabilizing, two destabilizing and two stabilizing mutations from two proteins, namely the toxin protein and bovine liver cytochrome. Overall, the results on S268, S630 and S1342 datasets show that the performance of the integrated predictors is better than the mechanistic or individual machine learning predictors. We expect that this paper will provide useful guidance for the design and development of next-generation bioinformatic tools for predicting protein stability changes upon mutations.

摘要

了解突变如何影响蛋白质稳定性对于蛋白质工程以及理解蛋白质进化和遗传疾病至关重要。虽然已经开发了许多计算工具来预测错义突变对蛋白质稳定性的影响,但众所周知,它们存在很大的偏差,部分原因是用于训练和评估它们的数据。在这里,我们提供了预测工具的全面概述,这为深入了解能够辨别突变对蛋白质稳定性影响的特征的重要性和相关性提供了不断发展的见解。使用来自ThermoMutDB的130种蛋白质的1342个实验表征突变的大型突变水平盲数据集、包含来自iStable2.0的39种蛋白质的630个实验表征突变的第二个测试数据集以及由新发布的ProThermDB的27种蛋白质中的268个突变组成的第三个盲测试数据集,对这些免费可用工具的各种选择进行了基准测试。根据突变位点、突变残基类型以及改变pH值和温度范围,进一步评估了这些方法的性能。此外,还通过将突变分类为稳定(∆∆G≥0)或不稳定(∆∆G<0)来评估分类性能。结果表明,预测器的性能受突变位点和突变残基类型的影响。此外,结果显示,除了iStable2.0在S630数据集上之外,所有预测器在pH值为6 - 8和温度高于65时的性能都非常低。为了说明单点突变时稳定性和结构如何变化,我们考虑了来自两种蛋白质(即毒素蛋白和牛肝细胞色素)的四个稳定突变、两个不稳定突变和两个稳定突变。总体而言,S268、S630和S1342数据集的结果表明,集成预测器的性能优于机械或单个机器学习预测器。我们期望本文将为设计和开发用于预测突变后蛋白质稳定性变化的下一代生物信息学工具提供有用指导。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验