Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington.
The eScience Institute, University of Washington, Seattle, Washington.
Hum Mutat. 2019 Sep;40(9):1495-1506. doi: 10.1002/humu.23838. Epub 2019 Jul 3.
Thermodynamic stability is a fundamental property shared by all proteins. Changes in stability due to mutation are a widespread molecular mechanism in genetic diseases. Methods for the prediction of mutation-induced stability change have typically been developed and evaluated on incomplete and/or biased data sets. As part of the Critical Assessment of Genome Interpretation, we explored the utility of high-throughput variant stability profiling (VSP) assay data as an alternative for the assessment of computational methods and evaluated state-of-the-art predictors against over 7,000 nonsynonymous variants from two proteins. We found that predictions were modestly correlated with actual experimental values. Predictors fared better when evaluated as classifiers of extreme stability effects. While different methods emerging as top performers depending on the metric, it is nontrivial to draw conclusions on their adoption or improvement. Our analyses revealed that only 16% of all variants in VSP assays could be confidently defined as stability-affecting. Furthermore, it is unclear as to what extent VSP abundance scores were reasonable proxies for the stability-related quantities that participating methods were designed to predict. Overall, our observations underscore the need for clearly defined objectives when developing and using both computational and experimental methods in the context of measuring variant impact.
热力学稳定性是所有蛋白质共有的基本特性。由于突变导致的稳定性变化是遗传疾病中广泛存在的分子机制。预测突变诱导的稳定性变化的方法通常是在不完整和/或有偏差的数据集上开发和评估的。作为基因组解读关键评估的一部分,我们探讨了高通量变体稳定性分析(VSP)测定数据作为评估计算方法的替代方法的效用,并针对来自两种蛋白质的超过 7000 个非同义变体评估了最先进的预测器。我们发现预测与实际实验值有一定的相关性。当作为极端稳定性效应的分类器进行评估时,预测器的表现更好。虽然不同的方法根据指标表现出色,但要对它们的采用或改进得出结论并不容易。我们的分析表明,VSP 测定中只有 16%的变体可以被确定为稳定影响。此外,VSP 丰度评分在多大程度上可以作为参与方法旨在预测的与稳定性相关的数量的合理替代物尚不清楚。总体而言,我们的观察结果强调了在制定和使用计算和实验方法来衡量变体影响时,明确目标的必要性。