Department of BioModeling, BioInformatics & BioProcesses, Université Libre de Bruxelles (ULB), CP 165/61, Roosevelt Avenue 50, 1050, Brussels, Belgium.
Interuniversity Institute of Bioinformatics in Brussels, ULB, CP 263, Triumph Bld, 1050, Brussels, Belgium.
Sci Rep. 2018 Mar 14;8(1):4480. doi: 10.1038/s41598-018-22531-2.
The classification of human genetic variants into deleterious and neutral is a challenging issue, whose complexity is rooted in the large variety of biophysical mechanisms that can be responsible for disease conditions. For non-synonymous mutations in structured proteins, one of these is the protein stability change, which can lead to loss of protein structure or function. We developed a stability-driven knowledge-based classifier that uses protein structure, artificial neural networks and solvent accessibility-dependent combinations of statistical potentials to predict whether destabilizing or stabilizing mutations are disease-causing. Our predictor yields a balanced accuracy of 71% in cross validation. As expected, it has a very high positive predictive value of 89%: it predicts with high accuracy the subset of mutations that are deleterious because of stability issues, but is by construction unable of classifying variants that are deleterious for other reasons. Its combination with an evolutionary-based predictor increases the balanced accuracy up to 75%, and allowed predicting more than 1/4 of the variants with 95% positive predictive value. Our method, called SNPMuSiC, can be used with both experimental and modeled structures and compares favorably with other prediction tools on several independent test sets. It constitutes a step towards interpreting variant effects at the molecular scale. SNPMuSiC is freely available at https://soft.dezyme.com/ .
将人类遗传变异体分类为有害和中性是一个具有挑战性的问题,其复杂性源于可以导致疾病状况的多种生物物理机制。对于结构蛋白中的非 synonymous突变,其中之一是蛋白质稳定性变化,这可能导致蛋白质结构或功能丧失。我们开发了一种基于稳定性的知识驱动分类器,该分类器使用蛋白质结构、人工神经网络和溶剂可及性依赖的统计势组合来预测导致稳定或不稳定的突变是否是致病的。我们的预测器在交叉验证中的平衡准确率为 71%。不出所料,它具有非常高的阳性预测值 89%:它可以准确地预测因稳定性问题而有害的突变子集,但由于其结构无法对因其他原因而有害的变体进行分类。它与基于进化的预测器相结合,将平衡准确率提高到 75%,并允许以 95%的阳性预测值预测超过 1/4的变体。我们的方法称为 SNPMuSiC,可与实验和模拟结构一起使用,并且在几个独立的测试集上与其他预测工具相比具有优势。它标志着在分子尺度上解释变体效应迈出了一步。SNPMuSiC 可在 https://soft.dezyme.com/ 上免费获得。