Katsonis Panagiotis, Lichtarge Olivier
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.
Department of Biochemistry & Molecular Biology, Baylor College of Medicine, Houston, Texas.
Hum Mutat. 2017 Sep;38(9):1072-1084. doi: 10.1002/humu.23266. Epub 2017 Jun 21.
A major challenge in genome interpretation is to estimate the fitness effect of coding variants of unknown significance (VUS). Labor, limited understanding of protein functions, and lack of assays generally limit direct experimental assessment of VUS, and make robust and accurate computational approaches a necessity. Often, however, algorithms that predict mutational effect disagree among themselves and with experimental data, slowing their adoption for clinical diagnostics. To objectively assess such methods, the Critical Assessment of Genome Interpretation (CAGI) community organizes contests to predict unpublished experimental data, available only to CAGI assessors. We review here the CAGI performance of evolutionary action (EA) predictions of mutational impact. EA models the fitness effect of coding mutations analytically, as a product of the gradient of the fitness landscape times the perturbation size. In practice, these terms are computed from phylogenetic considerations as the functional sensitivity of the mutated site and as the magnitude of amino acid substitution, respectively, and yield the percentage loss of wild-type activity. In five CAGI challenges, EA consistently performed on par or better than sophisticated machine learning approaches. This objective assessment suggests that a simple differential model of evolution can interpret the fitness effect of coding variations, opening diverse clinical applications.
基因组解读中的一个主要挑战是评估意义未明的编码变异(VUS)对适应性的影响。实验工作、对蛋白质功能的有限理解以及缺乏相关检测方法,通常限制了对VUS进行直接实验评估,因此需要强大而准确的计算方法。然而,预测突变效应的算法往往相互之间以及与实验数据存在分歧,这减缓了它们在临床诊断中的应用。为了客观评估此类方法,基因组解读关键评估(CAGI)社区组织竞赛来预测未发表的实验数据,这些数据仅对CAGI评估者可用。我们在此回顾CAGI中进化作用(EA)对突变影响预测的表现。EA通过分析编码突变的适应性效应,将其作为适应度景观梯度与扰动大小的乘积。实际上,这些项分别根据系统发育因素计算为突变位点的功能敏感性和氨基酸取代的幅度,并得出野生型活性的损失百分比。在CAGI的五项挑战中,EA的表现始终与复杂的机器学习方法相当或更优。这一客观评估表明,一个简单的进化差异模型可以解读编码变异的适应性效应,从而开启多种临床应用。