Ambry Genetics, 15 Argonaut, Aliso Viejo, CA, 92656, USA.
Divsion of Genetics and Genomics, University of California, Irvine, CA, 92697, USA.
Sci Rep. 2019 Sep 4;9(1):12752. doi: 10.1038/s41598-019-49224-8.
Many in silico predictors of genetic variant pathogenicity have been previously developed, but there is currently no standard application of these algorithms for variant assessment. Using 4,094 ClinVar-curated missense variants in clinically actionable genes, we evaluated the accuracy and yield of benign and deleterious evidence in 5 in silico meta-predictors, as well as agreement of SIFT and PolyPhen2, and report the derived thresholds for the best performing predictor(s). REVEL and BayesDel outperformed all other meta-predictors (CADD, MetaSVM, Eigen), with higher positive predictive value, comparable negative predictive value, higher yield, and greater overall prediction performance. Agreement of SIFT and PolyPhen2 resulted in slightly higher yield but lower overall prediction performance than REVEL or BayesDel. Our results support the use of gene-level rather than generalized thresholds, when gene-level thresholds can be estimated. Our results also support the use of 2-sided thresholds, which allow for uncertainty, rather than a single, binary cut-point for assigning benign and deleterious evidence. The gene-level 2-sided thresholds we derived for REVEL or BayesDel can be used to assess in silico evidence for missense variants in accordance with current classification guidelines.
许多用于预测遗传变异致病性的计算工具已经被开发出来,但目前还没有这些算法在变异评估中的标准应用。我们使用 4094 个 ClinVar 经过精心筛选的错义变异,这些变异位于具有临床可操作性的基因中,评估了 5 种计算综合预测因子中良性和有害证据的准确性和检出率,以及 SIFT 和 PolyPhen2 的一致性,并报告了最佳预测因子的衍生阈值。REVEL 和 BayesDel 的表现优于所有其他的综合预测因子(CADD、MetaSVM、Eigen),具有更高的阳性预测值、可比的阴性预测值、更高的检出率和更好的整体预测性能。SIFT 和 PolyPhen2 的一致性导致的检出率略高,但整体预测性能低于 REVEL 或 BayesDel。我们的研究结果支持在可以估计基因水平阈值的情况下使用基因水平而不是一般化的阈值。我们的研究结果还支持使用双边阈值,允许存在不确定性,而不是使用单一的二进制截断点来分配良性和有害证据。我们为 REVEL 或 BayesDel 推导的基因水平双边阈值可用于根据当前的分类指南评估错义变异的计算证据。