Suppr超能文献

在全基因组数据上对变异效应预测因子进行校准会掩盖基因间异质性的性能。

Calibration of variant effect predictors on genome-wide data masks heterogeneous performance across genes.

机构信息

Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.

Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA; Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA; Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA.

出版信息

Am J Hum Genet. 2024 Sep 5;111(9):2031-2043. doi: 10.1016/j.ajhg.2024.07.018. Epub 2024 Aug 21.

Abstract

In silico variant effect predictions are available for nearly all missense variants but played a minimal role in clinical variant classification because they were deemed to provide only supporting evidence. Recently, the ClinGen Sequence Variant Interpretation (SVI) Working Group updated recommendations for variant effect prediction use. By analyzing control pathogenic and benign variants across all genes, they were able to compute evidence strength for predictor score intervals with some intervals generating moderate, strong, or even very strong evidence. However, this genome-wide approach could obscure heterogeneous predictor performance in different genes. We quantified the gene-by-gene performance of two top predictors, REVEL and BayesDel, by analyzing control variants in each predictor score interval in 3,668 disease-relevant genes. Approximately 10% of intervals had sufficient control variants for analysis, and ∼70% of these intervals exceeded the maximum number of incorrect predictions implied by the SVI recommendations. These trending discordant intervals arose owing to the divergence of the gene-specific distribution of predictions from the genome-wide distribution, suggesting that gene-specific calibration is needed in many cases. Approximately 22% of ClinVar missense variants of uncertain significance in genes we analyzed (REVEL = 100,629, BayesDel = 71,928) had predictions in trending discordant intervals. Thus, genome-wide calibrations could result in many variants receiving inappropriate evidence strength. To facilitate a review of the SVI's calibrations, we developed a web application enabling visualization of gene-specific predictions and trending concordant and discordant intervals.

摘要

目前几乎所有错义变异都可以进行基于计算的变异效应预测,但在临床变异分类中的作用很小,因为它们被认为只能提供辅助证据。最近,ClinGen 序列变异解释(SVI)工作组更新了对变异效应预测使用的建议。通过分析所有基因中的对照致病性和良性变异,他们能够计算预测器评分区间的证据强度,其中一些区间产生中等、强甚至非常强的证据。然而,这种全基因组方法可能会掩盖不同基因中预测器性能的异质性。我们通过分析每个预测器评分区间中的对照变异,量化了两个顶级预测器(REVEL 和 BayesDel)的基因间性能,在 3668 个与疾病相关的基因中。大约有 10%的区间有足够的对照变异可供分析,其中约 70%的区间超过了 SVI 建议中暗示的最大错误预测数。这些呈上升趋势的不一致区间是由于预测从全基因组分布的基因特异性分布中出现分歧造成的,这表明在许多情况下需要进行基因特异性校准。在我们分析的基因中,大约 22%的 ClinVar 意义未明的错义变异(REVEL=100629,BayesDel=71928)的预测处于上升的不一致区间。因此,全基因组校准可能会导致许多变异获得不适当的证据强度。为了便于对 SVI 的校准进行审查,我们开发了一个网络应用程序,能够可视化基因特异性预测和上升的一致和不一致区间。

相似文献

引用本文的文献

6

本文引用的文献

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验