在全基因组数据上对变异效应预测因子进行校准会掩盖基因间异质性的性能。

Calibration of variant effect predictors on genome-wide data masks heterogeneous performance across genes.

机构信息

Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.

Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA; Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA; Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA.

出版信息

Am J Hum Genet. 2024 Sep 5;111(9):2031-2043. doi: 10.1016/j.ajhg.2024.07.018. Epub 2024 Aug 21.

DOI:10.1016/j.ajhg.2024.07.018

PMID:39173626

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11393694/

Abstract

In silico variant effect predictions are available for nearly all missense variants but played a minimal role in clinical variant classification because they were deemed to provide only supporting evidence. Recently, the ClinGen Sequence Variant Interpretation (SVI) Working Group updated recommendations for variant effect prediction use. By analyzing control pathogenic and benign variants across all genes, they were able to compute evidence strength for predictor score intervals with some intervals generating moderate, strong, or even very strong evidence. However, this genome-wide approach could obscure heterogeneous predictor performance in different genes. We quantified the gene-by-gene performance of two top predictors, REVEL and BayesDel, by analyzing control variants in each predictor score interval in 3,668 disease-relevant genes. Approximately 10% of intervals had sufficient control variants for analysis, and ∼70% of these intervals exceeded the maximum number of incorrect predictions implied by the SVI recommendations. These trending discordant intervals arose owing to the divergence of the gene-specific distribution of predictions from the genome-wide distribution, suggesting that gene-specific calibration is needed in many cases. Approximately 22% of ClinVar missense variants of uncertain significance in genes we analyzed (REVEL = 100,629, BayesDel = 71,928) had predictions in trending discordant intervals. Thus, genome-wide calibrations could result in many variants receiving inappropriate evidence strength. To facilitate a review of the SVI's calibrations, we developed a web application enabling visualization of gene-specific predictions and trending concordant and discordant intervals.

摘要

目前几乎所有错义变异都可以进行基于计算的变异效应预测，但在临床变异分类中的作用很小，因为它们被认为只能提供辅助证据。最近，ClinGen 序列变异解释（SVI）工作组更新了对变异效应预测使用的建议。通过分析所有基因中的对照致病性和良性变异，他们能够计算预测器评分区间的证据强度，其中一些区间产生中等、强甚至非常强的证据。然而，这种全基因组方法可能会掩盖不同基因中预测器性能的异质性。我们通过分析每个预测器评分区间中的对照变异，量化了两个顶级预测器（REVEL 和 BayesDel）的基因间性能，在 3668 个与疾病相关的基因中。大约有 10%的区间有足够的对照变异可供分析，其中约 70%的区间超过了 SVI 建议中暗示的最大错误预测数。这些呈上升趋势的不一致区间是由于预测从全基因组分布的基因特异性分布中出现分歧造成的，这表明在许多情况下需要进行基因特异性校准。在我们分析的基因中，大约 22%的 ClinVar 意义未明的错义变异（REVEL=100629，BayesDel=71928）的预测处于上升的不一致区间。因此，全基因组校准可能会导致许多变异获得不适当的证据强度。为了便于对 SVI 的校准进行审查，我们开发了一个网络应用程序，能够可视化基因特异性预测和上升的一致和不一致区间。

相似文献

Calibration of variant effect predictors on genome-wide data masks heterogeneous performance across genes.在全基因组数据上对变异效应预测因子进行校准会掩盖基因间异质性的性能。

Am J Hum Genet. 2024 Sep 5;111(9):2031-2043. doi: 10.1016/j.ajhg.2024.07.018. Epub 2024 Aug 21.

Assessment of the evidence yield for the calibrated PP3/BP4 computational recommendations.校准后的 PP3/BP4 计算建议的证据产出评估。

Genet Med. 2024 Nov;26(11):101213. doi: 10.1016/j.gim.2024.101213. Epub 2024 Jul 25.

REVEL and BayesDel outperform other in silico meta-predictors for clinical variant classification.REVEL 和 BayesDel 在临床变异分类的计算元预测方面优于其他方法。

Sci Rep. 2019 Sep 4;9(1):12752. doi: 10.1038/s41598-019-49224-8.

Improvement of MEFV gene variants classification to aid treatment decision making in familial Mediterranean fever.改善 MEFV 基因突变分类，以辅助家族性地中海热的治疗决策。

Rheumatology (Oxford). 2020 Apr 1;59(4):754-761. doi: 10.1093/rheumatology/kez332.

Improved, ACMG-compliant, in silico prediction of pathogenicity for missense substitutions encoded by TP53 variants.提高 ACMG 标准符合性，对编码 TP53 变体的错义替换进行基于计算机的致病性预测。

Hum Mutat. 2018 Aug;39(8):1061-1069. doi: 10.1002/humu.23553. Epub 2018 Jun 5.

In-silico Analysis of Missense Variants in ClinVar: Translating Variant Predictions into Variant Interpretation and Classification.ClinVar 中的错义变异体的计算机分析：将变异预测转化为变异解释和分类。

Int J Mol Sci. 2020 Jan 22;21(3):721. doi: 10.3390/ijms21030721.

Evaluation of in silico algorithms for use with ACMG/AMP clinical variant interpretation guidelines.评估 ACMG/AMP 临床变异解读指南中使用的计算机算法。

Genome Biol. 2017 Nov 28;18(1):225. doi: 10.1186/s13059-017-1353-5.

Gene-specific criteria for PTEN variant curation: Recommendations from the ClinGen PTEN Expert Panel.基因特异性 PTEN 变异校正标准：ClinGen PTEN 专家小组的建议。

Hum Mutat. 2018 Nov;39(11):1581-1592. doi: 10.1002/humu.23636.

Evaluating novel in silico tools for accurate pathogenicity classification in epilepsy-associated genetic missense variants.评估用于癫痫相关基因错义变异准确致病性分类的新型计算机工具。

Epilepsia. 2024 Dec;65(12):3655-3663. doi: 10.1111/epi.18155. Epub 2024 Oct 23.

Calibration of computational tools for missense variant pathogenicity classification and ClinGen recommendations for PP3/BP4 criteria.计算工具的校准用于错义变异致病性分类和 ClinGen 对 PP3/BP4 标准的建议。

Am J Hum Genet. 2022 Dec 1;109(12):2163-2177. doi: 10.1016/j.ajhg.2022.10.013. Epub 2022 Nov 21.

引用本文的文献

Creating an atlas of variant effects to resolve variants of uncertain significance and guide cardiovascular medicine.创建一个变异效应图谱，以解析意义未明的变异并指导心血管医学。

Nat Rev Cardiol. 2025 Sep 1. doi: 10.1038/s41569-025-01201-7.

Gene-based calibration of high-throughput functional assays for clinical variant classification.用于临床变异分类的高通量功能测定的基于基因的校准

bioRxiv. 2025 May 4:2025.04.29.651326. doi: 10.1101/2025.04.29.651326.

Variant effect predictor correlation with functional assays is reflective of clinical classification performance.变异效应预测器与功能测定的相关性反映了临床分类性能。

Genome Biol. 2025 Apr 22;26(1):104. doi: 10.1186/s13059-025-03575-w.

Structural biology in variant interpretation: Perspectives and practices from two studies.变异解读中的结构生物学：两项研究的观点与实践

Am J Hum Genet. 2025 May 1;112(5):984-992. doi: 10.1016/j.ajhg.2025.03.010. Epub 2025 Apr 14.

Calibration of additional computational tools expands ClinGen recommendation options for variant classification with PP3/BP4 criteria.对其他计算工具的校准扩展了ClinGen使用PP3/BP4标准进行变异分类的推荐选项。

Genet Med. 2025 Mar 10;27(6):101402. doi: 10.1016/j.gim.2025.101402.

Toward trustable use of machine learning models of variant effects in the clinic.迈向临床中变异效应机器学习模型的可靠应用。

Am J Hum Genet. 2024 Dec 5;111(12):2589-2593. doi: 10.1016/j.ajhg.2024.10.011. Epub 2024 Nov 18.

Calibration of additional computational tools expands ClinGen recommendation options for variant classification with PP3/BP4 criteria.额外计算工具的校准扩展了ClinGen使用PP3/BP4标准进行变异分类的推荐选项。

bioRxiv. 2024 Sep 21:2024.09.17.611902. doi: 10.1101/2024.09.17.611902.

本文引用的文献

Evidence-based recommendations for gene-specific ACMG/AMP variant classification from the ClinGen ENIGMA BRCA1 and BRCA2 Variant Curation Expert Panel.基于证据的基因特异性 ACMG/AMP 变异分类推荐意见，来自 ClinGen ENIGMA BRCA1 和 BRCA2 变异 curation 专家小组。

Am J Hum Genet. 2024 Sep 5;111(9):2044-2058. doi: 10.1016/j.ajhg.2024.07.013. Epub 2024 Aug 13.

Assessment of the evidence yield for the calibrated PP3/BP4 computational recommendations.校准后的 PP3/BP4 计算建议的证据产出评估。

Genet Med. 2024 Nov;26(11):101213. doi: 10.1016/j.gim.2024.101213. Epub 2024 Jul 25.

Diagnostic Outcomes of Concurrent DNA and RNA Sequencing in Individuals Undergoing Hereditary Cancer Testing.个体接受遗传性癌症检测时同时进行 DNA 和 RNA 测序的诊断结果。

JAMA Oncol. 2024 Feb 1;10(2):212-219. doi: 10.1001/jamaoncol.2023.5586.

Rates and Classification of Variants of Uncertain Significance in Hereditary Disease Genetic Testing.遗传性疾病基因检测中不确定意义变异的发生率和分类。

JAMA Netw Open. 2023 Oct 2;6(10):e2339571. doi: 10.1001/jamanetworkopen.2023.39571.

Accurate proteome-wide missense variant effect prediction with AlphaMissense.使用 AlphaMissense 进行精确的全蛋白质错义变异效应预测。

Science. 2023 Sep 22;381(6664):eadg7492. doi: 10.1126/science.adg7492.

ACMG SF v3.2 list for reporting of secondary findings in clinical exome and genome sequencing: A policy statement of the American College of Medical Genetics and Genomics (ACMG).ACMG SF v3.2 临床外显子组和基因组测序中报告次要发现的列表：美国医学遗传学与基因组学学会 (ACMG) 的政策声明。

Genet Med. 2023 Aug;25(8):100866. doi: 10.1016/j.gim.2023.100866. Epub 2023 Jun 22.

Am J Hum Genet. 2022 Dec 1;109(12):2163-2177. doi: 10.1016/j.ajhg.2022.10.013. Epub 2022 Nov 21.

The Clinical Genome Resource (ClinGen) Familial Hypercholesterolemia Variant Curation Expert Panel consensus guidelines for LDLR variant classification.临床基因组资源（ClinGen）家族性高胆固醇血症变异体管理专家小组共识指南，用于 LDLR 变异体分类。

Genet Med. 2022 Feb;24(2):293-306. doi: 10.1016/j.gim.2021.09.012. Epub 2021 Nov 30.

Closing the gap: Systematic integration of multiplexed functional data resolves variants of uncertain significance in BRCA1, TP53, and PTEN.缩小差距：多重功能数据的系统整合解决了 BRCA1、TP53 和 PTEN 中不确定意义的变体。

Am J Hum Genet. 2021 Dec 2;108(12):2248-2258. doi: 10.1016/j.ajhg.2021.11.001. Epub 2021 Nov 17.

Disease variant prediction with deep generative models of evolutionary data.利用进化数据的深度生成模型进行疾病变异预测。

Nature. 2021 Nov;599(7883):91-95. doi: 10.1038/s41586-021-04043-8. Epub 2021 Oct 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验