• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用癌症突变数据为种系错义变异的致病性分类提供信息。

Leveraging cancer mutation data to inform the pathogenicity classification of germline missense variants.

作者信息

Haque Bushra, Cheerie David, Pan Amy, Curtis Meredith, Nalpathamkalam Thomas, Nguyen Jimmy, Salhab Celine, Thiruvahindrapuram Bhooma, Zhang Jade, Couse Madeline, Hartley Taila, Morrow Michelle M, Price E Magda, Walker Susan, Malkin David, Roth Frederick P, Costain Gregory

机构信息

Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada.

Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada.

出版信息

PLoS Genet. 2025 Jan 6;21(1):e1011540. doi: 10.1371/journal.pgen.1011540. eCollection 2025 Jan.

DOI:10.1371/journal.pgen.1011540
PMID:39761285
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11737861/
Abstract

Innovative and easy-to-implement strategies are needed to improve the pathogenicity assessment of rare germline missense variants. Somatic cancer driver mutations identified through large-scale tumor sequencing studies often impact genes that are also associated with rare Mendelian disorders. The use of cancer mutation data to aid in the interpretation of germline missense variants, regardless of whether the gene is associated with a hereditary cancer predisposition syndrome or a non-cancer-related developmental disorder, has not been systematically assessed. We extracted putative cancer driver missense mutations from the Cancer Hotspots database and annotated them as germline variants, including presence/absence and classification in ClinVar. We trained two supervised learning models (logistic regression and random forest) to predict variant classifications of germline missense variants in ClinVar using Cancer Hotspot data (training dataset). The performance of each model was evaluated with an independent test dataset generated in part from searching public and private genome-wide sequencing datasets from ~1.5 million individuals. Of the 2,447 cancer mutations, 691 corresponding germline variants had been previously classified in ClinVar: 426 (61.6%) as likely pathogenic/pathogenic, 261 (37.8%) as uncertain significance, and 4 (0.6%) as likely benign/benign. The odds ratio for a likely pathogenic/pathogenic classification in ClinVar was 28.3 (95% confidence interval: 24.2-33.1, p < 0.001), compared with all other germline missense variants in the same 216 genes. Both supervised learning models showed high correlation with pathogenicity assessments in the training dataset. There was high area under precision-recall curve values (0.847 and 0.829) and area under the receiver-operating characteristic curve values (0.821 and 0.774) for logistic regression and random forest models, respectively, when applied to the test dataset. With the use of cancer and germline datasets and supervised learning techniques, our study shows that cancer mutation data can be leveraged to improve the interpretation of germline missense variation potentially causing rare Mendelian disorders.

摘要

需要创新且易于实施的策略来改进对罕见种系错义变异的致病性评估。通过大规模肿瘤测序研究鉴定出的体细胞癌驱动突变通常会影响那些也与罕见孟德尔疾病相关的基因。利用癌症突变数据来辅助解释种系错义变异,无论该基因是否与遗传性癌症易感性综合征或非癌症相关的发育障碍有关,这一点尚未得到系统评估。我们从癌症热点数据库中提取了假定的癌症驱动错义突变,并将它们注释为种系变异,包括在ClinVar中的存在与否及分类情况。我们训练了两个监督学习模型(逻辑回归和随机森林),以使用癌症热点数据(训练数据集)来预测ClinVar中种系错义变异的变异分类。每个模型的性能都通过一个独立测试数据集进行评估,该测试数据集部分是通过搜索来自约150万人的公共和私人全基因组测序数据集生成的。在2447个癌症突变中,有691个相应的种系变异先前已在ClinVar中分类:426个(61.6%)被分类为可能致病/致病,261个(37.8%)意义不明确,4个(0.6%)被分类为可能良性/良性。与同一216个基因中的所有其他种系错义变异相比,ClinVar中可能致病/致病分类的优势比为28.3(95%置信区间:24.2 - 33.1,p < 0.001)。两个监督学习模型在训练数据集中与致病性评估都显示出高度相关性。当应用于测试数据集时,逻辑回归和随机森林模型的精确召回曲线下面积值分别为0.847和0.829,以及受试者工作特征曲线下面积值分别为0.821和0.774。通过使用癌症和种系数据集以及监督学习技术,我们的研究表明,癌症突变数据可用于改进对可能导致罕见孟德尔疾病的种系错义变异的解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd4/11737861/b17a889008b4/pgen.1011540.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd4/11737861/77d1506b1c1a/pgen.1011540.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd4/11737861/b473765703f4/pgen.1011540.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd4/11737861/0c6277bcfa5e/pgen.1011540.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd4/11737861/b17a889008b4/pgen.1011540.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd4/11737861/77d1506b1c1a/pgen.1011540.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd4/11737861/b473765703f4/pgen.1011540.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd4/11737861/0c6277bcfa5e/pgen.1011540.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd4/11737861/b17a889008b4/pgen.1011540.g004.jpg

相似文献

1
Leveraging cancer mutation data to inform the pathogenicity classification of germline missense variants.利用癌症突变数据为种系错义变异的致病性分类提供信息。
PLoS Genet. 2025 Jan 6;21(1):e1011540. doi: 10.1371/journal.pgen.1011540. eCollection 2025 Jan.
2
Germline Mutations for Novel Candidate Predisposition Genes in Sporadic Schwannomatosis.散发性许旺细胞瘤中新型候选易感性基因的种系突变。
Clin Orthop Relat Res. 2020 Nov;478(11):2442-2450. doi: 10.1097/CORR.0000000000001239.
3
A comparative medical genomics approach may facilitate the interpretation of rare missense variation.比较医学基因组学方法可能有助于解释罕见的错义变异。
J Med Genet. 2024 Jul 19;61(8):817-821. doi: 10.1136/jmg-2023-109760.
4
Evaluating novel in silico tools for accurate pathogenicity classification in epilepsy-associated genetic missense variants.评估用于癫痫相关基因错义变异准确致病性分类的新型计算机工具。
Epilepsia. 2024 Dec;65(12):3655-3663. doi: 10.1111/epi.18155. Epub 2024 Oct 23.
5
Co-observation of germline pathogenic variants in breast cancer predisposition genes: Results from analysis of the BRIDGES sequencing dataset.乳腺癌易感基因胚系致病性变异的共同观察:BRIDGES 测序数据集分析的结果。
Am J Hum Genet. 2024 Sep 5;111(9):2059-2069. doi: 10.1016/j.ajhg.2024.07.004. Epub 2024 Aug 2.
6
Tumour characteristics provide evidence for germline mismatch repair missense variant pathogenicity.肿瘤特征为胚系错配修复同义突变致病性提供依据。
J Med Genet. 2020 Jan;57(1):62-69. doi: 10.1136/jmedgenet-2019-106096. Epub 2019 Aug 7.
7
ClinVar: updates to support classifications of both germline and somatic variants.ClinVar:更新以支持种系变异和体细胞变异的分类。
Nucleic Acids Res. 2025 Jan 6;53(D1):D1313-D1321. doi: 10.1093/nar/gkae1090.
8
Genotype-first approach to identify associations between CDH1 germline variants and cancer phenotypes: a multicentre study by the European Reference Network on Genetic Tumour Risk Syndromes.基于基因型的方法鉴定 CDH1 种系变异与癌症表型之间的关联:欧洲遗传肿瘤风险综合征参考网络的多中心研究。
Lancet Oncol. 2023 Jan;24(1):91-106. doi: 10.1016/S1470-2045(22)00643-X. Epub 2022 Nov 24.
9
AI-derived comparative assessment of the performance of pathogenicity prediction tools on missense variants of breast cancer genes.基于人工智能的乳腺癌基因错义变异致病性预测工具性能的比较评估。
Hum Genomics. 2024 Sep 11;18(1):99. doi: 10.1186/s40246-024-00667-9.
10
BRCA1/2 germline missense mutations: a systematic review.BRCA1/2 基因种系错义突变:一项系统评价
Eur J Cancer Prev. 2018 May;27(3):279-286. doi: 10.1097/CEJ.0000000000000337.