• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

罕见基因组计划中罕见病诊断变异优先级排序方法的批判性评估

Critical assessment of variant prioritization methods for rare disease diagnosis within the Rare Genomes Project.

作者信息

Stenton Sarah L, O'Leary Melanie, Lemire Gabrielle, VanNoy Grace E, DiTroia Stephanie, Ganesh Vijay S, Groopman Emily, O'Heir Emily, Mangilog Brian, Osei-Owusu Ikeoluwa, Pais Lynn S, Serrano Jillian, Singer-Berk Moriel, Weisburd Ben, Wilson Michael, Austin-Tse Christina, Abdelhakim Marwa, Althagafi Azza, Babbi Giulia, Bellazzi Riccardo, Bovo Samuele, Carta Maria Giulia, Casadio Rita, Coenen Pieter-Jan, De Paoli Federica, Floris Matteo, Gajapathy Manavalan, Hoehndorf Robert, Jacobsen Julius O B, Joseph Thomas, Kamandula Akash, Katsonis Panagiotis, Kint Cyrielle, Lichtarge Olivier, Limongelli Ivan, Lu Yulan, Magni Paolo, Mamidi Tarun Karthik Kumar, Martelli Pier Luigi, Mulargia Marta, Nicora Giovanna, Nykamp Keith, Pejaver Vikas, Peng Yisu, Pham Thi Hong Cam, Podda Maurizio S, Rao Aditya, Rizzo Ettore, Saipradeep Vangala G, Savojardo Castrense, Schols Peter, Shen Yang, Sivadasan Naveen, Smedley Damian, Soru Dorian, Srinivasan Rajgopal, Sun Yuanfei, Sunderam Uma, Tan Wuwei, Tiwari Naina, Wang Xiao, Wang Yaqiong, Williams Amanda, Worthey Elizabeth A, Yin Rujie, You Yuning, Zeiberg Daniel, Zucca Susanna, Bakolitsa Constantina, Brenner Steven E, Fullerton Stephanie M, Radivojac Predrag, Rehm Heidi L, O'Donnell-Luria Anne

机构信息

Division of Genetics and Genomics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.

Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

出版信息

medRxiv. 2023 Aug 4:2023.08.02.23293212. doi: 10.1101/2023.08.02.23293212.

DOI:10.1101/2023.08.02.23293212
PMID:37577678
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10418577/
Abstract

BACKGROUND

A major obstacle faced by rare disease families is obtaining a genetic diagnosis. The average "diagnostic odyssey" lasts over five years, and causal variants are identified in under 50%. The Rare Genomes Project (RGP) is a direct-to-participant research study on the utility of genome sequencing (GS) for diagnosis and gene discovery. Families are consented for sharing of sequence and phenotype data with researchers, allowing development of a Critical Assessment of Genome Interpretation (CAGI) community challenge, placing variant prioritization models head-to-head in a real-life clinical diagnostic setting.

METHODS

Predictors were provided a dataset of phenotype terms and variant calls from GS of 175 RGP individuals (65 families), including 35 solved training set families, with causal variants specified, and 30 test set families (14 solved, 16 unsolved). The challenge tasked teams with identifying the causal variants in as many test set families as possible. Ranked variant predictions were submitted with estimated probability of causal relationship (EPCR) values. Model performance was determined by two metrics, a weighted score based on rank position of true positive causal variants and maximum F-measure, based on precision and recall of causal variants across EPCR thresholds.

RESULTS

Sixteen teams submitted predictions from 52 models, some with manual review incorporated. Top performing teams recalled the causal variants in up to 13 of 14 solved families by prioritizing high quality variant calls that were rare, predicted deleterious, segregating correctly, and consistent with reported phenotype. In unsolved families, newly discovered diagnostic variants were returned to two families following confirmatory RNA sequencing, and two prioritized novel disease gene candidates were entered into Matchmaker Exchange. In one example, RNA sequencing demonstrated aberrant splicing due to a deep intronic indel in , identified in with a frameshift variant, in an unsolved proband with phenotype overlap with asparagine synthetase deficiency.

CONCLUSIONS

By objective assessment of variant predictions, we provide insights into current state-of-the-art algorithms and platforms for genome sequencing analysis for rare disease diagnosis and explore areas for future optimization. Identification of diagnostic variants in unsolved families promotes synergy between researchers with clinical and computational expertise as a means of advancing the field of clinical genome interpretation.

摘要

背景

罕见病家庭面临的一个主要障碍是获得基因诊断。平均“诊断之旅”持续超过五年,且仅不到50%的家庭能确定致病变异。罕见基因组计划(RGP)是一项直接面向参与者的研究,旨在探讨基因组测序(GS)在诊断和基因发现方面的效用。研究征得家庭同意,让研究人员共享序列和表型数据,从而促成了基因组解读关键评估(CAGI)社区挑战赛,使变异优先级排序模型在真实临床诊断环境中进行直接较量。

方法

向预测者提供了来自175名RGP个体(65个家庭)GS的表型术语和变异调用数据集,其中包括35个已解决的训练集家庭(已明确致病变异)以及30个测试集家庭(14个已解决,16个未解决)。挑战赛要求各团队尽可能多地识别测试集家庭中的致病变异。提交的变异预测排名需附带因果关系估计概率(EPCR)值。模型性能由两个指标确定,一个是基于真阳性致病变异排名位置的加权分数,另一个是基于EPCR阈值的致病变异精度和召回率的最大F值。

结果

16个团队提交了来自52个模型的预测结果,有些模型还纳入了人工审核。表现最佳的团队通过优先考虑罕见、预测为有害、正确分离且与报告表型一致的高质量变异调用,在14个已解决家庭中的多达13个家庭中识别出致病变异。在未解决的家庭中,经确认性RNA测序后,两个家庭发现了新的诊断变异,并且两个优先级较高的新型疾病基因候选物被录入匹配者交换平台。在一个例子中,RNA测序显示,在一名与天冬酰胺合成酶缺乏症表型重叠的未解决先证者中,由于在 中一个内含子深处的插入缺失导致异常剪接,该插入缺失与一个移码变异一起被识别出来。

结论

通过对变异预测的客观评估,我们深入了解了当前用于罕见病诊断的基因组测序分析的先进算法和平台,并探索了未来优化的领域。在未解决的家庭中识别诊断变异促进了具有临床和计算专业知识的研究人员之间的协同合作,以此推动临床基因组解读领域的发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f0/10418577/77f3caa6dc13/nihpp-2023.08.02.23293212v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f0/10418577/de8143c37c84/nihpp-2023.08.02.23293212v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f0/10418577/12fd32221adb/nihpp-2023.08.02.23293212v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f0/10418577/4957c556c65c/nihpp-2023.08.02.23293212v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f0/10418577/77f3caa6dc13/nihpp-2023.08.02.23293212v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f0/10418577/de8143c37c84/nihpp-2023.08.02.23293212v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f0/10418577/12fd32221adb/nihpp-2023.08.02.23293212v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f0/10418577/4957c556c65c/nihpp-2023.08.02.23293212v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f0/10418577/77f3caa6dc13/nihpp-2023.08.02.23293212v1-f0004.jpg

相似文献

1
Critical assessment of variant prioritization methods for rare disease diagnosis within the Rare Genomes Project.罕见基因组计划中罕见病诊断变异优先级排序方法的批判性评估
medRxiv. 2023 Aug 4:2023.08.02.23293212. doi: 10.1101/2023.08.02.23293212.
2
Critical assessment of variant prioritization methods for rare disease diagnosis within the rare genomes project.对罕见基因组项目中罕见病诊断的变异优先级方法的批判性评估。
Hum Genomics. 2024 Apr 29;18(1):44. doi: 10.1186/s40246-024-00604-w.
3
Artificial intelligence enables comprehensive genome interpretation and nomination of candidate diagnoses for rare genetic diseases.人工智能能够全面解读基因组并为罕见遗传病提名候选诊断。
Genome Med. 2021 Oct 14;13(1):153. doi: 10.1186/s13073-021-00965-0.
4
Uncovering recessive alleles in rare Mendelian disorders by genome sequencing of 174 individuals with monoallelic pathogenic variants.通过对174名单等位基因致病变异个体进行基因组测序,发现罕见孟德尔疾病中的隐性等位基因。
Eur J Hum Genet. 2025 Jan;33(1):56-64. doi: 10.1038/s41431-024-01694-9. Epub 2024 Sep 27.
5
Explicable prioritization of genetic variants by integration of rule-based and machine learning algorithms for diagnosis of rare Mendelian disorders.基于规则和机器学习算法的遗传变异可解释优先级排序,用于罕见孟德尔疾病的诊断。
Hum Genomics. 2024 Mar 21;18(1):28. doi: 10.1186/s40246-024-00595-8.
6
Genomic answers for children: Dynamic analyses of >1000 pediatric rare disease genomes.儿童基因组学解答:对 1000 多个儿科罕见病基因组的动态分析。
Genet Med. 2022 Jun;24(6):1336-1348. doi: 10.1016/j.gim.2022.02.007. Epub 2022 Mar 16.
7
CAGI4 SickKids clinical genomes challenge: A pipeline for identifying pathogenic variants.CAGI4 病童临床基因组挑战:一种识别致病变异的流程。
Hum Mutat. 2017 Sep;38(9):1169-1181. doi: 10.1002/humu.23257. Epub 2017 Jun 27.
8
Structural and non-coding variants increase the diagnostic yield of clinical whole genome sequencing for rare diseases.结构变异和非编码变异增加了临床全基因组测序在罕见病诊断中的收益。
Genome Med. 2023 Nov 9;15(1):94. doi: 10.1186/s13073-023-01240-0.
9
eDiVA-Classification and prioritization of pathogenic variants for clinical diagnostics.eDiVA-临床诊断用致病变异体的分类和优先级排序。
Hum Mutat. 2019 Jul;40(7):865-878. doi: 10.1002/humu.23772. Epub 2019 May 21.
10
CAGI SickKids challenges: Assessment of phenotype and variant predictions derived from clinical and genomic data of children with undiagnosed diseases.CAGI SickKids 挑战:评估患有不明原因疾病的儿童的临床和基因组数据中得出的表型和变异预测。
Hum Mutat. 2019 Sep;40(9):1373-1391. doi: 10.1002/humu.23874. Epub 2019 Sep 3.

本文引用的文献

1
CAGI, the Critical Assessment of Genome Interpretation, establishes progress and prospects for computational genetic variant interpretation methods.CAGI,即基因组解读的关键评估,旨在评估计算遗传变异解读方法的进展和前景。
Genome Biol. 2024 Feb 22;25(1):53. doi: 10.1186/s13059-023-03113-6.
2
Advancing Understanding of Inequities in Rare Disease Genomics.推进罕见病基因组学中不公平现象的认识。
Clin Ther. 2023 Aug;45(8):745-753. doi: 10.1016/j.clinthera.2023.06.010. Epub 2023 Jul 28.
3
Calibration of computational tools for missense variant pathogenicity classification and ClinGen recommendations for PP3/BP4 criteria.
计算工具的校准用于错义变异致病性分类和 ClinGen 对 PP3/BP4 标准的建议。
Am J Hum Genet. 2022 Dec 1;109(12):2163-2177. doi: 10.1016/j.ajhg.2022.10.013. Epub 2022 Nov 21.
4
The Gene Curation Coalition: A global effort to harmonize gene-disease evidence resources.基因保存联盟:全球协同统一基因-疾病证据资源
Genet Med. 2022 Aug;24(8):1732-1742. doi: 10.1016/j.gim.2022.04.017. Epub 2022 May 4.
5
Phenotype-driven approaches to enhance variant prioritization and diagnosis of rare disease.基于表型的方法提高罕见病变异的优先级和诊断。
Hum Mutat. 2022 Aug;43(8):1071-1081. doi: 10.1002/humu.24380. Epub 2022 Apr 27.
6
Clinical implementation of RNA sequencing for Mendelian disease diagnostics.RNA 测序在孟德尔疾病诊断中的临床应用。
Genome Med. 2022 Apr 5;14(1):38. doi: 10.1186/s13073-022-01019-9.
7
seqr: A web-based analysis and collaboration tool for rare disease genomics.seqr:一个用于罕见病基因组学的基于网络的分析和协作工具。
Hum Mutat. 2022 Jun;43(6):698-707. doi: 10.1002/humu.24366. Epub 2022 Mar 21.
8
A machine learning approach based on ACMG/AMP guidelines for genomic variant classification and prioritization.基于 ACMG/AMP 指南的基因组变异分类和优先级的机器学习方法。
Sci Rep. 2022 Feb 15;12(1):2517. doi: 10.1038/s41598-022-06547-3.
9
100,000 Genomes Pilot on Rare-Disease Diagnosis in Health Care - Preliminary Report.10 万基因组计划在医疗保健中的罕见病诊断 - 初步报告。
N Engl J Med. 2021 Nov 11;385(20):1868-1880. doi: 10.1056/NEJMoa2035790.
10
Clustered mutations in the GRIK2 kainate receptor subunit gene underlie diverse neurodevelopmental disorders.簇集突变导致 GRIK2 型谷氨酸受体亚基基因异常,从而引发多种神经发育障碍。
Am J Hum Genet. 2021 Sep 2;108(9):1692-1709. doi: 10.1016/j.ajhg.2021.07.007. Epub 2021 Aug 9.