IMPROVE-DD：整合多种表型资源可优化遗传所致发育障碍中的变异评估。

IMPROVE-DD: Integrating multiple phenotype resources optimizes variant evaluation in genetically determined developmental disorders.

机构信息

MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK.

Wellcome Sanger Institute, Hinxton, Cambridgeshire CB10 1SA, UK.

出版信息

HGG Adv. 2022 Nov 24;4(1):100162. doi: 10.1016/j.xhgg.2022.100162. eCollection 2023 Jan 12.

DOI:10.1016/j.xhgg.2022.100162

PMID:36561149

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9763511/

Abstract

Diagnosing rare developmental disorders using genome-wide sequencing data commonly necessitates review of multiple plausible candidate variants, often using ontologies of categorical clinical terms. We show that Integrating Multiple Phenotype Resources Optimizes Variant Evaluation in Developmental Disorders (IMPROVE-DD) by incorporating additional classes of data commonly available to clinicians and recorded in health records. In doing so, we quantify the distinct contributions of sex, growth, and development in addition to Human Phenotype Ontology (HPO) terms and demonstrate added value from these readily available information sources. We use likelihood ratios for nominal and quantitative data and propose a classifier for HPO terms in this framework. This Bayesian framework results in more robust diagnoses. Using data systematically collected in the Deciphering Developmental Disorders study, we considered 77 genes with pathogenic/likely pathogenic variants in ≥10 individuals. All genes showed at least a satisfactory prediction by receiver operating characteristic when testing on training data (AUC ≥ 0.6), and HPO terms were the best predictor for the majority of genes, though a minority (13/77) of genes were better predicted by other phenotypic data types. Overall, classifiers based upon multiple integrated phenotypic data sources performed better than those based upon any individual source, and importantly, integrated models produced notably fewer false positives. Finally, we show that IMPROVE-DD models with good predictive performance on cross-validation can be constructed from relatively few individuals. This suggests new strategies for candidate gene prioritization and highlights the value of systematic clinical data collection to support diagnostic programs.

摘要

使用全基因组测序数据诊断罕见发育障碍通常需要对多个合理的候选变异进行评估，通常使用分类临床术语的本体论。我们表明，通过纳入临床医生通常可用且记录在健康记录中的其他类别的数据，综合多种表型资源可优化发育障碍中的变异评估（IMPROVE-DD）。通过这样做，我们定量了性别、生长和发育除人类表型本体论（HPO）术语之外的独特贡献，并证明了这些现成信息源的附加值。我们在该框架中使用名义和定量数据的似然比，并提出了 HPO 术语的分类器。这种贝叶斯框架可得出更稳健的诊断结果。使用在解析发育障碍研究中系统收集的数据，我们考虑了 77 个基因，这些基因在≥10 个个体中具有致病性/可能致病性变异。在对训练数据进行测试时，所有基因的接收者操作特性曲线（AUC≥0.6）均至少显示出令人满意的预测，而 HPO 术语是大多数基因的最佳预测指标，尽管少数（13/77）基因被其他表型数据类型更好地预测。总体而言，基于多个综合表型数据源的分类器的性能优于基于任何单个数据源的分类器，重要的是，综合模型产生的假阳性明显减少。最后，我们表明，在交叉验证中具有良好预测性能的 IMPROVE-DD 模型可以由相对较少的个体构建。这表明了候选基因优先级排序的新策略，并突出了系统临床数据收集对支持诊断计划的价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9344/9763511/2d65640dca17/gr1.jpg

相似文献

IMPROVE-DD: Integrating multiple phenotype resources optimizes variant evaluation in genetically determined developmental disorders.

HGG Adv. 2022 Nov 24;4(1):100162. doi: 10.1016/j.xhgg.2022.100162. eCollection 2023 Jan 12.

HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology.

J Biomed Inform. 2019 Aug;96:103246. doi: 10.1016/j.jbi.2019.103246. Epub 2019 Jun 27.

Am J Hum Genet. 2016 Mar 3;98(3):490-499. doi: 10.1016/j.ajhg.2016.01.008. Epub 2016 Feb 25.

Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data.

Am J Hum Genet. 2019 Nov 7;105(5):933-946. doi: 10.1016/j.ajhg.2019.09.015. Epub 2019 Oct 10.

Creation and evaluation of full-text literature-derived, feature-weighted disease models of genetically determined developmental disorders.

Database (Oxford). 2022 Jun 7;2022. doi: 10.1093/database/baac038.

Expansion of the Human Phenotype Ontology (HPO) knowledge base and resources.

Nucleic Acids Res. 2019 Jan 8;47(D1):D1018-D1027. doi: 10.1093/nar/gky1105.

Increasing phenotypic annotation improves the diagnostic rate of exome sequencing in a rare neuromuscular disorder.

Hum Mutat. 2019 Oct;40(10):1797-1812. doi: 10.1002/humu.23792. Epub 2019 Jun 23.

Curation and expansion of the Human Phenotype Ontology for systemic autoinflammatory diseases improves phenotype-driven disease-matching.

Front Immunol. 2023 Sep 12;14:1215869. doi: 10.3389/fimmu.2023.1215869. eCollection 2023.

DECIPHER: Supporting the interpretation and sharing of rare disease phenotype-linked variant data to advance diagnosis and research.

Hum Mutat. 2022 Jun;43(6):682-697. doi: 10.1002/humu.24340. Epub 2022 Feb 21.

[From symptom to syndrome using modern software support].

Internist (Berl). 2018 Aug;59(8):766-775. doi: 10.1007/s00108-018-0456-8.

引用本文的文献

Improving the care of children with GENetic Rare disease: Observational Cohort study (GenROC)-a study protocol.

BMJ Open. 2024 May 16;14(5):e085237. doi: 10.1136/bmjopen-2024-085237.

Genomic Diagnosis of Rare Pediatric Disease in the United Kingdom and Ireland.

N Engl J Med. 2023 Apr 27;388(17):1559-1571. doi: 10.1056/NEJMoa2209046. Epub 2023 Apr 12.

本文引用的文献

RDmap: a map for exploring rare diseases.

Orphanet J Rare Dis. 2021 Feb 25;16(1):101. doi: 10.1186/s13023-021-01741-4.

The Human Phenotype Ontology in 2021.

Nucleic Acids Res. 2021 Jan 8;49(D1):D1207-D1217. doi: 10.1093/nar/gkaa1043.

DeepPheno: Predicting single gene loss-of-function phenotypes using an ontology-aware hierarchical classifier.

PLoS Comput Biol. 2020 Nov 18;16(11):e1008453. doi: 10.1371/journal.pcbi.1008453. eCollection 2020 Nov.

Interpretable Clinical Genomics with a Likelihood Ratio Paradigm.

Am J Hum Genet. 2020 Sep 3;107(3):403-417. doi: 10.1016/j.ajhg.2020.06.021. Epub 2020 Aug 4.

Genomically Aided Diagnosis of Severe Developmental Disorders.

Annu Rev Genomics Hum Genet. 2020 Aug 31;21:327-349. doi: 10.1146/annurev-genom-120919-082329. Epub 2020 May 18.

HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks.

BMC Med Genomics. 2019 Dec 23;12(Suppl 10):187. doi: 10.1186/s12920-019-0625-1.

Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data.

Am J Hum Genet. 2019 Nov 7;105(5):933-946. doi: 10.1016/j.ajhg.2019.09.015. Epub 2019 Oct 10.

Encoding Clinical Data with the Human Phenotype Ontology for Computational Differential Diagnostics.

Curr Protoc Hum Genet. 2019 Sep;103(1):e92. doi: 10.1002/cphg.92.

Predicting disease-related phenotypes using an integrated phenotype similarity measurement based on HPO.

BMC Syst Biol. 2019 Apr 5;13(Suppl 2):34. doi: 10.1186/s12918-019-0697-8.

PhenoPro: a novel toolkit for assisting in the diagnosis of Mendelian disease.

Bioinformatics. 2019 Oct 1;35(19):3559-3566. doi: 10.1093/bioinformatics/btz100.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

IMPROVE-DD：整合多种表型资源可优化遗传所致发育障碍中的变异评估。

IMPROVE-DD: Integrating multiple phenotype resources optimizes variant evaluation in genetically determined developmental disorders.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献