CAGI SickKids 挑战：评估患有不明原因疾病的儿童的临床和基因组数据中得出的表型和变异预测。

CAGI SickKids challenges: Assessment of phenotype and variant predictions derived from clinical and genomic data of children with undiagnosed diseases.

机构信息

Department of Plant and Microbial Biology, University of California, Berkeley, California.

Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia.

出版信息

Hum Mutat. 2019 Sep;40(9):1373-1391. doi: 10.1002/humu.23874. Epub 2019 Sep 3.

DOI:10.1002/humu.23874

PMID:31322791

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7318886/

Abstract

Whole-genome sequencing (WGS) holds great potential as a diagnostic test. However, the majority of patients currently undergoing WGS lack a molecular diagnosis, largely due to the vast number of undiscovered disease genes and our inability to assess the pathogenicity of most genomic variants. The CAGI SickKids challenges attempted to address this knowledge gap by assessing state-of-the-art methods for clinical phenotype prediction from genomes. CAGI4 and CAGI5 participants were provided with WGS data and clinical descriptions of 25 and 24 undiagnosed patients from the SickKids Genome Clinic Project, respectively. Predictors were asked to identify primary and secondary causal variants. In addition, for CAGI5, groups had to match each genome to one of three disorder categories (neurologic, ophthalmologic, and connective), and separately to each patient. The performance of matching genomes to categories was no better than random but two groups performed significantly better than chance in matching genomes to patients. Two of the ten variants proposed by two groups in CAGI4 were deemed to be diagnostic, and several proposed pathogenic variants in CAGI5 are good candidates for phenotype expansion. We discuss implications for improving in silico assessment of genomic variants and identifying new disease genes.

摘要

全基因组测序（WGS）作为一种诊断测试具有巨大的潜力。然而，目前大多数接受 WGS 的患者都没有分子诊断，这主要是由于尚未发现大量疾病基因，以及我们无法评估大多数基因组变异的致病性。CAGI SickKids 挑战赛试图通过评估从基因组预测临床表型的最先进方法来填补这一知识空白。CAGI4 和 CAGI5 的参与者分别获得了 SickKids 基因组诊所项目的 25 名和 24 名未确诊患者的 WGS 数据和临床描述。要求预测者识别主要和次要因果变异。此外，对于 CAGI5，各组必须将每个基因组与三个疾病类别之一（神经、眼科和结缔组织）匹配，并分别与每个患者匹配。将基因组与类别进行匹配的性能并不优于随机，但有两组在将基因组与患者进行匹配方面的表现明显优于随机。CAGI4 中的两组提出的十个变体中有两个被认为是诊断性的，CAGI5 中的几个提出的致病性变体是表型扩展的良好候选者。我们讨论了改进基因组变异的计算评估和识别新疾病基因的影响。

相似文献

CAGI SickKids challenges: Assessment of phenotype and variant predictions derived from clinical and genomic data of children with undiagnosed diseases.CAGI SickKids 挑战：评估患有不明原因疾病的儿童的临床和基因组数据中得出的表型和变异预测。

Hum Mutat. 2019 Sep;40(9):1373-1391. doi: 10.1002/humu.23874. Epub 2019 Sep 3.

Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype-weighted knowledge in the CAGI SickKids5 clinical genomes challenge.将全基因组与罕见遗传疾病相匹配：在 CAGI SickKids5 临床基因组挑战中使用表型加权知识鉴定潜在的致病变异。

Hum Mutat. 2020 Feb;41(2):347-362. doi: 10.1002/humu.23933. Epub 2019 Nov 15.

CAGI4 SickKids clinical genomes challenge: A pipeline for identifying pathogenic variants.CAGI4 病童临床基因组挑战：一种识别致病变异的流程。

Hum Mutat. 2017 Sep;38(9):1169-1181. doi: 10.1002/humu.23257. Epub 2017 Jun 27.

Are machine learning based methods suited to address complex biological problems? Lessons from CAGI-5 challenges.基于机器学习的方法是否适合解决复杂的生物学问题？来自 CAGI-5 挑战赛的经验教训。

Hum Mutat. 2019 Sep;40(9):1455-1462. doi: 10.1002/humu.23784. Epub 2019 Jun 18.

Critical assessment of variant prioritization methods for rare disease diagnosis within the rare genomes project.对罕见基因组项目中罕见病诊断的变异优先级方法的批判性评估。

Hum Genomics. 2024 Apr 29;18(1):44. doi: 10.1186/s40246-024-00604-w.

Diagnostic and clinical utility of whole genome sequencing in a cohort of undiagnosed Chinese families with rare diseases.在中国一组未确诊的罕见病家系中全基因组测序的诊断和临床效用。

Sci Rep. 2019 Dec 18;9(1):19365. doi: 10.1038/s41598-019-55832-1.

Reports from the fifth edition of CAGI: The Critical Assessment of Genome Interpretation.来自第五版 CAGI 的报告：基因组解读的关键评估。

Hum Mutat. 2019 Sep;40(9):1197-1201. doi: 10.1002/humu.23876. Epub 2019 Aug 26.

CAGI5: Objective performance assessments of predictions based on the Evolutionary Action equation.CAGI5：基于进化作用方程的预测的客观性能评估。

Hum Mutat. 2019 Sep;40(9):1436-1454. doi: 10.1002/humu.23873. Epub 2019 Aug 7.

Copy-number variants in clinical genome sequencing: deployment and interpretation for rare and undiagnosed disease.临床基因组测序中的拷贝数变异：在罕见病和不明原因疾病中的应用和解读。

Genet Med. 2019 May;21(5):1121-1130. doi: 10.1038/s41436-018-0295-y. Epub 2018 Oct 8.

Assessing predictions of the impact of variants on splicing in CAGI5.评估 CAGI5 中变异对剪接影响的预测。

Hum Mutat. 2019 Sep;40(9):1215-1224. doi: 10.1002/humu.23869. Epub 2019 Aug 19.

引用本文的文献

CAGI, the Critical Assessment of Genome Interpretation, establishes progress and prospects for computational genetic variant interpretation methods.CAGI，即基因组解读的关键评估，旨在评估计算遗传变异解读方法的进展和前景。

Genome Biol. 2024 Feb 22;25(1):53. doi: 10.1186/s13059-023-03113-6.

Novel Variants of CEP152 in a Case of Compound-Heterozygous Inheritance of Epilepsy.癫痫复合杂合子遗传病例中CEP152的新型变异体

Glob Med Genet. 2024 Jan 16;11(1):20-24. doi: 10.1055/s-0043-1777807. eCollection 2024 Jan.

Technology Platforms and Approaches for Building and Evaluating Machine Learning Methods in Healthcare.医疗保健中构建和评估机器学习方法的技术平台和方法。

J Appl Lab Med. 2023 Jan 4;8(1):194-202. doi: 10.1093/jalm/jfac113.

Genome interpretation using in silico predictors of variant impact.使用变异影响的计算机预测因子进行基因组解读。

Hum Genet. 2022 Oct;141(10):1549-1577. doi: 10.1007/s00439-022-02457-6. Epub 2022 Apr 30.

Monogenic causes of non-obstructive azoospermia: challenges, established knowledge, limitations and perspectives.单基因导致的非梗阻性无精子症：挑战、已有知识、局限性和展望。

Hum Genet. 2021 Jan;140(1):135-154. doi: 10.1007/s00439-020-02112-y. Epub 2020 Jan 18.

Hum Mutat. 2020 Feb;41(2):347-362. doi: 10.1002/humu.23933. Epub 2019 Nov 15.

Reports from the fifth edition of CAGI: The Critical Assessment of Genome Interpretation.来自第五版 CAGI 的报告：基因组解读的关键评估。

Hum Mutat. 2019 Sep;40(9):1197-1201. doi: 10.1002/humu.23876. Epub 2019 Aug 26.

CAGI5: Objective performance assessments of predictions based on the Evolutionary Action equation.CAGI5：基于进化作用方程的预测的客观性能评估。

Hum Mutat. 2019 Sep;40(9):1436-1454. doi: 10.1002/humu.23873. Epub 2019 Aug 7.

本文引用的文献

Inferring the molecular and phenotypic impact of amino acid variants with MutPred2.使用 MutPred2 推断氨基酸变异的分子和表型影响。

Nat Commun. 2020 Nov 20;11(1):5918. doi: 10.1038/s41467-020-19669-x.

PhenPath: a tool for characterizing biological functions underlying different phenotypes.PhenPath：一个用于描述不同表型背后生物功能的工具。

BMC Genomics. 2019 Jul 16;20(Suppl 8):548. doi: 10.1186/s12864-019-5868-x.

Specific phenotype semantics facilitate gene prioritization in clinical exome sequencing.特定表型语义有助于临床外显子组测序中的基因优先级排序。

Eur J Hum Genet. 2019 Sep;27(9):1389-1397. doi: 10.1038/s41431-019-0412-7. Epub 2019 May 3.

A retrospective review of multiple findings in diagnostic exome sequencing: half are distinct and half are overlapping diagnoses.回顾性分析诊断外显子组测序的多项结果：一半为不同的诊断，另一半为重叠的诊断。

Genet Med. 2019 Oct;21(10):2199-2207. doi: 10.1038/s41436-019-0477-2. Epub 2019 Mar 21.

Clinical whole genome sequencing as a first-tier test at a resource-limited dysmorphology clinic in Mexico.在墨西哥一家资源有限的畸形学诊所，将临床全基因组测序作为一线检测手段。

NPJ Genom Med. 2019 Feb 14;4:5. doi: 10.1038/s41525-018-0076-1. eCollection 2019.

Identification of key transcription factors - gene regulatory network related with osteogenic differentiation of human mesenchymal stem cells based on transcription factor prognosis system.基于转录因子预后系统的人骨髓间充质干细胞成骨分化相关关键转录因子-基因调控网络的鉴定

Exp Ther Med. 2019 Mar;17(3):2113-2122. doi: 10.3892/etm.2019.7170. Epub 2019 Jan 14.

Into the Wild: GWAS Exploration of Non-coding RNAs.《走进荒野：非编码RNA的全基因组关联研究探索》

Front Cardiovasc Med. 2018 Dec 17;5:181. doi: 10.3389/fcvm.2018.00181. eCollection 2018.

VarSome: the human genomic variant search engine.VarSome：人类基因组变异搜索引擎。

Bioinformatics. 2019 Jun 1;35(11):1978-1980. doi: 10.1093/bioinformatics/bty897.

Spermidine restores dysregulated autophagy and polyamine synthesis in aged and osteoarthritic chondrocytes via EP300.精胺通过 EP300 恢复衰老和骨关节炎软骨细胞中失调的自噬和多胺合成。

Exp Mol Med. 2018 Sep 19;50(9):1-10. doi: 10.1038/s12276-018-0149-3.

Realizing the significance of noncoding functionality in clinical genomics.认识到非编码功能在临床基因组学中的意义。

Exp Mol Med. 2018 Aug 7;50(8):1-8. doi: 10.1038/s12276-018-0087-0.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验