Graham Linck Emma J, Richmond Phillip A, Tarailo-Graovac Maja, Engelke Udo, Kluijtmans Leo A J, Coene Karlien L M, Wevers Ron A, Wasserman Wyeth, van Karnebeek Clara D M, Mostafavi Sara
BC Children's Hospital Research Institute, Centre for Molecular Medicine and Therapeutics, University of British Columbia, Vancouver, Canada.
Departments of Biochemistry, Molecular Biology and Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, Canada.
NPJ Genom Med. 2020 Jul 2;5:25. doi: 10.1038/s41525-020-0132-5. eCollection 2020.
Many inborn errors of metabolism (IEMs) are amenable to treatment, therefore early diagnosis is imperative. Whole-exome sequencing (WES) variant prioritization coupled with phenotype-guided clinical and bioinformatics expertise is typically used to identify disease-causing variants; however, it can be challenging to identify the causal candidate gene when a large number of rare and potentially pathogenic variants are detected. Here, we present a network-based approach, metPropagate, that uses untargeted metabolomics (UM) data from a single patient and a group of controls to prioritize candidate genes in patients with suspected IEMs. We validate metPropagate on 107 patients with IEMs diagnosed in Miller et al. (2015) and 11 patients with both CNS and metabolic abnormalities. The metPropagate method ranks candidate genes by label propagation, a graph-smoothing algorithm that considers each gene's metabolic perturbation in addition to the network of interactions between neighbors. metPropagate was able to prioritize at least one causative gene in the top 20 percentile of candidate genes for 92% of patients with known IEMs. Applied to patients with suspected neurometabolic disease, metPropagate placed at least one causative gene in the top 20 percentile in 9/11 patients, and ranked the causative gene more highly than Exomiser's phenotype-based ranking in 6/11 patients. Interestingly, ranking by a weighted combination of metPropagate and Exomiser scores resulted in improved prioritization. The results of this study indicate that network-based analysis of UM data can provide an additional mode of evidence to prioritize causal genes in patients with suspected IEMs.
许多先天性代谢缺陷病(IEMs)都可以进行治疗,因此早期诊断至关重要。全外显子测序(WES)变异优先级排序结合表型引导的临床和生物信息学专业知识通常用于识别致病变异;然而,当检测到大量罕见且可能致病的变异时,确定因果候选基因可能具有挑战性。在这里,我们提出了一种基于网络的方法,即metPropagate,它使用来自单个患者和一组对照的非靶向代谢组学(UM)数据对疑似IEMs患者的候选基因进行优先级排序。我们在Miller等人(2015年)诊断的107例IEMs患者和11例患有中枢神经系统和代谢异常的患者中验证了metPropagate。metPropagate方法通过标签传播对候选基因进行排名,标签传播是一种图形平滑算法,除了考虑邻居之间的相互作用网络外,还考虑每个基因的代谢扰动。对于92%的已知IEMs患者,metPropagate能够在候选基因的前20%中对至少一个致病基因进行优先级排序。应用于疑似神经代谢疾病的患者,metPropagate在9/11的患者中至少将一个致病基因排在前20%,并且在6/11的患者中,将致病基因的排名高于基于Exomiser表型的排名。有趣的是,通过metPropagate和Exomiser分数的加权组合进行排名可提高优先级排序。这项研究的结果表明,基于网络的UM数据分析可以为疑似IEMs患者的因果基因优先级排序提供额外的证据模式。