Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology, 4700 KAUST, Thuwal, 23955-6900, Kingdom of Saudi Arabia.
Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology, 4700 KAUST, PO Box 2882, Thuwal, 23955-6900, Kingdom of Saudi Arabia.
BMC Bioinformatics. 2019 Feb 6;20(1):65. doi: 10.1186/s12859-019-2633-8.
Prioritization of variants in personal genomic data is a major challenge. Recently, computational methods that rely on comparing phenotype similarity have shown to be useful to identify causative variants. In these methods, pathogenicity prediction is combined with a semantic similarity measure to prioritize not only variants that are likely to be dysfunctional but those that are likely involved in the pathogenesis of a patient's phenotype.
We have developed DeepPVP, a variant prioritization method that combined automated inference with deep neural networks to identify the likely causative variants in whole exome or whole genome sequence data. We demonstrate that DeepPVP performs significantly better than existing methods, including phenotype-based methods that use similar features. DeepPVP is freely available at https://github.com/bio-ontology-research-group/phenomenet-vp .
DeepPVP further improves on existing variant prioritization methods both in terms of speed as well as accuracy.
在个人基因组数据中优先考虑变异是一个主要挑战。最近,依赖于比较表型相似性的计算方法已被证明有助于识别致病变异。在这些方法中,致病性预测与语义相似性度量相结合,不仅优先考虑可能功能失调的变异,还优先考虑可能与患者表型发病机制相关的变异。
我们开发了 DeepPVP,这是一种变体优先级方法,它结合了自动推理和深度神经网络,以识别全外显子或全基因组序列数据中的可能致病变体。我们证明,DeepPVP 的性能明显优于现有的方法,包括使用类似特征的基于表型的方法。DeepPVP 可在 https://github.com/bio-ontology-research-group/phenomenet-vp 上免费获得。
DeepPVP 在速度和准确性方面都进一步改进了现有的变体优先级方法。