University College London, Institute of Child Health, GOSgene team London, U.K ; Department of Biomedicine, Human Genetics, Aarhus University Aarhus, Denmark.
CRBA Centro Ricerca Biomedica Applicata, Azienda Ospedaliero-Universitaria Policlinico S. Orsola - Malpighi Bologna, Italy.
Mol Genet Genomic Med. 2014 Jan;2(1):58-63. doi: 10.1002/mgg3.42. Epub 2013 Oct 11.
The choice of an appropriate variant calling pipeline for exome sequencing data is becoming increasingly more important in translational medicine projects and clinical contexts. Within GOSgene, which facilitates genetic analysis as part of a joint effort of the University College London and the Great Ormond Street Hospital, we aimed to optimize a variant calling pipeline suitable for our clinical context. We implemented the GATK/Queue framework and evaluated the performance of its two callers: the classical UnifiedGenotyper and the new variant discovery tool HaplotypeCaller. We performed an experimental validation of the loss-of-function (LoF) variants called by the two methods using Sequenom technology. UnifiedGenotyper showed a total validation rate of 97.6% for LoF single-nucleotide polymorphisms (SNPs) and 92.0% for insertions or deletions (INDELs), whereas HaplotypeCaller was 91.7% for SNPs and 55.9% for INDELs. We confirm that GATK/Queue is a reliable pipeline in translational medicine and clinical context. We conclude that in our working environment, UnifiedGenotyper is the caller of choice, being an accurate method, with a high validation rate of error-prone calls like LoF variants. We finally highlight the importance of experimental validation, especially for INDELs, as part of a standard pipeline in clinical environments.
在转化医学项目和临床环境中,选择适当的外显子组测序数据分析管道变得越来越重要。在 GOSgene 中,我们旨在优化适合我们临床环境的分析管道,该项目是伦敦大学学院和大奥蒙德街儿童医院共同努力的一部分。我们实现了 GATK/Queue 框架,并评估了其两个调用器的性能:经典的 UnifiedGenotyper 和新的变异发现工具 HaplotypeCaller。我们使用 Sequenom 技术对这两种方法调用的功能丧失(LoF)变异进行了实验验证。UnifiedGenotyper 对 LoF 单核苷酸多态性(SNP)的总验证率为 97.6%,对插入或缺失(INDEL)的总验证率为 92.0%,而 HaplotypeCaller 对 SNP 的验证率为 91.7%,对 INDEL 的验证率为 55.9%。我们确认 GATK/Queue 是转化医学和临床环境中的可靠管道。我们的结论是,在我们的工作环境中,UnifiedGenotyper 是首选的调用器,因为它是一种准确的方法,具有高的易错调用(如 LoF 变体)的验证率。我们最后强调了实验验证的重要性,特别是对于 INDELs,因为这是临床环境中标准管道的一部分。