Suppr超能文献

从FastQ数据到高可信度变异检测:基因组分析工具包最佳实践流程

From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline.

作者信息

Van der Auwera Geraldine A, Carneiro Mauricio O, Hartl Christopher, Poplin Ryan, Del Angel Guillermo, Levy-Moonshine Ami, Jordan Tadeusz, Shakir Khalid, Roazen David, Thibault Joel, Banks Eric, Garimella Kiran V, Altshuler David, Gabriel Stacey, DePristo Mark A

机构信息

Genome Sequencing and Analysis Group, Broad Institute, Cambridge, Massachusetts.

Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom.

出版信息

Curr Protoc Bioinformatics. 2013;43(1110):11.10.1-11.10.33. doi: 10.1002/0471250953.bi1110s43.

Abstract

This unit describes how to use BWA and the Genome Analysis Toolkit (GATK) to map genome sequencing data to a reference and produce high-quality variant calls that can be used in downstream analyses. The complete workflow includes the core NGS data processing steps that are necessary to make the raw data suitable for analysis by the GATK, as well as the key methods involved in variant discovery using the GATK.

摘要

本单元介绍如何使用BWA和基因组分析工具包(GATK)将基因组测序数据比对到参考序列,并生成可用于下游分析的高质量变异位点调用结果。完整的工作流程包括使原始数据适合GATK分析所需的核心NGS数据处理步骤,以及使用GATK进行变异位点发现所涉及的关键方法。

相似文献

1
From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline.
Curr Protoc Bioinformatics. 2013;43(1110):11.10.1-11.10.33. doi: 10.1002/0471250953.bi1110s43.
2
Impact of post-alignment processing in variant discovery from whole exome data.
BMC Bioinformatics. 2016 Oct 3;17(1):403. doi: 10.1186/s12859-016-1279-z.
3
Calling known variants and identifying new variants while rapidly aligning sequence data.
J Dairy Sci. 2019 Apr;102(4):3216-3229. doi: 10.3168/jds.2018-15172. Epub 2019 Feb 14.
5
Fast and accurate DNASeq variant calling workflow composed of LUSH toolkit.
Hum Genomics. 2024 Oct 10;18(1):114. doi: 10.1186/s40246-024-00666-w.
7
Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools.
PLoS One. 2023 Aug 3;18(8):e0288371. doi: 10.1371/journal.pone.0288371. eCollection 2023.
8
OVarFlow: a resource optimized GATK 4 based Open source Variant calling workFlow.
BMC Bioinformatics. 2021 Aug 13;22(1):402. doi: 10.1186/s12859-021-04317-y.
9
An analytical workflow for accurate variant discovery in highly divergent regions.
BMC Genomics. 2016 Sep 2;17(1):703. doi: 10.1186/s12864-016-3045-z.
10
Towards pan-genome read alignment to improve variation calling.
BMC Genomics. 2018 May 9;19(Suppl 2):87. doi: 10.1186/s12864-018-4465-8.

引用本文的文献

1
Biallelic Variants Are Associated with Isolated Retinitis Pigmentosa.
Int J Mol Sci. 2025 Aug 25;26(17):8244. doi: 10.3390/ijms26178244.
2
Genomic Insights into Tumorigenesis in Newly Diagnosed Multiple Myeloma.
Diagnostics (Basel). 2025 Aug 23;15(17):2130. doi: 10.3390/diagnostics15172130.
4
Novel Grm6 Variant in a no b-wave (nob) Mouse Model: Phenotype Characterization and Gene Therapy.
Invest Ophthalmol Vis Sci. 2025 Sep 2;66(12):20. doi: 10.1167/iovs.66.12.20.
6
Infant Born With Autosomal Recessive Glycogen Storage Disease Type IV due to Complete Maternal Isodisomy of Chromosome 3.
Case Rep Genet. 2025 Aug 27;2025:5577571. doi: 10.1155/crig/5577571. eCollection 2025.
10
Mechanisms of Resistance to PARPi in Pancreatic Ductal Adenocarcinoma.
J Cell Mol Med. 2025 Aug;29(16):e70816. doi: 10.1111/jcmm.70816.

本文引用的文献

1
A framework for variation discovery and genotyping using next-generation DNA sequencing data.
Nat Genet. 2011 May;43(5):491-8. doi: 10.1038/ng.806. Epub 2011 Apr 10.
2
A map of human genome variation from population-scale sequencing.
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
3
Integrating common and rare genetic variation in diverse human populations.
Nature. 2010 Sep 2;467(7311):52-8. doi: 10.1038/nature09298.
4
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.
5
Fast and accurate long-read alignment with Burrows-Wheeler transform.
Bioinformatics. 2010 Mar 1;26(5):589-95. doi: 10.1093/bioinformatics/btp698. Epub 2010 Jan 15.
6
The Sequence Alignment/Map format and SAMtools.
Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.
7
An initial map of insertion and deletion (INDEL) variation in the human genome.
Genome Res. 2006 Sep;16(9):1182-90. doi: 10.1101/gr.4565806. Epub 2006 Aug 10.
8
dbSNP: the NCBI database of genetic variation.
Nucleic Acids Res. 2001 Jan 1;29(1):308-11. doi: 10.1093/nar/29.1.308.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验