对个人基因组进行准确而全面的测序。

Accurate and comprehensive sequencing of personal genomes.

机构信息

Genome Informatics Section, Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA.

出版信息

Genome Res. 2011 Sep;21(9):1498-505. doi: 10.1101/gr.123638.111. Epub 2011 Jul 19.

DOI:10.1101/gr.123638.111

PMID:21771779

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3166834/

Abstract

As whole-genome sequencing becomes commoditized and we begin to sequence and analyze personal genomes for clinical and diagnostic purposes, it is necessary to understand what constitutes a complete sequencing experiment for determining genotypes and detecting single-nucleotide variants. Here, we show that the current recommendation of ∼30× coverage is not adequate to produce genotype calls across a large fraction of the genome with acceptably low error rates. Our results are based on analyses of a clinical sample sequenced on two related Illumina platforms, GAII(x) and HiSeq 2000, to a very high depth (126×). We used these data to establish genotype-calling filters that dramatically increase accuracy. We also empirically determined how the callable portion of the genome varies as a function of the amount of sequence data used. These results help provide a "sequencing guide" for future whole-genome sequencing decisions and metrics by which coverage statistics should be reported.

摘要

随着全基因组测序变得普及，我们开始为临床和诊断目的对个人基因组进行测序和分析，因此有必要了解确定基因型和检测单核苷酸变异所需的完整测序实验的组成部分。在这里，我们表明，目前建议的约 30×覆盖范围不足以在具有可接受的低错误率的情况下产生基因组的大部分的基因型调用。我们的结果基于对在两种相关的 Illumina 平台（GAII(x)和 HiSeq 2000）上测序到非常高深度（126×）的临床样本的分析。我们使用这些数据来建立基因型调用过滤器，从而极大地提高了准确性。我们还通过使用的测序数据量来确定基因组的可调用部分随时间的变化。这些结果有助于为未来的全基因组测序决策提供“测序指南”，并为覆盖率统计数据的报告提供指标。

相似文献

Accurate and comprehensive sequencing of personal genomes.

Genome Res. 2011 Sep;21(9):1498-505. doi: 10.1101/gr.123638.111. Epub 2011 Jul 19.

Assessing single nucleotide variant detection and genotype calling on whole-genome sequenced individuals.

Bioinformatics. 2014 Jun 15;30(12):1707-13. doi: 10.1093/bioinformatics/btu067. Epub 2014 Feb 19.

Comparison of sequencing platforms for single nucleotide variant calls in a human sample.

PLoS One. 2013;8(2):e55089. doi: 10.1371/journal.pone.0055089. Epub 2013 Feb 6.

Estimating genotype error rates from high-coverage next-generation sequence data.

Genome Res. 2014 Nov;24(11):1734-9. doi: 10.1101/gr.168393.113. Epub 2014 Oct 10.

Optimal sequencing depth design for whole genome re-sequencing in pigs.

BMC Bioinformatics. 2019 Nov 8;20(1):556. doi: 10.1186/s12859-019-3164-z.

Using genotype array data to compare multi- and single-sample variant calls and improve variant call sets from deep coverage whole-genome sequencing data.

Bioinformatics. 2017 Apr 15;33(8):1147-1153. doi: 10.1093/bioinformatics/btw786.

Revising a personal genome by comparing and combining data from two different sequencing platforms.

PLoS One. 2013 Apr 8;8(4):e60585. doi: 10.1371/journal.pone.0060585. Print 2013.

Validation of multiple single nucleotide variation calls by additional exome analysis with a semiconductor sequencer to supplement data of whole-genome sequencing of a human population.

BMC Genomics. 2014 Aug 10;15(1):673. doi: 10.1186/1471-2164-15-673.

Performance comparison of whole-genome sequencing platforms.

Nat Biotechnol. 2011 Dec 18;30(1):78-82. doi: 10.1038/nbt.2065.

Archived neonatal dried blood spot samples can be used for accurate whole genome and exome-targeted next-generation sequencing.

Mol Genet Metab. 2013 Sep-Oct;110(1-2):65-72. doi: 10.1016/j.ymgme.2013.06.004. Epub 2013 Jun 13.

引用本文的文献

Comparative study of tools for copy number variation detection using next-generation sequencing data.

Sci Rep. 2025 Jul 1;15(1):22145. doi: 10.1038/s41598-025-06527-3.

Benchmarking accelerated next-generation sequencing analysis pipelines.

Bioinform Adv. 2025 May 15;5(1):vbaf085. doi: 10.1093/bioadv/vbaf085. eCollection 2025.

Short-Read and Long-Read Whole Genome Sequencing for SARS-CoV-2 Variants Identification.

Viruses. 2025 Apr 18;17(4):584. doi: 10.3390/v17040584.

Advances in Whole Genome Sequencing: Methods, Tools, and Applications in Population Genomics.

Int J Mol Sci. 2025 Jan 4;26(1):372. doi: 10.3390/ijms26010372.

Efficient identification of genomic insertions and surrounding regions in two transgenic maize events using third-generation single-molecule nanopore sequencing technology.

Sci Rep. 2024 Dec 30;14(1):31921. doi: 10.1038/s41598-024-83403-6.

The Correlation Between Temperament and Fitness for Work According to the Persian Medicine Viewpoints.

Galen Med J. 2023 Dec 17;12:e2934. doi: 10.31661/gmj.v12i.2934. eCollection 2023.

Review of the technology used for structural characterization of the GMO genome using NGS data.

Genomics Inform. 2024 Oct 2;22(1):14. doi: 10.1186/s44342-024-00016-1.

The digestion and dietary carbohydrate pathway contains 100% gene mutations enrichment among 117 patients with major depressive disorder.

Front Psychiatry. 2024 Apr 29;15:1362612. doi: 10.3389/fpsyt.2024.1362612. eCollection 2024.

Generation of High-Quality African Swine Fever Virus Complete Genome from Field Samples by Next-Generation Sequencing.

Viruses. 2024 Feb 18;16(2):312. doi: 10.3390/v16020312.

New biomarkers underlying acetic acid tolerance in the probiotic yeast Saccharomyces cerevisiae var. boulardii.

Appl Microbiol Biotechnol. 2024 Jan 19;108(1):153. doi: 10.1007/s00253-023-12946-x.

本文引用的文献

A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Nat Genet. 2011 May;43(5):491-8. doi: 10.1038/ng.806. Epub 2011 Apr 10.

Efficient study design for next generation sequencing.

Genet Epidemiol. 2011 May;35(4):269-77. doi: 10.1002/gepi.20575.

Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries.

Genome Biol. 2011;12(2):R18. doi: 10.1186/gb-2011-12-2-r18. Epub 2011 Feb 21.

The $1,000 genome, the $100,000 analysis?

Genome Med. 2010 Nov 26;2(11):84. doi: 10.1186/gm205.

A map of human genome variation from population-scale sequencing.

Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.

Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing.

Nat Genet. 2010 Nov;42(11):931-6. doi: 10.1038/ng.691. Epub 2010 Oct 24.

The characterization of twenty sequenced human genomes.

PLoS Genet. 2010 Sep 9;6(9):e1001111. doi: 10.1371/journal.pgen.1001111.

Systematic comparison of three genomic enrichment methods for massively parallel DNA sequencing.

Genome Res. 2010 Oct;20(10):1420-31. doi: 10.1101/gr.106716.110. Epub 2010 Sep 1.

Whole-genome sequencing of a single proband together with linkage analysis identifies a Mendelian disease gene.

PLoS Genet. 2010 Jun 17;6(6):e1000991. doi: 10.1371/journal.pgen.1000991.

Challenges of sequencing human genomes.

Brief Bioinform. 2010 Sep;11(5):484-98. doi: 10.1093/bib/bbq016. Epub 2010 Jun 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr
超能文献

对个人基因组进行准确而全面的测序。

Accurate and comprehensive sequencing of personal genomes.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr超能文献

对个人基因组进行准确而全面的测序。

Accurate and comprehensive sequencing of personal genomes.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr
超能文献