人类基因组变异的图谱来自于基于人群的测序。

A map of human genome variation from population-scale sequencing.

出版信息

Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.

PMID:20981092

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3042601/

Abstract

The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother-father-child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10(-8) per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research.

摘要

1000 基因组计划旨在深入描述人类基因组序列变异，以此作为研究基因型与表型之间关系的基础。在此，我们呈现该计划先导阶段的研究结果，旨在开发和比较不同的策略，利用高通量平台进行全基因组测序。我们开展了三个项目：对来自四个群体的 179 个人进行低覆盖率全基因组测序；对两个母子-父子三人组进行高覆盖率测序；对来自七个群体的 697 个人进行外显子靶向测序。我们描述了约 1500 万个单核苷酸多态性、100 万个短插入和缺失以及 20000 个结构变异的位置、等位基因频率和局部单倍型结构，其中大多数是以前未描述的。我们表明，由于我们已经编目了绝大多数常见变异，因此目前在任何个体中可获得的变异中，有超过 95%都存在于这个数据集。平均而言，每个人被发现携带大约 250 到 300 个在注释基因中失活的变异，以及 50 到 100 个先前与遗传性疾病有关的变异。我们展示了如何利用这些结果来指导关联和功能研究。从这两个三人组中，我们直接估计新生种系碱基替换突变的发生率约为每个碱基对每代 10(-8)。我们探讨了数据中自然选择的特征，并确定了由于连锁位点的选择，基因周围的遗传变异明显减少。这些方法和公共数据将支持人类遗传研究的下一阶段。

相似文献

A map of human genome variation from population-scale sequencing.

Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.

Comprehensive characterization of human genome variation by high coverage whole-genome sequencing of forty four Caucasians.

PLoS One. 2013;8(4):e59494. doi: 10.1371/journal.pone.0059494. Epub 2013 Apr 5.

A global reference for human genetic variation.

Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.

An integrated map of genetic variation from 1,092 human genomes.

Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.

Genomics: In search of rare human variants.

Nature. 2010 Oct 28;467(7319):1050-1. doi: 10.1038/4671050a.

The complete genome of an individual by massively parallel DNA sequencing.

Nature. 2008 Apr 17;452(7189):872-6. doi: 10.1038/nature06884.

Sequencing and de novo assembly of 150 genomes from Denmark as a population reference.

Nature. 2017 Aug 3;548(7665):87-91. doi: 10.1038/nature23264. Epub 2017 Jul 26.

Singapore Genome Variation Project: a haplotype map of three Southeast Asian populations.

Genome Res. 2009 Nov;19(11):2154-62. doi: 10.1101/gr.095000.109. Epub 2009 Aug 21.

A haplotype map of the human genome.

Nature. 2005 Oct 27;437(7063):1299-320. doi: 10.1038/nature04226.

Whole-genome sequence variation, population structure and demographic history of the Dutch population.

Nat Genet. 2014 Aug;46(8):818-25. doi: 10.1038/ng.3021. Epub 2014 Jun 29.

引用本文的文献

Augmenting cost-effectiveness in clinical diagnosis using extended whole-exome sequencing: SNVs, SVs, and beyond.

J Hum Genet. 2025 Sep 8. doi: 10.1038/s10038-025-01403-4.

Finding easy regions for short-read variant calling from pangenome data.

Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf103.

Plasma metabolic landscape unveils key regulators of leukemia subtype progression.

Future Sci OA. 2025 Dec;11(1):2527015. doi: 10.1080/20565623.2025.2527015. Epub 2025 Sep 1.

Winner's curse in rare variant analysis: effect size estimation bias depends on effect direction and the association method used.

Front Genet. 2025 Aug 8;16:1416673. doi: 10.3389/fgene.2025.1416673. eCollection 2025.

Constipation and Parkinson disease: A 2-sample bidirectional Mendelian randomization analysis.

Medicine (Baltimore). 2025 Aug 22;104(34):e43240. doi: 10.1097/MD.0000000000043240.

Benchmarking of low coverage sequencing workflows for precision genotyping in eggplant.

BMC Plant Biol. 2025 Aug 25;25(1):1125. doi: 10.1186/s12870-025-07242-x.

Increasing pathogenic germline variant diagnosis rates in precision medicine: current best practices and future opportunities.

Hum Genomics. 2025 Aug 22;19(1):97. doi: 10.1186/s40246-025-00811-z.

Exploring the common genetic basis of metabolic syndrome-related diseases and chronic kidney disease: insights from extensive genome-wide cross-trait analyses.

BioData Min. 2025 Aug 17;18(1):54. doi: 10.1186/s13040-025-00472-7.

Leveraging multimodal neuroimaging and GWAS for identifying modality-level causal pathways to Alzheimer's disease.

Imaging Neurosci (Camb). 2025 May 16;3. doi: 10.1162/imag_a_00580. eCollection 2025.

Finding easy regions for short-read variant calling from pangenome data.

ArXiv. 2025 Aug 8:arXiv:2507.03718v2.

本文引用的文献

MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.

Genet Epidemiol. 2010 Dec;34(8):816-34. doi: 10.1002/gepi.20533.

Dindel: accurate indel calls from short-read data.

Genome Res. 2011 Jun;21(6):961-73. doi: 10.1101/gr.112326.110. Epub 2010 Oct 27.

Exome sequencing, ANGPTL3 mutations, and familial combined hypolipidemia.

N Engl J Med. 2010 Dec 2;363(23):2220-7. doi: 10.1056/NEJMoa1002926. Epub 2010 Oct 13.

Integrating common and rare genetic variation in diverse human populations.

Nature. 2010 Sep 2;467(7311):52-8. doi: 10.1038/nature09298.

Association of trypanolytic ApoL1 variants with kidney disease in African Americans.

Science. 2010 Aug 13;329(5993):841-5. doi: 10.1126/science.1193032. Epub 2010 Jul 15.

Genotype imputation for genome-wide association studies.

Nat Rev Genet. 2010 Jul;11(7):499-511. doi: 10.1038/nrg2796.

High-throughput sequencing reveals extensive variation in human-specific L1 content in individual human genomes.

Genome Res. 2010 Sep;20(9):1262-70. doi: 10.1101/gr.106419.110. Epub 2010 May 20.

Variants within the immunoregulatory CBLB gene are associated with multiple sclerosis.

Nat Genet. 2010 Jun;42(6):495-7. doi: 10.1038/ng.584. Epub 2010 May 9.

Meta-analysis and imputation refines the association of 15q25 with smoking quantity.

Nat Genet. 2010 May;42(5):436-40. doi: 10.1038/ng.572. Epub 2010 Apr 25.

Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls.

Nature. 2010 Apr 1;464(7289):713-20. doi: 10.1038/nature08979.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人类基因组变异的图谱来自于基于人群的测序。

A map of human genome variation from population-scale sequencing.

出版信息

Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.

DOI:10.1038/nature09534

PMID:20981092

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3042601/

Abstract

摘要

人类基因组变异的图谱来自于基于人群的测序。

A map of human genome variation from population-scale sequencing.

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

人类基因组变异的图谱来自于基于人群的测序。

A map of human genome variation from population-scale sequencing.

出版信息