• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超过 100 个人类基因组的全基因组序列和经过实验相位的单倍型。

The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.

机构信息

Complete Genomics, Inc., 2071 Stierlin Ct., Mountain View, CA, 94043, USA.

Harvard Personal Genome Project, Harvard Medical School, NRB 238, 77 Avenue Louis Pasteur, Boston, MA, 02115, USA.

出版信息

Gigascience. 2016 Oct 11;5(1):42. doi: 10.1186/s13742-016-0148-z.

DOI:10.1186/s13742-016-0148-z
PMID:27724973
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5057367/
Abstract

BACKGROUND

Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information.

FINDINGS

As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology. Here, we present the experimental whole genome haplotyping and sequencing of these samples to an average read coverage depth of 100X. This is approximately three-fold higher than the read coverage applied to most whole human genome assemblies and ensures the highest quality results. Currently, 114 genomes from this dataset are freely available in the GigaDB repository and are associated with rich phenotypic data; the remaining 70 should be added in the near future as they are approved through the PGP data release process. For reproducibility analyses, 20 genomes were sequenced at least twice using independent LFR barcoded libraries. Seven genomes were also sequenced using Complete Genomics' standard non-barcoded library process. In addition, we report 2.6 million high-quality, rare variants not previously identified in the Single Nucleotide Polymorphisms database or the 1000 Genomes Project Phase 3 data.

CONCLUSIONS

These genomes represent a unique source of haplotype and phenotype data for the scientific community and should help to expand our understanding of human genome evolution and function.

摘要

背景

自 2003 年人类基因组计划完成以来,据估计已经有超过 20 万个完整的人类基因组被测序。在如此短的时间内取得了惊人的成就。然而,其中大多数基因组在测序时没有实验单倍型数据,因此缺少了基因组生物学的一个重要方面。此外,大部分基因组数据公众无法获取,且缺乏表型信息。

发现

作为个人基因组计划的一部分,从 184 名参与者身上采集了血液样本,并用 Complete Genomics 的长片段读取技术对其进行了处理。在这里,我们展示了这些样本的实验性全基因组单倍型测序和测序,平均读取覆盖率达到 100X。这大约是大多数全人类基因组组装应用的读取覆盖率的三倍,可确保最高质量的结果。目前,该数据集的 114 个基因组可在 GigaDB 存储库中免费获取,且与丰富的表型数据相关联;其余 70 个基因组应在不久的将来添加,因为它们将通过 PGP 数据发布过程获得批准。为了进行可重复性分析,使用独立的 LFR 条形码文库至少对 20 个基因组进行了两次测序。还使用 Complete Genomics 的标准非条形码文库过程对 7 个基因组进行了测序。此外,我们报告了 260 万个高质量的罕见变体,这些变体以前未在单核苷酸多态性数据库或 1000 基因组计划第三阶段数据中被识别。

结论

这些基因组代表了单倍型和表型数据的独特来源,可供科学界使用,应该有助于扩大我们对人类基因组进化和功能的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/9dd2248e7390/13742_2016_148_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/f08a92433a3a/13742_2016_148_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/67ae758e547e/13742_2016_148_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/a072616744c2/13742_2016_148_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/f536d11c5bef/13742_2016_148_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/6c4459e41b00/13742_2016_148_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/089ba88b5e3c/13742_2016_148_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/6c139dafa7d4/13742_2016_148_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/9dd2248e7390/13742_2016_148_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/f08a92433a3a/13742_2016_148_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/67ae758e547e/13742_2016_148_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/a072616744c2/13742_2016_148_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/f536d11c5bef/13742_2016_148_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/6c4459e41b00/13742_2016_148_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/089ba88b5e3c/13742_2016_148_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/6c139dafa7d4/13742_2016_148_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dd4/5057367/9dd2248e7390/13742_2016_148_Fig8_HTML.jpg

相似文献

1
The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.超过 100 个人类基因组的全基因组序列和经过实验相位的单倍型。
Gigascience. 2016 Oct 11;5(1):42. doi: 10.1186/s13742-016-0148-z.
2
A Fosmid Pool-Based Next Generation Sequencing Approach to Haplotype-Resolve Whole Genomes.一种基于Fosmid文库的新一代测序方法用于单倍型解析全基因组。
Methods Mol Biol. 2017;1551:223-269. doi: 10.1007/978-1-4939-6750-6_13.
3
Long Fragment Read (LFR) Technology: Cost-Effective, High-Quality Genome-Wide Molecular Haplotyping.长片段读取(LFR)技术:具有成本效益的高质量全基因组分子单倍型分型
Methods Mol Biol. 2017;1551:191-205. doi: 10.1007/978-1-4939-6750-6_11.
4
Haplotyping germline and cancer genomes with high-throughput linked-read sequencing.利用高通量连锁读长测序对种系和癌症基因组进行单倍型分型
Nat Biotechnol. 2016 Mar;34(3):303-11. doi: 10.1038/nbt.3432. Epub 2016 Feb 1.
5
Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells.从 10 到 20 个人类细胞中进行准确的全基因组测序和单倍型分型。
Nature. 2012 Jul 11;487(7406):190-5. doi: 10.1038/nature11236.
6
Accurate Sequencing and Haplotyping from 10 Cells Using Long Fragment Read (LFR) Technology.使用长片段读长(LFR)技术从 10 个细胞中进行准确测序和单体型分析。
Methods Mol Biol. 2023;2590:71-84. doi: 10.1007/978-1-0716-2819-5_5.
7
Dense and accurate whole-chromosome haplotyping of individual genomes.个体基因组的密集且精确的全染色体单倍型分型。
Nat Commun. 2017 Nov 3;8(1):1293. doi: 10.1038/s41467-017-01389-4.
8
Whole genome sequence analysis of BT-474 using complete Genomics' standard and long fragment read technologies.使用Complete Genomics的标准和长片段读取技术对BT-474进行全基因组序列分析。
Gigascience. 2016 Feb 9;5:8. doi: 10.1186/s13742-016-0113-x. eCollection 2016.
9
Haplotype phasing of whole human genomes using bead-based barcode partitioning in a single tube.基于珠基条码分区的单管全人类基因组单体型定相
Nat Biotechnol. 2017 Sep;35(9):852-857. doi: 10.1038/nbt.3897. Epub 2017 Jun 26.
10
Linked read sequencing resolves complex genomic rearrangements in gastric cancer metastases.关联阅读测序解析胃癌转移灶中的复杂基因组重排。
Genome Med. 2017 Jun 19;9(1):57. doi: 10.1186/s13073-017-0447-8.

引用本文的文献

1
Herpes Simplex Virus 1 Infection of Human Brain Organoids and Pancreatic Stem Cell-Islets Drives Organoid-Specific Transcripts Associated with Alzheimer's Disease and Autoimmune Diseases.单纯疱疹病毒1型感染人脑类器官和胰腺干细胞胰岛会驱动与阿尔茨海默病和自身免疫性疾病相关的类器官特异性转录本。
Cells. 2024 Nov 29;13(23):1978. doi: 10.3390/cells13231978.
2
An efficient molecular genetic testing strategy for incontinentia pigmenti based on single-tube long fragment read sequencing.一种基于单管长片段读取测序的色素失禁症高效分子遗传学检测策略。
NPJ Genom Med. 2024 May 29;9(1):32. doi: 10.1038/s41525-024-00421-z.
3
Reliable multiplex generation of pooled induced pluripotent stem cells.

本文引用的文献

1
Deep sequencing of 10,000 human genomes.一万个人类基因组的深度测序。
Proc Natl Acad Sci U S A. 2016 Oct 18;113(42):11901-11906. doi: 10.1073/pnas.1613365113. Epub 2016 Oct 4.
2
New observations on maternal age effect on germline de novo mutations.关于母亲年龄对生殖系新生突变影响的新观察结果。
Nat Commun. 2016 Jan 19;7:10486. doi: 10.1038/ncomms10486.
3
A global reference for human genetic variation.人类遗传变异的全球参考。
可靠的诱导多能干细胞池的多重生成。
Cell Rep Methods. 2023 Sep 25;3(9):100570. doi: 10.1016/j.crmeth.2023.100570. Epub 2023 Aug 31.
4
Modeling of mitochondrial genetic polymorphisms reveals induction of heteroplasmy by pleiotropic disease locus 10398A>G.线粒体遗传多态性建模显示多效性疾病位点 10398A>G 诱导异质性。
Sci Rep. 2023 Jun 27;13(1):10405. doi: 10.1038/s41598-023-37541-y.
5
Analysis of 1276 Haplotype-Resolved Genomes Allows Characterization of Cis- and Trans-Abundant Genes.分析 1276 个单倍型解析基因组,可对顺式和反式丰富基因进行特征分析。
Methods Mol Biol. 2023;2590:237-272. doi: 10.1007/978-1-0716-2819-5_15.
6
Establishing analytical validity of BeadChip array genotype data by comparison to whole-genome sequence and standard benchmark datasets.通过与全基因组序列和标准基准数据集进行比较,确立 BeadChip 芯片基因分型数据的分析有效性。
BMC Med Genomics. 2022 Mar 14;15(1):56. doi: 10.1186/s12920-022-01199-8.
7
Discovery of genomic variation across a generation.跨越一代的基因组变异发现。
Hum Mol Genet. 2021 Oct 1;30(R2):R174-R186. doi: 10.1093/hmg/ddab209.
8
GenomeChronicler: The Personal Genome Project UK Genomic Report Generator Pipeline.基因组编年史家:英国个人基因组计划基因组报告生成管道。
Front Genet. 2020 Sep 24;11:518644. doi: 10.3389/fgene.2020.518644. eCollection 2020.
9
Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly.高效且独特的第二代测序读长 DNA 分子 cobarcoding,实现经济高效、准确的测序、单倍型分析和从头组装。
Genome Res. 2019 May;29(5):798-808. doi: 10.1101/gr.245126.118. Epub 2019 Apr 2.
10
Significant abundance of cis configurations of coding variants in diploid human genomes.二倍体人类基因组中编码变异的顺式构型丰度显著。
Nucleic Acids Res. 2019 Apr 8;47(6):2981-2995. doi: 10.1093/nar/gkz031.
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
4
The UK10K project identifies rare variants in health and disease.英国万人基因组计划识别健康与疾病中的罕见变异。
Nature. 2015 Oct 1;526(7571):82-90. doi: 10.1038/nature14962. Epub 2015 Sep 14.
5
Genome-wide patterns and properties of de novo mutations in humans.人类新生突变的全基因组模式与特性
Nat Genet. 2015 Jul;47(7):822-826. doi: 10.1038/ng.3292. Epub 2015 May 18.
6
Detection and phasing of single base de novo mutations in biopsies from human in vitro fertilized embryos by advanced whole-genome sequencing.通过先进的全基因组测序检测和定相人类体外受精胚胎活检中的单碱基新生突变。
Genome Res. 2015 Mar;25(3):426-34. doi: 10.1101/gr.181255.114. Epub 2015 Feb 11.
7
Co-barcoded sequence reads from long DNA fragments: a cost-effective solution for "perfect genome" sequencing.来自长DNA片段的共条形码序列读取:一种用于“完美基因组”测序的经济高效解决方案。
Front Genet. 2015 Jan 14;5:466. doi: 10.3389/fgene.2014.00466. eCollection 2014.
8
Whole-genome sequence variation, population structure and demographic history of the Dutch population.荷兰人群的全基因组序列变异、种群结构和人口历史。
Nat Genet. 2014 Aug;46(8):818-25. doi: 10.1038/ng.3021. Epub 2014 Jun 29.
9
Technology: The $1,000 genome.技术:千美元基因组。
Nature. 2014 Mar 20;507(7492):294-5. doi: 10.1038/507294a.
10
Detection of clinically relevant genetic variants in autism spectrum disorder by whole-genome sequencing.通过全基因组测序检测自闭症谱系障碍中的临床相关遗传变异。
Am J Hum Genet. 2013 Aug 8;93(2):249-63. doi: 10.1016/j.ajhg.2013.06.012. Epub 2013 Jul 11.