Suppr超能文献

通过单分子技术构建单个人类基因组的组装与二倍体结构

Assembly and diploid architecture of an individual human genome via single-molecule technologies.

作者信息

Pendleton Matthew, Sebra Robert, Pang Andy Wing Chun, Ummat Ajay, Franzen Oscar, Rausch Tobias, Stütz Adrian M, Stedman William, Anantharaman Thomas, Hastie Alex, Dai Heng, Fritz Markus Hsi-Yang, Cao Han, Cohain Ariella, Deikus Gintaras, Durrett Russell E, Blanchard Scott C, Altman Roger, Chin Chen-Shan, Guo Yan, Paxinos Ellen E, Korbel Jan O, Darnell Robert B, McCombie W Richard, Kwok Pui-Yan, Mason Christopher E, Schadt Eric E, Bashir Ali

机构信息

Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, USA.

BioNano Genomics, San Diego, California, USA.

出版信息

Nat Methods. 2015 Aug;12(8):780-6. doi: 10.1038/nmeth.3454. Epub 2015 Jun 29.

Abstract

We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.

摘要

我们展示了对二倍体人类基因组的首次全面分析,该分析将单分子测序与单分子基因组图谱相结合。我们的混合组装显著改善了传统鸟枪法测序方法所观察到的连续性,支架N50值接近30 Mb,并且我们鉴定出了其他高通量方法遗漏的复杂结构变异(SV)。此外,通过将Illumina短读长数据与长读长数据相结合,我们对单核苷酸变异和SV进行了定相,生成了与先前基于三人组的研究一致性超过99%的单倍型。我们的工作表明,现在有可能整合单分子和高通量序列数据以生成接近参考质量的从头组装基因组。

相似文献

1
Assembly and diploid architecture of an individual human genome via single-molecule technologies.
Nat Methods. 2015 Aug;12(8):780-6. doi: 10.1038/nmeth.3454. Epub 2015 Jun 29.
2
De novo assembly and phasing of a Korean human genome.
Nature. 2016 Oct 13;538(7624):243-247. doi: 10.1038/nature20098. Epub 2016 Oct 5.
3
Fully Phased Sequence of a Diploid Human Genome Determined from the DNA of a Single Individual.
G3 (Bethesda). 2020 Sep 2;10(9):2911-2925. doi: 10.1534/g3.119.400995.
4
De novo assembly of a haplotype-resolved human genome.
Nat Biotechnol. 2015 Jun;33(6):617-22. doi: 10.1038/nbt.3200. Epub 2015 May 25.
6
Fully phased human genome assembly without parental data using single-cell strand sequencing and long reads.
Nat Biotechnol. 2021 Mar;39(3):302-308. doi: 10.1038/s41587-020-0719-5. Epub 2020 Dec 7.
7
9
phasebook: haplotype-aware de novo assembly of diploid genomes from long reads.
Genome Biol. 2021 Oct 27;22(1):299. doi: 10.1186/s13059-021-02512-x.
10
Rapid Low-Cost Assembly of the Reference Genome Using Low-Coverage, Long-Read Sequencing.
G3 (Bethesda). 2018 Oct 3;8(10):3143-3154. doi: 10.1534/g3.118.200162.

引用本文的文献

1
A 69.9-kb long inverted repeat increases genome instability in a strain of .
NAR Genom Bioinform. 2025 Jun 26;7(2):lqaf085. doi: 10.1093/nargab/lqaf085. eCollection 2025 Jun.
5
Comparisons of performances of structural variants detection algorithms in solitary or combination strategy.
PLoS One. 2025 Feb 6;20(2):e0314982. doi: 10.1371/journal.pone.0314982. eCollection 2025.
6
Chromosome-level genome assembly of Phortica okadai, a vector of Thelazia callipaeda.
Sci Data. 2024 Dec 18;11(1):1370. doi: 10.1038/s41597-024-04239-3.
7
A nearly gapless, highly contiguous reference genome for a doubled haploid line of , enabling advanced genomic studies.
For Res (Fayettev). 2024 May 13;4:e019. doi: 10.48130/forres-0024-0016. eCollection 2024.
8
Chromosome-scale genome assembly of Astragalus membranaceus using PacBio and Hi-C technologies.
Sci Data. 2024 Oct 2;11(1):1071. doi: 10.1038/s41597-024-03852-6.
9
Fc gamma receptors: Their evolution, genomic architecture, genetic variation, and impact on human disease.
Immunol Rev. 2024 Nov;328(1):65-97. doi: 10.1111/imr.13401. Epub 2024 Sep 30.

本文引用的文献

1
Assembling large genomes with single-molecule sequencing and locality-sensitive hashing.
Nat Biotechnol. 2015 Jun;33(6):623-30. doi: 10.1038/nbt.3238. Epub 2015 May 25.
2
Resolving the complexity of the human genome using single-molecule sequencing.
Nature. 2015 Jan 29;517(7536):608-11. doi: 10.1038/nature13907. Epub 2014 Nov 10.
3
Palindromic GOLGA8 core duplicons promote chromosome 15q13.3 microdeletion and evolutionary instability.
Nat Genet. 2014 Dec;46(12):1293-302. doi: 10.1038/ng.3120. Epub 2014 Oct 19.
4
Resolving complex tandem repeats with long reads.
Bioinformatics. 2014 Dec 15;30(24):3491-8. doi: 10.1093/bioinformatics/btu437. Epub 2014 Jul 15.
5
PBHoney: identifying genomic variants via long-read discordance and interrupted mapping.
BMC Bioinformatics. 2014 Jun 10;15:180. doi: 10.1186/1471-2105-15-180.
6
Whole-genome haplotyping using long reads and statistical methods.
Nat Biotechnol. 2014 Mar;32(3):261-266. doi: 10.1038/nbt.2833. Epub 2014 Feb 23.
7
Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls.
Nat Biotechnol. 2014 Mar;32(3):246-51. doi: 10.1038/nbt.2835. Epub 2014 Feb 16.
8
Reconstructing complex regions of genomes using long-read sequencing technology.
Genome Res. 2014 Apr;24(4):688-96. doi: 10.1101/gr.168450.113. Epub 2014 Jan 13.
9
Amplification and thrifty single-molecule sequencing of recurrent somatic structural variations.
Genome Res. 2014 Feb;24(2):318-28. doi: 10.1101/gr.161497.113. Epub 2013 Dec 4.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验