• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类基因组的染色体规模、单倍型解析组装。

Chromosome-scale, haplotype-resolved assembly of human genomes.

机构信息

Department of Genetics, Harvard Medical School, Boston, MA, USA.

Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA.

出版信息

Nat Biotechnol. 2021 Mar;39(3):309-312. doi: 10.1038/s41587-020-0711-0. Epub 2020 Dec 7.

DOI:10.1038/s41587-020-0711-0
PMID:33288905
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7954703/
Abstract

Haplotype-resolved or phased genome assembly provides a complete picture of genomes and their complex genetic variations. However, current algorithms for phased assembly either do not generate chromosome-scale phasing or require pedigree information, which limits their application. We present a method named diploid assembly (DipAsm) that uses long, accurate reads and long-range conformation data for single individuals to generate a chromosome-scale phased assembly within 1 day. Applied to four public human genomes, PGP1, HG002, NA12878 and HG00733, DipAsm produced haplotype-resolved assemblies with minimum contig length needed to cover 50% of the known genome (NG50) up to 25 Mb and phased ~99.5% of heterozygous sites at 98-99% accuracy, outperforming other approaches in terms of both contiguity and phasing completeness. We demonstrate the importance of chromosome-scale phased assemblies for the discovery of structural variants (SVs), including thousands of new transposon insertions, and of highly polymorphic and medically important regions such as the human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptor (KIR) regions. DipAsm will facilitate high-quality precision medicine and studies of individual haplotype variation and population diversity.

摘要

单倍型解析或相位基因组组装提供了基因组及其复杂遗传变异的完整图景。然而,目前用于相位组装的算法要么不能生成染色体尺度的相位,要么需要系谱信息,这限制了它们的应用。我们提出了一种名为二倍体组装(DipAsm)的方法,该方法使用长的、准确的读取和长程构象数据来对单个个体进行单倍型解析组装,在 1 天内生成染色体尺度的相位组装。将 DipAsm 应用于四个公开的人类基因组 PGP1、HG002、NA12878 和 HG00733,生成的单倍型解析组装具有最小的 contig 长度,可覆盖 50%的已知基因组(NG50),达到 25Mb,并以 98-99%的准确率对~99.5%的杂合位点进行相位,在连续性和相位完整性方面都优于其他方法。我们证明了染色体尺度相位组装对于结构变异(SV)的发现的重要性,包括数千个新的转座子插入,以及高度多态性和医学上重要的区域,如人类白细胞抗原(HLA)和杀伤细胞免疫球蛋白样受体(KIR)区域。DipAsm 将促进高质量的精准医学和个体单倍型变异和人群多样性的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fe2/7954703/cf4dd9173053/41587_2020_711_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fe2/7954703/e23da24c53fc/41587_2020_711_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fe2/7954703/cf4dd9173053/41587_2020_711_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fe2/7954703/e23da24c53fc/41587_2020_711_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fe2/7954703/cf4dd9173053/41587_2020_711_Fig2_HTML.jpg

相似文献

1
Chromosome-scale, haplotype-resolved assembly of human genomes.人类基因组的染色体规模、单倍型解析组装。
Nat Biotechnol. 2021 Mar;39(3):309-312. doi: 10.1038/s41587-020-0711-0. Epub 2020 Dec 7.
2
Large indel detection in region-based phased diploid assemblies from linked-reads.基于连接 reads 的区域分阶段二倍体组装中的大片段插入缺失检测
BMC Genomics. 2025 Mar 18;26(Suppl 2):263. doi: 10.1186/s12864-025-11398-z.
3
Fully phased human genome assembly without parental data using single-cell strand sequencing and long reads.利用单细胞测序和长读长技术进行全相基因组组装,无需父母数据。
Nat Biotechnol. 2021 Mar;39(3):302-308. doi: 10.1038/s41587-020-0719-5. Epub 2020 Dec 7.
4
The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features.一个杂合二倍体非洲木薯品种的单体型解析染色体对揭示了新的泛基因组和等位基因特异性转录组特征。
Gigascience. 2022 Mar 24;11. doi: 10.1093/gigascience/giac028.
5
De novo assembly and phasing of a Korean human genome.韩国人类基因组的从头组装和相位。
Nature. 2016 Oct 13;538(7624):243-247. doi: 10.1038/nature20098. Epub 2016 Oct 5.
6
Integrating read-based and population-based phasing for dense and accurate haplotyping of individual genomes.基于读取和基于群体的相位整合,实现个体基因组的密集和精确单倍型分型。
Bioinformatics. 2019 Jul 15;35(14):i242-i248. doi: 10.1093/bioinformatics/btz329.
7
Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C.利用 Hi-C 对长读从头基因组组装进行扩展单倍型相位分析。
Nat Commun. 2021 Apr 28;12(1):1935. doi: 10.1038/s41467-020-20536-y.
8
Assembly of complete diploid-phased chromosomes from draft genome sequences.从草图基因组序列组装完整的二倍体相染色体。
G3 (Bethesda). 2022 Jul 29;12(8). doi: 10.1093/g3journal/jkac143.
9
A high-quality, haplotype-phased genome reconstruction reveals unexpected haplotype diversity in a pearl oyster.高质量、单体型相位基因组重建揭示了珍珠贝中意想不到的单体型多样性。
DNA Res. 2022 Dec 1;29(6). doi: 10.1093/dnares/dsac035.
10
Simultaneous de novo calling and phasing of genetic variants at chromosome-scale using NanoStrand-seq.使用纳米链测序在染色体水平上同时进行遗传变异的从头测序和定相分析。
Cell Discov. 2024 Jul 9;10(1):74. doi: 10.1038/s41421-024-00694-9.

引用本文的文献

1
Chromosome-level haplotype-resolved genome assembly provides insights into the highly heterozygous genome of Italian ryegrass (Lolium multiflorum Lam.).染色体水平单倍型解析的基因组组装为多花黑麦草(Lolium multiflorum Lam.)高度杂合的基因组提供了见解。
Plant Genome. 2025 Sep;18(3):e70079. doi: 10.1002/tpg2.70079.
2
Complex genetic variation in nearly complete human genomes.近乎完整的人类基因组中的复杂遗传变异。
Nature. 2025 Jul 23. doi: 10.1038/s41586-025-09140-6.
3
Toward a Kinh Vietnamese Reference Genome: Constructing a De Novo Genome Assembly Using Long-Read Sequencing and Optical Mapping.

本文引用的文献

1
A haplotype-aware de novo assembly of related individuals using pedigree sequence graph.基于家系序列图的相关个体的单体型感知从头组装。
Bioinformatics. 2020 Apr 15;36(8):2385-2392. doi: 10.1093/bioinformatics/btz942.
2
Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome.精确的圆形共识长读测序提高了人类基因组变异检测和组装的准确性。
Nat Biotechnol. 2019 Oct;37(10):1155-1162. doi: 10.1038/s41587-019-0217-9. Epub 2019 Aug 12.
3
Multi-platform discovery of haplotype-resolved structural variation in human genomes.
迈向京族越南人参考基因组:利用长读长测序和光学图谱构建从头基因组组装
Genes (Basel). 2025 Apr 29;16(5):536. doi: 10.3390/genes16050536.
4
SVHunter: long-read-based structural variation detection through the transformer model.SVHunter:通过变压器模型进行基于长读长的结构变异检测。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf203.
5
The haplotype-resolved T2T genome for Bauhinia × blakeana sheds light on the genetic basis of flower heterosis.洋紫荆的单倍型解析T2T基因组揭示了花杂种优势的遗传基础。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf044.
6
Establishing genome sequencing and assembly for non-model and emerging model organisms: a brief guide.为非模式生物和新兴模式生物建立基因组测序与组装:简要指南
Front Zool. 2025 Apr 17;22(1):7. doi: 10.1186/s12983-025-00561-7.
7
Allelic variation and duplication of the dmrt1 were associated with sex chromosome turnover in three representative Scatophagidae fish species.在三种具有代表性的鲹科鱼类中,dmrt1的等位基因变异和重复与性染色体更替有关。
Commun Biol. 2025 Apr 17;8(1):627. doi: 10.1038/s42003-025-08056-1.
8
A Hitchhiker's Guide to long-read genomic analysis.长读长基因组分析指南
Genome Res. 2025 Apr 14;35(4):545-558. doi: 10.1101/gr.279975.124.
9
Genome assembly resources of genitourinary cancers for chromosomal aberration at the single nucleotide level.用于单核苷酸水平染色体畸变研究的泌尿生殖系统癌症基因组组装资源。
Sci Data. 2025 Apr 1;12(1):550. doi: 10.1038/s41597-025-04801-7.
10
Chromosome-scale genome assembly of Phyllanthus emblica L. 'Yingyu'.余甘子‘英玉’的染色体级基因组组装
DNA Res. 2025 Mar 1;32(2). doi: 10.1093/dnares/dsaf006.
多平台发现人类基因组中单体型分辨率结构变异。
Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.
4
An open resource for accurately benchmarking small variant and reference calls.用于准确基准测试小型变体和参考调用的开放资源。
Nat Biotechnol. 2019 May;37(5):561-566. doi: 10.1038/s41587-019-0074-6. Epub 2019 Apr 1.
5
Walking along chromosomes with super-resolution imaging, contact maps, and integrative modeling.用超高分辨率成像、接触图谱和整合建模沿着染色体行走。
PLoS Genet. 2018 Dec 26;14(12):e1007872. doi: 10.1371/journal.pgen.1007872. eCollection 2018 Dec.
6
De novo assembly of haplotype-resolved genomes with trio binning.利用三人分箱法对单倍型解析基因组进行从头组装。
Nat Biotechnol. 2018 Oct 22. doi: 10.1038/nbt.4277.
7
A universal SNP and small-indel variant caller using deep neural networks.使用深度神经网络的通用 SNP 和小插入缺失变体调用器。
Nat Biotechnol. 2018 Nov;36(10):983-987. doi: 10.1038/nbt.4235. Epub 2018 Sep 24.
8
A synthetic-diploid benchmark for accurate variant-calling evaluation.用于准确变异呼叫评估的合成二倍体基准。
Nat Methods. 2018 Aug;15(8):595-597. doi: 10.1038/s41592-018-0054-7. Epub 2018 Jul 16.
9
A graph-based approach to diploid genome assembly.基于图的二倍体基因组组装方法。
Bioinformatics. 2018 Jul 1;34(13):i105-i114. doi: 10.1093/bioinformatics/bty279.
10
Direct determination of diploid genome sequences.二倍体基因组序列的直接测定。
Genome Res. 2017 May;27(5):757-767. doi: 10.1101/gr.214874.116. Epub 2017 Apr 5.