• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用单分子、高保真长读长提高单倍体人类基因组的组装和变异检测。

Improved assembly and variant detection of a haploid human genome using single-molecule, high-fidelity long reads.

机构信息

Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington.

Pacific Biosciences of California, Menlo Park, California.

出版信息

Ann Hum Genet. 2020 Mar;84(2):125-140. doi: 10.1111/ahg.12364. Epub 2019 Nov 11.

DOI:10.1111/ahg.12364
PMID:31711268
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7015760/
Abstract

The sequence and assembly of human genomes using long-read sequencing technologies has revolutionized our understanding of structural variation and genome organization. We compared the accuracy, continuity, and gene annotation of genome assemblies generated from either high-fidelity (HiFi) or continuous long-read (CLR) datasets from the same complete hydatidiform mole human genome. We find that the HiFi sequence data assemble an additional 10% of duplicated regions and more accurately represent the structure of tandem repeats, as validated with orthogonal analyses. As a result, an additional 5 Mbp of pericentromeric sequences are recovered in the HiFi assembly, resulting in a 2.5-fold increase in the NG50 within 1 Mbp of the centromere (HiFi 480.6 kbp, CLR 191.5 kbp). Additionally, the HiFi genome assembly was generated in significantly less time with fewer computational resources than the CLR assembly. Although the HiFi assembly has significantly improved continuity and accuracy in many complex regions of the genome, it still falls short of the assembly of centromeric DNA and the largest regions of segmental duplication using existing assemblers. Despite these shortcomings, our results suggest that HiFi may be the most effective standalone technology for de novo assembly of human genomes.

摘要

使用长读测序技术对人类基因组进行测序和组装,彻底改变了我们对结构变异和基因组组织的理解。我们比较了来自同一完全葡萄胎人基因组的高保真(HiFi)或连续长读(CLR)数据集生成的基因组组装的准确性、连续性和基因注释。我们发现,HiFi 序列数据额外组装了 10%的重复区域,并且更准确地表示串联重复的结构,这通过正交分析得到了验证。结果,HiFi 组装中恢复了额外的 5 Mbp 着丝粒周围序列,导致着丝粒 1 Mbp 内的 NG50 增加了 2.5 倍(HiFi 为 480.6 kbp,CLR 为 191.5 kbp)。此外,与 CLR 组装相比,HiFi 基因组组装所需的时间更短,计算资源更少。尽管 HiFi 组装在基因组的许多复杂区域都显著提高了连续性和准确性,但它仍然无法组装着丝粒 DNA 和最大的片段重复区域,这需要使用现有的组装程序。尽管存在这些缺点,但我们的结果表明,HiFi 可能是人类基因组从头组装最有效的独立技术。

相似文献

1
Improved assembly and variant detection of a haploid human genome using single-molecule, high-fidelity long reads.利用单分子、高保真长读长提高单倍体人类基因组的组装和变异检测。
Ann Hum Genet. 2020 Mar;84(2):125-140. doi: 10.1111/ahg.12364. Epub 2019 Nov 11.
2
HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads.HiCanu:从高保真长读段中精确组装片段重复、卫星和等位基因变体。
Genome Res. 2020 Sep;30(9):1291-1305. doi: 10.1101/gr.263566.120. Epub 2020 Aug 14.
3
Evaluating long-read de novo assembly tools for eukaryotic genomes: insights and considerations.评估真核生物基因组的长读长从头组装工具:见解与考虑。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad100. Epub 2023 Nov 24.
4
Highly accurate long reads are crucial for realizing the potential of biodiversity genomics.高质量的长读长序列对于实现生物多样性基因组学的潜力至关重要。
BMC Genomics. 2023 Mar 16;24(1):117. doi: 10.1186/s12864-023-09193-9.
5
Pushing the limits of HiFi assemblies reveals centromere diversity between two Arabidopsis thaliana genomes.推高 HiFi 组装的极限揭示了两个拟南芥基因组之间着丝粒的多样性。
Nucleic Acids Res. 2022 Nov 28;50(21):12309-12327. doi: 10.1093/nar/gkac1115.
6
Single haplotype assembly of the human genome from a hydatidiform mole.来自葡萄胎的人类基因组单倍型组装
Genome Res. 2014 Dec;24(12):2066-76. doi: 10.1101/gr.180893.114. Epub 2014 Nov 4.
7
SpLitteR: diploid genome assembly using TELL-Seq linked-reads and assembly graphs.SpLitter:利用 TELL-Seq 连接读取和组装图进行二倍体基因组组装。
PeerJ. 2024 Sep 27;12:e18050. doi: 10.7717/peerj.18050. eCollection 2024.
8
NPGREAT: assembly of human subtelomere regions with the use of ultralong nanopore reads and linked-reads.NPGREAT:利用超长纳米孔读取和链接读取组装人类亚端粒区域。
BMC Bioinformatics. 2022 Dec 16;23(1):545. doi: 10.1186/s12859-022-05081-3.
9
Benchmarking multi-platform sequencing technologies for human genome assembly.多平台测序技术在人类基因组组装中的基准测试。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad300.
10
Alpha-CENTAURI: assessing novel centromeric repeat sequence variation with long read sequencing.半人马座α星:利用长读长测序评估新型着丝粒重复序列变异
Bioinformatics. 2016 Jul 1;32(13):1921-1924. doi: 10.1093/bioinformatics/btw101. Epub 2016 Feb 24.

引用本文的文献

1
Retrotransposon methylation profiles and survival in Black women with high-grade serous ovarian carcinoma.高级别浆液性卵巢癌黑人女性的逆转录转座子甲基化谱与生存情况
Clin Epigenetics. 2025 Jul 30;17(1):134. doi: 10.1186/s13148-025-01942-9.
2
Improving gene isoform quantification with miniQuant.使用miniQuant改进基因异构体定量分析。
Nat Biotechnol. 2025 Jun 3. doi: 10.1038/s41587-025-02633-9.
3
The whole genome sequence of Cordyceps cicadae - an edible and potential medicinal fungus.蝉花——一种可食用的潜在药用真菌的全基因组序列。

本文引用的文献

1
Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C.利用 Hi-C 对长读从头基因组组装进行扩展单倍型相位分析。
Nat Commun. 2021 Apr 28;12(1):1935. doi: 10.1038/s41467-020-20536-y.
2
Telomere-to-telomere assembly of a complete human X chromosome.端粒到端粒组装完整的人类 X 染色体。
Nature. 2020 Sep;585(7823):79-84. doi: 10.1038/s41586-020-2547-7. Epub 2020 Jul 14.
3
Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome.精确的圆形共识长读测序提高了人类基因组变异检测和组装的准确性。
Mol Genet Genomics. 2025 May 22;300(1):50. doi: 10.1007/s00438-025-02255-5.
4
Analytic Approaches in Genomic Epidemiological Studies of Parasitic Protozoa.寄生原生动物基因组流行病学研究中的分析方法
Transbound Emerg Dis. 2024 Jun 8;2024:7679727. doi: 10.1155/2024/7679727. eCollection 2024.
5
Recent Advances in Genome Editing and Bioinformatics: Addressing Challenges in Genome Editing Implementation and Genome Sequencing.基因组编辑与生物信息学的最新进展:应对基因组编辑实施和基因组测序中的挑战
Int J Mol Sci. 2025 Apr 7;26(7):3442. doi: 10.3390/ijms26073442.
6
Genetic diversity and regulatory features of human-specific duplications.人类特异性重复序列的遗传多样性和调控特征
bioRxiv. 2025 Mar 17:2025.03.14.643395. doi: 10.1101/2025.03.14.643395.
7
The additional diagnostic yield of long-read sequencing in undiagnosed rare diseases.长读长测序在未确诊罕见病中的额外诊断价值。
Genome Res. 2025 Apr 14;35(4):559-571. doi: 10.1101/gr.279970.124.
8
Chromosome-Level Genome Assembly and Annotation of the Highly Heterozygous Provide New Insights into Its Genetics.高度杂合的[物种名称]的染色体水平基因组组装与注释为其遗传学研究提供了新见解。 (你提供的原文中“the Highly Heterozygous”后面缺少具体物种等关键信息,我根据一般情况补充了[物种名称]使译文更完整)
J Fungi (Basel). 2025 Jan 15;11(1):62. doi: 10.3390/jof11010062.
9
Structural polymorphism and diversity of human segmental duplications.人类节段性重复序列的结构多态性与多样性
Nat Genet. 2025 Feb;57(2):390-401. doi: 10.1038/s41588-024-02051-8. Epub 2025 Jan 8.
10
A reference quality, fully annotated diploid genome from a Saudi individual.沙特个体的参考质量、完全注释的二倍体基因组。
Sci Data. 2024 Nov 23;11(1):1278. doi: 10.1038/s41597-024-04121-2.
Nat Biotechnol. 2019 Oct;37(10):1155-1162. doi: 10.1038/s41587-019-0217-9. Epub 2019 Aug 12.
4
Multi-platform discovery of haplotype-resolved structural variation in human genomes.多平台发现人类基因组中单体型分辨率结构变异。
Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.
5
Errors in long-read assemblies can critically affect protein prediction.长读长组装中的错误会严重影响蛋白质预测。
Nat Biotechnol. 2019 Feb;37(2):124-126. doi: 10.1038/s41587-018-0004-z.
6
Characterizing the Major Structural Variant Alleles of the Human Genome.人类基因组主要结构变异等位基因的特征。
Cell. 2019 Jan 24;176(3):663-675.e19. doi: 10.1016/j.cell.2018.12.019. Epub 2019 Jan 17.
7
Chromosome-level assembly of the water buffalo genome surpasses human and goat genomes in sequence contiguity.水牛基因组的染色体水平组装在序列连续性方面超过了人类和山羊基因组。
Nat Commun. 2019 Jan 16;10(1):260. doi: 10.1038/s41467-018-08260-0.
8
Long-read sequence and assembly of segmental duplications.长读序列和串联重复序列的组装。
Nat Methods. 2019 Jan;16(1):88-94. doi: 10.1038/s41592-018-0236-3. Epub 2018 Dec 17.
9
De novo assembly of haplotype-resolved genomes with trio binning.利用三人分箱法对单倍型解析基因组进行从头组装。
Nat Biotechnol. 2018 Oct 22. doi: 10.1038/nbt.4277.
10
A synthetic-diploid benchmark for accurate variant-calling evaluation.用于准确变异呼叫评估的合成二倍体基准。
Nat Methods. 2018 Aug;15(8):595-597. doi: 10.1038/s41592-018-0054-7. Epub 2018 Jul 16.