• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相位图:基于长读长的二倍体基因组单体型感知从头组装

phasebook: haplotype-aware de novo assembly of diploid genomes from long reads.

机构信息

Life Science & Health, Centrum Wiskunde & Informatica, Amsterdam, The Netherlands.

Genome Data Science, Faculty of Technology, Bielefeld University, Bielefeld, Germany.

出版信息

Genome Biol. 2021 Oct 27;22(1):299. doi: 10.1186/s13059-021-02512-x.

DOI:10.1186/s13059-021-02512-x
PMID:34706745
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8549298/
Abstract

Haplotype-aware diploid genome assembly is crucial in genomics, precision medicine, and many other disciplines. Long-read sequencing technologies have greatly improved genome assembly. However, current long-read assemblers are either reference based, so introduce biases, or fail to capture the haplotype diversity of diploid genomes. We present phasebook, a de novo approach for reconstructing the haplotypes of diploid genomes from long reads. phasebook outperforms other approaches in terms of haplotype coverage by large margins, in addition to achieving competitive performance in terms of assembly errors and assembly contiguity.

摘要

单体型感知的二倍体基因组组装在基因组学、精准医学和许多其他领域都至关重要。长读测序技术极大地提高了基因组组装的质量。然而,目前的长读序列组装方法要么基于参考序列,从而引入偏差,要么无法捕获二倍体基因组的单体型多样性。我们提出了 phasebook,这是一种从长读序列中重建二倍体基因组单体型的从头方法。phasebook 在单体型覆盖度方面的表现优于其他方法,同时在组装错误和组装连续性方面也具有竞争力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/198b/8549298/92f69965069d/13059_2021_2512_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/198b/8549298/80e5c3efc3a1/13059_2021_2512_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/198b/8549298/ae35c5178674/13059_2021_2512_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/198b/8549298/92f69965069d/13059_2021_2512_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/198b/8549298/80e5c3efc3a1/13059_2021_2512_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/198b/8549298/ae35c5178674/13059_2021_2512_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/198b/8549298/92f69965069d/13059_2021_2512_Fig3_HTML.jpg

相似文献

1
phasebook: haplotype-aware de novo assembly of diploid genomes from long reads.相位图:基于长读长的二倍体基因组单体型感知从头组装
Genome Biol. 2021 Oct 27;22(1):299. doi: 10.1186/s13059-021-02512-x.
2
De novo diploid genome assembly using long noisy reads.从头组装具有长噪声读长的二倍体基因组。
Nat Commun. 2024 Apr 5;15(1):2964. doi: 10.1038/s41467-024-47349-7.
3
SpLitteR: diploid genome assembly using TELL-Seq linked-reads and assembly graphs.SpLitter:利用 TELL-Seq 连接读取和组装图进行二倍体基因组组装。
PeerJ. 2024 Sep 27;12:e18050. doi: 10.7717/peerj.18050. eCollection 2024.
4
Telomere-to-telomere assembly of diploid chromosomes with Verkko.利用 Verkko 进行二倍体染色体的端粒到端粒组装。
Nat Biotechnol. 2023 Oct;41(10):1474-1482. doi: 10.1038/s41587-023-01662-6. Epub 2023 Feb 16.
5
Graphasing: phasing diploid genome assembly graphs with single-cell strand sequencing.Graphasing:利用单细胞测序进行二倍体基因组组装图谱的相位分析。
Genome Biol. 2024 Oct 10;25(1):265. doi: 10.1186/s13059-024-03409-1.
6
Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing.Longshot 可通过单分子长读测序对二倍体基因组进行准确的变异调用。
Nat Commun. 2019 Oct 11;10(1):4660. doi: 10.1038/s41467-019-12493-y.
7
Haplotyping-Assisted Diploid Assembly and Variant Detection with Linked Reads.基于连锁reads 的单体型辅助二倍体组装和变异检测。
Methods Mol Biol. 2023;2590:161-182. doi: 10.1007/978-1-0716-2819-5_11.
8
Semi-automated assembly of high-quality diploid human reference genomes.半自动组装高质量的二倍体人类参考基因组。
Nature. 2022 Nov;611(7936):519-531. doi: 10.1038/s41586-022-05325-5. Epub 2022 Oct 19.
9
Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies.清除单倍型:三代二倍体基因组组装的等位基因 contig 重新分配。
BMC Bioinformatics. 2018 Nov 29;19(1):460. doi: 10.1186/s12859-018-2485-7.
10
gcaPDA: a haplotype-resolved diploid assembler.gcaPDA:一种单倍型解析的二倍体组装器。
BMC Bioinformatics. 2022 Feb 14;23(1):68. doi: 10.1186/s12859-022-04591-4.

引用本文的文献

1
Interlaboratory evaluation of high molecular weight DNA extraction methods for long-read sequencing and structural variant analysis.用于长读长测序和结构变异分析的高分子量DNA提取方法的实验室间评估。
BMC Genomics. 2025 Jul 28;26(1):698. doi: 10.1186/s12864-025-11792-7.
2
Repeat and haplotype aware error correction in nanopore sequencing reads with DeChat.使用DeChat对纳米孔测序读数进行重复和单倍型感知错误校正。
Commun Biol. 2024 Dec 19;7(1):1678. doi: 10.1038/s42003-024-07376-y.
3
DeepHapNet: a haplotype assembly method based on RetNet and deep spectral clustering.

本文引用的文献

1
Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm.使用带有 hifiasm 的相定装配图进行单体型解析从头组装。
Nat Methods. 2021 Feb;18(2):170-175. doi: 10.1038/s41592-020-01056-5. Epub 2021 Feb 1.
2
Scalable long read self-correction and assembly polishing with multiple sequence alignment.可扩展的长读自我纠错和多重序列比对的组装优化。
Sci Rep. 2021 Jan 12;11(1):761. doi: 10.1038/s41598-020-80757-5.
3
Efficient assembly of nanopore reads via highly accurate and intact error correction.通过高度准确和完整的纠错实现纳米孔读取的高效组装。
深度单倍型网络:一种基于RetNet和深度谱聚类的单倍型组装方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae656.
4
Generating barcodes for nanopore sequencing data with PRO.使用PRO为纳米孔测序数据生成条形码。
Fundam Res. 2024 Apr 25;4(4):785-794. doi: 10.1016/j.fmre.2024.04.014. eCollection 2024 Jul.
5
Graphasing: phasing diploid genome assembly graphs with single-cell strand sequencing.Graphasing:利用单细胞测序进行二倍体基因组组装图谱的相位分析。
Genome Biol. 2024 Oct 10;25(1):265. doi: 10.1186/s13059-024-03409-1.
6
Chromosome-level subgenome-aware de novo assembly provides insight into genome divergence after hybridization.基于染色体级别的亚基因组感知从头组装揭示了杂交后基因组分化的机制。
Genome Res. 2024 Nov 20;34(11):2133-2146. doi: 10.1101/gr.279364.124.
7
Pangenome Identification and Analysis of Terpene Synthase Gene Family Members in .泛基因组鉴定与萜烯合酶基因家族成员分析
Int J Mol Sci. 2024 Sep 6;25(17):9677. doi: 10.3390/ijms25179677.
8
GCphase: an SNP phasing method using a graph partition and error correction algorithm.GC 相:一种使用图划分和错误纠正算法的 SNP 相位方法。
BMC Bioinformatics. 2024 Aug 19;25(1):267. doi: 10.1186/s12859-024-05901-8.
9
Rockfish: A transformer-based model for accurate 5-methylcytosine prediction from nanopore sequencing.岩鱼:基于转换器的模型,可从纳米孔测序中准确预测 5-甲基胞嘧啶。
Nat Commun. 2024 Jul 3;15(1):5580. doi: 10.1038/s41467-024-49847-0.
10
Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph.使用双图进行二倍体和多倍体基因组的可扩展端粒到端粒组装。
Nat Methods. 2024 Jun;21(6):967-970. doi: 10.1038/s41592-024-02269-8. Epub 2024 May 10.
Nat Commun. 2021 Jan 4;12(1):60. doi: 10.1038/s41467-020-20236-7.
4
Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies.Merqury:基因组组装的无参考质量、完整性和相位评估。
Genome Biol. 2020 Sep 14;21(1):245. doi: 10.1186/s13059-020-02134-9.
5
HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads.HiCanu:从高保真长读段中精确组装片段重复、卫星和等位基因变体。
Genome Res. 2020 Sep;30(9):1291-1305. doi: 10.1101/gr.263566.120. Epub 2020 Aug 14.
6
Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes.纳米孔测序和 Shasta 工具包可实现 11 个人类基因组的高效从头组装。
Nat Biotechnol. 2020 Sep;38(9):1044-1053. doi: 10.1038/s41587-020-0503-6. Epub 2020 May 4.
7
Telomere-to-telomere assembly of a complete human X chromosome.端粒到端粒组装完整的人类 X 染色体。
Nature. 2020 Sep;585(7823):79-84. doi: 10.1038/s41586-020-2547-7. Epub 2020 Jul 14.
8
Long-read human genome sequencing and its applications.长读长基因组测序及其应用。
Nat Rev Genet. 2020 Oct;21(10):597-614. doi: 10.1038/s41576-020-0236-x. Epub 2020 Jun 5.
9
Fast and accurate long-read assembly with wtdbg2.使用 wtdbg2 实现快速准确的长读长序列组装。
Nat Methods. 2020 Feb;17(2):155-158. doi: 10.1038/s41592-019-0669-3. Epub 2019 Dec 9.
10
Using Haplotype Information for Conservation Genomics.利用单体型信息进行保护基因组学研究。
Trends Ecol Evol. 2020 Mar;35(3):245-258. doi: 10.1016/j.tree.2019.10.012. Epub 2019 Dec 3.