• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HaploMerger2:从高杂合度二倍体基因组组装中重建两个单倍体亚组装体。

HaploMerger2: rebuilding both haploid sub-assemblies from high-heterozygosity diploid genome assembly.

作者信息

Huang Shengfeng, Kang Mingjing, Xu Anlong

机构信息

State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, People's Republic of China.

出版信息

Bioinformatics. 2017 Aug 15;33(16):2577-2579. doi: 10.1093/bioinformatics/btx220.

DOI:10.1093/bioinformatics/btx220
PMID:28407147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5870766/
Abstract

SUMMARY

De novo assembly is a difficult issue for heterozygous diploid genomes. The advent of high-throughput short-read and long-read sequencing technologies provides both new challenges and potential solutions to the issue. Here, we present HaploMerger2 (HM2), an automated pipeline for rebuilding both haploid sub-assemblies from the polymorphic diploid genome assembly. It is designed to work on pre-existing diploid assemblies, which are typically created by using de novo assemblers. HM2 can process any diploid assemblies, but it is especially suitable for diploid assemblies with high heterozygosity (≥3%), which can be difficult for other tools. This pipeline also implements flexible and sensitive assembly error detection, a hierarchical scaffolding procedure and a reliable gap-closing method for haploid sub-assemblies. Using HM2, we demonstrate that two haploid sub-assemblies reconstructed from a real, highly-polymorphic diploid assembly show greatly improved continuity.

AVAILABILITY AND IMPLEMENTATION

Source code, executables and the testing dataset are freely available at https://github.com/mapleforest/HaploMerger2/releases/.

CONTACT

hshengf2@mail.sysu.edu.cn.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

对于杂合二倍体基因组而言,从头组装是一个难题。高通量短读长和长读长测序技术的出现,给这个问题带来了新的挑战,也提供了潜在的解决方案。在此,我们展示了HaploMerger2(HM2),这是一种用于从多态性二倍体基因组组装中重建单倍体子组装的自动化流程。它旨在处理预先存在的二倍体组装,这些组装通常是使用从头组装器创建的。HM2可以处理任何二倍体组装,但它特别适用于高杂合度(≥3%)的二倍体组装,而其他工具处理这类组装可能会有困难。该流程还为单倍体子组装实现了灵活且灵敏的组装错误检测、分层支架构建程序和可靠的缺口闭合方法。使用HM2,我们证明了从一个真实的、高度多态的二倍体组装中重建的两个单倍体子组装具有显著改善的连续性。

可用性与实现方式

源代码、可执行文件和测试数据集可在https://github.com/mapleforest/HaploMerger2/releases/免费获取。

联系方式

hshengf2@mail.sysu.edu.cn。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4352/5870766/57b18c8aff51/btx220f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4352/5870766/57b18c8aff51/btx220f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4352/5870766/57b18c8aff51/btx220f1.jpg

相似文献

1
HaploMerger2: rebuilding both haploid sub-assemblies from high-heterozygosity diploid genome assembly.HaploMerger2:从高杂合度二倍体基因组组装中重建两个单倍体亚组装体。
Bioinformatics. 2017 Aug 15;33(16):2577-2579. doi: 10.1093/bioinformatics/btx220.
2
Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies.清除单倍型:三代二倍体基因组组装的等位基因 contig 重新分配。
BMC Bioinformatics. 2018 Nov 29;19(1):460. doi: 10.1186/s12859-018-2485-7.
3
A comparative analysis of methods for de novo assembly of hymenopteran genomes using either haploid or diploid samples.利用单倍体或二倍体样本进行膜翅目基因组从头组装方法的比较分析。
Sci Rep. 2019 Apr 24;9(1):6480. doi: 10.1038/s41598-019-42795-6.
4
SpLitteR: diploid genome assembly using TELL-Seq linked-reads and assembly graphs.SpLitter:利用 TELL-Seq 连接读取和组装图进行二倍体基因组组装。
PeerJ. 2024 Sep 27;12:e18050. doi: 10.7717/peerj.18050. eCollection 2024.
5
A graph-based approach to diploid genome assembly.基于图的二倍体基因组组装方法。
Bioinformatics. 2018 Jul 1;34(13):i105-i114. doi: 10.1093/bioinformatics/bty279.
6
Overcoming uncollapsed haplotypes in long-read assemblies of non-model organisms.克服非模式生物长读长组装中的未折叠单倍型。
BMC Bioinformatics. 2021 Jun 5;22(1):303. doi: 10.1186/s12859-021-04118-3.
7
phasebook: haplotype-aware de novo assembly of diploid genomes from long reads.相位图:基于长读长的二倍体基因组单体型感知从头组装
Genome Biol. 2021 Oct 27;22(1):299. doi: 10.1186/s13059-021-02512-x.
8
ntLink: A Toolkit for De Novo Genome Assembly Scaffolding and Mapping Using Long Reads.ntLink:一种使用长读长进行从头基因组组装支架和映射的工具包。
Curr Protoc. 2023 Apr;3(4):e733. doi: 10.1002/cpz1.733.
9
HaploMerger: reconstructing allelic relationships for polymorphic diploid genome assemblies.HaploMerger:重构多态二倍体基因组组装的等位基因关系。
Genome Res. 2012 Aug;22(8):1581-8. doi: 10.1101/gr.133652.111. Epub 2012 May 3.
10
Redundans: an assembly pipeline for highly heterozygous genomes.Redundans:一种用于高度杂合基因组的组装管道。
Nucleic Acids Res. 2016 Jul 8;44(12):e113. doi: 10.1093/nar/gkw294. Epub 2016 Apr 29.

引用本文的文献

1
The draft genome sequences of the cosmopolitan centric diatom, the genus Skeletonema.全球分布的中心硅藻——骨条藻属的基因组序列草图。
Sci Data. 2025 Aug 5;12(1):1358. doi: 10.1038/s41597-025-05432-8.
2
A reference genome enhances the power to detect signatures of recent anthropogenic impact in genomic data: a lesson learned from a stag beetle system.参考基因组增强了在基因组数据中检测近期人为影响特征的能力:从锹甲系统中获得的经验教训。
BMC Biol. 2025 Jul 9;23(1):205. doi: 10.1186/s12915-025-02307-7.
3
Genomics Research on the Road of Studying Biology and Virulence of Cereal Rust Fungi.

本文引用的文献

1
MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads.MECAT:用于单分子测序读取的快速映射、错误纠正和从头组装。
Nat Methods. 2017 Nov;14(11):1072-1074. doi: 10.1038/nmeth.4432. Epub 2017 Sep 18.
2
Canu: scalable and accurate long-read assembly via adaptive -mer weighting and repeat separation.Canu:通过自适应k-mer加权和重复序列分离实现可扩展且准确的长读长序列拼接
Genome Res. 2017 May;27(5):722-736. doi: 10.1101/gr.215087.116. Epub 2017 Mar 15.
3
Phased diploid genome assembly with single-molecule real-time sequencing.
谷物锈菌生物学与致病性研究道路上的基因组学研究
Mol Plant Pathol. 2025 Apr;26(4):e70082. doi: 10.1111/mpp.70082.
4
FHY3/FAR1 transposable elements generate adaptive genetic variation in the Bassia scoparia genome.FHY3/FAR1转座元件在地肤基因组中产生适应性遗传变异。
Pest Manag Sci. 2025 Apr 1. doi: 10.1002/ps.8798.
5
Multiple Displacement Amplification Facilitates SMRT Sequencing of Microscopic Animals and the Genome of the Gastrotrich Lepidodermella squamata (Dujardin 1841).多重置换扩增促进了微观动物的单分子实时测序以及腹毛动物鳞皮棘尾虫(杜雅尔丹,1841年)基因组的测序。
Genome Biol Evol. 2024 Dec 4;16(12). doi: 10.1093/gbe/evae254.
6
Collinearity-based Assembly Correction Tool GUI: Software for collinearity-based genome assembly correction.基于共线性的组装校正工具图形用户界面:用于基于共线性的基因组组装校正的软件。
G3 (Bethesda). 2025 Feb 5;15(2). doi: 10.1093/g3journal/jkae277.
7
Genome Assembly of the Nematode Rhabditoides Inermis From a Complex Microbial Community.从复杂微生物群落中组装无尾线虫的基因组。
Genome Biol Evol. 2024 Nov 1;16(11). doi: 10.1093/gbe/evae230.
8
Chromosome-level genome assembly and annotation of the skinnycheek lanternfish Benthosema ptertum.瘦头平鳍灯笼鱼 Benthosema ptertum 的染色体水平基因组组装和注释。
Sci Data. 2024 Oct 30;11(1):1178. doi: 10.1038/s41597-024-04039-9.
9
Chromosome-level genome assembly from a single planthopper Nilaparvata muiri (Hemiptera: Delphacidae).从单个褐飞虱 Nilaparvata muiri(半翅目:飞虱科)获得染色体水平的基因组组装。
Sci Data. 2024 Aug 28;11(1):937. doi: 10.1038/s41597-024-03812-0.
10
Pan-genome and phylogenomic analyses highlight Hevea species delineation and rubber trait evolution.泛基因组和系统基因组分析突出了 Hevea 物种的划分和橡胶性状的进化。
Nat Commun. 2024 Aug 22;15(1):7232. doi: 10.1038/s41467-024-51031-3.
基于单分子实时测序的阶段性二倍体基因组组装
Nat Methods. 2016 Dec;13(12):1050-1054. doi: 10.1038/nmeth.4035. Epub 2016 Oct 17.
4
Redundans: an assembly pipeline for highly heterozygous genomes.Redundans:一种用于高度杂合基因组的组装管道。
Nucleic Acids Res. 2016 Jul 8;44(12):e113. doi: 10.1093/nar/gkw294. Epub 2016 Apr 29.
5
Assembling large genomes with single-molecule sequencing and locality-sensitive hashing.利用单分子测序和局部敏感哈希组装大型基因组。
Nat Biotechnol. 2015 Jun;33(6):623-30. doi: 10.1038/nbt.3238. Epub 2015 May 25.
6
dipSPAdes: Assembler for Highly Polymorphic Diploid Genomes.dipSPAdes:用于高度多态二倍体基因组的组装器。
J Comput Biol. 2015 Jun;22(6):528-45. doi: 10.1089/cmb.2014.0153. Epub 2015 Mar 3.
7
The genome and transcriptome of the zoonotic hookworm Ancylostoma ceylanicum identify infection-specific gene families.人兽共患钩虫锡兰钩虫的基因组和转录组鉴定出感染特异性基因家族。
Nat Genet. 2015 Apr;47(4):416-22. doi: 10.1038/ng.3237. Epub 2015 Mar 2.
8
Decelerated genome evolution in modern vertebrates revealed by analysis of multiple lancelet genomes.通过对多个文昌鱼基因组的分析揭示现代脊椎动物中减速的基因组进化
Nat Commun. 2014 Dec 19;5:5896. doi: 10.1038/ncomms6896.
9
Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads.利用全基因组鸟枪法短读长数据对高杂合基因组进行高效的从头组装。
Genome Res. 2014 Aug;24(8):1384-95. doi: 10.1101/gr.170720.113. Epub 2014 Apr 22.
10
Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.非杂交、基于长读长 SMRT 测序数据的完成微生物基因组组装。
Nat Methods. 2013 Jun;10(6):563-9. doi: 10.1038/nmeth.2474. Epub 2013 May 5.