• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于长读测序的复杂基因组组装。

Complex genome assembly based on long-read sequencing.

机构信息

College of Information and Computer Engineering, Northeast Forestry University, Harbin, 150040, China.

出版信息

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac305.

DOI:10.1093/bib/bbac305
PMID:35940845
Abstract

High-quality genome chromosome-scale sequences provide an important basis for genomics downstream analysis, especially the construction of haplotype-resolved and complete genomes, which plays a key role in genome annotation, mutation detection, evolutionary analysis, gene function research, comparative genomics and other aspects. However, genome-wide short-read sequencing is difficult to produce a complete genome in the face of a complex genome with high duplication and multiple heterozygosity. The emergence of long-read sequencing technology has greatly improved the integrity of complex genome assembly. We review a variety of computational methods for complex genome assembly and describe in detail the theories, innovations and shortcomings of collapsed, semi-collapsed and uncollapsed assemblers based on long reads. Among the three methods, uncollapsed assembly is the most correct and complete way to represent genomes. In addition, genome assembly is closely related to haplotype reconstruction, that is uncollapsed assembly realizes haplotype reconstruction, and haplotype reconstruction promotes uncollapsed assembly. We hope that gapless, telomere-to-telomere and accurate assembly of complex genomes can be truly routinely achieved using only a simple process or a single tool in the future.

摘要

高质量的基因组染色体级别的序列为基因组学下游分析提供了重要基础,特别是单倍型解析和完整基因组的构建,这对基因组注释、突变检测、进化分析、基因功能研究、比较基因组学等方面都起着关键作用。然而,全基因组短读测序在面对具有高重复和多种杂合性的复杂基因组时,很难产生完整的基因组。长读测序技术的出现极大地提高了复杂基因组组装的完整性。我们综述了多种用于复杂基因组组装的计算方法,并详细描述了基于长读的崩溃、半崩溃和未崩溃组装器的理论、创新和缺点。在这三种方法中,未崩溃组装是最正确和完整的代表基因组的方法。此外,基因组组装与单倍型重构密切相关,即未崩溃组装实现了单倍型重构,而单倍型重构促进了未崩溃组装。我们希望未来仅通过简单的过程或单一工具,就能真正常规地实现复杂基因组的无缺口、端到端和精确组装。

相似文献

1
Complex genome assembly based on long-read sequencing.基于长读测序的复杂基因组组装。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac305.
2
Comparison of Long-Read Methods for Sequencing and Assembly of Lepidopteran Pest Genomes.鳞翅目害虫基因组测序和组装的长读方法比较。
Int J Mol Sci. 2022 Dec 30;24(1):649. doi: 10.3390/ijms24010649.
3
phasebook: haplotype-aware de novo assembly of diploid genomes from long reads.相位图:基于长读长的二倍体基因组单体型感知从头组装
Genome Biol. 2021 Oct 27;22(1):299. doi: 10.1186/s13059-021-02512-x.
4
Overcoming uncollapsed haplotypes in long-read assemblies of non-model organisms.克服非模式生物长读长组装中的未折叠单倍型。
BMC Bioinformatics. 2021 Jun 5;22(1):303. doi: 10.1186/s12859-021-04118-3.
5
Completion of draft bacterial genomes by long-read sequencing of synthetic genomic pools.通过合成基因组文库的长读长测序完成细菌基因组草图
BMC Genomics. 2020 Jul 29;21(1):519. doi: 10.1186/s12864-020-06910-6.
6
Telomere-to-telomere assembly of diploid chromosomes with Verkko.利用 Verkko 进行二倍体染色体的端粒到端粒组装。
Nat Biotechnol. 2023 Oct;41(10):1474-1482. doi: 10.1038/s41587-023-01662-6. Epub 2023 Feb 16.
7
Computational methods for chromosome-scale haplotype reconstruction.染色体级别的单倍型重构的计算方法。
Genome Biol. 2021 Apr 12;22(1):101. doi: 10.1186/s13059-021-02328-9.
8
Recent Advances in Assembly of Complex Plant Genomes.复杂植物基因组组装的最新进展。
Genomics Proteomics Bioinformatics. 2023 Jun;21(3):427-439. doi: 10.1016/j.gpb.2023.04.004. Epub 2023 Apr 25.
9
Single-Molecule Real-Time Sequencing Combined with Optical Mapping Yields Completely Finished Fungal Genome.单分子实时测序结合光学图谱生成完全完成的真菌基因组
mBio. 2015 Aug 18;6(4):e00936-15. doi: 10.1128/mBio.00936-15.
10
A practical assembly guideline for genomes with various levels of heterozygosity.具有不同杂合度基因组的实用组装指南。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad337.

引用本文的文献

1
Next-generation sequencing applications in food science: fundamentals and recent advances.下一代测序技术在食品科学中的应用:基础与最新进展
Front Bioeng Biotechnol. 2025 Aug 20;13:1638957. doi: 10.3389/fbioe.2025.1638957. eCollection 2025.
2
Benchmarking of bioinformatics tools for the hybrid assembly of human and non-human whole-genome sequencing data.用于人类和非人类全基因组测序数据混合组装的生物信息学工具的基准测试。
Comput Struct Biotechnol J. 2025 Jul 13;27:3099-3109. doi: 10.1016/j.csbj.2025.07.020. eCollection 2025.
3
GTasm: a genome assembly method using graph transformers and HiFi reads.
GTasm:一种使用图变换器和高保真读段的基因组组装方法。
Front Genet. 2024 Oct 25;15:1495657. doi: 10.3389/fgene.2024.1495657. eCollection 2024.
4
Thalassemia screening by third-generation sequencing: Pilot study in a Thai population.通过第三代测序进行地中海贫血筛查:泰国人群的试点研究。
Obstet Med. 2024 Jun;17(2):101-107. doi: 10.1177/1753495X231207676. Epub 2023 Oct 26.
5
Technology-enabled great leap in deciphering plant genomes.技术助力植物基因组破译实现巨大飞跃。
Nat Plants. 2024 Apr;10(4):551-566. doi: 10.1038/s41477-024-01655-6. Epub 2024 Mar 20.
6
Understanding blaNDM-1 gene regulation in CRKP infections: toward novel antimicrobial strategies for hospital-acquired pneumonia.了解 blaNDM-1 基因在碳青霉烯类耐药肺炎克雷伯菌感染中的调控:寻找医院获得性肺炎新型抗菌策略。
Mol Med. 2024 Feb 23;30(1):29. doi: 10.1186/s10020-024-00794-y.
7
The haplotype-resolved telomere-to-telomere carnation () genome reveals the correlation between genome architecture and gene expression.单倍型解析的端粒到端粒康乃馨()基因组揭示了基因组结构与基因表达之间的相关性。
Hortic Res. 2023 Nov 27;11(1):uhad244. doi: 10.1093/hr/uhad244. eCollection 2024 Jan.
8
Deciphering the roles of unknown/uncharacterized genes in plant development and stress responses.解析未知/未表征基因在植物发育和胁迫响应中的作用。
Front Plant Sci. 2023 Nov 23;14:1276559. doi: 10.3389/fpls.2023.1276559. eCollection 2023.
9
Recent progress in the genotyping of bovine tuberculosis and its rapid diagnosis nanoparticle-based electrochemical biosensors.牛结核病基因分型及其基于纳米颗粒的电化学生物传感器快速诊断的最新进展。
RSC Adv. 2023 Oct 30;13(45):31795-31810. doi: 10.1039/d3ra05606f. eCollection 2023 Oct 26.
10
Key , , and Genes Involved in the Fatty Acid Synthesis in Flax Identified Based on Genomic and Transcriptomic Data.基于基因组和转录组数据鉴定亚麻中参与脂肪酸合成的关键基因。
Int J Mol Sci. 2023 Oct 4;24(19):14885. doi: 10.3390/ijms241914885.