• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用Inspector基于长读长测序数据评估基因组组装的详细指南。

A detailed guide to assessing genome assembly based on long-read sequencing data using Inspector.

作者信息

Guo Yan, Song Yuwei, Jiang Limin, Chen Yu, Ceccarelli Michele, Gao Min, Chong Zechen

机构信息

Department of Public Health and Sciences, University of Miami, Miami, FL, USA.

Department of Biomedical Informatics and Data Science, Heersink School of Medicine, University of Alabama, Birmingham, AL, USA.

出版信息

Nat Protoc. 2025 Mar 26. doi: 10.1038/s41596-025-01149-5.

DOI:10.1038/s41596-025-01149-5
PMID:40140633
Abstract

Long-read sequencing technologies yield extended DNA sequences capable of spanning intricate, repetitive genome regions, thereby facilitating the generation of more precise and comprehensive genome assemblies. However, assembly errors are inevitable owing to inherent genomic complexity and limitations of sequencing technology and assembly algorithms, making assembly evaluation crucial. The genome assembly evaluation tool Inspector presents several advantages over existing long-read de novo assembly evaluation tools, including (1) both reference-free and reference-guided assembly evaluation; (2) the ability to detect both small- and large-scale structural errors; (3) the option of assembly error correction, which can improve the quality value of the original assembly; and (4) the ability to perform haplotype-resolved assembly evaluation. Inspector can provide not only basic contig and alignment statistics, but also the precise locations and types of the different structural errors. These advantages provide a robust framework for long-read assembly evaluation. In this Protocol, we showcase four procedures to demonstrate the different applications of Inspector for long-read assembly evaluation. Inspector software and additional guides can be found at https://github.com/ChongLab/Inspector_protocol .

摘要

长读长测序技术可产生能够跨越复杂、重复基因组区域的延伸DNA序列,从而有助于生成更精确、更全面的基因组组装。然而,由于基因组固有的复杂性以及测序技术和组装算法的局限性,组装错误不可避免,这使得组装评估至关重要。基因组组装评估工具Inspector与现有的长读长从头组装评估工具相比具有多个优势,包括:(1)无参考和有参考引导的组装评估;(2)检测小规模和大规模结构错误的能力;(3)组装错误校正选项,可提高原始组装的质量值;以及(4)进行单倍型解析组装评估的能力。Inspector不仅可以提供基本的重叠群和比对统计信息,还能提供不同结构错误的精确位置和类型。这些优势为长读长组装评估提供了一个强大的框架。在本方案中,我们展示了四个程序,以演示Inspector在长读长组装评估中的不同应用。Inspector软件和其他指南可在https://github.com/ChongLab/Inspector_protocol上找到。

相似文献

1
A detailed guide to assessing genome assembly based on long-read sequencing data using Inspector.使用Inspector基于长读长测序数据评估基因组组装的详细指南。
Nat Protoc. 2025 Mar 26. doi: 10.1038/s41596-025-01149-5.
2
Accurate long-read de novo assembly evaluation with Inspector.使用 Inspector 进行准确的长读从头组装评估。
Genome Biol. 2021 Nov 14;22(1):312. doi: 10.1186/s13059-021-02527-4.
3
ntLink: A Toolkit for De Novo Genome Assembly Scaffolding and Mapping Using Long Reads.ntLink:一种使用长读长进行从头基因组组装支架和映射的工具包。
Curr Protoc. 2023 Apr;3(4):e733. doi: 10.1002/cpz1.733.
4
Software for pre-processing Illumina next-generation sequencing short read sequences.用于预处理Illumina下一代测序短读序列的软件。
Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014.
5
Evaluating long-read de novo assembly tools for eukaryotic genomes: insights and considerations.评估真核生物基因组的长读长从头组装工具:见解与考虑。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad100. Epub 2023 Nov 24.
6
Comparison of ONT and CCS sequencing technologies on the polyploid genome of a medicinal plant showed that high error rate of ONT reads are not suitable for self-correction.对一种药用植物多倍体基因组上的纳米孔测序(ONT)技术和环形一致序列(CCS)测序技术进行比较后发现,ONT读数的高错误率不适用于自我校正。
Chin Med. 2022 Aug 9;17(1):94. doi: 10.1186/s13020-022-00644-1.
7
Kastor: a reference-based comparative approach for assessment and correction of gene-fragmenting errors in long-read assemblies of small genomes.卡斯特:一种基于参考的比较方法,用于评估和纠正小基因组长读长组装中的基因片段化错误。
BMC Genomics. 2025 Apr 18;26(1):388. doi: 10.1186/s12864-025-11569-y.
8
ntEdit+Sealer: Efficient Targeted Error Resolution and Automated Finishing of Long-Read Genome Assemblies.ntEdit+Sealer:高效靶向纠错与长读长基因组组装自动化封端。
Curr Protoc. 2022 May;2(5):e442. doi: 10.1002/cpz1.442.
9
BlockPolish: accurate polishing of long-read assembly via block divide-and-conquer.BlockPolish:通过块划分与征服实现长读序列组装的精确抛光。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab405.
10
Lerna: transformer architectures for configuring error correction tools for short- and long-read genome sequencing.Lerna:用于配置短读和长读基因组测序错误纠正工具的变压器架构。
BMC Bioinformatics. 2022 Jan 6;23(1):25. doi: 10.1186/s12859-021-04547-0.

本文引用的文献

1
Unveiling microbial diversity: harnessing long-read sequencing technology.揭示微生物多样性:利用长读长测序技术
Nat Methods. 2024 Jun;21(6):954-966. doi: 10.1038/s41592-024-02262-1. Epub 2024 Apr 30.
2
Sequencing and characterizing short tandem repeats in the human genome.对人类基因组中的短串联重复序列进行测序和特征分析。
Nat Rev Genet. 2024 Jul;25(7):460-475. doi: 10.1038/s41576-024-00692-3. Epub 2024 Feb 16.
3
A pangenome reference of 36 Chinese populations.36 个中国人群的泛基因组参考图谱。
Nature. 2023 Jul;619(7968):112-121. doi: 10.1038/s41586-023-06173-7. Epub 2023 Jun 14.
4
Deciphering the exact breakpoints of structural variations using long sequencing reads with DeBreak.使用 DeBreak 对长测序reads 进行分析,以破译结构变异的精确断点。
Nat Commun. 2023 Jan 17;14(1):283. doi: 10.1038/s41467-023-35996-1.
5
Method of the year: long-read sequencing.年度方法:长读长测序。
Nat Methods. 2023 Jan;20(1):6-11. doi: 10.1038/s41592-022-01730-w.
6
Gene Fusion Detection and Characterization in Long-Read Cancer Transcriptome Sequencing Data with FusionSeeker.利用 FusionSeeker 在长读癌症转录组测序数据中检测和描述基因融合。
Cancer Res. 2023 Jan 4;83(1):28-33. doi: 10.1158/0008-5472.CAN-22-1628.
7
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
8
Haplotype-resolved assembly of diploid genomes without parental data.单体型解析组装二倍体基因组,无需父母本数据。
Nat Biotechnol. 2022 Sep;40(9):1332-1335. doi: 10.1038/s41587-022-01261-x. Epub 2022 Mar 24.
9
Accurate long-read de novo assembly evaluation with Inspector.使用 Inspector 进行准确的长读从头组装评估。
Genome Biol. 2021 Nov 14;22(1):312. doi: 10.1186/s13059-021-02527-4.
10
Nanopore sequencing technology, bioinformatics and applications.纳米孔测序技术、生物信息学及其应用。
Nat Biotechnol. 2021 Nov;39(11):1348-1365. doi: 10.1038/s41587-021-01108-x. Epub 2021 Nov 8.