Guo Yan, Song Yuwei, Jiang Limin, Chen Yu, Ceccarelli Michele, Gao Min, Chong Zechen
Department of Public Health and Sciences, University of Miami, Miami, FL, USA.
Department of Biomedical Informatics and Data Science, Heersink School of Medicine, University of Alabama, Birmingham, AL, USA.
Nat Protoc. 2025 Mar 26. doi: 10.1038/s41596-025-01149-5.
Long-read sequencing technologies yield extended DNA sequences capable of spanning intricate, repetitive genome regions, thereby facilitating the generation of more precise and comprehensive genome assemblies. However, assembly errors are inevitable owing to inherent genomic complexity and limitations of sequencing technology and assembly algorithms, making assembly evaluation crucial. The genome assembly evaluation tool Inspector presents several advantages over existing long-read de novo assembly evaluation tools, including (1) both reference-free and reference-guided assembly evaluation; (2) the ability to detect both small- and large-scale structural errors; (3) the option of assembly error correction, which can improve the quality value of the original assembly; and (4) the ability to perform haplotype-resolved assembly evaluation. Inspector can provide not only basic contig and alignment statistics, but also the precise locations and types of the different structural errors. These advantages provide a robust framework for long-read assembly evaluation. In this Protocol, we showcase four procedures to demonstrate the different applications of Inspector for long-read assembly evaluation. Inspector software and additional guides can be found at https://github.com/ChongLab/Inspector_protocol .
长读长测序技术可产生能够跨越复杂、重复基因组区域的延伸DNA序列,从而有助于生成更精确、更全面的基因组组装。然而,由于基因组固有的复杂性以及测序技术和组装算法的局限性,组装错误不可避免,这使得组装评估至关重要。基因组组装评估工具Inspector与现有的长读长从头组装评估工具相比具有多个优势,包括:(1)无参考和有参考引导的组装评估;(2)检测小规模和大规模结构错误的能力;(3)组装错误校正选项,可提高原始组装的质量值;以及(4)进行单倍型解析组装评估的能力。Inspector不仅可以提供基本的重叠群和比对统计信息,还能提供不同结构错误的精确位置和类型。这些优势为长读长组装评估提供了一个强大的框架。在本方案中,我们展示了四个程序,以演示Inspector在长读长组装评估中的不同应用。Inspector软件和其他指南可在https://github.com/ChongLab/Inspector_protocol上找到。