Obinu Lia, Trivedi Urmi, Porceddu Andrea
Department of Agricultural Sciences, University of Sassari, Sassari, Sardinia, Italy.
Edinburgh Genomics, The University of Edinburgh, Edinburgh, United Kingdom.
Front Bioinform. 2024 Nov 15;4:1462923. doi: 10.3389/fbinf.2024.1462923. eCollection 2024.
The implementation of Hi-C reads in the genome assembly process allows the ordering of large regions of the genome in scaffolds and the generation of chromosome-level assemblies. Several bioinformatics tools have been developed for genome scaffolding with Hi-C, and each tool has advantages and disadvantages that need to be carefully evaluated before their adoption. We generated two assemblies of obtained from the same raw PacBio HiFi and Oxford Nanopore Technologies data. We scaffolded the assemblies implementing Hi-C reads with the scaffolders 3D-DNA, SALSA2, and YaHS, with the aim of identifying the tool providing the most accurate assembly. The scaffolded assemblies were evaluated according to contiguity, completeness, accuracy, and structural correctness. In our analysis, YaHS proved to be the best-performing bioinformatics tool for scaffolding genome assemblies in .
在基因组组装过程中使用Hi-C读取数据,能够对基因组中的大片段区域进行支架排序,并生成染色体水平的组装结果。已经开发了几种用于利用Hi-C进行基因组支架构建的生物信息学工具,每个工具都有其优缺点,在采用之前需要仔细评估。我们从相同的原始PacBio HiFi和牛津纳米孔技术数据生成了两个组装结果。我们使用支架构建工具3D-DNA、SALSA2和YaHS,通过实施Hi-C读取数据来构建组装结果的支架,目的是确定提供最准确组装的工具。根据连续性、完整性、准确性和结构正确性对构建好支架的组装结果进行评估。在我们的分析中,YaHS被证明是在[具体物种或基因组]中构建基因组组装支架的性能最佳的生物信息学工具。