• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大规模全基因组序列数据结构变异调用器的比较。

Comparison of structural variant callers for massive whole-genome sequence data.

机构信息

Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.

Aging Convergence Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.

出版信息

BMC Genomics. 2024 Mar 28;25(1):318. doi: 10.1186/s12864-024-10239-9.

DOI:10.1186/s12864-024-10239-9
PMID:38549092
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10976732/
Abstract

BACKGROUND

Detecting structural variations (SVs) at the population level using next-generation sequencing (NGS) requires substantial computational resources and processing time. Here, we compared the performances of 11 SV callers: Delly, Manta, GridSS, Wham, Sniffles, Lumpy, SvABA, Canvas, CNVnator, MELT, and INSurVeyor. These SV callers have been recently published and have been widely employed for processing massive whole-genome sequencing datasets. We evaluated the accuracy, sequence depth, running time, and memory usage of the SV callers.

RESULTS

Notably, several callers exhibited better calling performance for deletions than for duplications, inversions, and insertions. Among the SV callers, Manta identified deletion SVs with better performance and efficient computing resources, and both Manta and MELT demonstrated relatively good precision regarding calling insertions. We confirmed that the copy number variation callers, Canvas and CNVnator, exhibited better performance in identifying long duplications as they employ the read-depth approach. Finally, we also verified the genotypes inferred from each SV caller using a phased long-read assembly dataset, and Manta showed the highest concordance in terms of the deletions and insertions.

CONCLUSIONS

Our findings provide a comprehensive understanding of the accuracy and computational efficiency of SV callers, thereby facilitating integrative analysis of SV profiles in diverse large-scale genomic datasets.

摘要

背景

使用下一代测序(NGS)在人群水平上检测结构变异(SV)需要大量的计算资源和处理时间。在这里,我们比较了 11 种 SV 调用者的性能:Delly、Manta、GridSS、Wham、Sniffles、Lumpy、SvABA、Canvas、CNVnator、MELT 和 INSurVeyor。这些 SV 调用者最近已经发表,并被广泛用于处理大规模全基因组测序数据集。我们评估了 SV 调用者的准确性、序列深度、运行时间和内存使用情况。

结果

值得注意的是,一些调用者在缺失调用方面的性能优于重复、反转和插入。在 SV 调用者中,Manta 在删除 SV 方面表现出更好的性能和高效的计算资源,Manta 和 MELT 在插入调用方面都表现出相对较好的精度。我们证实,使用读深度方法的拷贝数变异调用者 Canvas 和 CNVnator 在识别长重复方面表现出更好的性能。最后,我们还使用相位长读组装数据集验证了每个 SV 调用者推断的基因型,Manta 在删除和插入方面表现出最高的一致性。

结论

我们的研究结果提供了对 SV 调用者准确性和计算效率的全面了解,从而促进了对不同大规模基因组数据集 SV 谱的综合分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/19b34cde52b7/12864_2024_10239_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/dc61738eb4bf/12864_2024_10239_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/38f0c298b640/12864_2024_10239_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/213a92dbca5d/12864_2024_10239_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/dd82ef218b15/12864_2024_10239_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/ebb5d0ef869b/12864_2024_10239_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/19b34cde52b7/12864_2024_10239_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/dc61738eb4bf/12864_2024_10239_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/38f0c298b640/12864_2024_10239_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/213a92dbca5d/12864_2024_10239_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/dd82ef218b15/12864_2024_10239_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/ebb5d0ef869b/12864_2024_10239_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9809/10976732/19b34cde52b7/12864_2024_10239_Fig6_HTML.jpg

相似文献

1
Comparison of structural variant callers for massive whole-genome sequence data.大规模全基因组序列数据结构变异调用器的比较。
BMC Genomics. 2024 Mar 28;25(1):318. doi: 10.1186/s12864-024-10239-9.
2
Comprehensive evaluation and guidance of structural variation detection tools in chicken whole genome sequence data.鸡全基因组序列数据中结构变异检测工具的综合评估和指导
BMC Genomics. 2024 Oct 16;25(1):970. doi: 10.1186/s12864-024-10875-1.
3
A Comparison of Structural Variant Calling from Short-Read and Nanopore-Based Whole-Genome Sequencing Using Optical Genome Mapping as a Benchmark.基于光学基因组图谱作为基准的短读长和纳米孔全基因组测序的结构变异调用比较。
Genes (Basel). 2024 Jul 16;15(7):925. doi: 10.3390/genes15070925.
4
A large structural variant collection in Holstein cattle and associated database for variant discovery, characterization, and application.荷斯坦牛大型结构变异组库及相关数据库的建立,用于变异的发现、鉴定和应用。
BMC Genomics. 2024 Sep 30;25(1):903. doi: 10.1186/s12864-024-10812-2.
5
Benchmarking long-read aligners and SV callers for structural variation detection in Oxford nanopore sequencing data.基于 Oxford nanopore 测序数据的结构变异检测的长读长比对软件和变异调用软件的基准测试。
Sci Rep. 2024 Mar 14;14(1):6160. doi: 10.1038/s41598-024-56604-2.
6
VISTA: an integrated framework for structural variant discovery.VISTA:一个用于结构变异发现的集成框架。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae462.
7
Benchmarking of structural variant detection in the tetraploid potato genome using linked-read sequencing.利用连锁读取测序对四倍体马铃薯基因组中的结构变异进行基准测试。
Genomics. 2023 Mar;115(2):110568. doi: 10.1016/j.ygeno.2023.110568. Epub 2023 Jan 23.
8
Systematic evaluation of multiple NGS platforms for structural variants detection.系统评估多种用于结构变异检测的 NGS 平台。
J Biol Chem. 2023 Dec;299(12):105436. doi: 10.1016/j.jbc.2023.105436. Epub 2023 Nov 7.
9
Comprehensive evaluation of structural variation detection algorithms for whole genome sequencing.全基因组测序结构变异检测算法的综合评估。
Genome Biol. 2019 Jun 3;20(1):117. doi: 10.1186/s13059-019-1720-5.
10
Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.从长读序列组装和短读基因组测序技术中检测结构变异的预期和盲点。
Am J Hum Genet. 2021 May 6;108(5):919-928. doi: 10.1016/j.ajhg.2021.03.014. Epub 2021 Mar 30.

引用本文的文献

1
Visual Inspection of Sequencing Data for Diagnosis: Practical Guide to Structural Variant Analysis Using Integrative Genomics Viewer (IGV).用于诊断的测序数据可视化检查:使用综合基因组学浏览器(IGV)进行结构变异分析实用指南
Methods Mol Biol. 2025;2968:263-279. doi: 10.1007/978-1-0716-4750-9_15.
2
Structural Variants: Mechanisms, Mapping, and Interpretation in Human Genetics.结构变异:人类遗传学中的机制、定位与解读
Genes (Basel). 2025 Jul 29;16(8):905. doi: 10.3390/genes16080905.
3
A Hitchhiker Guide to Structural Variant Calling: A Comprehensive Benchmark Through Different Sequencing Technologies.

本文引用的文献

1
INSurVeyor: improving insertion calling from short read sequencing data.INSurVeyor:从短读测序数据中提高插入片段调用。
Nat Commun. 2023 Jun 5;14(1):3243. doi: 10.1038/s41467-023-38870-2.
2
Introduction of the Korea BioData Station (K-BDS) for sharing biological data.韩国生物数据站(K-BDS)介绍:用于生物数据共享
Genomics Inform. 2023 Mar;21(1):e12. doi: 10.5808/gi.22073. Epub 2023 Mar 31.
3
Long-read mapping to repetitive reference sequences using Winnowmap2.使用Winnowmap2将长读段映射到重复参考序列。
结构变异检测指南:通过不同测序技术的全面基准测试
Biomedicines. 2025 Aug 9;13(8):1949. doi: 10.3390/biomedicines13081949.
4
Comparative analysis of hybrid-SNP microarray and nanopore sequencing for detection of large-sized copy number variants in the human genome.杂交-SNP微阵列与纳米孔测序用于检测人类基因组中大片段拷贝数变异的比较分析
Mol Cytogenet. 2025 Jul 23;18(1):18. doi: 10.1186/s13039-025-00721-8.
5
SVCROWS: A User-Defined Tool for Interpreting Significant Structural Variants in Heterogeneous Datasets.SVCROWS:一种用于解释异构数据集中显著结构变异的用户定义工具。
bioRxiv. 2025 May 19:2025.01.24.634734. doi: 10.1101/2025.01.24.634734.
6
Artificial intelligence in variant calling: a review.变异检测中的人工智能:综述
Front Bioinform. 2025 Apr 23;5:1574359. doi: 10.3389/fbinf.2025.1574359. eCollection 2025.
7
Comparisons of performances of structural variants detection algorithms in solitary or combination strategy.结构变异检测算法在单独或联合策略下的性能比较。
PLoS One. 2025 Feb 6;20(2):e0314982. doi: 10.1371/journal.pone.0314982. eCollection 2025.
8
A large structural variant collection in Holstein cattle and associated database for variant discovery, characterization, and application.荷斯坦牛大型结构变异组库及相关数据库的建立,用于变异的发现、鉴定和应用。
BMC Genomics. 2024 Sep 30;25(1):903. doi: 10.1186/s12864-024-10812-2.
Nat Methods. 2022 Jun;19(6):705-710. doi: 10.1038/s41592-022-01457-8. Epub 2022 Apr 1.
4
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
5
Application of long-read sequencing to the detection of structural variants in human cancer genomes.长读长测序在人类癌症基因组结构变异检测中的应用。
Comput Struct Biotechnol J. 2021 Jul 28;19:4207-4216. doi: 10.1016/j.csbj.2021.07.030. eCollection 2021.
6
GRIDSS2: comprehensive characterisation of somatic structural variation using single breakend variants and structural variant phasing.GRIDSS2:利用单断点变体和结构变异相位进行体细胞结构变异的全面特征描述。
Genome Biol. 2021 Jul 12;22(1):202. doi: 10.1186/s13059-021-02423-x.
7
Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm.使用带有 hifiasm 的相定装配图进行单体型解析从头组装。
Nat Methods. 2021 Feb;18(2):170-175. doi: 10.1038/s41592-020-01056-5. Epub 2021 Feb 1.
8
The structural variation landscape in 492 Atlantic salmon genomes.492 份大西洋鲑基因组中的结构变异景观。
Nat Commun. 2020 Oct 14;11(1):5176. doi: 10.1038/s41467-020-18972-x.
9
Effect of sequence depth and length in long-read assembly of the maize inbred NC358.长读长序列深度和长度对玉米自交系 NC358 组装的影响。
Nat Commun. 2020 May 8;11(1):2288. doi: 10.1038/s41467-020-16037-7.
10
Detection of somatic structural variants from short-read next-generation sequencing data.从短读下一代测序数据中检测体细胞结构变体。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa056.