• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

第三代测序数据结构变异检测管道的全面深入评估。

Comprehensive and deep evaluation of structural variation detection pipelines with third-generation sequencing data.

机构信息

Program in Bioinformatics, Zhongshan School of Medicine, The Fifth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, China.

Key Laboratory of Tropical Disease Control (Sun Yat-Sen University), Ministry of Education, Guangzhou, China.

出版信息

Genome Biol. 2024 Jul 15;25(1):188. doi: 10.1186/s13059-024-03324-5.

DOI:10.1186/s13059-024-03324-5
PMID:39010145
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11247875/
Abstract

BACKGROUND

Structural variation (SV) detection methods using third-generation sequencing data are widely employed, yet accurately detecting SVs remains challenging. Different methods often yield inconsistent results for certain SV types, complicating tool selection and revealing biases in detection.

RESULTS

This study comprehensively evaluates 53 SV detection pipelines using simulated and real data from PacBio (CLR: Continuous Long Read, CCS: Circular Consensus Sequencing) and Nanopore (ONT) platforms. We assess their performance in detecting various sizes and types of SVs, breakpoint biases, and genotyping accuracy with various sequencing depths. Notably, pipelines such as Minimap2-cuteSV2, NGMLR-SVIM, PBMM2-pbsv, Winnowmap-Sniffles2, and Winnowmap-SVision exhibit comparatively higher recall and precision. Our findings also show that combining multiple pipelines with the same aligner, like pbmm2 or winnowmap, can significantly enhance performance. The individual pipelines' detailed ranking and performance metrics can be viewed in a dynamic table: http://pmglab.top/SVPipelinesRanking .

CONCLUSIONS

This study comprehensively characterizes the strengths and weaknesses of numerous pipelines, providing valuable insights that can improve SV detection in third-generation sequencing data and inform SV annotation and function prediction.

摘要

背景

使用第三代测序数据的结构变异(SV)检测方法被广泛应用,但准确检测 SV 仍然具有挑战性。不同的方法通常对某些 SV 类型产生不一致的结果,这使得工具选择变得复杂,并揭示了检测中的偏差。

结果

本研究使用 PacBio(CLR:连续长读,CCS:圆形一致测序)和 Nanopore(ONT)平台的模拟和真实数据,全面评估了 53 个 SV 检测管道。我们评估了它们在检测各种大小和类型的 SV、断点偏差以及具有不同测序深度的基因分型准确性方面的性能。值得注意的是,Minimap2-cuteSV2、NGMLR-SVIM、PBMM2-pbsv、Winnowmap-Sniffles2 和 Winnowmap-SVision 等管道表现出相对较高的召回率和精度。我们的研究结果还表明,使用相同的比对器(如 pbmm2 或 winnowmap)结合多个管道可以显著提高性能。可以在动态表中查看各个管道的详细排名和性能指标:http://pmglab.top/SVPipelinesRanking。

结论

本研究全面描述了众多管道的优缺点,为提高第三代测序数据中的 SV 检测提供了有价值的见解,并为 SV 注释和功能预测提供了信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/f60e852ec98a/13059_2024_3324_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/bd16651d21d6/13059_2024_3324_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/ac61377804ee/13059_2024_3324_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/e6a75dded161/13059_2024_3324_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/1fc357bd48db/13059_2024_3324_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/26a934f67548/13059_2024_3324_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/f60e852ec98a/13059_2024_3324_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/bd16651d21d6/13059_2024_3324_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/ac61377804ee/13059_2024_3324_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/e6a75dded161/13059_2024_3324_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/1fc357bd48db/13059_2024_3324_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/26a934f67548/13059_2024_3324_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b86/11247875/f60e852ec98a/13059_2024_3324_Fig6_HTML.jpg

相似文献

1
Comprehensive and deep evaluation of structural variation detection pipelines with third-generation sequencing data.第三代测序数据结构变异检测管道的全面深入评估。
Genome Biol. 2024 Jul 15;25(1):188. doi: 10.1186/s13059-024-03324-5.
2
Comparison of structural variants detected by PacBio-CLR and ONT sequencing in pear.梨中 PacBio-CLR 和 ONT 测序检测到的结构变异比较。
BMC Genomics. 2022 Dec 14;23(1):830. doi: 10.1186/s12864-022-09074-7.
3
Evaluating nanopore sequencing data processing pipelines for structural variation identification.评估纳米孔测序数据处理管道进行结构变异识别。
Genome Biol. 2019 Nov 14;20(1):237. doi: 10.1186/s13059-019-1858-1.
4
Benchmarking long-read aligners and SV callers for structural variation detection in Oxford nanopore sequencing data.基于 Oxford nanopore 测序数据的结构变异检测的长读长比对软件和变异调用软件的基准测试。
Sci Rep. 2024 Mar 14;14(1):6160. doi: 10.1038/s41598-024-56604-2.
5
Detection of mosaic and population-level structural variants with Sniffles2.使用 Sniffles2 检测嵌合体和群体水平的结构变异。
Nat Biotechnol. 2024 Oct;42(10):1571-1580. doi: 10.1038/s41587-023-02024-y. Epub 2024 Jan 2.
6
Comparison of multiple algorithms to reliably detect structural variants in pears.比较多种算法以可靠地检测梨中的结构变异。
BMC Genomics. 2020 Jan 20;21(1):61. doi: 10.1186/s12864-020-6455-x.
7
Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods.利用长读测序方法对牛的结构变异检测进行初步分析。
Genes (Basel). 2022 May 6;13(5):828. doi: 10.3390/genes13050828.
8
HQAlign: aligning nanopore reads for SV detection using current-level modeling.HQAlign:使用电流水平建模对齐纳米孔读取以进行 SV 检测。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad580.
9
Comprehensive assessment of long-read sequencing platforms and calling algorithms for detection of copy number variation.长读测序平台和拷贝数变异检测调用算法的综合评估。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae441.
10
Robust Benchmark Structural Variant Calls of An Asian Using State-of-the-art Long-read Sequencing Technologies.利用最先进的长读测序技术对亚洲个体进行稳健的基准结构变异调用。
Genomics Proteomics Bioinformatics. 2022 Feb;20(1):192-204. doi: 10.1016/j.gpb.2020.10.006. Epub 2021 Mar 2.

引用本文的文献

1
Hidden in the Genome: The First Italian Family with North Carolina Macular Dystrophy Carrying a Novel and Duplication.隐藏于基因组中:首个携带新型重复突变的患北卡罗来纳黄斑营养不良的意大利家族
Biomedicines. 2025 Aug 5;13(8):1904. doi: 10.3390/biomedicines13081904.
2
Long-Read Sequencing and Structural Variant Detection: Unlocking the Hidden Genome in Rare Genetic Disorders.长读长测序与结构变异检测:揭示罕见遗传病中的隐藏基因组
Diagnostics (Basel). 2025 Jul 17;15(14):1803. doi: 10.3390/diagnostics15141803.
3
Accurate, Scalable Structural Variant Genotyping in Complex Genomes at Population Scales.

本文引用的文献

1
Kled: an ultra-fast and sensitive structural variant detection tool for long-read sequencing data.Kled:一种用于长读测序数据的超快速和敏感的结构变异检测工具。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae049.
2
Detection of mosaic and population-level structural variants with Sniffles2.使用 Sniffles2 检测嵌合体和群体水平的结构变异。
Nat Biotechnol. 2024 Oct;42(10):1571-1580. doi: 10.1038/s41587-023-02024-y. Epub 2024 Jan 2.
3
The complete sequence of a human Y chromosome.人类 Y 染色体的完整序列。
群体规模下复杂基因组中准确、可扩展的结构变异基因分型
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf180.
4
Plant graph-based pangenomics: techniques, applications, and challenges.基于植物图谱的泛基因组学:技术、应用与挑战。
aBIOTECH. 2025 Mar 28;6(2):361-376. doi: 10.1007/s42994-025-00206-7. eCollection 2025 Jun.
5
Extrachromosomal circular DNA drives dynamic genome plasticity: emerging roles in disease progression and clinical potential.染色体外环状DNA驱动动态基因组可塑性:在疾病进展中的新作用及临床潜力
Theranostics. 2025 May 25;15(13):6387-6411. doi: 10.7150/thno.111765. eCollection 2025.
6
The nature of complex structural variations in tomatoes.番茄复杂结构变异的本质。
Hortic Res. 2025 Apr 16;12(7):uhaf107. doi: 10.1093/hr/uhaf107. eCollection 2025 Jul.
7
Long-Read Sequencing for the Rapid Response to Infectious Diseases Outbreaks.用于传染病爆发快速响应的长读长测序
Curr Clin Microbiol Rep. 2025;12(1):10. doi: 10.1007/s40588-025-00247-y. Epub 2025 May 15.
8
Systematic benchmarking of tools for structural variation detection using short- and long-read sequencing data in pigs.利用猪的短读长和长读长测序数据对结构变异检测工具进行系统基准测试。
iScience. 2025 Feb 8;28(3):111983. doi: 10.1016/j.isci.2025.111983. eCollection 2025 Mar 21.
9
Benchmarking, detection, and genotyping of structural variants in a population of whole-genome assemblies using the SVGAP pipeline.使用SVGAP流程对全基因组组装群体中的结构变异进行基准测试、检测和基因分型。
bioRxiv. 2025 Feb 8:2025.02.07.637096. doi: 10.1101/2025.02.07.637096.
Nature. 2023 Sep;621(7978):344-354. doi: 10.1038/s41586-023-06457-y. Epub 2023 Aug 23.
4
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
5
Comprehensive de novo mutation discovery with HiFi long-read sequencing.利用 HiFi 长读测序进行全面的从头突变发现。
Genome Med. 2023 May 8;15(1):34. doi: 10.1186/s13073-023-01183-6.
6
Deciphering the exact breakpoints of structural variations using long sequencing reads with DeBreak.使用 DeBreak 对长测序reads 进行分析,以破译结构变异的精确断点。
Nat Commun. 2023 Jan 17;14(1):283. doi: 10.1038/s41467-023-35996-1.
7
Estimation of intrafamilial DNA contamination in family trio genome sequencing using deviation from Mendelian inheritance.利用孟德尔遗传偏差估计家系三代基因组测序中的家族内 DNA 污染。
Genome Res. 2022 Nov-Dec;32(11-12):2134-2144. doi: 10.1101/gr.276794.122. Epub 2022 Dec 6.
8
Truvari: refined structural variant comparison preserves allelic diversity.特鲁瓦里:精细化结构变异比较保留等位基因多样性。
Genome Biol. 2022 Dec 27;23(1):271. doi: 10.1186/s13059-022-02840-6.
9
Construction of a trio-based structural variation panel utilizing activated T lymphocytes and long-read sequencing technology.利用激活的 T 淋巴细胞和长读测序技术构建三碱基结构变异面板。
Commun Biol. 2022 Sep 20;5(1):991. doi: 10.1038/s42003-022-03953-1.
10
SVision: a deep learning approach to resolve complex structural variants.SVision:一种深度学习方法,用于解决复杂的结构变异。
Nat Methods. 2022 Oct;19(10):1230-1233. doi: 10.1038/s41592-022-01609-w. Epub 2022 Sep 16.