• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ASVBM:通过对多个数据集进行局部联合分析的结构变异基准测试

ASVBM: Structural variant benchmarking with local joint analysis for multiple callsets.

作者信息

Mu Peizheng, Feng Xiangyan, Tong Lanxin, Huang Jie, Zhu Chaoqun, Wang Fei, Quan Wei, Ma Yuanjun, Dong Yucui, Zhu Xiao

机构信息

School of Computer and Control Engineering, Yantai University, Yantai, Shandong 264005, China.

Department of Hematology, Yantai Yuhuangding Hospital Affiliated to Qingdao University, Yantai, Shandong 264009, China.

出版信息

Comput Struct Biotechnol J. 2025 Jun 29;27:2851-2862. doi: 10.1016/j.csbj.2025.06.045. eCollection 2025.

DOI:10.1016/j.csbj.2025.06.045
PMID:40687987
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12271604/
Abstract

Accurate benchmarking of structural variant (SV) detection is essential for advancing the development and application of human whole-genome sequencing (WGS). A fundamental challenge in benchmarking SV detection results is determining whether two SVs represent the same event. Differences in the variation-awareness and strategic implementation of aligners inherently constrain SV detection algorithms that rely on alignment-based approaches. Traditional benchmarking, which primarily focuses on comparing and matching individual variants, makes it difficult to capture the relationships between multiple adjacent variants. We introduced ASVBM, an improved benchmarking framework that introduces the notion of latent positives and leverages a joint analysis and validation strategy based on local variants. This performance improvement arose from the discovery that multiple smaller variants are nearly equivalent to a larger variant. We comprehensively evaluated the performance of six state-of-the-art variant calling pipelines using real WGS datasets. According to multiple matching criteria, ASVBM employs a joint analysis strategy to uncover potential equivalences between the callset and the benchmark set, thereby reducing false mismatches caused by differences in variant representation. ASVBM is available at https://github.com/zhuxiao/asvbm.

摘要

准确的结构变异(SV)检测基准对于推动人类全基因组测序(WGS)的发展和应用至关重要。基准化SV检测结果的一个基本挑战是确定两个SV是否代表同一事件。比对器在变异感知和策略实施方面的差异本质上限制了依赖基于比对方法的SV检测算法。传统的基准测试主要侧重于比较和匹配单个变异,难以捕捉多个相邻变异之间的关系。我们引入了ASVBM,这是一个改进的基准测试框架,引入了潜在阳性的概念,并利用基于局部变异的联合分析和验证策略。这种性能提升源于发现多个较小的变异几乎等同于一个较大的变异。我们使用真实的WGS数据集全面评估了六种最先进的变异调用流程的性能。根据多个匹配标准,ASVBM采用联合分析策略来揭示调用集和基准集之间的潜在等效性,从而减少由变异表示差异导致的错误错配。ASVBM可在https://github.com/zhuxiao/asvbm获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fad/12271604/9b00ee75b8d7/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fad/12271604/95d51a1199d0/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fad/12271604/f5de379cb210/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fad/12271604/cb3110fa0dc0/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fad/12271604/9b00ee75b8d7/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fad/12271604/95d51a1199d0/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fad/12271604/f5de379cb210/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fad/12271604/cb3110fa0dc0/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fad/12271604/9b00ee75b8d7/gr3.jpg

相似文献

1
ASVBM: Structural variant benchmarking with local joint analysis for multiple callsets.ASVBM:通过对多个数据集进行局部联合分析的结构变异基准测试
Comput Struct Biotechnol J. 2025 Jun 29;27:2851-2862. doi: 10.1016/j.csbj.2025.06.045. eCollection 2025.
2
NCBench: providing an open, reproducible, transparent, adaptable, and continuous benchmark approach for DNA-sequencing-based variant calling.NCBench:提供一种开放、可重复、透明、可适应和持续的基于 DNA 测序的变异调用基准方法。
F1000Res. 2024 Sep 12;12:1125. doi: 10.12688/f1000research.140344.1. eCollection 2023.
3
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
4
Systemic Inflammatory Response Syndrome全身炎症反应综合征
5
A robust benchmark for detecting low-frequency variants in the HG002 Genome In A Bottle NIST reference material.用于检测基因组在瓶 NIST 参考材料 HG002 中低频变异的强大基准。
bioRxiv. 2024 Dec 5:2024.12.02.625685. doi: 10.1101/2024.12.02.625685.
6
CYTO-SV-ML: A Machine Learning Tool for Cytogenetic Structural Variant Analysis in Somatic Cell Type Using Genome Sequences.CYTO-SV-ML:一种利用基因组序列对体细胞类型进行细胞遗传学结构变异分析的机器学习工具。
Life (Basel). 2025 Jun 9;15(6):929. doi: 10.3390/life15060929.
7
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
8
Rapid, point-of-care antigen tests for diagnosis of SARS-CoV-2 infection.用于 SARS-CoV-2 感染诊断的快速、即时抗原检测。
Cochrane Database Syst Rev. 2022 Jul 22;7(7):CD013705. doi: 10.1002/14651858.CD013705.pub3.
9
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
10
Variation within and between digital pathology and light microscopy for the diagnosis of histopathology slides: blinded crossover comparison study.数字病理学与光学显微镜检查在组织病理学切片诊断中的内部及相互间差异:双盲交叉对比研究
Health Technol Assess. 2025 Jul;29(30):1-75. doi: 10.3310/SPLK4325.

本文引用的文献

1
Jointly benchmarking small and structural variant calls with vcfdist.使用 vcfdist 联合基准小型和结构变异调用。
Genome Biol. 2024 Oct 2;25(1):253. doi: 10.1186/s13059-024-03394-5.
2
VolcanoSV enables accurate and robust structural variant calling in diploid genomes from single-molecule long read sequencing.VolcanoSV 可实现基于单分子长读测序的二倍体基因组中准确稳健的结构变异 calling。
Nat Commun. 2024 Aug 13;15(1):6956. doi: 10.1038/s41467-024-51282-0.
3
Detection of mosaic and population-level structural variants with Sniffles2.
使用 Sniffles2 检测嵌合体和群体水平的结构变异。
Nat Biotechnol. 2024 Oct;42(10):1571-1580. doi: 10.1038/s41587-023-02024-y. Epub 2024 Jan 2.
4
Comparison and benchmark of structural variants detected from long read and long-read assembly.长读与长读组装检测到的结构变异的比较和基准测试。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad188.
5
Variant calling and benchmarking in an era of complete human genome sequences.全基因组序列时代的变异调用和基准测试。
Nat Rev Genet. 2023 Jul;24(7):464-483. doi: 10.1038/s41576-023-00590-0. Epub 2023 Apr 14.
6
Deciphering the exact breakpoints of structural variations using long sequencing reads with DeBreak.使用 DeBreak 对长测序reads 进行分析,以破译结构变异的精确断点。
Nat Commun. 2023 Jan 17;14(1):283. doi: 10.1038/s41467-023-35996-1.
7
Truvari: refined structural variant comparison preserves allelic diversity.特鲁瓦里:精细化结构变异比较保留等位基因多样性。
Genome Biol. 2022 Dec 27;23(1):271. doi: 10.1186/s13059-022-02840-6.
8
igv.js: an embeddable JavaScript implementation of the Integrative Genomics Viewer (IGV).igv.js:一个可嵌入的 JavaScript 实现的综合基因组浏览器(IGV)。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac830.
9
SVDSS: structural variation discovery in hard-to-call genomic regions using sample-specific strings from accurate long reads.SVDSS:使用准确长读段中样本特异性字符串在难以测序的基因组区域发现结构变异。
Nat Methods. 2023 Apr;20(4):550-558. doi: 10.1038/s41592-022-01674-1. Epub 2022 Dec 22.
10
Whole genome sequencing identifies structural variants contributing to hematologic traits in the NHLBI TOPMed program.全基因组测序鉴定出导致 NHLBI TOPMed 计划中血液学特征的结构变异。
Nat Commun. 2022 Dec 8;13(1):7592. doi: 10.1038/s41467-022-35354-7.