• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 vcfdist 联合基准小型和结构变异调用。

Jointly benchmarking small and structural variant calls with vcfdist.

机构信息

Computer Science and Engineering, University of Michigan, Ann Arbor, Michigan, USA.

National Institute of Standards and Technology, Gaithersburg, Maryland, USA.

出版信息

Genome Biol. 2024 Oct 2;25(1):253. doi: 10.1186/s13059-024-03394-5.

DOI:10.1186/s13059-024-03394-5
PMID:39358801
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11446017/
Abstract

In this work, we extend vcfdist to be the first variant call benchmarking tool to jointly evaluate phased single-nucleotide polymorphisms (SNPs), small insertions/deletions (INDELs), and structural variants (SVs) for the whole genome. First, we find that a joint evaluation of small and structural variants uniformly reduces measured errors for SNPs (- 28.9%), INDELs (- 19.3%), and SVs (- 52.4%) across three datasets. vcfdist also corrects a common flaw in phasing evaluations, reducing measured flip errors by over 50%. Lastly, we show that vcfdist is more accurate than previously published works and on par with the newest approaches while providing improved result interpretability.

摘要

在这项工作中,我们将 vcfdist 扩展为第一个变体调用基准测试工具,以联合评估全基因组的相位单核苷酸多态性(SNP)、小插入/缺失(INDEL)和结构变异(SV)。首先,我们发现,对小变异和结构变异进行联合评估,可统一降低三个数据集上 SNP(-28.9%)、INDEL(-19.3%)和 SV(-52.4%)的测量误差。vcfdist 还纠正了相位评估中的一个常见缺陷,将测量的翻转错误减少了 50%以上。最后,我们表明,vcfdist 比以前的工作更准确,与最新的方法相当,同时提供了更好的结果可解释性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/0a375f09efaf/13059_2024_3394_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/131a6edb4f5c/13059_2024_3394_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/04df6f35ed0c/13059_2024_3394_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/4d4b68a43f2a/13059_2024_3394_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/428a62191048/13059_2024_3394_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/0a375f09efaf/13059_2024_3394_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/131a6edb4f5c/13059_2024_3394_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/04df6f35ed0c/13059_2024_3394_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/4d4b68a43f2a/13059_2024_3394_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/428a62191048/13059_2024_3394_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2816/11446017/0a375f09efaf/13059_2024_3394_Fig5_HTML.jpg

相似文献

1
Jointly benchmarking small and structural variant calls with vcfdist.使用 vcfdist 联合基准小型和结构变异调用。
Genome Biol. 2024 Oct 2;25(1):253. doi: 10.1186/s13059-024-03394-5.
2
vcfdist: accurately benchmarking phased small variant calls in human genomes.vcfdist:准确地对人类基因组中已分型的小型变异进行基准测试。
Nat Commun. 2023 Dec 9;14(1):8149. doi: 10.1038/s41467-023-43876-x.
3
svclassify: a method to establish benchmark structural variant calls.svclassify:一种建立基准结构变异调用的方法。
BMC Genomics. 2016 Jan 16;17:64. doi: 10.1186/s12864-016-2366-2.
4
ONCOLINER: A new solution for monitoring, improving, and harmonizing somatic variant calling across genomic oncology centers.ONCOLINER:一种新的解决方案,用于监测、改进和协调肿瘤基因组学中心的体细胞变异调用。
Cell Genom. 2024 Sep 11;4(9):100639. doi: 10.1016/j.xgen.2024.100639. Epub 2024 Aug 30.
5
SMaSH: a benchmarking toolkit for human genome variant calling.SMaSH:一种用于人类基因组变异检测的基准测试工具包。
Bioinformatics. 2014 Oct;30(19):2787-95. doi: 10.1093/bioinformatics/btu345. Epub 2014 Jun 3.
6
SvABA: genome-wide detection of structural variants and indels by local assembly.SvABA:通过局部组装进行全基因组结构变异和插入缺失的检测。
Genome Res. 2018 Apr;28(4):581-591. doi: 10.1101/gr.221028.117. Epub 2018 Mar 13.
7
An open resource for accurately benchmarking small variant and reference calls.用于准确基准测试小型变体和参考调用的开放资源。
Nat Biotechnol. 2019 May;37(5):561-566. doi: 10.1038/s41587-019-0074-6. Epub 2019 Apr 1.
8
Comparative assessments of indel annotations in healthy and cancer genomes with next-generation sequencing data.基于下一代测序数据的健康和癌症基因组中插入缺失注释的比较评估。
BMC Med Genomics. 2020 Nov 10;13(1):170. doi: 10.1186/s12920-020-00818-6.
9
GCAT|Panel, a comprehensive structural variant haplotype map of the Iberian population from high-coverage whole-genome sequencing.GCAT|Panel,对来自高覆盖全基因组测序的伊比利亚人群进行的全面结构变异单倍型图谱。
Nucleic Acids Res. 2022 Mar 21;50(5):2464-2479. doi: 10.1093/nar/gkac076.
10
A high-quality human reference panel reveals the complexity and distribution of genomic structural variants.高质量的人类参考面板揭示了基因组结构变异的复杂性和分布。
Nat Commun. 2016 Oct 6;7:12989. doi: 10.1038/ncomms12989.

引用本文的文献

1
ASVBM: Structural variant benchmarking with local joint analysis for multiple callsets.ASVBM:通过对多个数据集进行局部联合分析的结构变异基准测试
Comput Struct Biotechnol J. 2025 Jun 29;27:2851-2862. doi: 10.1016/j.csbj.2025.06.045. eCollection 2025.
2
Unraveling the hidden complexity of cancer through long-read sequencing.通过长读长测序揭示癌症隐藏的复杂性。
Genome Res. 2025 Apr 14;35(4):599-620. doi: 10.1101/gr.280041.124.

本文引用的文献

1
vcfdist: accurately benchmarking phased small variant calls in human genomes.vcfdist:准确地对人类基因组中已分型的小型变异进行基准测试。
Nat Commun. 2023 Dec 9;14(1):8149. doi: 10.1038/s41467-023-43876-x.
2
Telomere-to-telomere assembly of diploid chromosomes with Verkko.利用 Verkko 进行二倍体染色体的端粒到端粒组装。
Nat Biotechnol. 2023 Oct;41(10):1474-1482. doi: 10.1038/s41587-023-01662-6. Epub 2023 Feb 16.
3
Optimal gap-affine alignment in O(s) space.最优间隙仿射对齐,时间复杂度为 O(s)。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad074.
4
Truvari: refined structural variant comparison preserves allelic diversity.特鲁瓦里:精细化结构变异比较保留等位基因多样性。
Genome Biol. 2022 Dec 27;23(1):271. doi: 10.1186/s13059-022-02840-6.
5
Semi-automated assembly of high-quality diploid human reference genomes.半自动组装高质量的二倍体人类参考基因组。
Nature. 2022 Nov;611(7936):519-531. doi: 10.1038/s41586-022-05325-5. Epub 2022 Oct 19.
6
PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions.精准FDA真相挑战V2:在难以映射的区域中从短读长和长读长中识别变异体。
Cell Genom. 2022 May 11;2(5). doi: 10.1016/j.xgen.2022.100129. Epub 2022 Apr 27.
7
Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads.使用 PEPPER-Margin-DeepVariant 进行单体型感知变异调用可实现纳米孔长读段的高精度。
Nat Methods. 2021 Nov;18(11):1322-1332. doi: 10.1038/s41592-021-01299-w. Epub 2021 Nov 1.
8
Haplotype-resolved diverse human genomes and integrated analysis of structural variation.单体型解析的多样化人类基因组和结构变异的综合分析。
Science. 2021 Apr 2;372(6537). doi: 10.1126/science.abf7117. Epub 2021 Feb 25.
9
Twelve years of SAMtools and BCFtools.SAMtools 和 BCFtools 十二年。
Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.
10
Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm.使用带有 hifiasm 的相定装配图进行单体型解析从头组装。
Nat Methods. 2021 Feb;18(2):170-175. doi: 10.1038/s41592-020-01056-5. Epub 2021 Feb 1.