• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VISTA:一个用于结构变异发现的集成框架。

VISTA: an integrated framework for structural variant discovery.

机构信息

Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, United States.

Department of Quantitative and Computational Biology, Dana and David Dornsife College of Letters, Arts and Sciences University of Southern California, 3540 S Figueroa St, Los Angeles, California 90089, United States.

出版信息

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae462.

DOI:10.1093/bib/bbae462
PMID:39297879
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11411772/
Abstract

Structural variation (SV) refers to insertions, deletions, inversions, and duplications in human genomes. SVs are present in approximately 1.5% of the human genome. Still, this small subset of genetic variation has been implicated in the pathogenesis of psoriasis, Crohn's disease and other autoimmune disorders, autism spectrum and other neurodevelopmental disorders, and schizophrenia. Since identifying structural variants is an important problem in genetics, several specialized computational techniques have been developed to detect structural variants directly from sequencing data. With advances in whole-genome sequencing (WGS) technologies, a plethora of SV detection methods have been developed. However, dissecting SVs from WGS data remains a challenge, with the majority of SV detection methods prone to a high false-positive rate, and no existing method able to precisely detect a full range of SVs present in a sample. Previous studies have shown that none of the existing SV callers can maintain high accuracy across various SV lengths and genomic coverages. Here, we report an integrated structural variant calling framework, Variant Identification and Structural Variant Analysis (VISTA), that leverages the results of individual callers using a novel and robust filtering and merging algorithm. In contrast to existing consensus-based tools which ignore the length and coverage, VISTA overcomes this limitation by executing various combinations of top-performing callers based on variant length and genomic coverage to generate SV events with high accuracy. We evaluated the performance of VISTA on comprehensive gold-standard datasets across varying organisms and coverage. We benchmarked VISTA using the Genome-in-a-Bottle gold standard SV set, haplotype-resolved de novo assemblies from the Human Pangenome Reference Consortium, along with an in-house polymerase chain reaction (PCR)-validated mouse gold standard set. VISTA maintained the highest F1 score among top consensus-based tools measured using a comprehensive gold standard across both mouse and human genomes. VISTA also has an optimized mode, where the calls can be optimized for precision or recall. VISTA-optimized can attain 100% precision and the highest sensitivity among other variant callers. In conclusion, VISTA represents a significant advancement in structural variant calling, offering a robust and accurate framework that outperforms existing consensus-based tools and sets a new standard for SV detection in genomic research.

摘要

结构变异 (SV) 是指人类基因组中的插入、缺失、倒位和重复。SV 约占人类基因组的 1.5%。尽管这一小部分遗传变异与银屑病、克罗恩病和其他自身免疫性疾病、自闭症谱系和其他神经发育障碍以及精神分裂症的发病机制有关。由于鉴定结构变异是遗传学中的一个重要问题,因此已经开发了几种专门的计算技术来直接从测序数据中检测结构变异。随着全基因组测序 (WGS) 技术的进步,已经开发了大量的 SV 检测方法。然而,从 WGS 数据中解析 SV 仍然是一个挑战,大多数 SV 检测方法容易出现高假阳性率,并且没有现有的方法能够精确检测样本中存在的全范围的 SV。以前的研究表明,现有的 SV 调用者都不能在各种 SV 长度和基因组覆盖范围内保持高精度。在这里,我们报告了一种集成的结构变异调用框架,即变体识别和结构变异分析 (VISTA),它利用了各个调用者的结果,使用一种新颖而强大的过滤和合并算法。与忽略长度和覆盖范围的现有基于共识的工具不同,VISTA 通过根据变体长度和基因组覆盖范围执行各种表现最佳的调用者的组合来克服这一限制,从而以高精度生成 SV 事件。我们在不同的生物体和覆盖范围内的综合金标准数据集上评估了 VISTA 的性能。我们使用基因组瓶金标准 SV 集、人类泛基因组参考联盟的单倍型解析从头组装以及内部聚合酶链反应 (PCR) 验证的小鼠金标准集来对 VISTA 进行基准测试。VISTA 在使用综合金标准对小鼠和人类基因组进行测量时,在基于共识的顶级工具中保持了最高的 F1 分数。VISTA 还具有优化模式,可以针对精度或召回率优化调用。VISTA-optimized 可以在其他变体调用者中达到 100%的精度和最高的灵敏度。总之,VISTA 代表了结构变异调用方面的重大进展,提供了一种强大而准确的框架,优于现有的基于共识的工具,并为基因组研究中的 SV 检测设定了新标准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/1ae759ac026a/bbae462f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/7df14ac9295a/bbae462f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/17a01b933fcf/bbae462f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/823a9f097841/bbae462f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/2a3901ee132d/bbae462f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/19ef8b8fd8c8/bbae462f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/4fad711fe228/bbae462f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/1ae759ac026a/bbae462f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/7df14ac9295a/bbae462f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/17a01b933fcf/bbae462f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/823a9f097841/bbae462f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/2a3901ee132d/bbae462f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/19ef8b8fd8c8/bbae462f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/4fad711fe228/bbae462f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd0/11411772/1ae759ac026a/bbae462f7.jpg

相似文献

1
VISTA: an integrated framework for structural variant discovery.VISTA:一个用于结构变异发现的集成框架。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae462.
2
A comprehensive benchmarking of WGS-based deletion structural variant callers.基于 WGS 的缺失结构变异调用器的综合基准测试。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac221.
3
Comprehensive evaluation and guidance of structural variation detection tools in chicken whole genome sequence data.鸡全基因组序列数据中结构变异检测工具的综合评估和指导
BMC Genomics. 2024 Oct 16;25(1):970. doi: 10.1186/s12864-024-10875-1.
4
svclassify: a method to establish benchmark structural variant calls.svclassify:一种建立基准结构变异调用的方法。
BMC Genomics. 2016 Jan 16;17:64. doi: 10.1186/s12864-016-2366-2.
5
A Comparison of Structural Variant Calling from Short-Read and Nanopore-Based Whole-Genome Sequencing Using Optical Genome Mapping as a Benchmark.基于光学基因组图谱作为基准的短读长和纳米孔全基因组测序的结构变异调用比较。
Genes (Basel). 2024 Jul 16;15(7):925. doi: 10.3390/genes15070925.
6
VolcanoSV enables accurate and robust structural variant calling in diploid genomes from single-molecule long read sequencing.VolcanoSV 可实现基于单分子长读测序的二倍体基因组中准确稳健的结构变异 calling。
Nat Commun. 2024 Aug 13;15(1):6956. doi: 10.1038/s41467-024-51282-0.
7
Comparison of structural variant callers for massive whole-genome sequence data.大规模全基因组序列数据结构变异调用器的比较。
BMC Genomics. 2024 Mar 28;25(1):318. doi: 10.1186/s12864-024-10239-9.
8
Comprehensive evaluation of structural variation detection algorithms for whole genome sequencing.全基因组测序结构变异检测算法的综合评估。
Genome Biol. 2019 Jun 3;20(1):117. doi: 10.1186/s13059-019-1720-5.
9
Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.从长读序列组装和短读基因组测序技术中检测结构变异的预期和盲点。
Am J Hum Genet. 2021 May 6;108(5):919-928. doi: 10.1016/j.ajhg.2021.03.014. Epub 2021 Mar 30.
10
iSVP: an integrated structural variant calling pipeline from high-throughput sequencing data.iSVP:一种基于高通量测序数据的整合结构变异检测流程
BMC Syst Biol. 2013;7 Suppl 6(Suppl 6):S8. doi: 10.1186/1752-0509-7-S6-S8. Epub 2013 Dec 13.

引用本文的文献

1
SV-MeCa: an XGBoost-based meta-caller approach for structural variant calling from short-read data.SV-MeCa:一种基于XGBoost的元调用方法,用于从短读长数据中进行结构变异检测。
BMC Bioinformatics. 2025 Aug 20;26(1):218. doi: 10.1186/s12859-025-06246-6.
2
OctopusV and TentacleSV: a one-stop toolkit for multi-sample, cross-platform structural variant comparison and analysis.章鱼V和触手SV:用于多样本、跨平台结构变异比较与分析的一站式工具包。
bioRxiv. 2025 Mar 28:2025.03.24.645012. doi: 10.1101/2025.03.24.645012.

本文引用的文献

1
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
2
Jasmine and Iris: population-scale structural variant comparison and analysis.茉莉花和虹膜:人群规模结构变异比较与分析。
Nat Methods. 2023 Mar;20(3):408-417. doi: 10.1038/s41592-022-01753-3. Epub 2023 Jan 19.
3
Truvari: refined structural variant comparison preserves allelic diversity.特鲁瓦里:精细化结构变异比较保留等位基因多样性。
Genome Biol. 2022 Dec 27;23(1):271. doi: 10.1186/s13059-022-02840-6.
4
High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.对扩展的 1000 基因组项目队列进行高覆盖率全基因组测序,包括 602 个三核苷酸重复序列。
Cell. 2022 Sep 1;185(18):3426-3440.e19. doi: 10.1016/j.cell.2022.08.004.
5
A comprehensive benchmarking of WGS-based deletion structural variant callers.基于 WGS 的缺失结构变异调用器的综合基准测试。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac221.
6
A unified haplotype-based method for accurate and comprehensive variant calling.基于统一单倍型的精确和全面变异calling 方法。
Nat Biotechnol. 2021 Jul;39(7):885-892. doi: 10.1038/s41587-021-00861-3. Epub 2021 Mar 29.
7
Haplotype-resolved diverse human genomes and integrated analysis of structural variation.单体型解析的多样化人类基因组和结构变异的综合分析。
Science. 2021 Apr 2;372(6537). doi: 10.1126/science.abf7117. Epub 2021 Feb 25.
8
PopDel identifies medium-size deletions simultaneously in tens of thousands of genomes.PopDel 可同时在数万个基因组中识别中等大小的缺失。
Nat Commun. 2021 Feb 1;12(1):730. doi: 10.1038/s41467-020-20850-5.
9
Parliament2: Accurate structural variant calling at scale.Parliament2:大规模精确的结构变异调用。
Gigascience. 2020 Dec 21;9(12). doi: 10.1093/gigascience/giaa145.
10
A structural variation reference for medical and population genetics.医学和人群遗传学的结构变异参考
Nature. 2020 May;581(7809):444-451. doi: 10.1038/s41586-020-2287-8. Epub 2020 May 27.