• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VarBen:用于临床下一代测序生物信息学管道评估的计算机模拟参考数据集生成。

VarBen: Generating in Silico Reference Data Sets for Clinical Next-Generation Sequencing Bioinformatics Pipeline Evaluation.

机构信息

National Center for Clinical Laboratories, Beijing Hospital, National Center of Gerontology, Beijing, People's Republic of China; Graduate School, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, People's Republic of China.

Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Medical Sciences, Beijing, People's Republic of China.

出版信息

J Mol Diagn. 2021 Mar;23(3):285-299. doi: 10.1016/j.jmoldx.2020.11.010. Epub 2020 Dec 18.

DOI:10.1016/j.jmoldx.2020.11.010
PMID:33346148
Abstract

Next-generation sequencing is increasingly being adopted as a valuable method for the detection of somatic variants in clinical oncology. However, it is still challenging to reach a satisfactory level of robustness and standardization in clinical practice when using the currently available bioinformatics pipelines to detect variants from raw sequencing data. Moreover, appropriate reference data sets are lacking for clinical bioinformatics pipeline development, validation, and proficiency testing. Herein, we developed the Variant Benchmark tool (VarBen), an open-source software for variant simulation to generate customized reference data sets by directly editing the original sequencing reads. VarBen can introduce a variety of variants, including single-nucleotide variants, small insertions and deletions, and large structural variants, into targeted, exome, or whole-genome sequencing data, and can handle sequencing data from both the Illumina and Ion Torrent sequencing platforms. To demonstrate the feasibility and robustness of VarBen, we performed variant simulation on different sequencing data sets and compared the simulated variants with real-world data. The validation study showed that the simulated data are highly comparable to real-world data and that VarBen is a reliable tool for variant simulation. In addition, our collaborative study of somatic variant calling in 20 laboratories emphasizes the need for laboratories to evaluate their bioinformatics pipelines with customized reference data sets. VarBen may help users develop and validate their bioinformatics pipelines using locally generated sequencing data.

摘要

下一代测序技术越来越多地被用作检测临床肿瘤体细胞变异的有价值的方法。然而,当使用当前可用的生物信息学管道从原始测序数据中检测变异时,在临床实践中达到令人满意的稳健性和标准化水平仍然具有挑战性。此外,临床生物信息学管道开发、验证和能力测试缺乏适当的参考数据集。在此,我们开发了 Variant Benchmark 工具(VarBen),这是一种用于变异模拟的开源软件,可以通过直接编辑原始测序reads 生成定制的参考数据集。VarBen 可以将各种变异(包括单核苷酸变异、小插入和缺失以及大结构变异)引入靶向、外显子或全基因组测序数据中,并且可以处理来自 Illumina 和 Ion Torrent 测序平台的测序数据。为了证明 VarBen 的可行性和稳健性,我们在不同的测序数据集上进行了变异模拟,并将模拟的变异与真实世界的数据进行了比较。验证研究表明,模拟数据与真实世界数据高度可比,VarBen 是一种可靠的变异模拟工具。此外,我们在 20 个实验室中进行的体细胞变异calling 的合作研究强调了实验室需要使用定制的参考数据集来评估其生物信息学管道。VarBen 可以帮助用户使用本地生成的测序数据开发和验证其生物信息学管道。

相似文献

1
VarBen: Generating in Silico Reference Data Sets for Clinical Next-Generation Sequencing Bioinformatics Pipeline Evaluation.VarBen:用于临床下一代测序生物信息学管道评估的计算机模拟参考数据集生成。
J Mol Diagn. 2021 Mar;23(3):285-299. doi: 10.1016/j.jmoldx.2020.11.010. Epub 2020 Dec 18.
2
Validation of a Customized Bioinformatics Pipeline for a Clinical Next-Generation Sequencing Test Targeting Solid Tumor-Associated Variants.验证用于靶向实体瘤相关变异的临床下一代测序测试的定制生物信息学管道。
J Mol Diagn. 2018 May;20(3):355-365. doi: 10.1016/j.jmoldx.2018.01.007. Epub 2018 Feb 19.
3
Systematic comparison of germline variant calling pipelines cross multiple next-generation sequencers.跨多种下一代测序仪的种系变异调用管道的系统比较。
Sci Rep. 2019 Jun 27;9(1):9345. doi: 10.1038/s41598-019-45835-3.
4
A Model Study of In Silico Proficiency Testing for Clinical Next-Generation Sequencing.临床下一代测序的计算机模拟能力验证模型研究
Arch Pathol Lab Med. 2016 Oct;140(10):1085-91. doi: 10.5858/arpa.2016-0194-CP. Epub 2016 Jul 7.
5
Enhancing SNV identification in whole-genome sequencing data through the incorporation of known genetic variants into the minimap2 index.通过将已知遗传变异纳入 minimap2 索引来提高全基因组测序数据中 SNV 的识别能力。
BMC Bioinformatics. 2024 Jul 13;25(1):238. doi: 10.1186/s12859-024-05862-y.
6
From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing.从湿实验室到变异:全基因组和全外显子组测序的生物信息学流程的一致性和速度
Hum Mutat. 2016 Dec;37(12):1263-1271. doi: 10.1002/humu.23114. Epub 2016 Sep 26.
7
Multi-Institutional FASTQ File Exchange as a Means of Proficiency Testing for Next-Generation Sequencing Bioinformatics and Variant Interpretation.多机构FASTQ文件交换作为下一代测序生物信息学和变异解读能力验证的一种手段
J Mol Diagn. 2016 Jul;18(4):572-9. doi: 10.1016/j.jmoldx.2016.03.002. Epub 2016 May 4.
8
Evaluation of Variant Calling Methods for Bacterial Whole-Genome Sequencing Assays.细菌全基因组测序分析中变异calling 方法的评估。
J Clin Microbiol. 2023 Aug 23;61(8):e0184222. doi: 10.1128/jcm.01842-22. Epub 2023 Jul 10.
9
INDELseek: detection of complex insertions and deletions from next-generation sequencing data.INDELseek:从下一代测序数据中检测复杂插入和缺失
BMC Genomics. 2017 Jan 5;18(1):16. doi: 10.1186/s12864-016-3449-9.
10
insiM: in silico Mutator Software for Bioinformatics Pipeline Validation of Clinical Next-Generation Sequencing Assays.insiM:用于临床下一代测序检测生物信息学管道验证的计算机诱变软件。
J Mol Diagn. 2019 Jan;21(1):19-26. doi: 10.1016/j.jmoldx.2018.08.001. Epub 2018 Sep 29.

引用本文的文献

1
Improving somatic exome sequencing performance by biological replicates.通过生物学重复提高体细胞外显子组测序性能。
BMC Bioinformatics. 2024 Mar 22;25(1):124. doi: 10.1186/s12859-024-05742-5.
2
Variant calling and benchmarking in an era of complete human genome sequences.全基因组序列时代的变异调用和基准测试。
Nat Rev Genet. 2023 Jul;24(7):464-483. doi: 10.1038/s41576-023-00590-0. Epub 2023 Apr 14.
3
Deciphering complex genome rearrangements in C. elegans using short-read whole genome sequencing.利用短读长全基因组测序技术解析秀丽隐杆线虫中的复杂基因组重排。
Sci Rep. 2021 Sep 14;11(1):18258. doi: 10.1038/s41598-021-97764-9.
4
Creation of an Expert Curated Variant List for Clinical Genomic Test Development and Validation: A ClinGen and GeT-RM Collaborative Project.创建临床基因组测试开发和验证的专家策划变异列表:ClinGen 和 GeT-RM 合作项目。
J Mol Diagn. 2021 Nov;23(11):1500-1505. doi: 10.1016/j.jmoldx.2021.07.018. Epub 2021 Aug 9.