• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用贝叶斯因子调整的优势比进行普遍性调用小变异。

Calling small variants using universality with Bayes-factor-adjusted odds ratios.

机构信息

Genetron Health (Beijing) Co. Ltd, Beijing 102208, China.

State Key Laboratory of Medical Molecular Biology, Center for Bioinformatics, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing 100005, China.

出版信息

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab458.

DOI:10.1093/bib/bbab458
PMID:34791010
Abstract

The application of next-generation sequencing in research and particularly in clinical routine requires highly accurate variant calling. Here we describe UVC, a method for calling small variants of germline or somatic origin. By unifying opposite assumptions with sublation, we discovered the following two empirical laws to improve variant calling: allele fraction at high sequencing depth is inversely proportional to the cubic root of variant-calling error rate, and odds ratios adjusted with Bayes factors can model various sequencing biases. UVC outperformed other variant callers on the GIAB germline truth sets, 192 scenarios of in silico mixtures simulating 192 combinations of tumor/normal sequencing depths and tumor/normal purities, the GIAB somatic truth sets derived from physical mixture, and the SEQC2 somatic reference sets derived from the breast-cancer cell-line HCC1395. UVC achieved 100% concordance with the manual review conducted by multiple independent researchers on a Qiagen 71-gene-panel dataset derived from 16 patients with colon adenoma. UVC outperformed other unique molecular identifier (UMI)-aware variant callers on the datasets used for publishing these variant callers. Performance was measured with sensitivity-specificity trade off for called variants. The improved variant calls generated by UVC from previously published UMI-based sequencing data provided additional insight about DNA damage repair. UVC is open-sourced under the BSD 3-Clause license at https://github.com/genetronhealth/uvc and quay.io/genetronhealth/gcc-6-3-0-uvc-0-6-0-441a694.

摘要

下一代测序在研究中,特别是在临床常规中的应用需要高度准确的变异调用。在这里,我们描述了 UVC,这是一种用于调用种系或体细胞来源的小变体的方法。通过将对立的假设统一起来,并加以扬弃,我们发现了以下两条提高变异调用准确性的经验法则:高测序深度下的等位基因分数与变异调用错误率的立方根成反比,并且经过贝叶斯因子调整的优势比可以模拟各种测序偏差。在 GIAB 种系真实数据集、模拟肿瘤/正常测序深度和肿瘤/正常纯度 192 种组合的 192 种计算机模拟混合物场景、来自物理混合物的 GIAB 体细胞真实数据集以及来自乳腺癌细胞系 HCC1395 的 SEQC2 体细胞参考集中,UVC 的表现优于其他变异调用者。UVC 与多位独立研究人员对来自 16 名结肠腺瘤患者的 Qiagen 71 基因面板数据集进行的手动审查结果完全一致。在用于发布这些变异调用者的数据集上,UVC 优于其他独特分子标识符 (UMI) 感知的变异调用者。性能是通过对调用变异体的敏感性特异性权衡来衡量的。UVC 从先前发表的基于 UMI 的测序数据中生成的改进变异调用提供了有关 DNA 损伤修复的更多见解。UVC 是在 BSD 3 条款许可证下开源的,网址为 https://github.com/genetronhealth/uvc 和 quay.io/genetronhealth/gcc-6-3-0-uvc-0-6-0-441a694。

相似文献

1
Calling small variants using universality with Bayes-factor-adjusted odds ratios.使用贝叶斯因子调整的优势比进行普遍性调用小变异。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab458.
2
Benchmarking UMI-aware and standard variant callers for low frequency ctDNA variant detection.基于 UMIs 的低频 ctDNA 变异检测与标准变异 caller 的基准测试
BMC Genomics. 2024 Sep 3;25(1):827. doi: 10.1186/s12864-024-10737-w.
3
Evaluating the performance of low-frequency variant calling tools for the detection of variants from short-read deep sequencing data.评估低频变异调用工具在检测短读长深度测序数据中的变异方面的性能。
Sci Rep. 2023 Nov 22;13(1):20444. doi: 10.1038/s41598-023-47135-3.
4
RDscan: A New Method for Improving Germline and Somatic Variant Calling Based on Read Depth Distribution.RDscan:一种基于读深度分布的提高种系和体细胞变异calling 的新方法。
J Comput Biol. 2022 Sep;29(9):987-1000. doi: 10.1089/cmb.2021.0269. Epub 2022 Jun 24.
5
tarSVM: Improving the accuracy of variant calls derived from microfluidic PCR-based targeted next generation sequencing using a support vector machine.tarSVM:使用支持向量机提高基于微流控PCR的靶向新一代测序得出的变异检测准确性。
BMC Bioinformatics. 2016 Jun 10;17(1):233. doi: 10.1186/s12859-016-1108-4.
6
Systematic comparison of germline variant calling pipelines cross multiple next-generation sequencers.跨多种下一代测序仪的种系变异调用管道的系统比较。
Sci Rep. 2019 Jun 27;9(1):9345. doi: 10.1038/s41598-019-45835-3.
7
SNooPer: a machine learning-based method for somatic variant identification from low-pass next-generation sequencing.SNooPer:一种基于机器学习从低深度下一代测序中识别体细胞变异的方法。
BMC Genomics. 2016 Nov 14;17(1):912. doi: 10.1186/s12864-016-3281-2.
8
Variant callers for next-generation sequencing data: a comparison study.下一代测序数据的变异调用者:一项比较研究。
PLoS One. 2013 Sep 27;8(9):e75619. doi: 10.1371/journal.pone.0075619. eCollection 2013.
9
Benchmarking variant callers in next-generation and third-generation sequencing analysis.在新一代和第三代测序分析中对变异调用程序进行基准测试。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa148.
10
Somatic and Germline Variant Calling from Next-Generation Sequencing Data.从下一代测序数据中进行体细胞和种系变异调用。
Adv Exp Med Biol. 2022;1361:37-54. doi: 10.1007/978-3-030-91836-1_3.

引用本文的文献

1
Computational methods and data resources for predicting tumor neoantigens.预测肿瘤新抗原的计算方法和数据资源
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf302.
2
Validation of genetic variants from NGS data using deep convolutional neural networks.使用深度卷积神经网络验证 NGS 数据中的遗传变异。
BMC Bioinformatics. 2023 Apr 20;24(1):158. doi: 10.1186/s12859-023-05255-7.