• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 UMIs 的低频 ctDNA 变异检测与标准变异 caller 的基准测试

Benchmarking UMI-aware and standard variant callers for low frequency ctDNA variant detection.

机构信息

Department of Health Data Science, Institute of Population Health, University of Liverpool, Waterhouse Building, Block F, Brownlow Street, Liverpool, L69 3GF, UK.

MRC-University of Glasgow Centre for Virus Research, University of Glasgow, Garscube Campus, 464 Bearsden Road, Glasgow, G61 1QH, UK.

出版信息

BMC Genomics. 2024 Sep 3;25(1):827. doi: 10.1186/s12864-024-10737-w.

DOI:10.1186/s12864-024-10737-w
PMID:39227777
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11370058/
Abstract

BACKGROUND

Circulating tumour DNA (ctDNA) is a subset of cell free DNA (cfDNA) released by tumour cells into the bloodstream. Circulating tumour DNA has shown great potential as a biomarker to inform treatment in cancer patients. Collecting ctDNA is minimally invasive and reflects the entire genetic makeup of a patient's cancer. ctDNA variants in NGS data can be difficult to distinguish from sequencing and PCR artefacts due to low abundance, particularly in the early stages of cancer. Unique Molecular Identifiers (UMIs) are short sequences ligated to the sequencing library before amplification. These sequences are useful for filtering out low frequency artefacts. The utility of ctDNA as a cancer biomarker depends on accurate detection of cancer variants.

RESULTS

In this study, we benchmarked six variant calling tools, including two UMI-aware callers for their ability to call ctDNA variants. The standard variant callers tested included Mutect2, bcftools, LoFreq and FreeBayes. The UMI-aware variant callers benchmarked were UMI-VarCal and UMIErrorCorrect. We used both datasets with known variants spiked in at low frequencies, and datasets containing ctDNA, and generated synthetic UMI sequences for these datasets. Variant callers displayed different preferences for sensitivity and specificity. Mutect2 showed high sensitivity, while returning more privately called variants than any other caller in data without synthetic UMIs - an indicator of false positive variant discovery. In data encoded with synthetic UMIs, UMI-VarCal detected fewer putative false positive variants than all other callers in synthetic datasets. Mutect2 showed a balance between high sensitivity and specificity in data encoded with synthetic UMIs.

CONCLUSIONS

Our results indicate UMI-aware variant callers have potential to improve sensitivity and specificity in calling low frequency ctDNA variants over standard variant calling tools. There is a growing need for further development of UMI-aware variant calling tools if effective early detection methods for cancer using ctDNA samples are to be realised.

摘要

背景

循环肿瘤 DNA(ctDNA)是肿瘤细胞释放到血液中的无细胞 DNA(cfDNA)的一个子集。循环肿瘤 DNA 作为一种生物标志物,具有很大的潜力,可以为癌症患者的治疗提供信息。采集 ctDNA 具有微创性,并且反映了患者癌症的整个遗传构成。由于 ctDNA 变体丰度低,尤其是在癌症的早期阶段,在 NGS 数据中与测序和 PCR 伪影区分开来可能具有挑战性。独特分子标识符(UMI)是在扩增前连接到测序文库的短序列。这些序列对于过滤低频伪影很有用。ctDNA 作为癌症生物标志物的效用取决于对癌症变体的准确检测。

结果

在这项研究中,我们对六种变体调用工具进行了基准测试,包括两种用于检测 ctDNA 变体的 UMI 感知调用者。测试的标准变体调用者包括 Mutect2、bcftools、LoFreq 和 FreeBayes。基准测试的 UMI 感知变体调用者包括 UMI-VarCal 和 UMIErrorCorrect。我们使用了具有已知低频掺入变体的两个数据集和包含 ctDNA 的数据集,并为这些数据集生成了合成的 UMI 序列。变体调用者对灵敏度和特异性有不同的偏好。Mutect2 显示出高灵敏度,同时在没有合成 UMIs 的数据中比任何其他调用者返回更多的私有变体——这是假阳性变体发现的指标。在带有合成 UMIs 的数据中,UMI-VarCal 在合成数据集中检测到的假定假阳性变体比所有其他调用者都少。Mutect2 在带有合成 UMIs 的数据中显示出高灵敏度和特异性之间的平衡。

结论

我们的结果表明,与标准变体调用工具相比,UMI 感知变体调用者有可能提高对低频 ctDNA 变体的灵敏度和特异性。如果要使用 ctDNA 样本实现癌症的有效早期检测方法,则需要进一步开发 UMI 感知变体调用工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/63940d22b27a/12864_2024_10737_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/1dd6572a927c/12864_2024_10737_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/888a4dec4bca/12864_2024_10737_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/43190fc72817/12864_2024_10737_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/ee618b22c750/12864_2024_10737_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/97f7797cfce7/12864_2024_10737_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/63940d22b27a/12864_2024_10737_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/1dd6572a927c/12864_2024_10737_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/888a4dec4bca/12864_2024_10737_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/43190fc72817/12864_2024_10737_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/ee618b22c750/12864_2024_10737_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/97f7797cfce7/12864_2024_10737_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e82/11370058/63940d22b27a/12864_2024_10737_Fig6_HTML.jpg

相似文献

1
Benchmarking UMI-aware and standard variant callers for low frequency ctDNA variant detection.基于 UMIs 的低频 ctDNA 变异检测与标准变异 caller 的基准测试
BMC Genomics. 2024 Sep 3;25(1):827. doi: 10.1186/s12864-024-10737-w.
2
Evaluating the performance of low-frequency variant calling tools for the detection of variants from short-read deep sequencing data.评估低频变异调用工具在检测短读长深度测序数据中的变异方面的性能。
Sci Rep. 2023 Nov 22;13(1):20444. doi: 10.1038/s41598-023-47135-3.
3
UMI-VarCal: a new UMI-based variant caller that efficiently improves low-frequency variant detection in paired-end sequencing NGS libraries.UMI-VarCal:一种基于 UMI 的新型变异 caller,可有效提高配对末端测序 NGS 文库中低频变异的检测能力。
Bioinformatics. 2020 May 1;36(9):2718-2724. doi: 10.1093/bioinformatics/btaa053.
4
UMI-Varcal: A Low-Frequency Variant Caller for UMI-Tagged Paired-End Sequencing Data.UMI-Varcal:一种用于 UMI 标记的双端测序数据的低频变异调用器。
Methods Mol Biol. 2022;2493:235-245. doi: 10.1007/978-1-0716-2293-3_14.
5
SiNVICT: ultra-sensitive detection of single nucleotide variants and indels in circulating tumour DNA.SiNVICT:循环肿瘤 DNA 中单核苷酸变异和插入缺失的超灵敏检测。
Bioinformatics. 2017 Jan 1;33(1):26-34. doi: 10.1093/bioinformatics/btw536. Epub 2016 Aug 16.
6
UMI-Gen: A UMI-based read simulator for variant calling evaluation in paired-end sequencing NGS libraries.UMI-Gen:一种基于单分子唯一分子标识符(UMI)的读段模拟器,用于双端测序NGS文库中的变异检测评估。
Comput Struct Biotechnol J. 2020 Aug 27;18:2270-2280. doi: 10.1016/j.csbj.2020.08.011. eCollection 2020.
7
Benchmarking variant callers in next-generation and third-generation sequencing analysis.在新一代和第三代测序分析中对变异调用程序进行基准测试。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa148.
8
UMIErrorCorrect and UMIAnalyzer: Software for Consensus Read Generation, Error Correction, and Visualization Using Unique Molecular Identifiers.UMIErrorCorrect 和 UMIAnalyzer:使用唯一分子标识符生成共识读、纠错和可视化的软件。
Clin Chem. 2022 Nov 3;68(11):1425-1435. doi: 10.1093/clinchem/hvac136.
9
Benchmarking reveals superiority of deep learning variant callers on bacterial nanopore sequence data.基准测试显示深度学习变异调用程序在细菌纳米孔测序数据上的优越性。
Elife. 2024 Oct 10;13:RP98300. doi: 10.7554/eLife.98300.
10
MAGERI: Computational pipeline for molecular-barcoded targeted resequencing.MAGERI:用于分子条形码靶向重测序的计算流程
PLoS Comput Biol. 2017 May 5;13(5):e1005480. doi: 10.1371/journal.pcbi.1005480. eCollection 2017 May.

引用本文的文献

1
Exome sequencing shows same pattern of clonal tumor mutational burden, intratumor heterogenicity and clonal neoantigen between autologous tumor and Vigil product.外显子组测序显示,自体肿瘤与Vigil产品之间的克隆性肿瘤突变负荷、肿瘤内异质性和克隆性新抗原具有相同模式。
Sci Rep. 2025 Mar 13;15(1):8637. doi: 10.1038/s41598-025-90136-7.
2
Evaluating Bioinformatics Processing of Somatic Variant Detection in cfDNA Using Targeted Sequencing with UMIs.使用靶向测序和 UMIs 评估 cfDNA 中体细胞变异检测的生物信息学处理。
Int J Mol Sci. 2024 Oct 24;25(21):11439. doi: 10.3390/ijms252111439.

本文引用的文献

1
Optimized SMRT-UMI protocol produces highly accurate sequence datasets from diverse populations-Application to HIV-1 quasispecies.优化的单分子实时测序-独特分子标识符(SMRT-UMI)方案可从不同人群中产生高度准确的序列数据集——应用于HIV-1准种
Virus Evol. 2024 Mar 2;10(1):veae019. doi: 10.1093/ve/veae019. eCollection 2024.
2
Detection of circulating tumor-derived material in peripheral blood of pediatric sarcoma patients: A systematic review.小儿肉瘤患者外周血中循环肿瘤衍生物质的检测:一项系统综述。
Transl Oncol. 2023 Aug;34:101690. doi: 10.1016/j.tranon.2023.101690. Epub 2023 May 16.
3
UMIErrorCorrect and UMIAnalyzer: Software for Consensus Read Generation, Error Correction, and Visualization Using Unique Molecular Identifiers.
UMIErrorCorrect 和 UMIAnalyzer:使用唯一分子标识符生成共识读、纠错和可视化的软件。
Clin Chem. 2022 Nov 3;68(11):1425-1435. doi: 10.1093/clinchem/hvac136.
4
The evaluation of Bcftools mpileup and GATK HaplotypeCaller for variant calling in non-human species.Bcftools mpileup 和 GATK HaplotypeCaller 在非人类物种变异调用中的评估。
Sci Rep. 2022 Jul 5;12(1):11331. doi: 10.1038/s41598-022-15563-2.
5
Circulating cell-free DNA for cancer early detection.用于癌症早期检测的循环游离DNA
Innovation (Camb). 2022 May 6;3(4):100259. doi: 10.1016/j.xinn.2022.100259. eCollection 2022 Jul 12.
6
Mitochondrial DNA variation across 56,434 individuals in gnomAD.gnomAD 数据库中 56434 个人的线粒体 DNA 变异
Genome Res. 2022 Mar;32(3):569-582. doi: 10.1101/gr.276013.121. Epub 2022 Jan 24.
7
Multimodal liquid biopsy for early monitoring and outcome prediction of chemotherapy in metastatic breast cancer.用于转移性乳腺癌化疗早期监测和预后预测的多模态液体活检
NPJ Breast Cancer. 2021 Sep 9;7(1):115. doi: 10.1038/s41523-021-00319-4.
8
Towards Novel Non-Invasive Colorectal Cancer Screening Methods: A Comprehensive Review.迈向新型非侵入性结直肠癌筛查方法:全面综述
Cancers (Basel). 2021 Apr 10;13(8):1820. doi: 10.3390/cancers13081820.
9
A unified haplotype-based method for accurate and comprehensive variant calling.基于统一单倍型的精确和全面变异calling 方法。
Nat Biotechnol. 2021 Jul;39(7):885-892. doi: 10.1038/s41587-021-00861-3. Epub 2021 Mar 29.
10
ReSeq simulates realistic Illumina high-throughput sequencing data.ReSeq 模拟真实的 Illumina 高通量测序数据。
Genome Biol. 2021 Feb 19;22(1):67. doi: 10.1186/s13059-021-02265-7.