Department of Molecular Medicine, Aarhus University Hospital, 8200 Aarhus, Denmark.
Department of Clinical Medicine, Aarhus University, 8000 Aarhus, Denmark.
Int J Mol Sci. 2024 Oct 24;25(21):11439. doi: 10.3390/ijms252111439.
Circulating tumor DNA (ctDNA) is a promising cancer biomarker, but accurately detecting tumor mutations in cell-free DNA (cfDNA) is challenging due to their low frequency and sequencing errors. Our study benchmarked Mutect2, VarScan2, shearwater, and DREAMS-vc using deep targeted sequencing of cfDNA with Unique Molecular Identifiers (UMIs) from 111 colorectal cancer patients. Performance was assessed at both the mutation level (distinguish tumor variants from errors) and the sample level (detect if an individual has cancer). Additionally, we investigated the effects of various UMI grouping and consensus strategies. The shearwater-AND variant calling method demonstrated the highest precision in detecting tumor-derived mutations from plasma, and reached the highest ROC-AUC of 0.984 for sample classification in tumor-informed cfDNA analyses. DREAMS-vc exhibited the highest ROC-AUC of 0.808 for sample classification in tumor-agnostic studies. We also found that sequencing depth differences in PBMCs could lead to false positives, particularly with VarScan2 and Mutect2, which was addressed by downsampling to equivalent mean depths. Additionally, network-based UMI grouping methods outperformed those using identical UMIs when all reads were retained. Our findings emphasize that the optimal variant caller depends on the study context-whether focused on mutation or sample classification, and whether conducted under tumor-informed or tumor-agnostic conditions.
循环肿瘤 DNA(ctDNA)是一种很有前途的癌症生物标志物,但由于其频率低和测序错误,准确检测游离 DNA(cfDNA)中的肿瘤突变具有挑战性。我们使用带有独特分子标识符(UMI)的 cfDNA 进行深度靶向测序,对 111 例结直肠癌患者的 Mutect2、VarScan2、shearwater 和 DREAMS-vc 进行了基准测试。性能评估了突变水平(区分肿瘤变体与错误)和样本水平(检测个体是否患有癌症)。此外,我们还研究了各种 UMI 分组和共识策略的影响。shearwater-AND 变异调用方法在从血浆中检测肿瘤衍生突变方面表现出最高的精度,在肿瘤信息 cfDNA 分析中达到了 0.984 的最高 ROC-AUC 用于样本分类。DREAMS-vc 在肿瘤不可知研究中用于样本分类的 ROC-AUC 最高,为 0.808。我们还发现 PBMC 中的测序深度差异可能导致假阳性,特别是对于 VarScan2 和 Mutect2,通过下采样到等效的平均深度可以解决这个问题。此外,当保留所有读取时,基于网络的 UMI 分组方法优于使用相同 UMI 的方法。我们的研究结果强调,最佳变异调用器取决于研究背景——是否关注突变或样本分类,以及是否在肿瘤信息或肿瘤不可知条件下进行。