Medina Tyler D, Bennett Declan, Seoighe Cathal
School of Mathematical and Statistical Sciences, University of Galway, University Road, Galway H91 TK33, Ireland.
Research Ireland Centre for Research Training in Genomics Data Science, Research Ireland, Three Park Place, Hatch Street Upper, Dublin D02 FX65, Ireland.
NAR Genom Bioinform. 2025 Aug 27;7(3):lqaf120. doi: 10.1093/nargab/lqaf120. eCollection 2025 Sep.
Oxidative damage can introduce G>T mutations upon DNA replication. When this damage occurs , sequenced DNA exhibits strand asymmetry, whereby sequence alignment yields G>T mismatches without corresponding C>A mismatches on the complementary strand at a given locus. Strand asymmetry is used to identify potential sequencing artefacts in somatic variant calls in cancer sequencing projects. Consistent with previous studies, we found that the strandedness of this asymmetry is frequently shared across targeted capture regions. However, while some exome sequencing datasets displayed consistent asymmetry relative to the forward reference strand, some surprisingly showed asymmetry relative to the transcription strand. Though oxidation is the principle cause of artefactual G>T mutations, we propose that the asymmetry stems from the use of single-stranded exome capture probes, as we did not find consistent asymmetry in matched whole genome sequencing. We further propose that high levels of asymmetry can be indicative of oxidation artefacts in the reported somatic variant calls of some samples. While most analysed cohorts showed low to moderate asymmetry, in one cohort (testicular germ cell tumour), approximately half of the reported G>T somatic mutations were likely to be oxidative damage artefacts, as indicated by the extent of asymmetry in mismatches and variants.
氧化损伤可在DNA复制时引入G>T突变。当这种损伤发生时,测序得到的DNA会表现出链不对称性,即序列比对会产生G>T错配,而在给定基因座的互补链上没有相应的C>A错配。链不对称性被用于在癌症测序项目的体细胞变异检测中识别潜在的测序假象。与之前的研究一致,我们发现这种不对称性的链特异性在靶向捕获区域中经常是共享的。然而,虽然一些外显子组测序数据集相对于正向参考链显示出一致的不对称性,但有些数据集令人惊讶地显示出相对于转录链的不对称性。尽管氧化是人为造成G>T突变的主要原因,但我们认为这种不对称性源于单链外显子组捕获探针的使用,因为我们在匹配的全基因组测序中没有发现一致的不对称性。我们进一步提出,高水平的不对称性可能表明某些样本报告的体细胞变异检测中存在氧化假象。虽然大多数分析的队列显示出低到中等程度的不对称性,但在一个队列(睾丸生殖细胞肿瘤)中,大约一半报告的G>T体细胞突变可能是氧化损伤假象,这由错配和变异中的不对称程度所表明。