Tüns Alicia Isabell, Hartmann Till, Magin Simon, González Rocío Chamorro, Henssen Anton George, Rahmann Sven, Schramm Alexander, Köster Johannes
Laboratory of Molecular Oncology, West German Cancer Center, Department of Medical Oncology, University Hospital Essen, Essen, Germany.
Algorithms for Reproducible Bioinformatics, Institute of Human Genetics, University Hospital Essen, University of Duisburg-Essen, Essen, Germany.
Front Genet. 2022 May 30;13:867018. doi: 10.3389/fgene.2022.867018. eCollection 2022.
Occurrence of extra-chromosomal circular DNA is a phenomenon frequently observed in tumor cells, and the presence of such DNA has been recognized as a marker of adverse outcome across cancer types. We here describe a computational workflow for identification of DNA circles from long-read sequencing data. The workflow is implemented based on the Snakemake workflow management system. Its key step uses a graph-theoretic approach to identify putative circular fragments validated on simulated reads. We then demonstrate robustness of our approach using nanopore sequencing of selectively enriched circular DNA by highly sensitive and specific recovery of plasmids and the mitochondrial genome, which is the only circular DNA in normal human cells. Finally, we show that the workflow facilitates detection of larger circular DNA fragments containing extrachromosomal copies of the MYCN oncogene and the respective breakpoints, which is a potentially useful application in disease monitoring of several cancer types.
染色体外环状DNA的出现是肿瘤细胞中经常观察到的一种现象,并且这种DNA的存在已被公认为是所有癌症类型不良预后的一个标志物。我们在此描述一种用于从长读长测序数据中鉴定DNA环的计算工作流程。该工作流程基于Snakemake工作流程管理系统实现。其关键步骤使用一种图论方法来鉴定在模拟读长上验证的假定环状片段。然后,我们通过对质粒和线粒体基因组(正常人类细胞中唯一的环状DNA)进行高灵敏度和特异性回收,对选择性富集的环状DNA进行纳米孔测序,证明了我们方法的稳健性。最后,我们表明该工作流程有助于检测包含MYCN癌基因的染色体外拷贝和各自断点的更大环状DNA片段,这在几种癌症类型的疾病监测中是一个潜在有用的应用。