Department of Oncology, The First Affiliated Hospital of USTC, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, China.
Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, China.
Nat Commun. 2024 Oct 25;15(1):9208. doi: 10.1038/s41467-024-53496-8.
Extrachromosomal circular DNA (eccDNA) is crucial in oncogene amplification, gene transcription regulation, and intratumor heterogeneity. While various analysis pipelines and experimental methods have been developed for eccDNA identification, their detection efficiencies have not been systematically assessed. To address this, we evaluate the performance of 7 analysis pipelines using seven simulated datasets, in terms of accuracy, identity, duplication rate, and computational resource consumption. We also compare the eccDNA detection efficiency of 7 experimental methods through twenty-one real sequencing datasets. Here, we show that Circle-Map and Circle_finder (bwa-mem-samblaster) outperform the other short-read pipelines. However, Circle_finder (bwa-mem-samblaster) exhibits notable redundancy in its outcomes. CReSIL is the most effective pipeline for eccDNA detection in long-read sequencing data at depths higher than 10X. Moreover, long-read sequencing-based Circle-Seq shows superior efficiency in detecting copy number-amplified eccDNA over 10 kb in length. These results offer valuable insights for researchers in choosing the suitable methods for eccDNA research.
染色体外环状 DNA(eccDNA)在癌基因扩增、基因转录调控和肿瘤内异质性中起着关键作用。虽然已经开发了各种用于 eccDNA 鉴定的分析管道和实验方法,但它们的检测效率尚未得到系统评估。为了解决这个问题,我们使用七种模拟数据集评估了七种分析管道在准确性、同一性、复制率和计算资源消耗方面的性能。我们还通过二十一个真实测序数据集比较了七种实验方法的 eccDNA 检测效率。在这里,我们表明 Circle-Map 和 Circle_finder(bwa-mem-samblaster)优于其他短读长分析管道。然而,Circle_finder(bwa-mem-samblaster)在其结果中存在显著的冗余。在深度大于 10X 的长读长测序数据中,CReSIL 是 eccDNA 检测最有效的管道。此外,基于长读长测序的 Circle-Seq 在检测长度超过 10kb 的拷贝数扩增的 eccDNA 方面具有更高的效率。这些结果为研究人员选择适合 eccDNA 研究的方法提供了有价值的见解。