Liu Yang, Peng Fan, Wang Siyuan, Jiao Huanmin, Dang Miao, Zhou Kaixiang, Guo Wenjie, Guo Shanshan, Zhang Huanqin, Song Wenjie, Xing Jinliang
State Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and Department of Physiology and Pathophysiology, Fourth Military Medical University, Xi'an, China.
Department of Clinical Diagnosis, Tangdu Hospital, Fourth Military Medical University, Xi'an, China.
EMBO Mol Med. 2024 Dec;16(12):3169-3183. doi: 10.1038/s44321-024-00163-6. Epub 2024 Oct 30.
Fragmentomic features of circulating cell free mitochondrial DNA (ccf-mtDNA) including fragmentation profile, 5' end base preference and motif diversity are poorly understood. Here, we generated ccf-mtDNA sequencing data of 1607 plasma samples using capture-based next generation sequencing. We firstly found that fragmentomic features of ccf-mtDNA were remarkably different from those of circulating cell free nuclear DNA. Furthermore, region-specific fragmentomic features of ccf-mtDNA were observed, which was associated with protein binding, base composition and special structure of mitochondrial DNA. When comparing to non-cancer controls, six types of cancer patients exhibited aberrant fragmentomic features. Then, cancer detection models were built based on the fragmentomic features. Both internal and external validation cohorts demonstrated the excellent capacity of our model in distinguishing cancer patients from non-cancer control, with all area under curve higher than 0.9322. The overall accuracy of tissue-of-origin was 89.24% and 87.92% for six cancer types in two validation cohort, respectively. Altogether, our study comprehensively describes cancer-specific fragmentomic features of ccf-mtDNA and provides a proof-of-principle for the ccf-mtDNA fragmentomics-based multi-cancer detection and tissue-of-origin classification.
循环游离线粒体DNA(ccf-mtDNA)的片段组学特征,包括片段化谱、5'端碱基偏好和基序多样性,目前了解甚少。在此,我们使用基于捕获的下一代测序技术生成了1607份血浆样本的ccf-mtDNA测序数据。我们首先发现,ccf-mtDNA的片段组学特征与循环游离核DNA的显著不同。此外,还观察到ccf-mtDNA的区域特异性片段组学特征,这与线粒体DNA的蛋白质结合、碱基组成和特殊结构有关。与非癌症对照相比,六种癌症患者表现出异常的片段组学特征。然后,基于片段组学特征建立了癌症检测模型。内部和外部验证队列均证明我们的模型在区分癌症患者和非癌症对照方面具有出色的能力,所有曲线下面积均高于0.9322。在两个验证队列中,六种癌症类型的组织起源总体准确率分别为89.24%和87.92%。总之,我们的研究全面描述了ccf-mtDNA的癌症特异性片段组学特征,并为基于ccf-mtDNA片段组学的多癌症检测和组织起源分类提供了原理证明。