Keukeleire Pia, Makrodimitris Stavros, Reinders Marcel
Delft Bioinformatics Lab, Delft University of Technology, Delft, The Netherlands.
Translational Cancer Genomics group, Erasmus Medical Center, Rotterdam, The Netherlands.
NAR Genom Bioinform. 2023 Jun 2;5(2):lqad048. doi: 10.1093/nargab/lqad048. eCollection 2023 Jun.
Cell-free DNA (cfDNA) are DNA fragments originating from dying cells that are detectable in bodily fluids, such as the plasma. Accelerated cell death, for example caused by disease, induces an elevated concentration of cfDNA. As a result, determining the cell type origins of cfDNA molecules can provide information about an individual's health. In this work, we aim to increase the sensitivity of methylation-based cell type deconvolution by adapting an existing method, CelFiE, which uses the methylation beta values of individual CpG sites to estimate cell type proportions. Our new method, CelFEER, instead differentiates cell types by the average methylation values within individual reads. We additionally improved the originally reported performance of CelFiE by using a new approach for finding marker regions that are differentially methylated between cell types. We show that CelFEER estimates cell type proportions with a higher correlation ( = 0.94 ± 0.04) than CelFiE ( = 0.86 ± 0.09) on simulated mixtures of cell types. Moreover, we show that the cell type proportion estimated by CelFEER can differentiate between ALS patients and healthy controls, between pregnant women in their first and third trimester, and between pregnant women with and without gestational diabetes.
游离DNA(cfDNA)是源自死亡细胞的DNA片段,可在体液(如血浆)中检测到。加速的细胞死亡,例如由疾病引起的,会导致cfDNA浓度升高。因此,确定cfDNA分子的细胞类型来源可以提供有关个体健康的信息。在这项工作中,我们旨在通过改进现有的CelFiE方法来提高基于甲基化的细胞类型反卷积的灵敏度,该方法使用单个CpG位点的甲基化β值来估计细胞类型比例。我们的新方法CelFEER则通过单个读数内的平均甲基化值来区分细胞类型。我们还通过使用一种新方法来寻找细胞类型之间甲基化差异的标记区域,改进了最初报道的CelFiE的性能。我们表明,在细胞类型的模拟混合物上,CelFEER估计细胞类型比例的相关性(= 0.94 ± 0.04)高于CelFiE(= 0.86 ± 0.09)。此外,我们表明,CelFEER估计的细胞类型比例可以区分肌萎缩侧索硬化症患者和健康对照、妊娠早期和晚期的孕妇,以及患有和未患有妊娠期糖尿病的孕妇。