Zhang Weiwei, Tian Zhonghe, Peng Ling
School of Mathematics Information, Shaoxing University, Shaoxing, China.
Front Genet. 2025 May 30;16:1570781. doi: 10.3389/fgene.2025.1570781. eCollection 2025.
Genomic and epigenomic data from complex samples reflect the average level of multiple cell types. However, differences in cell compositions can introduce bias into many relevant analyses. Consequently, the accurate estimation of cell compositions has been regarded as an important initial step in the analysis of complex samples. A large number of computational methods have been developed for estimating cell compositions; however, their applications are limited due to the absence of reference or prior information. As a result, reference-free deconvolution has the potential to be widely applied due to its flexibility. A previous study emphasized the importance of feature selection for improving estimation accuracy in reference-free deconvolution.
In this paper, we systematically evaluated five feature selection options and developed an optimal feature-selection-based reference-free deconvolution method. Our proposal iteratively searches for cell-type-specific (CTS) features by integrating cross-cell-type differential analysis between one cell type and the other cell types, as well as between two cell types and the other cell types, and performs composition estimation.
Comprehensive simulation studies and analyses of seven real datasets show the excellent performance of the proposed method. The proposed method, that is, reference-free deconvolution based on cross-cell-type differential (RFdecd), is implemented as an R package at https://github.com/wwzhang-study/RFdecd.
来自复杂样本的基因组和表观基因组数据反映了多种细胞类型的平均水平。然而,细胞组成的差异会给许多相关分析带来偏差。因此,准确估计细胞组成被视为复杂样本分析中的重要初始步骤。已经开发了大量用于估计细胞组成的计算方法;然而,由于缺乏参考或先验信息,它们的应用受到限制。因此,无参考反卷积因其灵活性而具有广泛应用的潜力。先前的一项研究强调了特征选择对提高无参考反卷积估计准确性的重要性。
在本文中,我们系统地评估了五种特征选择选项,并开发了一种基于最优特征选择的无参考反卷积方法。我们的方法通过整合一种细胞类型与其他细胞类型之间以及两种细胞类型与其他细胞类型之间的跨细胞类型差异分析,迭代地搜索细胞类型特异性(CTS)特征,并进行组成估计。
对七个真实数据集的综合模拟研究和分析表明了所提出方法的优异性能。所提出的方法,即基于跨细胞类型差异的无参考反卷积(RFdecd),作为一个R包在https://github.com/wwzhang-study/RFdecd上实现。