Suppr超能文献

基于跨细胞类型差异分析的复杂样本无参考去卷积:使用各种特征选择选项的系统评估

Reference-free deconvolution of complex samples based on cross-cell-type differential analysis: Systematic evaluations with various feature selection options.

作者信息

Zhang Weiwei, Tian Zhonghe, Peng Ling

机构信息

School of Mathematics Information, Shaoxing University, Shaoxing, China.

出版信息

Front Genet. 2025 May 30;16:1570781. doi: 10.3389/fgene.2025.1570781. eCollection 2025.

Abstract

INTRODUCTION

Genomic and epigenomic data from complex samples reflect the average level of multiple cell types. However, differences in cell compositions can introduce bias into many relevant analyses. Consequently, the accurate estimation of cell compositions has been regarded as an important initial step in the analysis of complex samples. A large number of computational methods have been developed for estimating cell compositions; however, their applications are limited due to the absence of reference or prior information. As a result, reference-free deconvolution has the potential to be widely applied due to its flexibility. A previous study emphasized the importance of feature selection for improving estimation accuracy in reference-free deconvolution.

METHODS

In this paper, we systematically evaluated five feature selection options and developed an optimal feature-selection-based reference-free deconvolution method. Our proposal iteratively searches for cell-type-specific (CTS) features by integrating cross-cell-type differential analysis between one cell type and the other cell types, as well as between two cell types and the other cell types, and performs composition estimation.

RESULTS AND DISCUSSION

Comprehensive simulation studies and analyses of seven real datasets show the excellent performance of the proposed method. The proposed method, that is, reference-free deconvolution based on cross-cell-type differential (RFdecd), is implemented as an R package at https://github.com/wwzhang-study/RFdecd.

摘要

引言

来自复杂样本的基因组和表观基因组数据反映了多种细胞类型的平均水平。然而,细胞组成的差异会给许多相关分析带来偏差。因此,准确估计细胞组成被视为复杂样本分析中的重要初始步骤。已经开发了大量用于估计细胞组成的计算方法;然而,由于缺乏参考或先验信息,它们的应用受到限制。因此,无参考反卷积因其灵活性而具有广泛应用的潜力。先前的一项研究强调了特征选择对提高无参考反卷积估计准确性的重要性。

方法

在本文中,我们系统地评估了五种特征选择选项,并开发了一种基于最优特征选择的无参考反卷积方法。我们的方法通过整合一种细胞类型与其他细胞类型之间以及两种细胞类型与其他细胞类型之间的跨细胞类型差异分析,迭代地搜索细胞类型特异性(CTS)特征,并进行组成估计。

结果与讨论

对七个真实数据集的综合模拟研究和分析表明了所提出方法的优异性能。所提出的方法,即基于跨细胞类型差异的无参考反卷积(RFdecd),作为一个R包在https://github.com/wwzhang-study/RFdecd上实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/605c/12162504/ce6570460a66/fgene-16-1570781-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验