Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06511, USA.
Department of Biostatistics, School of Public Health, Yale University, New Haven, CT, 06511, USA.
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac166.
The development of single-cell RNA-sequencing (scRNA-seq) technologies has offered insights into complex biological systems at the single-cell resolution. In particular, these techniques facilitate the identifications of genes showing cell-type-specific differential expressions (DE). In this paper, we introduce MARBLES, a novel statistical model for cross-condition DE gene detection from scRNA-seq data. MARBLES employs a Markov Random Field model to borrow information across similar cell types and utilizes cell-type-specific pseudobulk count to account for sample-level variability. Our simulation results showed that MARBLES is more powerful than existing methods to detect DE genes with an appropriate control of false positive rate. Applications of MARBLES to real data identified novel disease-related DE genes and biological pathways from both a single-cell lipopolysaccharide mouse dataset with 24 381 cells and 11 076 genes and a Parkinson's disease human data set with 76 212 cells and 15 891 genes. Overall, MARBLES is a powerful tool to identify cell-type-specific DE genes across conditions from scRNA-seq data.
单细胞 RNA 测序 (scRNA-seq) 技术的发展为单细胞分辨率的复杂生物系统提供了深入了解。特别是,这些技术有助于识别表现出细胞类型特异性差异表达 (DE) 的基因。在本文中,我们介绍了 MARBLES,这是一种用于从 scRNA-seq 数据中检测跨条件 DE 基因的新统计模型。MARBLES 采用马尔可夫随机场模型在相似的细胞类型之间借用信息,并利用细胞类型特异性的伪总体计数来解释样本水平的变异性。我们的模拟结果表明,MARBLES 比现有方法更强大,能够以适当的假阳性率控制来检测 DE 基因。MARBLES 在真实数据上的应用从一个包含 24381 个细胞和 11076 个基因的单细胞脂多糖小鼠数据集和一个包含 76212 个细胞和 15891 个基因的帕金森病人类数据集,鉴定出了与疾病相关的新的 DE 基因和生物学途径。总体而言,MARBLES 是一种从 scRNA-seq 数据中识别跨条件细胞类型特异性 DE 基因的强大工具。