• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iDESC:基于多个样本的单细胞 RNA 测序数据差异表达识别。

iDESC: identifying differential expression in single-cell RNA sequencing data with multiple subjects.

机构信息

Department of Biostatistics, Yale School of Public Health, New Haven, CT, 06520, USA.

Section of Pulmonary, Critical Care and Sleep Medicine, Yale School of Medicine, New Haven, CT, 06520, USA.

出版信息

BMC Bioinformatics. 2023 Aug 22;24(1):318. doi: 10.1186/s12859-023-05432-8.

DOI:10.1186/s12859-023-05432-8
PMID:37608264
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10463720/
Abstract

BACKGROUND

Single-cell RNA sequencing (scRNA-seq) technology has enabled assessment of transcriptome-wide changes at single-cell resolution. Due to the heterogeneity in environmental exposure and genetic background across subjects, subject effect contributes to the major source of variation in scRNA-seq data with multiple subjects, which severely confounds cell type specific differential expression (DE) analysis. Moreover, dropout events are prevalent in scRNA-seq data, leading to excessive number of zeroes in the data, which further aggravates the challenge in DE analysis.

RESULTS

We developed iDESC to detect cell type specific DE genes between two groups of subjects in scRNA-seq data. iDESC uses a zero-inflated negative binomial mixed model to consider both subject effect and dropouts. The prevalence of dropout events (dropout rate) was demonstrated to be dependent on gene expression level, which is modeled by pooling information across genes. Subject effect is modeled as a random effect in the log-mean of the negative binomial component. We evaluated and compared the performance of iDESC with eleven existing DE analysis methods. Using simulated data, we demonstrated that iDESC had well-controlled type I error and higher power compared to the existing methods. Applications of those methods with well-controlled type I error to three real scRNA-seq datasets from the same tissue and disease showed that the results of iDESC achieved the best consistency between datasets and the best disease relevance.

CONCLUSIONS

iDESC was able to achieve more accurate and robust DE analysis results by separating subject effect from disease effect with consideration of dropouts to identify DE genes, suggesting the importance of considering subject effect and dropouts in the DE analysis of scRNA-seq data with multiple subjects.

摘要

背景

单细胞 RNA 测序(scRNA-seq)技术使人们能够在单细胞分辨率下评估转录组的变化。由于个体之间环境暴露和遗传背景的异质性,个体效应是多个体 scRNA-seq 数据中主要的变异来源之一,严重干扰了细胞类型特异性差异表达(DE)分析。此外,scRNA-seq 数据中普遍存在缺失事件,导致数据中出现大量零值,进一步加剧了 DE 分析的挑战。

结果

我们开发了 iDESC 来检测 scRNA-seq 数据中两组个体之间的细胞类型特异性 DE 基因。iDESC 使用零膨胀负二项混合模型来考虑个体效应和缺失值。缺失事件的发生率(缺失率)被证明与基因表达水平有关,这是通过跨基因信息池化来建模的。个体效应在负二项成分的对数均值中被建模为随机效应。我们评估并比较了 iDESC 与 11 种现有的 DE 分析方法的性能。使用模拟数据,我们表明 iDESC 具有良好的控制型 I 错误和比现有方法更高的功效。将那些具有良好控制型 I 错误的方法应用于来自同一组织和疾病的三个真实 scRNA-seq 数据集,表明 iDESC 的结果在数据集之间具有最佳的一致性,并且与疾病的相关性最好。

结论

iDESC 通过考虑缺失值来分离个体效应和疾病效应,从而能够更准确和稳健地进行 DE 分析,以识别 DE 基因,这表明在多个体 scRNA-seq 数据的 DE 分析中考虑个体效应和缺失值的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/8eb2f9a3420d/12859_2023_5432_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/dcb61098ac5c/12859_2023_5432_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/86b722b962a8/12859_2023_5432_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/27096682c240/12859_2023_5432_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/0bd987ead1e0/12859_2023_5432_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/5d7cedde2011/12859_2023_5432_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/8eb2f9a3420d/12859_2023_5432_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/dcb61098ac5c/12859_2023_5432_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/86b722b962a8/12859_2023_5432_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/27096682c240/12859_2023_5432_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/0bd987ead1e0/12859_2023_5432_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/5d7cedde2011/12859_2023_5432_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/844b/10463720/8eb2f9a3420d/12859_2023_5432_Fig6_HTML.jpg

相似文献

1
iDESC: identifying differential expression in single-cell RNA sequencing data with multiple subjects.iDESC:基于多个样本的单细胞 RNA 测序数据差异表达识别。
BMC Bioinformatics. 2023 Aug 22;24(1):318. doi: 10.1186/s12859-023-05432-8.
2
SwarnSeq: An improved statistical approach for differential expression analysis of single-cell RNA-seq data.SwarnSeq:一种用于单细胞 RNA-seq 数据差异表达分析的改进统计方法。
Genomics. 2021 May;113(3):1308-1324. doi: 10.1016/j.ygeno.2021.02.014. Epub 2021 Mar 1.
3
Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications.观测权重为零膨胀和单细胞应用解锁了批量 RNA-seq 工具。
Genome Biol. 2018 Feb 26;19(1):24. doi: 10.1186/s13059-018-1406-4.
4
Bayesian gamma-negative binomial modeling of single-cell RNA sequencing data.贝叶斯伽马负二项式模型在单细胞 RNA 测序数据分析中的应用。
BMC Genomics. 2020 Sep 9;21(Suppl 9):585. doi: 10.1186/s12864-020-06938-8.
5
Recommendations of scRNA-seq Differential Gene Expression Analysis Based on Comprehensive Benchmarking.基于全面基准测试的单细胞RNA测序差异基因表达分析建议
Life (Basel). 2022 Jun 7;12(6):850. doi: 10.3390/life12060850.
6
Detection of high variability in gene expression from single-cell RNA-seq profiling.从单细胞RNA测序分析中检测基因表达的高变异性。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):508. doi: 10.1186/s12864-016-2897-6.
7
CDSImpute: An ensemble similarity imputation method for single-cell RNA sequence dropouts.CDSImpute:一种用于单细胞 RNA 序列缺失的集成相似性插补方法。
Comput Biol Med. 2022 Jul;146:105658. doi: 10.1016/j.compbiomed.2022.105658. Epub 2022 May 21.
8
A comprehensive assessment of hurdle and zero-inflated models for single cell RNA-sequencing analysis.单细胞 RNA 测序分析中障碍和零膨胀模型的综合评估。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad272.
9
DEsingle for detecting three types of differential expression in single-cell RNA-seq data.DEsingle 用于检测单细胞 RNA-seq 数据中的三种差异表达。
Bioinformatics. 2018 Sep 15;34(18):3223-3224. doi: 10.1093/bioinformatics/bty332.
10
Imputing single-cell RNA-seq data by considering cell heterogeneity and prior expression of dropouts.考虑细胞异质性和缺失值先前表达来推断单细胞 RNA-seq 数据。
J Mol Cell Biol. 2021 Apr 10;13(1):29-40. doi: 10.1093/jmcb/mjaa052.

引用本文的文献

1
Simulating paired and longitudinal single-cell RNA sequencing data with rescueSim.使用rescueSim模拟配对和纵向单细胞RNA测序数据。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf442.
2
DiSC: a statistical tool for fast differential expression analysis of individual-level single-cell RNA-seq data.DiSC:一种用于个体水平单细胞RNA测序数据快速差异表达分析的统计工具。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf327.
3
Sex-specific single cell-level transcriptomic signatures of Rett syndrome disease progression.

本文引用的文献

1
Confronting false discoveries in single-cell differential expression.单细胞差异表达中虚假发现的应对策略。
Nat Commun. 2021 Sep 28;12(1):5692. doi: 10.1038/s41467-021-25960-2.
2
Identification of Hub Genes and Pathways Associated With Idiopathic Pulmonary Fibrosis Bioinformatics Analysis.与特发性肺纤维化相关的枢纽基因和通路的鉴定:生物信息学分析
Front Mol Biosci. 2021 Aug 12;8:711239. doi: 10.3389/fmolb.2021.711239. eCollection 2021.
3
A practical solution to pseudoreplication bias in single-cell studies.单细胞研究中拟似重复偏倚的实用解决方案。
雷特综合征疾病进展的性别特异性单细胞水平转录组特征。
Commun Biol. 2024 Oct 10;7(1):1292. doi: 10.1038/s42003-024-06990-0.
4
Sex-specific single cell-level transcriptomic signatures of Rett syndrome disease progression.雷特综合征疾病进展的性别特异性单细胞水平转录组特征
bioRxiv. 2024 May 19:2024.05.16.594595. doi: 10.1101/2024.05.16.594595.
5
eSVD-DE: cohort-wide differential expression in single-cell RNA-seq data using exponential-family embeddings.eSVD-DE:使用指数族嵌入进行单细胞 RNA-seq 数据的全队列差异表达分析。
BMC Bioinformatics. 2024 Mar 15;25(1):113. doi: 10.1186/s12859-024-05724-7.
6
eSVD-DE: Cohort-wide differential expression in single-cell RNA-seq data using exponential-family embeddings.eSVD-DE:使用指数族嵌入的单细胞RNA测序数据中的全队列差异表达
bioRxiv. 2024 Mar 1:2023.11.22.568369. doi: 10.1101/2023.11.22.568369.
7
Consistent analysis of differentially expressed genes across 7 cell types in papillary thyroid carcinoma.甲状腺乳头状癌中7种细胞类型差异表达基因的一致性分析
Comput Struct Biotechnol J. 2023 Oct 27;21:5337-5349. doi: 10.1016/j.csbj.2023.10.045. eCollection 2023.
8
stVAE deconvolves cell-type composition in large-scale cellular resolution spatial transcriptomics.stVAE 可对大规模细胞分辨率空间转录组学中的细胞类型组成进行去卷积。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad642.
9
Correction: iDESC: identifying differential expression in single-cell RNA sequencing data with multiple subjects.更正:iDESC:在具有多个样本的单细胞RNA测序数据中识别差异表达
BMC Bioinformatics. 2023 Oct 19;24(1):394. doi: 10.1186/s12859-023-05523-6.
Nat Commun. 2021 Feb 2;12(1):738. doi: 10.1038/s41467-021-21038-1.
4
muscat detects subpopulation-specific state transitions from multi-sample multi-condition single-cell transcriptomics data.Muscat 可从多样本多条件单细胞转录组学数据中检测到亚群特异性状态转变。
Nat Commun. 2020 Nov 30;11(1):6077. doi: 10.1038/s41467-020-19894-4.
5
Single-cell RNA-seq reveals ectopic and aberrant lung-resident cell populations in idiopathic pulmonary fibrosis.单细胞 RNA 测序揭示特发性肺纤维化中异位和异常的肺驻留细胞群体。
Sci Adv. 2020 Jul 8;6(28):eaba1983. doi: 10.1126/sciadv.aba1983. eCollection 2020 Jul.
6
Single-cell RNA sequencing reveals profibrotic roles of distinct epithelial and mesenchymal lineages in pulmonary fibrosis.单细胞 RNA 测序揭示了肺纤维化中不同上皮和间充质谱系的促纤维化作用。
Sci Adv. 2020 Jul 8;6(28):eaba1972. doi: 10.1126/sciadv.aba1972. eCollection 2020 Jul.
7
Bioinformatics analysis on differentially expressed genes of alveolar macrophage in IPF.特发性肺纤维化中肺泡巨噬细胞差异表达基因的生物信息学分析
Exp Lung Res. 2019 Nov-Dec;45(9-10):288-296. doi: 10.1080/01902148.2019.1680765. Epub 2019 Nov 24.
8
A systematic evaluation of single cell RNA-seq analysis pipelines.单细胞 RNA 测序分析流程的系统评价。
Nat Commun. 2019 Oct 11;10(1):4667. doi: 10.1038/s41467-019-12266-7.
9
DMSO cryopreservation is the method of choice to preserve cells for droplet-based single-cell RNA sequencing.DMSO 冷冻保存是基于液滴的单细胞 RNA 测序中保存细胞的首选方法。
Sci Rep. 2019 Jul 23;9(1):10699. doi: 10.1038/s41598-019-46932-z.
10
Proliferating SPP1/MERTK-expressing macrophages in idiopathic pulmonary fibrosis.特发性肺纤维化中增殖的 SPP1/MERTK 表达巨噬细胞。
Eur Respir J. 2019 Aug 22;54(2). doi: 10.1183/13993003.02441-2018. Print 2019 Aug.