• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用具有相关随机效应的障碍模型检测离散单细胞RNA测序数据中的差异表达基因。

Detection of differentially expressed genes in discrete single-cell RNA sequencing data using a hurdle model with correlated random effects.

作者信息

Sekula Michael, Gaskins Jeremy, Datta Susmita

机构信息

Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, Kentucky.

Department of Biostatistics, University of Florida, Gainesville, Florida.

出版信息

Biometrics. 2019 Dec;75(4):1051-1062. doi: 10.1111/biom.13074. Epub 2019 Jun 17.

DOI:10.1111/biom.13074
PMID:31009065
Abstract

Single-cell RNA sequencing (scRNA-seq) technologies are revolutionary tools allowing researchers to examine gene expression at the level of a single cell. Traditionally, transcriptomic data have been analyzed from bulk samples, masking the heterogeneity now seen across individual cells. Even within the same cellular population, genes can be highly expressed in some cells but not expressed (or lowly expressed) in others. Therefore, the computational approaches used to analyze bulk RNA sequencing data are not appropriate for the analysis of scRNA-seq data. Here, we present a novel statistical model for high dimensional and zero-inflated scRNA-seq count data to identify differentially expressed (DE) genes across cell types. Correlated random effects are employed based on an initial clustering of cells to capture the cell-to-cell variability within treatment groups. Moreover, this model is flexible and can be easily adapted to an independent random effect structure if needed. We apply our proposed methodology to both simulated and real data and compare results to other popular methods designed for detecting DE genes. Due to the hurdle model's ability to detect differences in the proportion of cells expressed and the average expression level (among the expressed cells), our methods naturally identify some genes as DE that other methods do not, and we demonstrate with real data that these uniquely detected genes are associated with similar biological processes and functions.

摘要

单细胞RNA测序(scRNA-seq)技术是具有革命性的工具,使研究人员能够在单细胞水平上检测基因表达。传统上,转录组数据是从大量样本中分析的,这掩盖了现在在单个细胞中看到的异质性。即使在同一细胞群体中,基因在某些细胞中可能高度表达,但在其他细胞中不表达(或低表达)。因此,用于分析大量RNA测序数据的计算方法不适用于scRNA-seq数据的分析。在这里,我们提出了一种针对高维零膨胀scRNA-seq计数数据的新型统计模型,以识别跨细胞类型的差异表达(DE)基因。基于细胞的初始聚类采用相关随机效应,以捕获治疗组内细胞间的变异性。此外,该模型具有灵活性,如果需要,可以轻松适应独立的随机效应结构。我们将我们提出的方法应用于模拟数据和真实数据,并将结果与其他用于检测DE基因的流行方法进行比较。由于障碍模型能够检测表达细胞比例和平均表达水平(在表达细胞中)的差异,我们的方法自然会识别出一些其他方法未识别的DE基因,并且我们用真实数据证明这些独特检测到的基因与相似的生物学过程和功能相关。

相似文献

1
Detection of differentially expressed genes in discrete single-cell RNA sequencing data using a hurdle model with correlated random effects.使用具有相关随机效应的障碍模型检测离散单细胞RNA测序数据中的差异表达基因。
Biometrics. 2019 Dec;75(4):1051-1062. doi: 10.1111/biom.13074. Epub 2019 Jun 17.
2
Detection of high variability in gene expression from single-cell RNA-seq profiling.从单细胞RNA测序分析中检测基因表达的高变异性。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):508. doi: 10.1186/s12864-016-2897-6.
3
A Comprehensive Survey of Statistical Approaches for Differential Expression Analysis in Single-Cell RNA Sequencing Studies.单细胞 RNA 测序研究中差异表达分析的统计方法综合综述。
Genes (Basel). 2021 Dec 2;12(12):1947. doi: 10.3390/genes12121947.
4
A Markov random field model-based approach for differentially expressed gene detection from single-cell RNA-seq data.基于马尔可夫随机场模型的方法用于从单细胞 RNA-seq 数据中检测差异表达基因。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac166.
5
A comprehensive assessment of hurdle and zero-inflated models for single cell RNA-sequencing analysis.单细胞 RNA 测序分析中障碍和零膨胀模型的综合评估。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad272.
6
DIMM-SC: a Dirichlet mixture model for clustering droplet-based single cell transcriptomic data.DIMM-SC:一种基于 Dirichlet 混合模型的用于聚类基于液滴的单细胞转录组学数据的方法。
Bioinformatics. 2018 Jan 1;34(1):139-146. doi: 10.1093/bioinformatics/btx490.
7
Single-Cell RNA Sequencing Analysis: A Step-by-Step Overview.单细胞 RNA 测序分析:分步概述。
Methods Mol Biol. 2021;2284:343-365. doi: 10.1007/978-1-0716-1307-8_19.
8
Bias, robustness and scalability in single-cell differential expression analysis.单细胞差异表达分析中的偏差、稳健性和可扩展性。
Nat Methods. 2018 Apr;15(4):255-261. doi: 10.1038/nmeth.4612. Epub 2018 Feb 26.
9
scMEB: a fast and clustering-independent method for detecting differentially expressed genes in single-cell RNA-seq data.scMEB:一种在单细胞 RNA-seq 数据中检测差异表达基因的快速且与聚类无关的方法。
BMC Genomics. 2023 May 25;24(1):280. doi: 10.1186/s12864-023-09374-6.
10
Random forest based similarity learning for single cell RNA sequencing data.基于随机森林的单细胞 RNA 测序数据相似性学习。
Bioinformatics. 2018 Jul 1;34(13):i79-i88. doi: 10.1093/bioinformatics/bty260.

引用本文的文献

1
MaAsLin 3: Refining and extending generalized multivariable linear models for meta-omic association discovery.MaAsLin 3:改进和扩展用于宏基因组关联发现的广义多变量线性模型。
bioRxiv. 2024 Dec 14:2024.12.13.628459. doi: 10.1101/2024.12.13.628459.
2
scMultiGAN: cell-specific imputation for single-cell transcriptomes with multiple deep generative adversarial networks.scMultiGAN:使用多个深度生成对抗网络进行单细胞转录组的细胞特异性插补。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad384.
3
Inferring Cell-Cell Communications from Spatially Resolved Transcriptomics Data Using a Bayesian Tweedie Model.
基于贝叶斯 Tweedie 模型从空间分辨转录组学数据推断细胞间通讯。
Genes (Basel). 2023 Jun 28;14(7):1368. doi: 10.3390/genes14071368.
4
SAREV: A review on statistical analytics of single-cell RNA sequencing data.SAREV:单细胞RNA测序数据的统计分析综述
Wiley Interdiscip Rev Comput Stat. 2022 Jul-Aug;14(4). doi: 10.1002/wics.1558. Epub 2021 May 20.
5
Differential Expression Analysis of Single-Cell RNA-Seq Data: Current Statistical Approaches and Outstanding Challenges.单细胞RNA测序数据的差异表达分析:当前的统计方法与突出挑战
Entropy (Basel). 2022 Jul 18;24(7):995. doi: 10.3390/e24070995.
6
Differential expression of single-cell RNA-seq data using Tweedie models.基于 Tweedie 模型的单细胞 RNA-seq 数据差异表达分析。
Stat Med. 2022 Aug 15;41(18):3492-3510. doi: 10.1002/sim.9430. Epub 2022 Jun 2.
7
Single-Cell Differential Network Analysis with Sparse Bayesian Factor Models.基于稀疏贝叶斯因子模型的单细胞差异网络分析
Front Genet. 2022 Feb 4;12:810816. doi: 10.3389/fgene.2021.810816. eCollection 2021.
8
A Comprehensive Survey of Statistical Approaches for Differential Expression Analysis in Single-Cell RNA Sequencing Studies.单细胞 RNA 测序研究中差异表达分析的统计方法综合综述。
Genes (Basel). 2021 Dec 2;12(12):1947. doi: 10.3390/genes12121947.
9
Bayesian correlation is a robust gene similarity measure for single-cell RNA-seq data.贝叶斯相关性是一种用于单细胞RNA测序数据的稳健基因相似性度量。
NAR Genom Bioinform. 2020 Jan 24;2(1):lqaa002. doi: 10.1093/nargab/lqaa002. eCollection 2020 Mar.
10
Alternative splicing: Human disease and quantitative analysis from high-throughput sequencing.可变剪接:人类疾病与高通量测序的定量分析
Comput Struct Biotechnol J. 2020 Dec 24;19:183-195. doi: 10.1016/j.csbj.2020.12.009. eCollection 2021.