• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯单细胞 RNA 测序数据图形模型中技术噪声的解释。

Accounting for technical noise in Bayesian graphical models of single-cell RNA-sequencing data.

机构信息

Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvannia, 423 Guardian Drive, Philadelphia, PA 19104, USA.

出版信息

Biostatistics. 2022 Dec 12;24(1):161-176. doi: 10.1093/biostatistics/kxab011.

DOI:10.1093/biostatistics/kxab011
PMID:34520533
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9748577/
Abstract

Single-cell RNA-sequencing (scRNAseq) data contain a high level of noise, especially in the form of zero-inflation, that is, the presence of an excessively large number of zeros. This is largely due to dropout events and amplification biases that occur in the preparation stage of single-cell experiments. Recent scRNAseq experiments have been augmented with unique molecular identifiers (UMI) and External RNA Control Consortium (ERCC) molecules which can be used to account for zero-inflation. However, most of the current methods on graphical models are developed under the assumption of the multivariate Gaussian distribution or its variants, and thus they are not able to adequately account for an excessively large number of zeros in scRNAseq data. In this article, we propose a single-cell latent graphical model (scLGM)-a Bayesian hierarchical model for estimating the conditional dependency network among genes using scRNAseq data. Taking advantage of UMI and ERCC data, scLGM explicitly models the two sources of zero-inflation. Our simulation study and real data analysis demonstrate that the proposed approach outperforms several existing methods.

摘要

单细胞 RNA 测序 (scRNAseq) 数据包含高水平的噪声,特别是零膨胀的形式,即存在大量的零值。这主要是由于单细胞实验准备阶段的丢包事件和扩增偏差引起的。最近的 scRNAseq 实验已经添加了独特分子标识符 (UMI) 和外部 RNA 对照协会 (ERCC) 分子,这些分子可用于解释零膨胀。然而,目前图形模型上的大多数方法都是在假设多元高斯分布或其变体的情况下开发的,因此它们不能充分考虑 scRNAseq 数据中大量的零值。在本文中,我们提出了一种单细胞潜在图形模型 (scLGM)——一种使用 scRNAseq 数据估计基因间条件依赖网络的贝叶斯层次模型。利用 UMI 和 ERCC 数据,scLGM 明确地对两种零膨胀源进行建模。我们的模拟研究和实际数据分析表明,所提出的方法优于几种现有的方法。

相似文献

1
Accounting for technical noise in Bayesian graphical models of single-cell RNA-sequencing data.贝叶斯单细胞 RNA 测序数据图形模型中技术噪声的解释。
Biostatistics. 2022 Dec 12;24(1):161-176. doi: 10.1093/biostatistics/kxab011.
2
Bayesian model selection reveals biological origins of zero inflation in single-cell transcriptomics.贝叶斯模型选择揭示了单细胞转录组学中零膨胀的生物学起源。
Genome Biol. 2020 Jul 27;21(1):183. doi: 10.1186/s13059-020-02103-2.
3
Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data.单细胞RNA测序数据差异表达分析中的技术噪声处理
Nucleic Acids Res. 2017 Nov 2;45(19):10978-10988. doi: 10.1093/nar/gkx754.
4
BADGE: a novel Bayesian model for accurate abundance quantification and differential analysis of RNA-Seq data.标记:一种用于 RNA-Seq 数据精确丰度定量和差异分析的新型贝叶斯模型。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S6. doi: 10.1186/1471-2105-15-S9-S6. Epub 2014 Sep 10.
5
Bayesian gamma-negative binomial modeling of single-cell RNA sequencing data.贝叶斯伽马负二项式模型在单细胞 RNA 测序数据分析中的应用。
BMC Genomics. 2020 Sep 9;21(Suppl 9):585. doi: 10.1186/s12864-020-06938-8.
6
Network analysis for count data with excess zeros.针对具有过多零值的计数数据的网络分析。
BMC Genet. 2017 Nov 6;18(1):93. doi: 10.1186/s12863-017-0561-z.
7
Disease mapping of zero-excessive mesothelioma data in Flanders.比利时弗拉芒地区零超额间皮瘤数据的疾病地图绘制。
Ann Epidemiol. 2017 Jan;27(1):59-66.e3. doi: 10.1016/j.annepidem.2016.10.006. Epub 2016 Nov 1.
8
Measurement error models with zero inflation and multiple sources of zeros, with applications to hard zeros.具有零膨胀和多个零源的测量误差模型,及其在硬零中的应用。
Lifetime Data Anal. 2024 Jul;30(3):600-623. doi: 10.1007/s10985-024-09627-w. Epub 2024 May 28.
9
M3Drop: dropout-based feature selection for scRNASeq.M3Drop:用于单细胞RNA测序的基于缺失值的特征选择
Bioinformatics. 2019 Aug 15;35(16):2865-2867. doi: 10.1093/bioinformatics/bty1044.
10
Detecting differential alternative splicing events in scRNA-seq with or without Unique Molecular Identifiers.在有或没有独特分子标识符的 scRNA-seq 中检测差异剪接事件。
PLoS Comput Biol. 2020 Jun 5;16(6):e1007925. doi: 10.1371/journal.pcbi.1007925. eCollection 2020 Jun.

引用本文的文献

1
Current Applications of Single-Cell RNA Sequencing in Glioblastoma: A Scoping Review.单细胞RNA测序在胶质母细胞瘤中的当前应用:一项范围综述
Brain Sci. 2025 Mar 14;15(3):309. doi: 10.3390/brainsci15030309.

本文引用的文献

1
GRAPHICAL MODELS FOR ZERO-INFLATED SINGLE CELL GENE EXPRESSION.零膨胀单细胞基因表达的图形模型
Ann Appl Stat. 2019 Jun;13(2):848-873. doi: 10.1214/18-AOAS1213. Epub 2019 Jun 17.
2
DrImpute: imputing dropout events in single cell RNA sequencing data.DrImpute:在单细胞 RNA 测序数据中推断缺失事件。
BMC Bioinformatics. 2018 Jun 8;19(1):220. doi: 10.1186/s12859-018-2226-y.
3
Missing data and technical variability in single-cell RNA-sequencing experiments.单细胞 RNA 测序实验中的数据缺失和技术变异性。
Biostatistics. 2018 Oct 1;19(4):562-578. doi: 10.1093/biostatistics/kxx053.
4
Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data.单细胞RNA测序数据差异表达分析中的技术噪声处理
Nucleic Acids Res. 2017 Nov 2;45(19):10978-10988. doi: 10.1093/nar/gkx754.
5
UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy.UMI-tools:对独特分子标识符中的测序错误进行建模以提高定量准确性。
Genome Res. 2017 Mar;27(3):491-499. doi: 10.1101/gr.209601.116. Epub 2017 Jan 18.
6
KEGG: new perspectives on genomes, pathways, diseases and drugs.京都基因与基因组百科全书(KEGG):关于基因组、通路、疾病和药物的新视角。
Nucleic Acids Res. 2017 Jan 4;45(D1):D353-D361. doi: 10.1093/nar/gkw1092. Epub 2016 Nov 28.
7
Design and computational analysis of single-cell RNA-sequencing experiments.单细胞RNA测序实验的设计与计算分析
Genome Biol. 2016 Apr 7;17:63. doi: 10.1186/s13059-016-0927-y.
8
CONDITIONAL DISTANCE CORRELATION.条件距离相关性
J Am Stat Assoc. 2015;110(512):1726-1734. doi: 10.1080/01621459.2014.993081. Epub 2015 Jan 23.
9
Gene regulation network inference with joint sparse Gaussian graphical models.基于联合稀疏高斯图形模型的基因调控网络推断
J Comput Graph Stat. 2015 Oct 1;24(4):954-974. doi: 10.1080/10618600.2014.956876. Epub 2014 Sep 17.
10
The huge Package for High-dimensional Undirected Graph Estimation in R.R语言中用于高维无向图估计的庞大软件包。
J Mach Learn Res. 2012 Apr;13:1059-1062.