• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯模型选择揭示了单细胞转录组学中零膨胀的生物学起源。

Bayesian model selection reveals biological origins of zero inflation in single-cell transcriptomics.

机构信息

The Jackson Laboratory, 600 Main Street, Bar Harbor, 04609, ME, USA.

University of Michigan, 500 South State Street, Ann Arbor, 48109, MI, USA.

出版信息

Genome Biol. 2020 Jul 27;21(1):183. doi: 10.1186/s13059-020-02103-2.

DOI:10.1186/s13059-020-02103-2
PMID:32718323
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7384222/
Abstract

BACKGROUND

Single-cell RNA sequencing is a powerful tool for characterizing cellular heterogeneity in gene expression. However, high variability and a large number of zero counts present challenges for analysis and interpretation. There is substantial controversy over the origins and proper treatment of zeros and no consensus on whether zero-inflated count distributions are necessary or even useful. While some studies assume the existence of zero inflation due to technical artifacts and attempt to impute the missing information, other recent studies argue that there is no zero inflation in scRNA-seq data.

RESULTS

We apply a Bayesian model selection approach to unambiguously demonstrate zero inflation in multiple biologically realistic scRNA-seq datasets. We show that the primary causes of zero inflation are not technical but rather biological in nature. We also demonstrate that parameter estimates from the zero-inflated negative binomial distribution are an unreliable indicator of zero inflation.

CONCLUSIONS

Despite the existence of zero inflation in scRNA-seq counts, we recommend the generalized linear model with negative binomial count distribution, not zero-inflated, as a suitable reference model for scRNA-seq analysis.

摘要

背景

单细胞 RNA 测序是一种强大的工具,可用于描述基因表达中的细胞异质性。然而,高变异性和大量的零计数给分析和解释带来了挑战。关于零的起源和正确处理存在很大争议,对于是否需要零膨胀计数分布,甚至是否有用,也没有共识。虽然有些研究由于技术因素假设存在零膨胀,并尝试推断缺失信息,但其他最近的研究则认为 scRNA-seq 数据中没有零膨胀。

结果

我们应用贝叶斯模型选择方法明确证明了多个生物学上合理的 scRNA-seq 数据集存在零膨胀。我们表明,零膨胀的主要原因不是技术上的,而是生物学上的。我们还表明,零膨胀负二项分布的参数估计是零膨胀的不可靠指标。

结论

尽管 scRNA-seq 计数中存在零膨胀,但我们建议使用具有负二项计数分布的广义线性模型,而不是零膨胀,作为 scRNA-seq 分析的合适参考模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/f21e5224c206/13059_2020_2103_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/ea257f132ae3/13059_2020_2103_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/66085de0ae85/13059_2020_2103_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/07334e1824a3/13059_2020_2103_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/829a6322fbae/13059_2020_2103_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/f21e5224c206/13059_2020_2103_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/ea257f132ae3/13059_2020_2103_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/66085de0ae85/13059_2020_2103_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/07334e1824a3/13059_2020_2103_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/829a6322fbae/13059_2020_2103_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc72/7384222/f21e5224c206/13059_2020_2103_Fig5_HTML.jpg

相似文献

1
Bayesian model selection reveals biological origins of zero inflation in single-cell transcriptomics.贝叶斯模型选择揭示了单细胞转录组学中零膨胀的生物学起源。
Genome Biol. 2020 Jul 27;21(1):183. doi: 10.1186/s13059-020-02103-2.
2
Bayesian gamma-negative binomial modeling of single-cell RNA sequencing data.贝叶斯伽马负二项式模型在单细胞 RNA 测序数据分析中的应用。
BMC Genomics. 2020 Sep 9;21(Suppl 9):585. doi: 10.1186/s12864-020-06938-8.
3
Identifying Subpopulations of Cells in Single-Cell Transcriptomic Data: A Bayesian Mixture Modeling Approach to Zero Inflation of Counts.基于零膨胀计数的贝叶斯混合模型方法鉴定单细胞转录组数据中的细胞亚群。
J Comput Biol. 2023 Oct;30(10):1059-1074. doi: 10.1089/cmb.2022.0273.
4
Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications.观测权重为零膨胀和单细胞应用解锁了批量 RNA-seq 工具。
Genome Biol. 2018 Feb 26;19(1):24. doi: 10.1186/s13059-018-1406-4.
5
C-ziptf: stable tensor factorization for zero-inflated multi-dimensional genomics data.C-ziptf:用于零膨胀多维基因组学数据的稳定张量分解。
BMC Bioinformatics. 2024 Oct 5;25(1):323. doi: 10.1186/s12859-024-05886-4.
6
Differential expression of single-cell RNA-seq data using Tweedie models.基于 Tweedie 模型的单细胞 RNA-seq 数据差异表达分析。
Stat Med. 2022 Aug 15;41(18):3492-3510. doi: 10.1002/sim.9430. Epub 2022 Jun 2.
7
Modeling zero inflation is not necessary for spatial transcriptomics.对空间转录组学而言,对零膨胀进行建模并非必需。
Genome Biol. 2022 May 18;23(1):118. doi: 10.1186/s13059-022-02684-0.
8
Statistics or biology: the zero-inflation controversy about scRNA-seq data.统计学还是生物学:关于 scRNA-seq 数据的零膨胀争议。
Genome Biol. 2022 Jan 21;23(1):31. doi: 10.1186/s13059-022-02601-5.
9
bayNorm: Bayesian gene expression recovery, imputation and normalization for single-cell RNA-sequencing data.bayNorm:用于单细胞 RNA-seq 数据的贝叶斯基因表达恢复、插补和标准化。
Bioinformatics. 2020 Feb 15;36(4):1174-1181. doi: 10.1093/bioinformatics/btz726.
10
Modeling dynamic correlation in zero-inflated bivariate count data with applications to single-cell RNA sequencing data.对零膨胀双变量计数数据中的动态相关性进行建模,并将其应用于单细胞 RNA 测序数据。
Biometrics. 2022 Jun;78(2):766-776. doi: 10.1111/biom.13457. Epub 2021 Mar 30.

引用本文的文献

1
Cytoplasmic dynamics are overlooked in single nuclei RNA-seq but can be rescued by CytoRescue, a generative AI model to recover cytoplasm enriched gene.细胞质动力学在单核RNA测序中被忽视,但可以通过CytoRescue挽救,CytoRescue是一种用于恢复富含细胞质基因的生成式人工智能模型。
bioRxiv. 2025 Aug 21:2025.08.15.670239. doi: 10.1101/2025.08.15.670239.
2
scCOSMIX: A Mixed-Effects Framework for Differential Coexpression and Transcriptional Interactions Modeling in Single-Cell RNA-Seq.scCOSMIX:用于单细胞RNA测序中差异共表达和转录相互作用建模的混合效应框架
Stat Med. 2025 Aug;44(18-19):e70213. doi: 10.1002/sim.70213.
3
A scalable gut epithelial organoid model reveals the genome-wide colonization landscape of a human-adapted pathogen.

本文引用的文献

1
Droplet scRNA-seq is not zero-inflated.液滴单细胞RNA测序不存在零膨胀问题。
Nat Biotechnol. 2020 Feb;38(2):147-150. doi: 10.1038/s41587-019-0379-5.
2
Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression.使用正则化负二项式回归进行单细胞 RNA-seq 数据的归一化和方差稳定化。
Genome Biol. 2019 Dec 23;20(1):296. doi: 10.1186/s13059-019-1874-1.
3
Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model.基于多项模型的单细胞 RNA-Seq 特征选择和降维。
一种可扩展的肠道上皮类器官模型揭示了一种适应人类病原体的全基因组定殖情况。
Nat Genet. 2025 Jun 12. doi: 10.1038/s41588-025-02218-x.
4
Cell-cycle dependence of bursty gene expression: insights from fitting mechanistic models to single-cell RNA-seq data.爆发性基因表达的细胞周期依赖性:通过将机制模型拟合到单细胞RNA测序数据获得的见解
Nucleic Acids Res. 2025 Apr 10;53(7). doi: 10.1093/nar/gkaf295.
5
Protocol for directly selecting cell type marker genes for single-cell clustering analyses by Festem.通过Festem直接选择用于单细胞聚类分析的细胞类型标记基因的方案。
STAR Protoc. 2025 Mar 21;6(1):103514. doi: 10.1016/j.xpro.2024.103514. Epub 2024 Dec 18.
6
Statistically principled feature selection for single cell transcriptomics.单细胞转录组学的统计学原理特征选择
bioRxiv. 2024 Oct 15:2024.10.11.617709. doi: 10.1101/2024.10.11.617709.
7
Leveraging gene correlations in single cell transcriptomic data.利用单细胞转录组数据中的基因相关性。
BMC Bioinformatics. 2024 Sep 18;25(1):305. doi: 10.1186/s12859-024-05926-z.
8
Directly selecting cell-type marker genes for single-cell clustering analyses.直接选择用于单细胞聚类分析的细胞类型标记基因。
Cell Rep Methods. 2024 Jul 15;4(7):100810. doi: 10.1016/j.crmeth.2024.100810. Epub 2024 Jul 8.
9
Systematic evaluation with practical guidelines for single-cell and spatially resolved transcriptomics data simulation under multiple scenarios.系统评估及多种场景下单细胞和空间分辨转录组数据模拟的实用指南。
Genome Biol. 2024 Jun 3;25(1):145. doi: 10.1186/s13059-024-03290-y.
10
Niche-DE: niche-differential gene expression analysis in spatial transcriptomics data identifies context-dependent cell-cell interactions.Niche-DE:空间转录组学数据中的生态位差异基因表达分析确定了依赖于上下文的细胞-细胞相互作用。
Genome Biol. 2024 Jan 12;25(1):14. doi: 10.1186/s13059-023-03159-6.
Genome Biol. 2019 Dec 23;20(1):295. doi: 10.1186/s13059-019-1861-6.
4
Continuous and Discrete Neuron Types of the Adult Murine Striatum.成年鼠纹状体的连续和离散神经元类型。
Neuron. 2020 Feb 19;105(4):688-699.e8. doi: 10.1016/j.neuron.2019.11.004. Epub 2019 Dec 5.
5
False signals induced by single-cell imputation.单细胞插补诱导的假信号。
F1000Res. 2018 Nov 2;7:1740. doi: 10.12688/f1000research.16613.2. eCollection 2018.
6
Sensitivity and specificity of information criteria.信息准则的灵敏度和特异性。
Brief Bioinform. 2020 Mar 23;21(2):553-565. doi: 10.1093/bib/bbz016.
7
Clustering trees: a visualization for evaluating clusterings at multiple resolutions.聚类树:一种用于在多个分辨率下评估聚类的可视化方法。
Gigascience. 2018 Jul 1;7(7). doi: 10.1093/gigascience/giy083.
8
DrImpute: imputing dropout events in single cell RNA sequencing data.DrImpute:在单细胞 RNA 测序数据中推断缺失事件。
BMC Bioinformatics. 2018 Jun 8;19(1):220. doi: 10.1186/s12859-018-2226-y.
9
UMI-count modeling and differential expression analysis for single-cell RNA sequencing.UMI 计数建模和单细胞 RNA 测序的差异表达分析。
Genome Biol. 2018 May 31;19(1):70. doi: 10.1186/s13059-018-1438-9.
10
Single-cell transcriptomics of the mouse kidney reveals potential cellular targets of kidney disease.单细胞转录组学分析揭示了肾脏疾病的潜在细胞靶标。
Science. 2018 May 18;360(6390):758-763. doi: 10.1126/science.aar2131. Epub 2018 Apr 5.