Suppr超能文献

一种用于评估单细胞RNA测序数据中表达变化的组成模型。

A COMPOSITIONAL MODEL TO ASSESS EXPRESSION CHANGES FROM SINGLE-CELL RNA-SEQ DATA.

作者信息

Ma Xiuyu, Korthauer Keegan, Kendziorski Christina, Newton Michael A

机构信息

Department of Statistics, University of Wisconsin-Madison.

Department of Statistics, University of British Columbia.

出版信息

Ann Appl Stat. 2021 Jun;15(2):880-901. doi: 10.1214/20-aoas1423. Epub 2021 Jul 12.

Abstract

On the problem of scoring genes for evidence of changes in the distribution of single-cell expression, we introduce an empirical Bayesian mixture approach and evaluate its operating characteristics in a range of numerical experiments. The proposed approach leverages cell-subtype structure revealed in cluster analysis in order to boost gene-level information on expression changes. Cell clustering informs gene-level analysis through a specially-constructed prior distribution over pairs of multinomial probability vectors; this prior meshes with available model-based tools that score patterns of differential expression over multiple subtypes. We derive an explicit formula for the posterior probability that a gene has the same distribution in two cellular conditions, allowing for a gene-specific mixture over subtypes in each condition. Advantage is gained by the compositional structure of the model not only in which a host of gene-specific mixture components are allowed but also in which the mixing proportions are constrained at the whole cell level. This structure leads to a novel form of information sharing through which the cell-clustering results support gene-level scoring of differential distribution. The result, according to our numerical experiments, is improved sensitivity compared to several standard approaches for detecting distributional expression changes.

摘要

关于单细胞表达分布变化证据的基因评分问题,我们引入了一种经验贝叶斯混合方法,并在一系列数值实验中评估了其操作特性。所提出的方法利用聚类分析中揭示的细胞亚型结构,以增强关于表达变化的基因水平信息。细胞聚类通过对多项概率向量对的特殊构造先验分布为基因水平分析提供信息;该先验与可用的基于模型的工具相结合,这些工具对多个亚型上的差异表达模式进行评分。我们推导了一个明确的公式,用于计算基因在两种细胞条件下具有相同分布的后验概率,允许在每种条件下对亚型进行基因特异性混合。该模型的组成结构不仅在允许大量基因特异性混合成分方面,而且在整个细胞水平上对混合比例进行约束方面都具有优势。这种结构导致了一种新颖的信息共享形式,通过这种形式,细胞聚类结果支持差异分布的基因水平评分。根据我们的数值实验,与几种检测分布表达变化的标准方法相比,结果的敏感性得到了提高。

相似文献

4
Clustering compositional data using Dirichlet mixture model.使用狄利克雷混合模型对组合数据进行聚类。
PLoS One. 2022 May 18;17(5):e0268438. doi: 10.1371/journal.pone.0268438. eCollection 2022.
5
Compositional adjustment of Dirichlet mixture priors.狄利克雷混合先验的成分调整。
J Comput Biol. 2010 Dec;17(12):1607-20. doi: 10.1089/cmb.2010.0117.
10
Differential expression analysis for paired RNA-Seq data.差异表达分析的配对 RNA-Seq 数据。
BMC Bioinformatics. 2013 Mar 27;14:110. doi: 10.1186/1471-2105-14-110.

本文引用的文献

2
SAVER: gene expression recovery for single-cell RNA sequencing.SAVER:单细胞 RNA 测序的基因表达恢复。
Nat Methods. 2018 Jul;15(7):539-542. doi: 10.1038/s41592-018-0033-z. Epub 2018 Jun 25.
6
7
How Single-Cell Genomics Is Changing Evolutionary and Developmental Biology.单细胞基因组学如何改变进化和发育生物学。
Annu Rev Cell Dev Biol. 2017 Oct 6;33:537-553. doi: 10.1146/annurev-cellbio-100616-060818. Epub 2017 Aug 16.
8
Single-cell RNA sequencing to explore immune cell heterogeneity.单细胞 RNA 测序探索免疫细胞异质性。
Nat Rev Immunol. 2018 Jan;18(1):35-45. doi: 10.1038/nri.2017.76. Epub 2017 Aug 7.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验