Suppr超能文献

CellCODE:一种用于异质细胞群体差异表达分析的稳健潜在变量方法。

CellCODE: a robust latent variable approach to differential expression analysis for heterogeneous cell populations.

作者信息

Chikina Maria, Zaslavsky Elena, Sealfon Stuart C

机构信息

Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15217, USA and Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.

出版信息

Bioinformatics. 2015 May 15;31(10):1584-91. doi: 10.1093/bioinformatics/btv015. Epub 2015 Jan 11.

Abstract

MOTIVATION

Identifying alterations in gene expression associated with different clinical states is important for the study of human biology. However, clinical samples used in gene expression studies are often derived from heterogeneous mixtures with variable cell-type composition, complicating statistical analysis. Considerable effort has been devoted to modeling sample heterogeneity, and presently, there are many methods that can estimate cell proportions or pure cell-type expression from mixture data. However, there is no method that comprehensively addresses mixture analysis in the context of differential expression without relying on additional proportion information, which can be inaccurate and is frequently unavailable.

RESULTS

In this study, we consider a clinically relevant situation where neither accurate proportion estimates nor pure cell expression is of direct interest, but where we are rather interested in detecting and interpreting relevant differential expression in mixture samples. We develop a method, Cell-type COmputational Differential Estimation (CellCODE), that addresses the specific statistical question directly, without requiring a physical model for mixture components. Our approach is based on latent variable analysis and is computationally transparent; it requires no additional experimental data, yet outperforms existing methods that use independent proportion measurements. CellCODE has few parameters that are robust and easy to interpret. The method can be used to track changes in proportion, improve power to detect differential expression and assign the differentially expressed genes to the correct cell type.

摘要

动机

识别与不同临床状态相关的基因表达变化对于人类生物学研究至关重要。然而,基因表达研究中使用的临床样本通常来自细胞类型组成各异的异质混合物,这使得统计分析变得复杂。人们已投入大量精力对样本异质性进行建模,目前有许多方法可以从混合数据中估计细胞比例或纯细胞类型的表达。然而,尚无一种方法能在不依赖额外比例信息(这种信息可能不准确且常常无法获得)的情况下,全面解决差异表达背景下的混合分析问题。

结果

在本研究中,我们考虑一种临床相关情况,即准确的比例估计和纯细胞表达都不是直接关注的重点,而我们更感兴趣的是检测和解释混合样本中的相关差异表达。我们开发了一种方法,即细胞类型计算差异估计法(CellCODE),该方法直接解决特定的统计问题,无需混合成分的物理模型。我们的方法基于潜在变量分析,计算过程透明;它不需要额外的实验数据,但性能优于使用独立比例测量的现有方法。CellCODE的参数很少,且稳健易解释。该方法可用于追踪比例变化、提高检测差异表达的能力,并将差异表达基因分配到正确的细胞类型。

相似文献

3
Differential correlation for sequencing data.测序数据的差异相关性
BMC Res Notes. 2017 Jan 19;10(1):54. doi: 10.1186/s13104-016-2331-9.
9

引用本文的文献

本文引用的文献

4
CellMix: a comprehensive toolbox for gene expression deconvolution.CellMix:一个全面的基因表达解卷积工具包。
Bioinformatics. 2013 Sep 1;29(17):2211-2. doi: 10.1093/bioinformatics/btt351. Epub 2013 Jul 3.
9
Cell subset prediction for blood genomic studies.血液基因组研究的细胞亚群预测。
BMC Bioinformatics. 2011 Jun 24;12:258. doi: 10.1186/1471-2105-12-258.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验