Computational Biology Group, Institute of Infectious Diseases and Molecular Medicine, University of Cape Town, South Africa.
Bioinformatics. 2013 Sep 1;29(17):2211-2. doi: 10.1093/bioinformatics/btt351. Epub 2013 Jul 3.
Gene expression data are typically generated from heterogeneous biological samples that are composed of multiple cell or tissue types, in varying proportions, each contributing to global gene expression. This heterogeneity is a major confounder in standard analysis such as differential expression analysis, where differences in the relative proportions of the constituent cells may prevent or bias the detection of cell-specific differences. Computational deconvolution of global gene expression is an appealing alternative to costly physical sample separation techniques and enables a more detailed analysis of the underlying biological processes at the cell-type level. To facilitate and popularize the application of such methods, we developed CellMix, an R package that incorporates most state-of-the-art deconvolution methods, into an intuitive and extendible framework, providing a single entry point to explore, assess and disentangle gene expression data from heterogeneous samples.
The CellMix package builds on R/BioConductor and is available from http://web.cbio.uct.ac.za/∼renaud/CRAN/web/CellMix. It is currently being submitted to BioConductor. The package's vignettes notably contain additional information, examples and references.
基因表达数据通常是从由多种细胞或组织类型组成的异质生物样本中生成的,这些细胞或组织类型以不同的比例存在,每个细胞或组织类型都对全局基因表达有贡献。这种异质性是标准分析(如差异表达分析)中的一个主要混杂因素,其中组成细胞的相对比例的差异可能会阻止或偏向于检测细胞特异性差异。全局基因表达的计算去卷积是一种有吸引力的替代昂贵的物理样本分离技术的方法,能够更详细地分析细胞水平的潜在生物学过程。为了促进和推广此类方法的应用,我们开发了 CellMix,这是一个 R 包,将最先进的去卷积方法集成到一个直观和可扩展的框架中,为探索、评估和分解异质样本中的基因表达数据提供了一个单一的入口点。
CellMix 包建立在 R/BioConductor 之上,可从 http://web.cbio.uct.ac.za/∼renaud/CRAN/web/CellMix 获得。它目前正在提交给 BioConductor。该包的示例特别包含了其他信息、示例和参考资料。