Liu Fang, Wu Baolin
Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455, USA.
Comput Biol Chem. 2007 Apr;31(2):65-71. doi: 10.1016/j.compbiolchem.2007.02.004. Epub 2007 Feb 16.
It has recently been shown that cancer genes (oncogenes) tend to have heterogeneous expressions across disease samples. So it is reasonable to assume that in a microarray data only a subset of disease samples will be activated (often referred to as outliers), which presents some new challenges for statistical analysis. In this paper, we study the multi-class cancer outlier differential gene expression detection. Statistical methods will be proposed to take into account the expression heterogeneity. Through simulation studies and application to public microarray data, we will show that the proposed methods could provide more comprehensive analysis results and improve upon the traditional differential gene expression detection methods, which often ignore the expression heterogeneity and may loss power. Supplementary information can be found at http://www.biostat.umn.edu/~baolin/research/orf.html.
最近的研究表明,癌症基因(癌基因)在不同疾病样本中往往具有异质性表达。因此,可以合理地假设,在微阵列数据中,只有一部分疾病样本会被激活(通常称为异常值),这给统计分析带来了一些新的挑战。在本文中,我们研究多类癌症异常值差异基因表达检测。将提出统计方法以考虑表达异质性。通过模拟研究和对公共微阵列数据的应用,我们将表明所提出的方法可以提供更全面的分析结果,并改进传统的差异基因表达检测方法,传统方法往往忽略表达异质性,可能会降低功效。补充信息可在http://www.biostat.umn.edu/~baolin/research/orf.html上找到。