Xiao Guanghua, Martinez-Vaz Betsy, Pan Wei, Khodursky Arkady B
Division of Biostatistics, School of Public Health, University of Minnesota, A460 Mayo Building, Minneapolis, MN 55455-0378, USA.
BMC Genomics. 2006 Apr 21;7:87. doi: 10.1186/1471-2164-7-87.
In prokaryotic genomes, genes are organized in operons, and the genes within an operon tend to have similar levels of expression. Because of co-transcription of genes within an operon, borrowing information from other genes within the same operon can improve the estimation of relative transcript levels; the estimation of relative levels of transcript abundances is one of the most challenging tasks in experimental genomics due to the high noise level in microarray data. Therefore, techniques that can improve such estimations, and moreover are based on sound biological premises, are expected to benefit the field of microarray data analysis
In this paper, we propose a hierarchical Bayesian model, which relies on borrowing information from other genes within the same operon, to improve the estimation of gene expression levels and, hence, the detection of differentially expressed genes. The simulation studies and the analysis of experiential data demonstrated that the proposed method outperformed other techniques that are routinely used to estimate transcript levels and detect differentially expressed genes, including the sample mean and SAM t statistics. The improvement became more significant as the noise level in microarray data increases.
By borrowing information about transcriptional activity of genes within classified operons, we improved the estimation of gene expression levels and the detection of differentially expressed genes.
在原核生物基因组中,基因以操纵子的形式组织,操纵子内的基因往往具有相似的表达水平。由于操纵子内基因的共转录,从同一操纵子内的其他基因借用信息可以改进相对转录水平的估计;由于微阵列数据中的高噪声水平,转录本丰度相对水平的估计是实验基因组学中最具挑战性的任务之一。因此,能够改进此类估计且基于合理生物学前提的技术有望造福微阵列数据分析领域。
在本文中,我们提出了一种层次贝叶斯模型,该模型依赖于从同一操纵子内的其他基因借用信息,以改进基因表达水平的估计,从而改进差异表达基因的检测。模拟研究和经验数据分析表明,所提出的方法优于其他常用于估计转录水平和检测差异表达基因的技术,包括样本均值和SAM t统计量。随着微阵列数据中噪声水平的增加,这种改进变得更加显著。
通过借用关于分类操纵子内基因转录活性的信息,我们改进了基因表达水平的估计和差异表达基因的检测。