Wit Ernst, McClure John
Department of Statistics, University of Glasgow, Glasgow G12 8QW, UK.
Bioinformatics. 2003 Jun 12;19(9):1055-60. doi: 10.1093/bioinformatics/btg003.
Numerical output of spotted microarrays displays censoring of pixel intensities at some software dependent threshold. This reduces the quality of gene expression data, because it seriously violates the linearity of expression with respect to signal intensity. Statistical methods based on typically available spot summaries together with some parametric assumptions can suggest ways to correct for this defect.
A maximum likelihood approach is suggested together with a sensible approximation to the joint density of the mean, median and variance-which are typically available to the biological end-user. The method 'corrects' the gene expression values for pixel censoring. A by-product of our approach is a comparison between several two-parameter models for pixel intensity values. It suggests that pixels separated by one or two other pixels can be considered independent draws from a Lognormal or a Gamma distribution.
The R/S-Plus code is available at http://www.stats.gla.ac.uk/~microarray/software.
点阵微阵列的数值输出在某些软件相关阈值处显示像素强度的删失。这降低了基因表达数据的质量,因为它严重违反了表达相对于信号强度的线性关系。基于通常可用的斑点汇总以及一些参数假设的统计方法可以提出纠正此缺陷的方法。
提出了一种最大似然方法以及对均值、中位数和方差的联合密度的合理近似——这些通常是生物终端用户可以获得的。该方法对像素删失的基因表达值进行“校正”。我们方法的一个副产品是对像素强度值的几个双参数模型进行比较。结果表明,被一两个其他像素隔开的像素可以被视为来自对数正态分布或伽马分布的独立抽样。
R/S-Plus代码可在http://www.stats.gla.ac.uk/~microarray/software获取。