Hoffmann Reinhard, Seidl Thomas, Dugas Martin
Department of Bacteriology, Max von Pettenkofer Institut, Pettenkoferstrasse 9a, 80336 Munich, Germany.
Genome Biol. 2002 Jun 14;3(7):RESEARCH0033. doi: 10.1186/gb-2002-3-7-research0033.
Oligonucleotide microarrays measure the relative transcript abundance of thousands of mRNAs in parallel. A large number of procedures for normalization and detection of differentially expressed genes have been proposed. However, the relative impact of these methods on the detection of differentially expressed genes remains to be determined.
We have employed four different normalization methods and all possible combinations with three different statistical algorithms for detection of differentially expressed genes on a prototype dataset. The number of genes detected as differentially expressed differs by a factor of about three. Analysis of lists of genes detected as differentially expressed, and rank correlation coefficients for probability of differential expression shows that a high concordance between different methods can only be achieved by using the same normalization procedure.
Normalization has a profound influence of detection of differentially expressed genes. This influence is higher than that of three subsequent statistical analysis procedures examined. Algorithms incorporating more array-derived information than gene-expression values alone are urgently needed.
寡核苷酸微阵列可同时测量数千种mRNA的相对转录本丰度。已经提出了大量用于差异表达基因标准化和检测的程序。然而,这些方法对差异表达基因检测的相对影响仍有待确定。
我们采用了四种不同的标准化方法以及与三种不同统计算法的所有可能组合,用于在一个原型数据集上检测差异表达基因。检测为差异表达的基因数量相差约三倍。对检测为差异表达的基因列表的分析以及差异表达概率的秩相关系数表明,只有使用相同的标准化程序才能在不同方法之间实现高度一致性。
标准化对差异表达基因的检测有深远影响。这种影响高于随后检查的三种统计分析程序的影响。迫切需要比单独的基因表达值纳入更多阵列衍生信息的算法。