Stoyanova Radka, Querec Troy D, Brown Truman R, Patriotis Christos
Division of Population Science, Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, PA 19111-2497, USA.
Bioinformatics. 2004 Jul 22;20(11):1772-84. doi: 10.1093/bioinformatics/bth170. Epub 2004 Mar 22.
Detailed comparison and analysis of the output of DNA gene expression arrays from multiple samples require global normalization of the measured individual gene intensities from the different hybridizations. This is needed for accounting for variations in array preparation and sample hybridization conditions.
Here, we present a simple, robust and accurate procedure for the global normalization of datasets generated with single-channel DNA arrays based on principal component analysis. The procedure makes minimal assumptions about the data and performs well in cases where other standard procedures produced biased estimates. It is also insensitive to data transformation, filtering (thresholding) and pre-screening.
对来自多个样本的DNA基因表达阵列的输出进行详细比较和分析,需要对不同杂交中测得的各个基因强度进行全局归一化。这对于考虑阵列制备和样本杂交条件的变化是必要的。
在此,我们提出了一种基于主成分分析的单通道DNA阵列生成数据集的全局归一化的简单、稳健且准确的程序。该程序对数据的假设最少,在其他标准程序产生有偏差估计的情况下表现良好。它对数据转换、过滤(阈值化)和预筛选也不敏感。