Dublin City University, Dublin 9, Ireland.
Adv Exp Med Biol. 2010;680:139-47. doi: 10.1007/978-1-4419-5913-3_16.
Data preprocessing in microarray technology is a crucial initial step before data analysis is performed. Many preprocessing methods have been proposed but none has proved to be ideal to date. Frequently, datasets are limited by laboratory constraints so that the need is for guidelines on quality and robustness, to inform further experimentation while data are yet restricted. In this paper, we compared the performance of four popular methods, namely MAS5, Li & Wong pmonly (LWPM), Li & Wong subtractMM (LWMM), and Robust Multichip Average (RMA). The comparison is based on the analysis carried out on sets of laboratory-generated data from the Bioinformatics Lab, National Institute of Cellular Biotechnology (NICB), Dublin City University, Ireland. These experiments were designed to examine the effect of Bromodeoxyuridine (5-bromo-2-deoxyuridine, BrdU) treatment in deep lamellar keratoplasty (DLKP) cells. The methodology employed is to assess dispersion across the replicates and analyze the false discovery rate. From the dispersion analysis, we found that variability is reduced more effectively by LWPM and RMA methods. From the false positive analysis, and for both parametric and nonparametric approaches, LWMM is found to perform best. Based on a complementary q-value analysis, LWMM approach again is the strongest candidate. The indications are that, while LWMM is marginally less effective than LWPM and RMA in terms of variance reduction, it has considerably improved discrimination overall.
在进行数据分析之前,微阵列技术中的数据预处理是一个至关重要的初始步骤。已经提出了许多预处理方法,但迄今为止没有一种被证明是理想的。通常,数据集受到实验室限制,因此需要有关质量和稳健性的指南,以便在数据受到限制时为进一步的实验提供信息。在本文中,我们比较了四种流行方法的性能,即 MAS5、Li 和 Wong pmonly (LWPM)、Li 和 Wong subtractMM (LWMM) 和 Robust Multichip Average (RMA)。比较基于在爱尔兰都柏林城市大学国家细胞生物技术研究所 (NICB) 的生物信息学实验室生成的数据集上进行的分析。这些实验旨在研究溴脱氧尿苷 (5-bromo-2-deoxyuridine, BrdU) 处理对深层板层角膜移植 (DLKP) 细胞的影响。所采用的方法是评估重复之间的分散程度并分析错误发现率。从分散分析中,我们发现 LWPM 和 RMA 方法更有效地降低了变异性。从假阳性分析来看,对于参数和非参数方法,LWMM 表现最佳。基于互补的 q 值分析,LWMM 方法再次是最强的候选方法。这表明,虽然 LWMM 在方差减少方面略逊于 LWPM 和 RMA,但总体上具有更好的区分能力。