Department of Arthritis and Immunology, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma, United States of America.
PLoS One. 2010 Sep 9;5(9):e12657. doi: 10.1371/journal.pone.0012657.
The number of methods for pre-processing and analysis of gene expression data continues to increase, often making it difficult to select the most appropriate approach. We present a simple procedure for comparative estimation of a variety of methods for microarray data pre-processing and analysis. Our approach is based on the use of real microarray data in which controlled fold changes are introduced into 20% of the data to provide a metric for comparison with the unmodified data. The data modifications can be easily applied to raw data measured with any technological platform and retains all the complex structures and statistical characteristics of the real-world data. The power of the method is illustrated by its application to the quantitative comparison of different methods of normalization and analysis of microarray data. Our results demonstrate that the method of controlled modifications of real experimental data provides a simple tool for assessing the performance of data preprocessing and analysis methods.
用于基因表达数据预处理和分析的方法数量不断增加,这使得选择最合适的方法变得困难。我们提出了一种简单的程序,用于比较评估各种微阵列数据预处理和分析方法。我们的方法基于使用真实的微阵列数据,其中将受控的倍数变化引入到数据的 20%中,以便与未修改的数据进行比较。可以轻松地将数据修改应用于使用任何技术平台测量的原始数据,并保留真实世界数据的所有复杂结构和统计特征。该方法的强大功能通过其在定量比较不同的微阵列数据归一化和分析方法中的应用得到了说明。我们的结果表明,真实实验数据的受控修改方法为评估数据预处理和分析方法的性能提供了一种简单的工具。