一种用于标记弱点的统计方法可改善微阵列中的标准化和比率估计。

A statistical method for flagging weak spots improves normalization and ratio estimates in microarrays.

作者信息

Yang M C, Ruan Q G, Yang J J, Eckenrode S, Wu S, McIndoe R A, She J X

机构信息

Department of Statistics, University of Florida, Department of Pathology, Center for Mammalian Genetics and Diabetes Center of Excellence, University of Florida, Gainesville, Florida, 32610-0275, USA.

出版信息

Physiol Genomics. 2001 Oct 10;7(1):45-53. doi: 10.1152/physiolgenomics.00020.2001.

DOI:10.1152/physiolgenomics.00020.2001

PMID:11595791

Abstract

Over the last few years, there has been a dramatic increase in the use of cDNA microarrays to monitor gene expression changes in biological systems. Data from these experiments are usually transformed into expression ratios between experimental samples and a common reference sample for subsequent data analysis. The accuracy of this critical transformation depends on two major parameters: the signal intensities and the normalization of the experiment vs. reference signal intensities. Here we describe and validate a new model for microarray signal intensity that has one multiplicative variation and one additive background variation. Using replicative experiments and simulated data, we found that the signal intensity is the most critical parameter that influences the performance of normalization, accuracy of ratio estimates, reproducibility, specificity, and sensitivity of microarray experiments. Therefore, we developed a statistical procedure to flag spots with weak signal intensity based on the standard deviation (delta(ij)) of background differences between a spot and the neighboring spots, i.e., a spot is considered as too weak if the signal is weaker than cdelta(ij). Our studies suggest that normalization and ratio estimates were unacceptable when this threshold (c) is small. We further showed that when a reasonable compromise of c (c = 6) is applied, normalization using trimmed mean of log ratios performed slightly better than global intensity and mean of ratios. These studies suggest that decreasing the background noise is critical to improve the quality of microarray experiments.

摘要

在过去几年中，利用cDNA微阵列监测生物系统中基因表达变化的情况急剧增加。这些实验的数据通常会转化为实验样本与共同参考样本之间的表达比率，用于后续的数据分析。这一关键转化的准确性取决于两个主要参数：信号强度以及实验信号强度与参考信号强度的归一化。在此，我们描述并验证了一种新的微阵列信号强度模型，该模型具有一个乘性变化和一个加性背景变化。通过重复实验和模拟数据，我们发现信号强度是影响归一化性能、比率估计准确性、重现性、特异性以及微阵列实验灵敏度的最关键参数。因此，我们开发了一种统计程序，根据一个点与相邻点之间背景差异的标准差（δ(ij)）来标记信号强度较弱的点，即如果信号比cδ(ij)弱，则该点被认为过弱。我们的研究表明，当这个阈值（c）较小时，归一化和比率估计是不可接受的。我们进一步表明，当应用合理的c值折衷（c = 6）时，使用对数比率截尾均值进行归一化的效果略优于全局强度和比率均值。这些研究表明，降低背景噪声对于提高微阵列实验质量至关重要。