Li Hao, Wood Constance L, Getchell Thomas V, Getchell Marilyn L, Stromberg Arnold J
Department of Statistics, 815 Patterson Office Tower, University of Kentucky, Lexington, Kentucky 40506-0027, USA.
BMC Bioinformatics. 2004 Dec 30;5:209. doi: 10.1186/1471-2105-5-209.
Two or more factor mixed factorial experiments are becoming increasingly common in microarray data analysis. In this case study, the two factors are presence (Patients with Alzheimer's disease) or absence (Control) of the disease, and brain regions including olfactory bulb (OB) or cerebellum (CER). In the design considered in this manuscript, OB and CER are repeated measurements from the same subject and, hence, are correlated. It is critical to identify sources of variability in the analysis of oligonucleotide array experiments with repeated measures and correlations among data points have to be considered. In addition, multiple testing problems are more complicated in experiments with multi-level treatments or treatment combinations.
In this study we adopted a linear mixed model to analyze oligonucleotide array experiments with repeated measures. We first construct a generalized F test to select differentially expressed genes. The Benjamini and Hochberg (BH) procedure of controlling false discovery rate (FDR) at 5% was applied to the P values of the generalized F test. For those genes with significant generalized F test, we then categorize them based on whether the interaction terms were significant or not at the alpha-level (alphanew = 0.0033) determined by the FDR procedure. Since simple effects may be examined for the genes with significant interaction effect, we adopt the protected Fisher's least significant difference test (LSD) procedure at the level of alphanew to control the family-wise error rate (FWER) for each gene examined.
A linear mixed model is appropriate for analysis of oligonucleotide array experiments with repeated measures. We constructed a generalized F test to select differentially expressed genes, and then applied a specific sequence of tests to identify factorial effects. This sequence of tests applied was designed to control for gene based FWER.
在微阵列数据分析中,两个或更多因素的混合析因实验正变得越来越普遍。在本案例研究中,两个因素分别是疾病的存在(阿尔茨海默病患者)或不存在(对照),以及包括嗅球(OB)或小脑(CER)在内的脑区。在本手稿所考虑的设计中,OB和CER是来自同一受试者的重复测量值,因此是相关的。在分析具有重复测量且数据点之间存在相关性的寡核苷酸阵列实验时,识别变异性来源至关重要。此外,在具有多级处理或处理组合的实验中,多重检验问题更为复杂。
在本研究中,我们采用线性混合模型来分析具有重复测量的寡核苷酸阵列实验。我们首先构建一个广义F检验来选择差异表达基因。将控制错误发现率(FDR)为5%的Benjamini和Hochberg(BH)程序应用于广义F检验的P值。对于那些广义F检验显著的基因,然后我们根据交互项在由FDR程序确定的α水平(α新 = 0.0033)上是否显著来对它们进行分类。由于对于具有显著交互效应的基因可以检验简单效应,我们在α新水平上采用受保护的Fisher最小显著差异检验(LSD)程序来控制每个检验基因的家族wise错误率(FWER)。
线性混合模型适用于分析具有重复测量的寡核苷酸阵列实验。我们构建了一个广义F检验来选择差异表达基因,然后应用特定的检验序列来识别析因效应。所应用的这个检验序列旨在控制基于基因的FWER。