Oral Cancer Research Institute, Yonsei University College of Dentistry, Seoul, 120-752, Republic of Korea.
J Biosci Bioeng. 2009 Sep;108(3):252-8. doi: 10.1016/j.jbiosc.2009.03.017.
Microarray experiments are often performed to detect differently expressed genes among different clinical phenotypes. The method used to calculate the appropriate sample size for this purpose differs from the sample size calculation used for general clinical experiments, because microarrays include tens of thousands of genes. We proposed a sample size calculation method that considers variance among an entire gene set and used the Bonferroni correction to address the multiplicity problem. Specifically, by adjusting for the multiplicity problem, the existing equation for sample size calculation was modified based on the Bonferroni correction. By k-means cluster analysis, the variances across all genes can be divided into several groups with similar values, and the sample sizes for each group were subsequently calculated and weight-averaged. The results of this study show that the sample size was related to the number of genes on a chip. The weighted sample size, calculated by the proposed method, preserved the Type I error for selection of significant genes within a microarray data set.
微阵列实验常用于检测不同临床表型之间差异表达的基因。为此目的计算合适样本量的方法与用于一般临床实验的样本量计算方法不同,因为微阵列包含数万种基因。我们提出了一种考虑整个基因集方差的样本量计算方法,并使用 Bonferroni 校正来解决多重性问题。具体来说,通过调整多重性问题,根据 Bonferroni 校正修改了现有的样本量计算方程。通过 k-均值聚类分析,可以将所有基因的方差分为几组具有相似值的组,然后计算每组的样本量并进行加权平均。本研究的结果表明,样本量与芯片上的基因数量有关。通过所提出的方法计算的加权样本量保留了在微阵列数据集内选择显著基因的 I 型错误率。