Department of Molecular and Cell Biology, University of Connecticut at Storrs, Storrs, CT 06269-3125.
Mol Biol Cell. 2024 Jan 1;35(1):br1. doi: 10.1091/mbc.E23-05-0159. Epub 2023 Nov 1.
Power of statistical tests for differences in means is the probability of obtaining a statistically significant value when means differ. When samples in experimental replicates come from a single cell culture, they are matched or paired because they share between-trials biological variability. This can cause positive correlation between values from conditions in a replicate. Correlation can also be caused in otherwise independent samples by shared technical variability. However, correlation is reduced by noise that affects samples individually. I investigated how to maximize power in experiments with two conditions over a range of correlations. Normalizing data to control increases the rate of false positives, if Student's test is used. Paired tests, theoretically the correct test for matched samples, have higher power than Student's test when correlation is high, but lower power when correlation is low. Testing correlation to select a test for differences in mean can affect the subsequent rate of false positives. Ultimately, components of experimental variability must be considered to choose the most powerful two sample test for differences in mean. This contrasts with experiments with more than two conditions, where random-block ANOVA, a matched samples test, can be used as a default.
均值差异的统计检验功效是指当均值存在差异时,获得统计学显著值的概率。当实验重复中的样本来自单个细胞培养时,由于它们共享试验间的生物学变异性,因此它们是匹配的或配对的。这可能导致重复中条件值之间存在正相关。相关性也可能由共享技术变异性引起在其他独立的样本中。然而,相关性会因单独影响样本的噪声而降低。我研究了在一系列相关性下,如何使具有两种条件的实验达到最大功效。如果使用学生 t 检验,将数据标准化到对照会增加假阳性率。配对检验是针对匹配样本的正确检验,当相关性高时,其功效高于学生 t 检验,但当相关性低时,其功效低于学生 t 检验。检验相关性以选择用于均值差异的检验可能会影响后续的假阳性率。最终,必须考虑实验变异性的组成部分,以选择用于均值差异的最强大的双样本检验。这与具有两个以上条件的实验形成对比,在具有两个以上条件的实验中,可以使用随机区组 ANOVA(一种匹配样本检验)作为默认检验。