Feise Ronald J
Institute of Evidence-Based Chiropractic 6252 Rookery Road, Fort Collins, Colorado 80528, USA.
BMC Med Res Methodol. 2002 Jun 17;2:8. doi: 10.1186/1471-2288-2-8.
Readers may question the interpretation of findings in clinical trials when multiple outcome measures are used without adjustment of the p-value. This question arises because of the increased risk of Type I errors (findings of false "significance") when multiple simultaneous hypotheses are tested at set p-values. The primary aim of this study was to estimate the need to make appropriate p-value adjustments in clinical trials to compensate for a possible increased risk in committing Type I errors when multiple outcome measures are used.
The classicists believe that the chance of finding at least one test statistically significant due to chance and incorrectly declaring a difference increases as the number of comparisons increases. The rationalists have the following objections to that theory: 1) P-value adjustments are calculated based on how many tests are to be considered, and that number has been defined arbitrarily and variably; 2) P-value adjustments reduce the chance of making type I errors, but they increase the chance of making type II errors or needing to increase the sample size.
Readers should balance a study's statistical significance with the magnitude of effect, the quality of the study and with findings from other studies. Researchers facing multiple outcome measures might want to either select a primary outcome measure or use a global assessment measure, rather than adjusting the p-value.
当使用多个结局指标而未对p值进行校正时,读者可能会质疑临床试验结果的解读。出现这个问题的原因是,当以设定的p值同时检验多个假设时,I类错误(错误地得出“显著性”结果)的风险会增加。本研究的主要目的是评估在临床试验中进行适当的p值校正的必要性,以弥补使用多个结局指标时可能增加的犯I类错误的风险。
传统主义者认为,由于偶然因素而发现至少一项检验具有统计学显著性并错误地宣称存在差异的可能性会随着比较次数的增加而增加。理性主义者对该理论有以下异议:1)p值校正是根据要考虑的检验数量来计算的,而该数量是任意定义且变化不定的;2)p值校正降低了犯I类错误的可能性,但增加了犯II类错误的可能性或需要增加样本量。
读者应在研究的统计学显著性与效应大小、研究质量以及其他研究结果之间进行权衡。面对多个结局指标的研究人员可能希望要么选择一个主要结局指标,要么使用一个整体评估指标,而不是对p值进行校正。