Simpson Helen Blair, Petkova Eva, Cheng Jianfeng, Huppert Jonathan, Foa Edna, Liebowitz Michael R
Department of Psychiatry, Columbia University, USA.
J Psychiatr Res. 2008 Jul;42(8):631-8. doi: 10.1016/j.jpsychires.2007.07.012. Epub 2007 Sep 24.
Longitudinal clinical trials in psychiatry have used various statistical methods to examine treatment effects. The validity of the inferences depends upon the different method's assumptions and whether a given study violates those assumptions. The objective of this paper was to elucidate these complex issues by comparing various methods for handling missing data (e.g., last observation carried forward [LOCF], completer analysis, propensity-adjusted multiple imputation) and for analyzing outcome (e.g., end-point analysis, repeated-measures analysis of variance [RM-ANOVA], mixed-effects models [MEMs]) using data from a multi-site randomized controlled trial in obsessive-compulsive disorder (OCD). The trial compared the effects of 12 weeks of exposure and ritual prevention (EX/RP), clomipramine (CMI), their combination (EX/RP&CMI) or pill placebo in 122 adults with OCD. The primary outcome measure was the Yale-Brown Obsessive Compulsive Scale. For most comparisons, inferences about the relative efficacy of the different treatments were impervious to different methods for handling missing data and analyzing outcome. However, when EX/RP was compared to CMI and when CMI was compared to placebo, traditional methods (e.g., LOCF, RM-ANOVA) led to different inferences than currently recommended alternatives (e.g., multiple imputation based on estimation-maximization algorithm, MEMs). Thus, inferences about treatment efficacy can be affected by statistical choices. This is most likely when there are small but potentially clinically meaningful treatment differences and when sample sizes are modest. The use of appropriate statistical methods in psychiatric trials can advance public health by ensuring that valid inferences are made about treatment efficacy.
精神病学领域的纵向临床试验采用了各种统计方法来检验治疗效果。推断的有效性取决于不同方法的假设以及特定研究是否违反这些假设。本文的目的是通过比较处理缺失数据的各种方法(例如,末次观察值结转[LOCF]、完全分析、倾向调整多重插补)以及使用强迫症(OCD)多中心随机对照试验的数据进行结果分析的方法(例如,终点分析、重复测量方差分析[RM-ANOVA]、混合效应模型[MEMs])来阐明这些复杂问题。该试验比较了12周暴露与仪式预防(EX/RP)、氯米帕明(CMI)、它们的联合使用(EX/RP&CMI)或安慰剂对122名成年强迫症患者的影响。主要结局指标是耶鲁-布朗强迫症量表。对于大多数比较而言,关于不同治疗相对疗效的推断不受处理缺失数据和分析结果的不同方法的影响。然而,当比较EX/RP与CMI以及CMI与安慰剂时,传统方法(例如,LOCF、RM-ANOVA)得出的推断与当前推荐的替代方法(例如,基于期望最大化算法的多重插补、MEMs)不同。因此,关于治疗疗效的推断可能会受到统计选择的影响。当存在小但可能具有临床意义的治疗差异且样本量适中时,这种情况最有可能发生。在精神病学试验中使用适当的统计方法可以通过确保对治疗疗效做出有效的推断来促进公共卫生。