Department of Epidemiology and Biostatistics and Urology Service, Memorial Sloan-Kettering Cancer Center, New York, New York 10065, USA.
J Urol. 2010 Apr;183(4):1510-4. doi: 10.1016/j.juro.2009.12.034. Epub 2010 Feb 20.
Surgeons routinely evaluate and modify their surgical technique to improve patient outcome. It is also common for surgeons to analyze results before and after a change in technique to determine whether the change led to better results. Simple comparison of results before and after surgical modification may be confounded by the surgical learning curve. We developed a statistical method applicable to analyzing before/after surgical studies.
We used simulation studies to compare different statistical analyses of before/after studies. We evaluated a simple 2-group comparison of results before and after the modification by the chi-square test and a novel bootstrap method that adjusts for the surgical learning curve.
In the presence of the learning curve a simple 2-group comparison almost always showed an ineffective surgical modification to be of benefit. When the surgical modification was harmful, leading to a 10% decrease in the success rate, 2-group comparison nonetheless showed a statistically significant improvement in outcome about 80% of the time. The bootstrap method had only moderate power but did not show that ineffective surgical modifications were beneficial more than would be expected by chance.
Simplistic approaches to the analysis of before/after surgical studies may lead to grossly erroneous results under the surgical learning curve. A straightforward alternative statistical method allows investigators to separate the effects of the learning curve from those of the surgical modification.
外科医生通常会评估和修改手术技术,以改善患者的预后。在技术改变前后分析结果以确定改变是否导致更好的结果,这也是外科医生的常见做法。简单地比较手术修改前后的结果可能会受到手术学习曲线的影响。我们开发了一种适用于分析前后手术研究的统计方法。
我们使用模拟研究比较了前后研究的不同统计分析。我们通过卡方检验评估了修改前后结果的简单 2 组比较,以及一种新的 bootstrap 方法,该方法可调整手术学习曲线。
在存在学习曲线的情况下,简单的 2 组比较几乎总是表明无效的手术修改是有益的。当手术修改是有害的,导致成功率降低 10%时,2 组比较仍然有大约 80%的时间显示出统计学上显著的结果改善。bootstrap 方法的功效只有中等水平,但并没有表明无效的手术修改比偶然更有益。
在手术学习曲线下,对前后手术研究的分析过于简单可能会导致严重错误的结果。一种简单的替代统计方法可以让研究人员将学习曲线的影响与手术修改的影响分开。