基于统计学的研究——盲目探索？

Statistics-based research--a pig in a poke?

机构信息

Scunthorpe General Hospital, Cliff Gardens, Scunthorpe, North Lincolnshire, UK.

出版信息

J Eval Clin Pract. 2011 Oct;17(5):862-7. doi: 10.1111/j.1365-2753.2011.01717.x. Epub 2011 Aug 11.

DOI:10.1111/j.1365-2753.2011.01717.x

Abstract

Much of medical research involves large-scale randomized controlled trials designed to detect small differences in outcome between the study groups. This approach is believed to produce reliable evidence on which the management of patients is based. But can we be sure that the demonstration of a small, albeit statistically significant, difference is sufficient to infer the presence of a causal relationship between the drug and the outcome? A study is claimed to have internal validity when other explanations for the observed difference - namely, inequalities between the groups, bias in the assessment of the outcome and chance - have been excluded. Despite the various processes that are put into place - including, for example, randomization, allocation concealment, double-blinding and intention-to-treat analysis - it remains doubtful whether the groups are equal in terms of all factors relevant to the outcome and whether bias has been excluded. As for the exclusion of chance, not only may inappropriate statistical tests be used, but also frequentist statistics has been subjected to serious criticisms in recent years that further bring internal validity into question. But the problems do not end with the flaws in internal validity. The philosophical basis of large-scale randomized controlled trials and epidemiological studies is unsound. When examined closely, many obstacles emerge that threaten the inference from a small, statistically significant difference to the presence of a causal relationship between the drug and the outcome. Given the influence of statistics-based research on the practice of medicine, it is of the utmost importance that the flaws in this methodology are brought to the fore.

摘要

许多医学研究都涉及大规模的随机对照试验，旨在检测研究组之间在结果上的微小差异。这种方法被认为可以产生可靠的证据，为患者的治疗提供依据。但是，我们能否确定，即使存在统计学上显著的微小差异，也足以推断药物与结果之间存在因果关系呢？当排除了其他导致观察到的差异的解释（即组间的不平等、结果评估中的偏差和偶然性）时，研究就被认为具有内部有效性。尽管采取了各种措施，包括随机化、分配隐藏、双盲和意向治疗分析，但仍难以确定两组在与结果相关的所有因素方面是否平等，以及是否排除了偏差。至于偶然性的排除，不仅可能使用不适当的统计检验，而且近年来，频率主义统计学也受到了严重的批评，这进一步使内部有效性受到质疑。但问题并没有随着内部有效性的缺陷而结束。大规模随机对照试验和流行病学研究的哲学基础并不健全。仔细研究后，会出现许多障碍，威胁到从药物与结果之间的统计学显著差异推断因果关系的推断。鉴于基于统计学的研究对医学实践的影响，将这种方法的缺陷暴露出来至关重要。