研究结果呈现方式对研究解读及医学决策的影响：PRIMER合作组随机研究

Shakespeare Thomas Philip, Gebski Val, Tang Johann, Lim Keith, Lu Jiade Jay, Zhang Xiaojian, Jiang Guoliang

North Coast Cancer Institute, Coffs Harbour Health Campus, Coffs Harbour, New South Wales, Australia.

Med Decis Making. 2008 Jan-Feb;28(1):127-37. doi: 10.1177/0272989X07309640. Epub 2007 Dec 14.

BACKGROUND

The manner of presentation of research results may affect how clinicians interpret research and make clinical decisions. The authors evaluate whether the use of confidence levels improve research interpretation and decision making compared with P values and 95% confidence intervals.

METHODS

The 2 Presentation and Interpretation of Medical Research (PRIMER) studies were 3-arm randomized trials. PRIMER 1 presented results of 5 fictitious scenarios with P values (P), P plus 95% confidence intervals (P + CI), or P, CI, and confidence levels (P + CI + CL); PRIMER 2 compared P + CI + CL, P + CI, and P + CL. Clinicians were asked to identify the correct interpretation of scenarios in terms of statistical and clinical significance and then indicate the intended decision making in terms of treatment recommendation.

RESULTS

Seventy-five and 246 clinicians participated in PRIMER 1 and PRIMER 2, respectively. In PRIMER 1, P+CI+CL was superior to P + CI and P (P < 0.05); the latter 2 arms did not differ significantly. Decision making was not significantly different between arms. In PRIMER 2, P+CI+CL resulted in better interpretation than P + CI (P = 0.03), with no difference between P + CI and P + CL. In combined analysis, the odds of correct interpretation were higher for P+CI+CL than P+CI (odds ratio = 1.73, P=0.005, 95% CI= 1.19--2.52). Decision making was better for P + CI+ CL (P = 0.03). On multivariate analysis, the P + CI+ CL arm and clinicians with statistics training, not in private practice, or participating in PRIMER 1 had better interpretation. The P + CI+ CL arm was the only factor improving decision making.

CONCLUSIONS

Presenting research with a combination of P values, confidence intervals, and confidence levels leads to better interpretation and decision making by clinicians.

背景

研究结果的呈现方式可能会影响临床医生对研究的解读以及做出临床决策。作者评估与P值和95%置信区间相比，使用置信水平是否能改善研究解读和决策。

方法

两项医学研究的呈现与解读（PRIMER）研究为三臂随机试验。PRIMER 1呈现了5个虚拟场景的结果，分别采用P值（P）、P值加95%置信区间（P + CI）或P值、置信区间和置信水平（P + CI + CL）；PRIMER 2比较了P + CI + CL、P + CI和P + CL。要求临床医生根据统计学和临床意义确定对各场景的正确解读，然后指出在治疗建议方面的预期决策。

结果

分别有75名和246名临床医生参与了PRIMER 1和PRIMER 2。在PRIMER 1中，P + CI + CL优于P + CI和P（P < 0.05）；后两者之间无显著差异。各臂之间的决策无显著差异。在PRIMER 2中，P + CI + CL的解读优于P + CI（P = 0.03），P + CI和P + CL之间无差异。在综合分析中，P + CI + CL正确解读的几率高于P + CI（优势比 = 1.73，P = 0.005，95%置信区间 = 1.19 - 2.52）。P + CI + CL的决策更好（P = 0.03）。多变量分析显示，采用P + CI + CL的组以及接受过统计学培训、不在私人诊所工作或参与PRIMER 1的临床医生解读更好。采用P + CI + CL的组是改善决策的唯一因素。