Division of Rheumatology, Department of Medicine, University Health Network, Toronto, Ontario M5T 2S8, Canada.
J Clin Epidemiol. 2010 Apr;63(4):355-69. doi: 10.1016/j.jclinepi.2009.06.003. Epub 2009 Aug 27.
Bayesian analysis can incorporate clinicians' beliefs about treatment effectiveness into models that estimate treatment effects. Many elicitation methods are available, but it is unclear if any confer advantages based on principles of measurement science. We review belief-elicitation methods for Bayesian analysis and determine if any of them had an incremental value over the others based on its validity, reliability, and responsiveness.
A systematic review was performed. MEDLINE, EMBASE, CINAHL, Health and Psychosocial Instruments, Current Index to Statistics, MathSciNet, and Zentralblatt Math were searched using the terms (prior OR prior probability distribution) AND (beliefs OR elicitation) AND (Bayes OR Bayesian). Studies were evaluated on: design, question stem, response options, analysis, consideration of validity, reliability, and responsiveness.
We identified 33 studies describing methods for elicitation in a Bayesian context. Elicitation occurred in cross-sectional studies (n=30, 89%), to derive point estimates with individual-level variation (n=19; 58%). Although 64% (n=21) considered validity, 24% (n=8) reliability, 12% (n=4) responsiveness of the elicitation methods, only 12% (n=4) formally tested validity, 6% (n=2) tested reliability, and none tested responsiveness.
We have summarized methods of belief elicitation for Bayesian priors. The validity, reliability, and responsiveness of elicitation methods have been infrequently evaluated. Until comparative studies are performed, strategies to reduce the effects of bias on the elicitation should be used.
贝叶斯分析可以将临床医生对治疗效果的信念纳入估计治疗效果的模型中。有许多启发式方法可用,但不清楚是否根据测量科学原理,任何一种方法都具有优势。我们回顾了用于贝叶斯分析的信念启发方法,并确定它们中的任何一种方法是否基于其有效性、可靠性和响应性而具有增量价值。
进行了系统评价。使用术语(先验或先验概率分布)和(信念或启发)和(贝叶斯或贝叶斯),在 MEDLINE、EMBASE、CINAHL、健康和社会心理仪器、当前统计索引、MathSciNet 和 Zentralblatt Math 中进行了搜索。研究评估的内容包括:设计、问题题头、应答选项、分析、有效性、可靠性和响应性的考虑。
我们确定了 33 项描述贝叶斯背景下启发方法的研究。启发发生在横断面研究中(n=30,89%),以得出具有个体水平变化的点估计(n=19;58%)。尽管 64%(n=21)考虑了启发方法的有效性,24%(n=8)考虑了可靠性,12%(n=4)考虑了响应性,但只有 12%(n=4)正式测试了有效性,6%(n=2)测试了可靠性,没有测试响应性。
我们总结了贝叶斯先验信念启发的方法。启发方法的有效性、可靠性和响应性很少被评估。在进行比较研究之前,应使用策略来减少启发偏差的影响。