Stürmer Til, Joshi Manisha, Glynn Robert J, Avorn Jerry, Rothman Kenneth J, Schneeweiss Sebastian
Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women's Hospital, Harvard Medical School, 1620 Tremont Street, Suite 3030, Boston, MA 02120, USA.
J Clin Epidemiol. 2006 May;59(5):437-47. doi: 10.1016/j.jclinepi.2005.07.004. Epub 2005 Oct 13.
Propensity score (PS) analyses attempt to control for confounding in nonexperimental studies by adjusting for the likelihood that a given patient is exposed. Such analyses have been proposed to address confounding by indication, but there is little empirical evidence that they achieve better control than conventional multivariate outcome modeling.
Using PubMed and Science Citation Index, we assessed the use of propensity scores over time and critically evaluated studies published through 2003.
Use of propensity scores increased from a total of 8 reports before 1998 to 71 in 2003. Most of the 177 published studies abstracted assessed medications (N=60) or surgical interventions (N=51), mainly in cardiology and cardiac surgery (N=90). Whether PS methods or conventional outcome models were used to control for confounding had little effect on results in those studies in which such comparison was possible. Only 9 of 69 studies (13%) had an effect estimate that differed by more than 20% from that obtained with a conventional outcome model in all PS analyses presented.
Publication of results based on propensity score methods has increased dramatically, but there is little evidence that these methods yield substantially different estimates compared with conventional multivariable methods.
倾向评分(PS)分析试图通过调整给定患者暴露的可能性来控制非实验性研究中的混杂因素。有人提出此类分析以解决指征性混杂问题,但几乎没有实证证据表明它们比传统的多变量结局建模能实现更好的控制。
我们利用PubMed和科学引文索引评估了倾向评分随时间的使用情况,并对截至2003年发表的研究进行了严格评估。
倾向评分的使用从1998年之前总共8篇报告增加到2003年的71篇。在抽取摘要的177项已发表研究中,大多数评估了药物(N = 60)或手术干预(N = 51),主要涉及心脏病学和心脏外科(N = 90)。在那些可以进行此类比较的研究中,使用PS方法还是传统结局模型来控制混杂因素对结果影响不大。在所有呈现的PS分析中,69项研究中只有9项(13%)的效应估计与使用传统结局模型得到的结果相差超过20%。
基于倾向评分方法的结果发表量大幅增加,但几乎没有证据表明这些方法与传统多变量方法相比能产生实质上不同的估计值。