不同倾向评分法在观察性研究中估计比例差异（风险差异或绝对风险降低）的表现。

The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies.

机构信息

Institute for Clinical Evaluative Sciences, Toronto, ON, Canada.

出版信息

Stat Med. 2010 Sep 10;29(20):2137-48. doi: 10.1002/sim.3854.

DOI:10.1002/sim.3854

PMID:20108233

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3068290/

Abstract

Propensity score methods are increasingly being used to estimate the effects of treatments on health outcomes using observational data. There are four methods for using the propensity score to estimate treatment effects: covariate adjustment using the propensity score, stratification on the propensity score, propensity-score matching, and inverse probability of treatment weighting (IPTW) using the propensity score. When outcomes are binary, the effect of treatment on the outcome can be described using odds ratios, relative risks, risk differences, or the number needed to treat. Several clinical commentators suggested that risk differences and numbers needed to treat are more meaningful for clinical decision making than are odds ratios or relative risks. However, there is a paucity of information about the relative performance of the different propensity-score methods for estimating risk differences. We conducted a series of Monte Carlo simulations to examine this issue. We examined bias, variance estimation, coverage of confidence intervals, mean-squared error (MSE), and type I error rates. A doubly robust version of IPTW had superior performance compared with the other propensity-score methods. It resulted in unbiased estimation of risk differences, treatment effects with the lowest standard errors, confidence intervals with the correct coverage rates, and correct type I error rates. Stratification, matching on the propensity score, and covariate adjustment using the propensity score resulted in minor to modest bias in estimating risk differences. Estimators based on IPTW had lower MSE compared with other propensity-score methods. Differences between IPTW and propensity-score matching may reflect that these two methods estimate the average treatment effect and the average treatment effect for the treated, respectively.

摘要

倾向评分法越来越多地被用于使用观察性数据估计治疗对健康结果的影响。使用倾向评分估计治疗效果有四种方法：使用倾向评分进行协变量调整、倾向评分分层、倾向评分匹配和使用倾向评分进行逆概率治疗加权（IPT）。当结果为二分类时，治疗对结果的影响可以用优势比、相对风险、风险差异或需要治疗的人数来描述。一些临床评论员认为，风险差异和需要治疗的人数比优势比或相对风险更有助于临床决策。然而，关于不同倾向评分方法估计风险差异的相对性能的信息很少。我们进行了一系列蒙特卡罗模拟来研究这个问题。我们检查了偏差、方差估计、置信区间覆盖、均方误差（MSE）和 I 型错误率。IPT 的双重稳健版本与其他倾向评分方法相比具有更好的性能。它导致风险差异的无偏估计、具有最低标准误差的治疗效果、具有正确覆盖率的置信区间和正确的 I 型错误率。倾向评分分层、倾向评分匹配和使用倾向评分进行协变量调整会导致风险差异估计的轻微到适度偏差。基于 IPT 的估计量与其他倾向评分方法相比具有更低的 MSE。IPT 和倾向评分匹配之间的差异可能反映了这两种方法分别估计治疗效果和治疗效果。

相似文献

The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies.

Stat Med. 2010 Sep 10;29(20):2137-48. doi: 10.1002/sim.3854.

The performance of different propensity score methods for estimating marginal hazard ratios.

Stat Med. 2013 Jul 20;32(16):2837-49. doi: 10.1002/sim.5705. Epub 2012 Dec 12.

The performance of different propensity score methods for estimating marginal odds ratios.

Stat Med. 2007 Jul 20;26(16):3078-94. doi: 10.1002/sim.2781.

The performance of inverse probability of treatment weighting and full matching on the propensity score in the presence of model misspecification when estimating the effect of treatment on survival outcomes.

Stat Methods Med Res. 2017 Aug;26(4):1654-1670. doi: 10.1177/0962280215584401. Epub 2015 Apr 30.

Variance estimation when using inverse probability of treatment weighting (IPTW) with survival analysis.

Stat Med. 2016 Dec 30;35(30):5642-5655. doi: 10.1002/sim.7084. Epub 2016 Aug 22.

Type I error rates, coverage of confidence intervals, and variance estimation in propensity-score matched analyses.

Int J Biostat. 2009 Apr 14;5(1):Article 13. doi: 10.2202/1557-4679.1146.

The performance of different propensity-score methods for estimating relative risks.

J Clin Epidemiol. 2008 Jun;61(6):537-45. doi: 10.1016/j.jclinepi.2007.07.011. Epub 2008 Feb 14.

The performance of different propensity score methods for estimating absolute effects of treatments on survival outcomes: A simulation study.

Stat Methods Med Res. 2016 Oct;25(5):2214-2237. doi: 10.1177/0962280213519716. Epub 2014 Jan 23.

Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies.

Pharm Stat. 2011 Mar-Apr;10(2):150-61. doi: 10.1002/pst.433.

Estimating the effect of treatment on binary outcomes using full matching on the propensity score.

Stat Methods Med Res. 2017 Dec;26(6):2505-2525. doi: 10.1177/0962280215601134. Epub 2015 Sep 1.

引用本文的文献

Real-world comparative effectiveness of biologic therapies in severe asthma: EU-ADVANTAGE.

ERJ Open Res. 2025 Aug 4;11(4). doi: 10.1183/23120541.01217-2024. eCollection 2025 Jul.

Dynamic effects of COVID-19 vaccination on major acute cardiovascular events and mortality following SARS-CoV-2 infection in a target trial emulation study.

Sci Rep. 2025 Jul 29;15(1):27530. doi: 10.1038/s41598-025-13043-x.

Effects of Screening Hemoglobin A1C on Complications in Implant-Based Breast Reconstruction.

Eplasty. 2024 Dec 4;24:e63. eCollection 2024.

Impact of long-term N-acetylcysteine use on cancer risk.

Am J Cancer Res. 2025 Feb 15;15(2):618-630. doi: 10.62347/VCDJ1296. eCollection 2025.

Inverse Probability of Treatment Weighting Using the Propensity Score With Competing Risks in Survival Analysis.

Stat Med. 2025 Feb 28;44(5):e70009. doi: 10.1002/sim.70009.

The use of propensity score matching to assess the effectiveness of the endometrial receptivity analysis in patients with recurrent implantation failure.

Front Endocrinol (Lausanne). 2025 Jan 13;15:1402575. doi: 10.3389/fendo.2024.1402575. eCollection 2024.

The Impact of Comorbid Sleep-Disordered Breathing on Hospitalization Risk Related to Diabetes and Atherosclerotic Disease: A Retrospective Cohort Analysis.

J Clin Med. 2024 Dec 18;13(24):7715. doi: 10.3390/jcm13247715.

Prognostic impact of lymph node dissection in intrahepatic cholangiocarcinoma: a propensity score analysis.

Langenbecks Arch Surg. 2024 Dec 11;410(1):3. doi: 10.1007/s00423-024-03564-w.

Conditional cash transfers and mortality in people hospitalised with psychiatric disorders: A cohort study of the Brazilian Bolsa Família Programme.

PLoS Med. 2024 Dec 2;21(12):e1004486. doi: 10.1371/journal.pmed.1004486. eCollection 2024 Dec.

Differences in the Distribution of Aβ in the Brain between U.S. Veterans and Adults aged 62+ and suffering from Alzheimer's Disease.

Ann Biostat Biom Appl. 2024;6(1). doi: 10.33552/abba.2024.06.000630. Epub 2024 Jun 26.

本文引用的文献

Type I error rates, coverage of confidence intervals, and variance estimation in propensity-score matched analyses.

Int J Biostat. 2009 Apr 14;5(1):Article 13. doi: 10.2202/1557-4679.1146.

Primer on statistical interpretation or methods report card on propensity-score matching in the cardiology literature from 2004 to 2006: a systematic review.

Circ Cardiovasc Qual Outcomes. 2008 Sep;1(1):62-7. doi: 10.1161/CIRCOUTCOMES.108.790634.

Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples.

Stat Med. 2009 Nov 10;28(25):3083-107. doi: 10.1002/sim.3697.

A substantial and confusing variation exists in handling of baseline covariates in randomized controlled trials: a review of trials published in leading medical journals.

J Clin Epidemiol. 2010 Feb;63(2):142-53. doi: 10.1016/j.jclinepi.2009.06.002. Epub 2009 Aug 27.

The relative ability of different propensity score methods to balance measured covariates between treated and untreated subjects in observational studies.

Med Decis Making. 2009 Nov-Dec;29(6):661-77. doi: 10.1177/0272989X09341755. Epub 2009 Aug 14.

Some methods of propensity-score matching had superior performance to others: results of an empirical investigation and Monte Carlo simulations.

Biom J. 2009 Feb;51(1):171-84. doi: 10.1002/bimj.200810488.

The performance of different propensity-score methods for estimating relative risks.

J Clin Epidemiol. 2008 Jun;61(6):537-45. doi: 10.1016/j.jclinepi.2007.07.011. Epub 2008 Feb 14.

A critical appraisal of propensity-score matching in the medical literature between 1996 and 2003.

Stat Med. 2008 May 30;27(12):2037-49. doi: 10.1002/sim.3150.

Propensity-score matching in the cardiovascular surgery literature from 2004 to 2006: a systematic review and suggestions for improvement.

J Thorac Cardiovasc Surg. 2007 Nov;134(5):1128-35. doi: 10.1016/j.jtcvs.2007.07.021.

The performance of different propensity score methods for estimating marginal odds ratios.

Stat Med. 2007 Jul 20;26(16):3078-94. doi: 10.1002/sim.2781.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

不同倾向评分法在观察性研究中估计比例差异（风险差异或绝对风险降低）的表现。

The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies.

机构信息

Institute for Clinical Evaluative Sciences, Toronto, ON, Canada.

出版信息

Stat Med. 2010 Sep 10;29(20):2137-48. doi: 10.1002/sim.3854.

DOI:10.1002/sim.3854

PMID:20108233

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3068290/

Abstract

摘要

不同倾向评分法在观察性研究中估计比例差异（风险差异或绝对风险降低）的表现。

The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

不同倾向评分法在观察性研究中估计比例差异（风险差异或绝对风险降低）的表现。

The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies.

机构信息

出版信息