评价在小样本量情况下估计边缘比值比的倾向评分方法。

Evaluation of the propensity score methods for estimating marginal odds ratios in case of small sample size.

机构信息

Service de Biostatistique et Information Médicale, Hôpital Saint-Louis, UMR-S717 Inserm; Sorbonne Paris Cité, Université Paris Diderot, 1 avenue Claude Vellefaux, Paris 75010, France.

出版信息

BMC Med Res Methodol. 2012 May 30;12:70. doi: 10.1186/1471-2288-12-70.

DOI:10.1186/1471-2288-12-70

PMID:22646911

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3511219/

Abstract

BACKGROUND

Propensity score (PS) methods are increasingly used, even when sample sizes are small or treatments are seldom used. However, the relative performance of the two mainly recommended PS methods, namely PS-matching or inverse probability of treatment weighting (IPTW), have not been studied in the context of small sample sizes.

METHODS

We conducted a series of Monte Carlo simulations to evaluate the influence of sample size, prevalence of treatment exposure, and strength of the association between the variables and the outcome and/or the treatment exposure, on the performance of these two methods.

RESULTS

Decreasing the sample size from 1,000 to 40 subjects did not substantially alter the Type I error rate, and led to relative biases below 10%. The IPTW method performed better than the PS-matching down to 60 subjects. When N was set at 40, the PS matching estimators were either similarly or even less biased than the IPTW estimators. Including variables unrelated to the exposure but related to the outcome in the PS model decreased the bias and the variance as compared to models omitting such variables. Excluding the true confounder from the PS model resulted, whatever the method used, in a significantly biased estimation of treatment effect. These results were illustrated in a real dataset.

CONCLUSION

Even in case of small study samples or low prevalence of treatment, PS-matching and IPTW can yield correct estimations of treatment effect unless the true confounders and the variables related only to the outcome are not included in the PS model.

摘要

背景

倾向评分（PS）方法越来越多地被使用，即使在样本量较小或治疗方法很少使用的情况下。然而，在样本量较小的情况下，两种主要推荐的 PS 方法，即 PS 匹配或治疗反概率加权（IPTW）的相对性能尚未得到研究。

方法

我们进行了一系列蒙特卡罗模拟，以评估样本量、治疗暴露的流行率以及变量与结局和/或治疗暴露之间的关联强度对这两种方法性能的影响。

结果

将样本量从 1000 减少到 40 个，对 I 型错误率没有显著影响，并导致相对偏差低于 10%。IPTW 方法的性能优于 PS 匹配，直至 60 个样本。当 N 设置为 40 时，PS 匹配估计值与 IPTW 估计值的偏差要么相似，要么甚至更小。在 PS 模型中纳入与暴露无关但与结局相关的变量，与排除此类变量的模型相比，会降低偏差和方差。无论使用哪种方法，将真正的混杂因素从 PS 模型中排除，都会导致治疗效果的估计产生显著偏差。这些结果在一个真实的数据集得到了说明。

结论

即使在研究样本量较小或治疗方法流行率较低的情况下，PS 匹配和 IPTW 也可以得出正确的治疗效果估计，除非真正的混杂因素和仅与结局相关的变量未被纳入 PS 模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6130/3511219/c2d9535c8127/1471-2288-12-70-1.jpg

相似文献

Evaluation of the propensity score methods for estimating marginal odds ratios in case of small sample size.

BMC Med Res Methodol. 2012 May 30;12:70. doi: 10.1186/1471-2288-12-70.

The performance of inverse probability of treatment weighting and full matching on the propensity score in the presence of model misspecification when estimating the effect of treatment on survival outcomes.

Stat Methods Med Res. 2017 Aug;26(4):1654-1670. doi: 10.1177/0962280215584401. Epub 2015 Apr 30.

Propensity score trimming mitigates bias due to covariate measurement error in inverse probability of treatment weighted analyses: A plasmode simulation.

Stat Med. 2021 Apr;40(9):2101-2112. doi: 10.1002/sim.8887. Epub 2021 Feb 23.

The performance of different propensity score methods for estimating marginal odds ratios.

Stat Med. 2007 Jul 20;26(16):3078-94. doi: 10.1002/sim.2781.

The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies.

Stat Med. 2010 Sep 10;29(20):2137-48. doi: 10.1002/sim.3854.

On the use of propensity scores in case of rare exposure.

BMC Med Res Methodol. 2016 Mar 31;16:38. doi: 10.1186/s12874-016-0135-1.

The performance of different propensity score methods for estimating marginal hazard ratios.

Stat Med. 2013 Jul 20;32(16):2837-49. doi: 10.1002/sim.5705. Epub 2012 Dec 12.

An evaluation of inverse probability weighting using the propensity score for baseline covariate adjustment in smaller population randomised controlled trials with a continuous outcome.

BMC Med Res Methodol. 2020 Mar 23;20(1):70. doi: 10.1186/s12874-020-00947-7.

A comparison of the ability of different propensity score models to balance measured variables between treated and untreated subjects: a Monte Carlo study.

Stat Med. 2007 Feb 20;26(4):734-53. doi: 10.1002/sim.2580.

Propensity score methods and regression adjustment for analysis of nonrandomized studies with health-related quality of life outcomes.

Pharmacoepidemiol Drug Saf. 2019 May;28(5):690-699. doi: 10.1002/pds.4756. Epub 2019 Feb 19.

引用本文的文献

Use of Machine Learning to Compare Disease Risk Scores and Propensity Scores Across Complex Confounding Scenarios: A Simulation Study.

Pharmacoepidemiol Drug Saf. 2025 Jun;34(6):e70165. doi: 10.1002/pds.70165.

Comparison of Postoperative Nausea and Vomiting Between Sedation with Remimazolam and Dexmedetomidine in Transcatheter Aortic Valve Replacement Patients: A Single-Center Retrospective Observational Study.

J Clin Med. 2025 Mar 5;14(5):1759. doi: 10.3390/jcm14051759.

Treatment Patterns and Healthcare Resource Utilization by Gender and Migraine Frequency in Adult Patients Receiving Galcanezumab Versus Standard of Care Preventive Medications Over 24 months: A United States Retrospective Claims Study.

Patient Prefer Adherence. 2025 Mar 1;19:543-567. doi: 10.2147/PPA.S492300. eCollection 2025.

Long-term clinical outcomes in patients between the age of 50-70 years receiving biological versus mechanical aortic valve prostheses.

Eur J Cardiothorac Surg. 2025 Feb 4;67(2). doi: 10.1093/ejcts/ezaf033.

Dynamic assessment of signal entropy for prognostication and secondary brain insult detection after traumatic brain injury.

Crit Care. 2024 Dec 30;28(1):436. doi: 10.1186/s13054-024-05228-z.

Endovascular Treatment of Patients With Acute Ischemic Stroke With Tandem Lesions Presenting With Low Alberta Stroke Program Early Computed Tomography Score.

J Am Heart Assoc. 2024 Nov 19;13(22):e035977. doi: 10.1161/JAHA.124.035977. Epub 2024 Nov 7.

Persistent delirium is associated with cerebrospinal fluid markers of neuronal injury.

Brain Commun. 2024 Sep 18;6(5):fcae319. doi: 10.1093/braincomms/fcae319. eCollection 2024.

Early statin use is associated with improved survival and cardiovascular outcomes in patients with atrial fibrillation and recent ischaemic stroke: A propensity-matched analysis of a global federated health database.

Eur Stroke J. 2025 Mar;10(1):116-127. doi: 10.1177/23969873241274213. Epub 2024 Sep 10.

Challenges in optimizing the treatment of Pneumocystis pneumonia in the intensive care unit.

Intensive Care Med. 2024 Oct;50(10):1719-1720. doi: 10.1007/s00134-024-07568-4. Epub 2024 Aug 5.

Comparison of antimicrobial therapy termination in febrile and afebrile patients with acute cholangitis after drainage.

Sci Rep. 2024 Aug 1;14(1):17858. doi: 10.1038/s41598-024-68999-z.

本文引用的文献

Model misspecification and robustness in causal inference: comparing matching with doubly robust estimation.

Stat Med. 2012 Jul 10;31(15):1572-81. doi: 10.1002/sim.4496. Epub 2012 Feb 23.

Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies.

Pharm Stat. 2011 Mar-Apr;10(2):150-61. doi: 10.1002/pst.433.

Propensity scores in intensive care and anaesthesiology literature: a systematic review.

Intensive Care Med. 2010 Dec;36(12):1993-2003. doi: 10.1007/s00134-010-1991-5. Epub 2010 Aug 6.

Reasons for refusal of admission to intensive care and impact on mortality.

Intensive Care Med. 2010 Oct;36(10):1772-1779. doi: 10.1007/s00134-010-1933-2. Epub 2010 Jun 9.

Long-term safety and efficacy of stenting versus coronary artery bypass grafting for unprotected left main coronary artery disease: 5-year results from the MAIN-COMPARE (Revascularization for Unprotected Left Main Coronary Artery Stenosis: Comparison of Percutaneous Coronary Angioplasty Versus Surgical Revascularization) registry.

J Am Coll Cardiol. 2010 Jul 6;56(2):117-24. doi: 10.1016/j.jacc.2010.04.004. Epub 2010 May 6.

Tandem autologous non-myeloablative allogeneic transplantation in patients with multiple myeloma relapsing after a first high dose therapy.

Bone Marrow Transplant. 2011 Feb;46(2):250-6. doi: 10.1038/bmt.2010.90. Epub 2010 Apr 19.

Long-term TNF-alpha blockade in patients with amyloid A amyloidosis complicating rheumatic diseases.

Am J Med. 2010 May;123(5):454-61. doi: 10.1016/j.amjmed.2009.11.010.

Estimators and confidence intervals for the marginal odds ratio using logistic regression and propensity score stratification.

Stat Med. 2010 Mar 30;29(7-8):760-9. doi: 10.1002/sim.3811.

Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples.

Stat Med. 2009 Nov 10;28(25):3083-107. doi: 10.1002/sim.3697.

The relative ability of different propensity score methods to balance measured covariates between treated and untreated subjects in observational studies.

Med Decis Making. 2009 Nov-Dec;29(6):661-77. doi: 10.1177/0272989X09341755. Epub 2009 Aug 14.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

评价在小样本量情况下估计边缘比值比的倾向评分方法。

Evaluation of the propensity score methods for estimating marginal odds ratios in case of small sample size.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献