对倾向得分模型是否小题大做？比较倾向得分匹配方法。

Thomson-Medstat, Ann Arbor, MI, USA.

Value Health. 2006 Nov-Dec;9(6):377-85. doi: 10.1111/j.1524-4733.2006.00130.x.

OBJECTIVE

A large number of possible techniques are available when conducting matching procedures, yet coherent guidelines for selecting the most appropriate application do not yet exist. In this article we evaluate several matching techniques and provide a suggested guideline for selecting the best technique.

METHODS

The main purpose of a matching procedure is to reduce selection bias by increasing the balance between the treatment and control groups. The following approach, consisting of five quantifiable steps, is proposed to check for balance: 1) Using two sample t-statistics to compare the means of the treatment and control groups for each explanatory variable; 2) Comparing the mean difference as a percentage of the average standard deviations; 3) Comparing percent reduction of bias in the means of the explanatory variables before and after matching; 4) Comparing treatment and control density estimates for the explanatory variables; and 5) Comparing the density estimates of the propensity scores of the control units with those of the treated units. We investigated seven different matching techniques and how they performed with regard to proposed five steps. Moreover, we estimate the average treatment effect with multivariate analysis and compared the results with the estimates of propensity score matching techniques. The Medstat MarketScan Data Base provided data for use in empirical examples of the utility of several matching methods. We conducted nearest neighborhood matching (NNM) analyses in seven ways: replacement, 2 to 1 matching, Mahalanobis matching (MM), MM with caliper, kernel matching, radius matching, and the stratification method.

RESULTS

Comparing techniques according to the above criteria revealed that the choice of matching has significant effects on outcomes. Patients with asthma are compared with patients without asthma and cost of illness ranged from 2040 dollars to 4463 dollars depending on the type of matching. After matching, we looked at the insignificant differences or larger P-values in the mean values (criterion 1); low mean differences as a percentage of the average standard deviation (criterion 2); 100% reduction bias in the means of explanatory variables (criterion 3); and insignificant differences when comparing the density estimates of the treatment and control groups (criterion 4 and criterion 5). Mahalanobis matching with caliber yielded the better results according all five criteria (Mean = 4463 dollars, SD = 3252 dollars). We also applied multivariate analysis over the matched sample. This decreased the deviation in cost of illness estimates more than threefold (Mean = 4456 dollars, SD = 996 dollars).

CONCLUSION

Sensitivity analysis of the matching techniques is especially important because none of the proposed methods in the literature is a priori superior to the others. The suggested joint consideration of propensity score matching and multivariate analysis offers an approach to assessing the robustness of the estimates.

目的

在进行匹配程序时，有大量可能的技术可供使用，但尚无关于选择最合适应用的连贯指南。在本文中，我们评估了几种匹配技术，并提供了选择最佳技术的建议指南。

方法

匹配程序的主要目的是通过提高治疗组和对照组之间的平衡性来减少选择偏倚。建议采用以下由五个可量化步骤组成的方法来检查平衡性：1）使用两个样本t统计量来比较每个解释变量在治疗组和对照组中的均值；2）将均值差异作为平均标准差的百分比进行比较；3）比较匹配前后解释变量均值的偏差减少百分比；4）比较解释变量的治疗组和对照组密度估计值；5）比较对照组单位与治疗组单位倾向得分的密度估计值。我们研究了七种不同的匹配技术以及它们在上述五个步骤中的表现。此外，我们使用多变量分析估计平均治疗效果，并将结果与倾向得分匹配技术的估计值进行比较。Medstat MarketScan数据库提供了数据，用于几种匹配方法效用的实证示例。我们以七种方式进行最近邻匹配（NNM）分析：替换、2对1匹配、马氏匹配（MM）、带卡尺的MM、核匹配、半径匹配和分层方法。

结果

根据上述标准比较技术发现，匹配的选择对结果有显著影响。将哮喘患者与非哮喘患者进行比较，疾病成本根据匹配类型在2040美元至4463美元之间。匹配后，我们查看了均值中的无显著差异或更大的P值（标准1）；均值差异作为平均标准差百分比的低值（标准2）；解释变量均值的偏差减少100%（标准3）；以及比较治疗组和对照组密度估计值时的无显著差异（标准4和标准5）。带卡尺的马氏匹配在所有五个标准下都产生了更好的结果（均值 = 4463美元，标准差 = 3252美元）。我们还对匹配样本应用了多变量分析。这使疾病成本估计的偏差降低了三倍多（均值 = 4456美元，标准差 = 996美元）。

结论

匹配技术的敏感性分析尤为重要，因为文献中提出的方法没有一种在先天条件上优于其他方法。建议联合考虑倾向得分匹配和多变量分析，提供了一种评估估计稳健性的方法。

相似文献

Too much ado about propensity score models? Comparing methods of propensity score matching.

Value Health. 2006 Nov-Dec;9(6):377-85. doi: 10.1111/j.1524-4733.2006.00130.x.

A critical appraisal of propensity-score matching in the medical literature between 1996 and 2003.

Stat Med. 2008 May 30;27(12):2037-49. doi: 10.1002/sim.3150.

One-to-many propensity score matching in cohort studies.

Pharmacoepidemiol Drug Saf. 2012 May;21 Suppl 2:69-80. doi: 10.1002/pds.3263.

Propensity score interval matching: using bootstrap confidence intervals for accommodating estimation errors of propensity scores.

BMC Med Res Methodol. 2015 Jul 28;15:53. doi: 10.1186/s12874-015-0049-3.

Methods to assess intended effects of drug treatment in observational studies are reviewed.

J Clin Epidemiol. 2004 Dec;57(12):1223-31. doi: 10.1016/j.jclinepi.2004.03.011.

[Meta-analysis of the Italian studies on short-term effects of air pollution].

Epidemiol Prev. 2001 Mar-Apr;25(2 Suppl):1-71.

Model misspecification and robustness in causal inference: comparing matching with doubly robust estimation.

Stat Med. 2012 Jul 10;31(15):1572-81. doi: 10.1002/sim.4496. Epub 2012 Feb 23.

An application of propensity score matching using claims data.

Pharmacoepidemiol Drug Saf. 2005 Jul;14(7):465-76. doi: 10.1002/pds.1062.

The concept of the marginally matched subject in propensity-score matched analyses.

Pharmacoepidemiol Drug Saf. 2009 Jun;18(6):469-82. doi: 10.1002/pds.1733.

Use of propensity score technique to account for exposure-related covariates: an example and lesson.

Med Care. 2007 Oct;45(10 Supl 2):S143-8. doi: 10.1097/MLR.0b013e318074ce79.

引用本文的文献

Estimating the under-five malaria risk in Uganda based on the nearest neighbour matched analysis technique.

Afr Health Sci. 2024 Jun;24(2):173-180. doi: 10.4314/ahs.v24i2.20.

Positive Childhood Experiences and Adult Health and Opportunity Outcomes in 4 US States.

JAMA Netw Open. 2025 Jul 1;8(7):e2524435. doi: 10.1001/jamanetworkopen.2025.24435.

Answer to Letter to the Editor of L. Dai, et al. concerning "The association between postoperative physical therapy and opioid prescription after posterior lumbar interbody fusion: a retrospective cohort study of United States academic health centers" by Baumann AN, et al. (Eur Spine J [2025]: doi: 10.1007/s00586-025-08824-x).

Eur Spine J. 2025 Jul 14. doi: 10.1007/s00586-025-09145-9.

Obesity in Italy: An Empirical Analysis of Healthcare Consumption, Quality of Life and Comorbidities.

Medicina (Kaunas). 2025 Jun 9;61(6):1061. doi: 10.3390/medicina61061061.

Open Access to Antipsychotics in State Medicaid Programs: Effect on Healthcare Resource Utilization and Costs among Patients with Serious Mental Illness.

J Health Econ Outcomes Res. 2025 Jun 17;12(1):222-229. doi: 10.36469/001c.137909. eCollection 2025.

Healthcare utilisation and economic burden of cancer on Indian households.

Sci Rep. 2025 May 14;15(1):16780. doi: 10.1038/s41598-025-01279-6.

Association of overactive bladder with all-cause and cardiovascular mortality in women: A propensity-matched NHANES study.

BJUI Compass. 2025 Apr 29;6(5):e70022. doi: 10.1002/bco2.70022. eCollection 2025 May.

Effects of rural-to-urban migration on healthcare utilization of middle-aged and older adults: evidence from the China health and retirement longitudinal study.

Front Public Health. 2025 Apr 14;13:1576285. doi: 10.3389/fpubh.2025.1576285. eCollection 2025.

Antihypertensive drugs and survival outcomes in oropharyngeal squamous cell carcinoma patients.

J Natl Cancer Inst. 2025 Jul 1;117(7):1410-1420. doi: 10.1093/jnci/djaf056.

A comparison of time-varying propensity score vs sequential stratification approaches to longitudinal matching with a time-varying treatment.

BMC Med Res Methodol. 2024 Nov 13;24(1):280. doi: 10.1186/s12874-024-02391-3.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Too much ado about propensity score models? Comparing methods of propensity score matching.

Value Health. 2006 Nov-Dec;9(6):377-85. doi: 10.1111/j.1524-4733.2006.00130.x.

A critical appraisal of propensity-score matching in the medical literature between 1996 and 2003.

Stat Med. 2008 May 30;27(12):2037-49. doi: 10.1002/sim.3150.

One-to-many propensity score matching in cohort studies.

Pharmacoepidemiol Drug Saf. 2012 May;21 Suppl 2:69-80. doi: 10.1002/pds.3263.

Propensity score interval matching: using bootstrap confidence intervals for accommodating estimation errors of propensity scores.

BMC Med Res Methodol. 2015 Jul 28;15:53. doi: 10.1186/s12874-015-0049-3.

Methods to assess intended effects of drug treatment in observational studies are reviewed.

J Clin Epidemiol. 2004 Dec;57(12):1223-31. doi: 10.1016/j.jclinepi.2004.03.011.

[Meta-analysis of the Italian studies on short-term effects of air pollution].

Epidemiol Prev. 2001 Mar-Apr;25(2 Suppl):1-71.

Model misspecification and robustness in causal inference: comparing matching with doubly robust estimation.

Stat Med. 2012 Jul 10;31(15):1572-81. doi: 10.1002/sim.4496. Epub 2012 Feb 23.

An application of propensity score matching using claims data.

Pharmacoepidemiol Drug Saf. 2005 Jul;14(7):465-76. doi: 10.1002/pds.1062.

The concept of the marginally matched subject in propensity-score matched analyses.

Pharmacoepidemiol Drug Saf. 2009 Jun;18(6):469-82. doi: 10.1002/pds.1733.

Use of propensity score technique to account for exposure-related covariates: an example and lesson.

Med Care. 2007 Oct;45(10 Supl 2):S143-8. doi: 10.1097/MLR.0b013e318074ce79.

引用本文的文献

Estimating the under-five malaria risk in Uganda based on the nearest neighbour matched analysis technique.

Afr Health Sci. 2024 Jun;24(2):173-180. doi: 10.4314/ahs.v24i2.20.

Positive Childhood Experiences and Adult Health and Opportunity Outcomes in 4 US States.

JAMA Netw Open. 2025 Jul 1;8(7):e2524435. doi: 10.1001/jamanetworkopen.2025.24435.

Eur Spine J. 2025 Jul 14. doi: 10.1007/s00586-025-09145-9.

Obesity in Italy: An Empirical Analysis of Healthcare Consumption, Quality of Life and Comorbidities.

Medicina (Kaunas). 2025 Jun 9;61(6):1061. doi: 10.3390/medicina61061061.

Open Access to Antipsychotics in State Medicaid Programs: Effect on Healthcare Resource Utilization and Costs among Patients with Serious Mental Illness.

J Health Econ Outcomes Res. 2025 Jun 17;12(1):222-229. doi: 10.36469/001c.137909. eCollection 2025.

Healthcare utilisation and economic burden of cancer on Indian households.

Sci Rep. 2025 May 14;15(1):16780. doi: 10.1038/s41598-025-01279-6.

Association of overactive bladder with all-cause and cardiovascular mortality in women: A propensity-matched NHANES study.

BJUI Compass. 2025 Apr 29;6(5):e70022. doi: 10.1002/bco2.70022. eCollection 2025 May.

Effects of rural-to-urban migration on healthcare utilization of middle-aged and older adults: evidence from the China health and retirement longitudinal study.

Front Public Health. 2025 Apr 14;13:1576285. doi: 10.3389/fpubh.2025.1576285. eCollection 2025.

Antihypertensive drugs and survival outcomes in oropharyngeal squamous cell carcinoma patients.

J Natl Cancer Inst. 2025 Jul 1;117(7):1410-1420. doi: 10.1093/jnci/djaf056.

A comparison of time-varying propensity score vs sequential stratification approaches to longitudinal matching with a time-varying treatment.

BMC Med Res Methodol. 2024 Nov 13;24(1):280. doi: 10.1186/s12874-024-02391-3.

Too much ado about propensity score models? Comparing methods of propensity score matching.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献