为何我们不应忽视双重差分法中的设定选择

Why We Should Not Be Indifferent to Specification Choices for Difference-in-Differences.

作者信息

Ryan Andrew M, Burgess James F, Dimick Justin B

机构信息

University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI.

Veterans Affairs Boston Health Care System, US Department of Veteran Affairs, Boston University School of Public Health, Boston, MA.

出版信息

Health Serv Res. 2015 Aug;50(4):1211-35. doi: 10.1111/1475-6773.12270. Epub 2014 Dec 11.

DOI:10.1111/1475-6773.12270

PMID:25495529

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4545355/

Abstract

OBJECTIVE

To evaluate the effects of specification choices on the accuracy of estimates in difference-in-differences (DID) models.

DATA SOURCES

Process-of-care quality data from Hospital Compare between 2003 and 2009.

STUDY DESIGN

We performed a Monte Carlo simulation experiment to estimate the effect of an imaginary policy on quality. The experiment was performed for three different scenarios in which the probability of treatment was (1) unrelated to pre-intervention performance; (2) positively correlated with pre-intervention levels of performance; and (3) positively correlated with pre-intervention trends in performance. We estimated alternative DID models that varied with respect to the choice of data intervals, the comparison group, and the method of obtaining inference. We assessed estimator bias as the mean absolute deviation between estimated program effects and their true value. We evaluated the accuracy of inferences through statistical power and rates of false rejection of the null hypothesis.

PRINCIPAL FINDINGS

Performance of alternative specifications varied dramatically when the probability of treatment was correlated with pre-intervention levels or trends. In these cases, propensity score matching resulted in much more accurate point estimates. The use of permutation tests resulted in lower false rejection rates for the highly biased estimators, but the use of clustered standard errors resulted in slightly lower false rejection rates for the matching estimators.

CONCLUSIONS

When treatment and comparison groups differed on pre-intervention levels or trends, our results supported specifications for DID models that include matching for more accurate point estimates and models using clustered standard errors or permutation tests for better inference. Based on our findings, we propose a checklist for DID analysis.

摘要

目的

评估规格选择对双重差分（DID）模型估计准确性的影响。

数据来源

2003年至2009年医院比较中的医疗护理质量数据。

研究设计

我们进行了一项蒙特卡洛模拟实验，以估计一项虚构政策对质量的影响。该实验针对三种不同情景进行，其中治疗概率分别为：（1）与干预前表现无关；（2）与干预前表现水平呈正相关；（3）与干预前表现趋势呈正相关。我们估计了替代DID模型，这些模型在数据区间选择、对照组以及获得推断的方法方面存在差异。我们将估计偏差评估为估计项目效果与其真实值之间的平均绝对偏差。我们通过统计功效和零假设的错误拒绝率来评估推断的准确性。

主要发现

当治疗概率与干预前水平或趋势相关时，替代规格的表现差异很大。在这些情况下，倾向得分匹配产生的点估计要准确得多。对于偏差较大的估计器，使用排列检验会导致较低的错误拒绝率，但对于匹配估计器，使用聚类标准误会导致略低的错误拒绝率。

结论

当治疗组和对照组在干预前水平或趋势上存在差异时，我们的结果支持DID模型的规格，包括采用匹配以获得更准确的点估计，以及使用聚类标准误或排列检验的模型以进行更好的推断。基于我们的发现，我们提出了一份DID分析清单。

相似文献

Why We Should Not Be Indifferent to Specification Choices for Difference-in-Differences.

Health Serv Res. 2015 Aug;50(4):1211-35. doi: 10.1111/1475-6773.12270. Epub 2014 Dec 11.

Stat Methods Med Res. 2019 Dec;28(12):3697-3711. doi: 10.1177/0962280218814570. Epub 2018 Nov 25.

Matching and Regression to the Mean in Difference-in-Differences Analysis.

Health Serv Res. 2018 Dec;53(6):4138-4156. doi: 10.1111/1475-6773.12993. Epub 2018 Jun 29.

Improving causal inference with a doubly robust estimator that combines propensity score stratification and weighting.

J Eval Clin Pract. 2017 Aug;23(4):697-702. doi: 10.1111/jep.12714. Epub 2017 Jan 24.

Inference With Difference-in-Differences With a Small Number of Groups: A Review, Simulation Study, and Empirical Application Using SHARE Data.

Med Care. 2018 Jan;56(1):97-105. doi: 10.1097/MLR.0000000000000830.

Comparison of the ability of double-robust estimators to correct bias in propensity score matching analysis. A Monte Carlo simulation study.

Pharmacoepidemiol Drug Saf. 2017 Dec;26(12):1513-1519. doi: 10.1002/pds.4325. Epub 2017 Oct 6.

Too much ado about propensity score models? Comparing methods of propensity score matching.

Value Health. 2006 Nov-Dec;9(6):377-85. doi: 10.1111/j.1524-4733.2006.00130.x.

The performance of different propensity-score methods for estimating relative risks.

J Clin Epidemiol. 2008 Jun;61(6):537-45. doi: 10.1016/j.jclinepi.2007.07.011. Epub 2008 Feb 14.

Interpreting treatment-effect estimates with heterogeneity and choice: simulation model results.

Clin Ther. 2009 Apr;31(4):902-19. doi: 10.1016/j.clinthera.2009.04.007.

引用本文的文献

Effects of the right to be forgotten for childhood or breast cancer survivors: results of a quasi-experimental study in France.

J Cancer Surviv. 2025 Jun 4. doi: 10.1007/s11764-025-01830-4.

Health economic evaluation of a medication safety intervention in elderly care: identifying causal effects in a multi-center quasi-experimental study design.

BMC Health Serv Res. 2025 May 30;25(1):773. doi: 10.1186/s12913-025-12898-0.

Experiences and effect of implementing social health insurance (SHI) program in Nepal-A mixed method study.

PLOS Glob Public Health. 2025 Apr 24;5(4):e0003492. doi: 10.1371/journal.pgph.0003492. eCollection 2025.

Changes in quality of life and mental health outcomes related to vaping cessation among US adults.

Tob Control. 2025 Mar 27. doi: 10.1136/tc-2024-059098.

Evaluating Federal Policies Using Bayesian Time Series Models: Estimating the Causal Impact of the Hospital Readmissions Reduction Program.

Health Serv Outcomes Res Methodol. 2023 Oct;23(4):433-451. doi: 10.1007/s10742-022-00294-8. Epub 2023 Jan 5.

Employment Nondiscrimination Protection and Mental Health Among Sexual Minority Adults.

JAMA Psychiatry. 2025 Mar 1;82(3):237-245. doi: 10.1001/jamapsychiatry.2024.4318.

Staggered interventions with no control groups.

Int J Epidemiol. 2024 Oct 13;53(6). doi: 10.1093/ije/dyae137.

Evaluating the Effect of Financial Penalty on Hospital-Acquired Infections.

Risk Manag Healthc Policy. 2024 Sep 7;17:2181-2190. doi: 10.2147/RMHP.S469424. eCollection 2024.

Decision Support and Behavioral Health for Reducing High-Dose Opioids in Comorbid Chronic Pain, Depression and Anxiety: Stepped-Wedge Cluster Randomized Trial.

J Gen Intern Med. 2024 Nov;39(15):2952-2960. doi: 10.1007/s11606-024-08965-7. Epub 2024 Aug 2.

The association between comprehensive medication review and medication adherence among medicare beneficiaries with chronic obstructive pulmonary disease.

Explor Res Clin Soc Pharm. 2024 Jun 24;15:100470. doi: 10.1016/j.rcsop.2024.100470. eCollection 2024 Sep.

本文引用的文献

How Did Health Care Reform in Massachusetts Impact Insurance Premiums?

Am Econ Rev. 2012 May;102(3):508-13. doi: 10.1257/aer.102.3.508.

The GRACE checklist for rating the quality of observational studies of comparative effectiveness: a tale of hope and caution.

J Manag Care Spec Pharm. 2014 Mar;20(3):301-8. doi: 10.18553/jmcp.2014.20.3.301.

Impact of a pregabalin step therapy policy among medicare advantage beneficiaries.

Pain Pract. 2014 Jun;14(5):419-26. doi: 10.1111/papr.12073. Epub 2013 May 23.

The impact of high-deductible health plans on men and women: an analysis of emergency department care.

Med Care. 2013 Aug;51(8):639-45. doi: 10.1097/MLR.0b013e31829742d0.

Change in health insurance coverage in Massachusetts and other New England States by perceived health status: potential impact of health reform.

Am J Public Health. 2013 Jun;103(6):e107-14. doi: 10.2105/AJPH.2012.300997. Epub 2013 Apr 18.

Insurance expansion in Massachusetts did not reduce access among previously insured Medicare patients.

Health Aff (Millwood). 2013 Mar;32(3):571-8. doi: 10.1377/hlthaff.2012.1018.

Do electronic medical records improve diabetes quality in physician practices?

Am J Manag Care. 2013 Feb;19(2):144-9.

Bariatric surgery complications before vs after implementation of a national policy restricting coverage to centers of excellence.

JAMA. 2013 Feb 27;309(8):792-9. doi: 10.1001/jama.2013.755.

The effect of pay-for-performance in nursing homes: evidence from state Medicaid programs.

Health Serv Res. 2013 Aug;48(4):1393-414. doi: 10.1111/1475-6773.12035. Epub 2013 Feb 10.

Small physician practices in new york needed sustained help to realize gains in quality from use of electronic health records.

Health Aff (Millwood). 2013 Jan;32(1):53-62. doi: 10.1377/hlthaff.2012.0742.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

为何我们不应忽视双重差分法中的设定选择

Why We Should Not Be Indifferent to Specification Choices for Difference-in-Differences.

作者信息

机构信息

出版信息

OBJECTIVE

DATA SOURCES

STUDY DESIGN

PRINCIPAL FINDINGS

CONCLUSIONS

目的

数据来源

研究设计

主要发现

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献