Huth Manuel, Garavito Carolina Alvarez, Seep Lea, Cirera Laia, Saúte Francisco, Sicuri Elisa, Hasenauer Jan
Institute for Computational Biology, Helmholtz Munich - German Research Center for Environmental Health, Munich, Germany.
LIMES, Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn, Germany.
iScience. 2024 Oct 9;27(11):111025. doi: 10.1016/j.isci.2024.111025. eCollection 2024 Nov 15.
Difference-in-differences (DID) is a key tool for causal impact evaluation but faces challenges when applied to sensitive data restricted by privacy regulations. Obtaining consent can shrink sample sizes and reduce statistical power, limiting the analysis's effectiveness. Federated learning addresses these issues by sharing aggregated statistics rather than individual data, though advanced federated DID software is limited. We developed a federated version of the Callaway and Sant'Anna difference-in-differences (CSDID), integrated into the DataSHIELD platform, adhering to stringent privacy protocols. Our approach reproduces key estimates and standard errors while preserving confidentiality. Using simulated and real-world data from a malaria intervention in Mozambique, we demonstrate that federated estimates increase sample sizes, reduce estimation uncertainty, and enable analyses when data owners cannot share treated or untreated group data. Our work contributes to facilitating the evaluation of policy interventions or treatments across centers and borders.
双重差分法(DID)是因果影响评估的关键工具,但在应用于受隐私法规限制的敏感数据时面临挑战。获得同意会缩小样本规模并降低统计效力,限制分析的有效性。联邦学习通过共享汇总统计数据而非个体数据来解决这些问题,不过先进的联邦DID软件有限。我们开发了Callaway和Sant'Anna双重差分法(CSDID)的联邦版本,并将其集成到DataSHIELD平台中,遵循严格的隐私协议。我们的方法在保持保密性的同时重现关键估计值和标准误差。利用来自莫桑比克疟疾干预的模拟数据和真实世界数据,我们证明联邦估计增加了样本规模,降低了估计不确定性,并在数据所有者无法共享处理组或未处理组数据时能够进行分析。我们的工作有助于促进跨中心和跨边界的政策干预或治疗评估。