Tong Jiayi, Chen Zhaoyi, Duan Rui, Lo-Ciganic Wei-Hsuan, Lyu Tianchen, Tao Cui, Merkel Peter A, Kranzler Henry R, Bian Jiang, Chen Yong
Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA.
Department of Epidemiology, College of Medicine & College of Public Health and Health Professions, University of Florida, Gainesville, FL, USA.
AMIA Annu Symp Proc. 2021 Jan 25;2020:1220-1229. eCollection 2020.
Because they contain detailed individual-level data on various patient characteristics including their medical conditions and treatment histories, electronic health record (EHR) systems have been widely adopted as an efficient source for health research. Compared to data from a single health system, real-world data (RWD) from multiple clinical sites provide a larger and more generalizable population for accurate estimation, leading to better decision making for health care. However, due to concerns over protecting patient privacy, it is challenging to share individual patient-level data across sites in practice. To tackle this issue, many distributed algorithms have been developed to transfer summary-level statistics to derive accurate estimates. Nevertheless, many of these algorithms require multiple rounds of communication to exchange intermediate results across different sites. Among them, the One-shot Distributed Algorithm for Logistic regression (termed ODAL) was developed to reduce communication overhead while protecting patient privacy. In this paper, we applied the ODAL algorithm to RWD from a large clinical data research network-the OneFlorida Clinical Research Consortium and estimated the associations between risk factors and the diagnosis of opioid use disorder (OUD) among individuals who received at least one opioid prescription. The ODAL algorithm provided consistent findings of the associated risk factors and yielded better estimates than meta-analysis.
由于电子健康记录(EHR)系统包含有关各种患者特征(包括其医疗状况和治疗史)的详细个人层面数据,因此已被广泛用作健康研究的有效数据源。与来自单一健康系统的数据相比,来自多个临床站点的真实世界数据(RWD)为准确估计提供了更大且更具普遍性的人群,从而有助于做出更好的医疗保健决策。然而,出于对保护患者隐私的担忧,在实践中跨站点共享个人患者层面的数据具有挑战性。为了解决这个问题,已经开发了许多分布式算法来传输汇总层面的统计数据以得出准确的估计值。尽管如此,这些算法中的许多都需要多轮通信来在不同站点之间交换中间结果。其中,逻辑回归的一次性分布式算法(称为ODAL)旨在在保护患者隐私的同时减少通信开销。在本文中,我们将ODAL算法应用于来自大型临床数据研究网络——OneFlorida临床研究联盟的真实世界数据,并估计了在至少接受过一次阿片类药物处方的个体中,风险因素与阿片类药物使用障碍(OUD)诊断之间的关联。ODAL算法提供了相关风险因素的一致结果,并且比荟萃分析产生了更好的估计值。