• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ODAL:一种用于对来自多个临床站点的电子健康记录数据进行逻辑回归的一次性分布式算法。

ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites.

作者信息

Duan Rui, Boland Mary Regina, Moore Jason H, Chen Yong

机构信息

Department of Biostatistics, Epidemiology & Informatics, University of Pennsylvania, 423 Guardian Drive, PA, 19104, USA#Co-first author.

出版信息

Pac Symp Biocomput. 2019;24:30-41.

PMID:30864308
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6417819/
Abstract

Electronic Health Records (EHR) contain extensive information on various health outcomes and risk factors, and therefore have been broadly used in healthcare research. Integrating EHR data from multiple clinical sites can accelerate knowledge discovery and risk prediction by providing a larger sample size in a more general population which potentially reduces clinical bias and improves estimation and prediction accuracy. To overcome the barrier of patient-level data sharing, distributed algorithms are developed to conduct statistical analyses across multiple sites through sharing only aggregated information. The current distributed algorithm often requires iterative information evaluation and transferring across sites, which can potentially lead to a high communication cost in practical settings. In this study, we propose a privacy-preserving and communication-efficient distributed algorithm for logistic regression without requiring iterative communications across sites. Our simulation study showed our algorithm reached comparative accuracy comparing to the oracle estimator where data are pooled together. We applied our algorithm to an EHR data from the University of Pennsylvania health system to evaluate the risks of fetal loss due to various medication exposures.

摘要

电子健康记录(EHR)包含有关各种健康结果和风险因素的广泛信息,因此已在医疗保健研究中广泛使用。整合来自多个临床站点的EHR数据可以通过在更广泛的人群中提供更大的样本量来加速知识发现和风险预测,这有可能减少临床偏差并提高估计和预测准确性。为了克服患者层面数据共享的障碍,人们开发了分布式算法,通过仅共享汇总信息来跨多个站点进行统计分析。当前的分布式算法通常需要在站点之间进行迭代信息评估和传输,这在实际应用中可能会导致高昂的通信成本。在本研究中,我们提出了一种用于逻辑回归的隐私保护且通信高效的分布式算法,该算法无需在站点之间进行迭代通信。我们的模拟研究表明,与将数据集中在一起的神谕估计器相比,我们的算法达到了相当的准确性。我们将我们的算法应用于宾夕法尼亚大学医疗系统的EHR数据,以评估各种药物暴露导致胎儿丢失的风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0bb/6417819/440ddedbcc75/nihms-999765-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0bb/6417819/e7df7cb5361e/nihms-999765-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0bb/6417819/eee6dc81b5d3/nihms-999765-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0bb/6417819/5d5232f27dcf/nihms-999765-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0bb/6417819/440ddedbcc75/nihms-999765-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0bb/6417819/e7df7cb5361e/nihms-999765-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0bb/6417819/eee6dc81b5d3/nihms-999765-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0bb/6417819/5d5232f27dcf/nihms-999765-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0bb/6417819/440ddedbcc75/nihms-999765-f0004.jpg

相似文献

1
ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites.ODAL:一种用于对来自多个临床站点的电子健康记录数据进行逻辑回归的一次性分布式算法。
Pac Symp Biocomput. 2019;24:30-41.
2
Learning from electronic health records across multiple sites: A communication-efficient and privacy-preserving distributed algorithm.从多个站点的电子健康记录中学习:一种通信高效且隐私保护的分布式算法。
J Am Med Inform Assoc. 2020 Mar 1;27(3):376-385. doi: 10.1093/jamia/ocz199.
3
Robust-ODAL: Learning from heterogeneous health systems without sharing patient-level data.鲁棒性 ODAL:在不共享患者级数据的情况下从异构健康系统中学习。
Pac Symp Biocomput. 2020;25:695-706.
4
Learning from local to global: An efficient distributed algorithm for modeling time-to-event data.从局部到全局学习:一种用于建模事件时间数据的高效分布式算法。
J Am Med Inform Assoc. 2020 Jul 1;27(7):1028-1036. doi: 10.1093/jamia/ocaa044.
5
Identifying Clinical Risk Factors for Opioid Use Disorder using a Distributed Algorithm to Combine Real-World Data from a Large Clinical Data Research Network.使用分布式算法结合大型临床数据研究网络的真实世界数据来识别阿片类药物使用障碍的临床风险因素。
AMIA Annu Symp Proc. 2021 Jan 25;2020:1220-1229. eCollection 2020.
6
Distributed Quasi-Poisson regression algorithm for modeling multi-site count outcomes in distributed data networks.分布式准泊松回归算法在分布式数据网络中对多点计数结果进行建模。
J Biomed Inform. 2022 Jul;131:104097. doi: 10.1016/j.jbi.2022.104097. Epub 2022 May 25.
7
An efficient and accurate distributed learning algorithm for modeling multi-site zero-inflated count outcomes.一种高效准确的分布式学习算法,用于对多站点零膨胀计数结果进行建模。
Sci Rep. 2021 Oct 4;11(1):19647. doi: 10.1038/s41598-021-99078-2.
8
One-shot distributed algorithms for addressing heterogeneity in competing risks data across clinical sites.单步分布式算法用于解决临床站点间竞争风险数据的异质性。
J Biomed Inform. 2024 Feb;150:104595. doi: 10.1016/j.jbi.2024.104595. Epub 2024 Jan 18.
9
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
10
ODACH: a one-shot distributed algorithm for Cox model with heterogeneous multi-center data.ODACH:一种用于异质多中心 Cox 模型的单步分布式算法。
Sci Rep. 2022 Apr 22;12(1):6627. doi: 10.1038/s41598-022-09069-0.

引用本文的文献

1
Federated Adaptive Causal Estimation (FACE) of Target Treatment Effects.目标治疗效果的联合自适应因果估计(FACE)
J Am Stat Assoc. 2025 Mar 17. doi: 10.1080/01621459.2025.2453249.
2
DisCo-HD: Distributed causal inference with covariates shift for analyzing real-world high-dimensional data.DisCo-HD:用于分析真实世界高维数据的带协变量偏移的分布式因果推断
J Mach Learn Res. 2025;26.
3
COLA-GLM: collaborative one-shot and lossless algorithms of generalized linear models for decentralized observational healthcare data.COLA-GLM:用于分散式观察性医疗数据的广义线性模型的协作式一次性无损算法

本文引用的文献

1
DISTRIBUTED TESTING AND ESTIMATION UNDER SPARSE HIGH DIMENSIONAL MODELS.稀疏高维模型下的分布式测试与估计
Ann Stat. 2018 Jun;46(3):1352-1382. doi: 10.1214/17-AOS1587. Epub 2018 May 3.
2
Uncovering exposures responsible for birth season - disease effects: a global study.揭示导致出生季节与疾病关联的暴露因素:一项全球研究。
J Am Med Inform Assoc. 2018 Mar 1;25(3):275-288. doi: 10.1093/jamia/ocx105.
3
Development of A Machine Learning Algorithm to Classify Drugs Of Unknown Fetal Effect.开发一种机器学习算法以分类具有未知胎儿效应的药物。
NPJ Digit Med. 2025 Jul 15;8(1):442. doi: 10.1038/s41746-025-01781-1.
4
Federated Learning in Healthcare: A Benchmark Comparison of Engineering and Statistical Approaches for Structured Data Analysis.医疗保健中的联邦学习:结构化数据分析的工程方法与统计方法的基准比较
Health Data Sci. 2024 Dec 4;4:0196. doi: 10.34133/hds.0196. eCollection 2024.
5
TARGETING UNDERREPRESENTED POPULATIONS IN PRECISION MEDICINE: A FEDERATED TRANSFER LEARNING APPROACH.精准医学中针对代表性不足人群:一种联邦迁移学习方法。
Ann Appl Stat. 2023 Dec;17(4):2970-2992. doi: 10.1214/23-AOAS1747. Epub 2023 Oct 30.
6
Centralized and Federated Models for the Analysis of Clinical Data.集中式和联邦式临床数据分析模型。
Annu Rev Biomed Data Sci. 2024 Aug;7(1):179-199. doi: 10.1146/annurev-biodatasci-122220-115746. Epub 2024 Jul 24.
7
Evaluating site-of-care-related racial disparities in kidney graft failure using a novel federated learning framework.利用新型联邦学习框架评估与照护地点相关的肾移植失败中的种族差异。
J Am Med Inform Assoc. 2024 May 20;31(6):1303-1312. doi: 10.1093/jamia/ocae075.
8
Collaborative inference for treatment effect with distributed data-sharing management in multicenter studies.多中心研究中通过分布式数据共享管理进行的治疗效果协同推断。
Stat Med. 2024 May 20;43(11):2263-2279. doi: 10.1002/sim.10068. Epub 2024 Mar 29.
9
Learning competing risks across multiple hospitals: one-shot distributed algorithms.在多家医院学习竞争风险:单次分布式算法。
J Am Med Inform Assoc. 2024 Apr 19;31(5):1102-1112. doi: 10.1093/jamia/ocae027.
10
Individual Data Protected Integrative Regression Analysis of High-Dimensional Heterogeneous Data.高维异构数据的个体数据保护整合回归分析
J Am Stat Assoc. 2022;117(540):2105-2119. doi: 10.1080/01621459.2021.1904958. Epub 2021 May 19.
Sci Rep. 2017 Oct 9;7(1):12839. doi: 10.1038/s41598-017-12943-x.
4
Large-scale adverse effects related to treatment evidence standardization (LAERTES): an open scalable system for linking pharmacovigilance evidence sources with clinical data.与治疗证据标准化相关的大规模不良反应(LAERTES):一个将药物警戒证据来源与临床数据相链接的开放式可扩展系统。
J Biomed Semantics. 2017 Mar 7;8(1):11. doi: 10.1186/s13326-017-0115-3.
5
Characterizing treatment pathways at scale using the OHDSI network.使用 Observational Health Data Sciences and Informatics (OHDSI) 网络大规模描述治疗途径。
Proc Natl Acad Sci U S A. 2016 Jul 5;113(27):7329-36. doi: 10.1073/pnas.1510502113. Epub 2016 Jun 6.
6
Big data for health.健康大数据。
IEEE J Biomed Health Inform. 2015 Jul;19(4):1193-208. doi: 10.1109/JBHI.2015.2450362. Epub 2015 Jul 10.
7
WebDISCO: a web service for distributed cox model learning without patient-level data sharing.WebDISCO:一种用于分布式Cox模型学习且无需患者级数据共享的网络服务。
J Am Med Inform Assoc. 2015 Nov;22(6):1212-9. doi: 10.1093/jamia/ocv083. Epub 2015 Jul 9.
8
Birth month affects lifetime disease risk: a phenome-wide method.出生月份影响一生的疾病风险:一种全表型组方法。
J Am Med Inform Assoc. 2015 Sep;22(5):1042-53. doi: 10.1093/jamia/ocv046. Epub 2015 Jun 2.
9
PEDSnet: a National Pediatric Learning Health System.PEDSnet:国家儿科学习健康系统。
J Am Med Inform Assoc. 2014 Jul-Aug;21(4):602-6. doi: 10.1136/amiajnl-2014-002743. Epub 2014 May 12.
10
pSCANNER: patient-centered Scalable National Network for Effectiveness Research.pSCANNER:以患者为中心的可扩展全国有效性研究网络。
J Am Med Inform Assoc. 2014 Jul-Aug;21(4):621-6. doi: 10.1136/amiajnl-2014-002751. Epub 2014 Apr 29.