• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多站点高维异质数据的联合学习及其在 5 个临床站点的 15000 名患者阿片类药物使用障碍研究中的应用。

Multisite learning of high-dimensional heterogeneous data with applications to opioid use disorder study of 15,000 patients across 5 clinical sites.

机构信息

Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, 423 Guardian Drive, Philadelphia, PA, 19104, USA.

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA.

出版信息

Sci Rep. 2022 Jun 30;12(1):11073. doi: 10.1038/s41598-022-14029-9.

DOI:10.1038/s41598-022-14029-9
PMID:35773438
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9245877/
Abstract

Integrating data across institutions can improve learning efficiency. To integrate data efficiently while protecting privacy, we propose A one-shot, summary-statistics-based, Distributed Algorithm for fitting Penalized (ADAP) regression models across multiple datasets. ADAP utilizes patient-level data from a lead site and incorporates the first-order (ADAP1) and second-order gradients (ADAP2) of the objective function from collaborating sites to construct a surrogate objective function at the lead site, where model fitting is then completed with proper regularizations applied. We evaluate the performance of the proposed method using both simulation and a real-world application to study risk factors for opioid use disorder (OUD) using 15,000 patient data from the OneFlorida Clinical Research Consortium. Our results show that ADAP performs nearly the same as the pooled estimator but achieves higher estimation accuracy and better variable selection than the local and average estimators. Moreover, ADAP2 successfully handles heterogeneity in covariate distributions.

摘要

跨机构整合数据可以提高学习效率。为了在保护隐私的同时高效地整合数据,我们提出了一种基于单样本、汇总统计的、适用于跨多个数据集的惩罚(ADAP)回归模型的分布式算法。ADAP 利用来自主导站点的患者水平数据,并结合协作站点的目标函数的一阶(ADAP1)和二阶梯度(ADAP2)来构建主导站点的替代目标函数,然后在适当的正则化应用下完成模型拟合。我们使用模拟和真实应用来评估所提出方法的性能,以使用来自 OneFlorida 临床研究联盟的 15000 名患者数据研究阿片类药物使用障碍(OUD)的风险因素。我们的结果表明,ADAP 的性能几乎与汇总估计器相同,但比本地和平均估计器具有更高的估计准确性和更好的变量选择。此外,ADAP2 成功处理了协变量分布的异质性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec4/9246956/7d903d2ceeab/41598_2022_14029_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec4/9246956/d39095ef1f12/41598_2022_14029_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec4/9246956/6064dfa551d1/41598_2022_14029_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec4/9246956/0c66891bd382/41598_2022_14029_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec4/9246956/7d903d2ceeab/41598_2022_14029_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec4/9246956/d39095ef1f12/41598_2022_14029_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec4/9246956/6064dfa551d1/41598_2022_14029_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec4/9246956/0c66891bd382/41598_2022_14029_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec4/9246956/7d903d2ceeab/41598_2022_14029_Fig4_HTML.jpg

相似文献

1
Multisite learning of high-dimensional heterogeneous data with applications to opioid use disorder study of 15,000 patients across 5 clinical sites.多站点高维异质数据的联合学习及其在 5 个临床站点的 15000 名患者阿片类药物使用障碍研究中的应用。
Sci Rep. 2022 Jun 30;12(1):11073. doi: 10.1038/s41598-022-14029-9.
2
Communication-efficient federated learning of temporal effects on opioid use disorder with data from distributed research networks.利用分布式研究网络的数据进行通信高效的阿片类药物使用障碍时间效应联合学习。
J Am Med Inform Assoc. 2025 Apr 1;32(4):656-664. doi: 10.1093/jamia/ocae313.
3
Learning from electronic health records across multiple sites: A communication-efficient and privacy-preserving distributed algorithm.从多个站点的电子健康记录中学习:一种通信高效且隐私保护的分布式算法。
J Am Med Inform Assoc. 2020 Mar 1;27(3):376-385. doi: 10.1093/jamia/ocz199.
4
Distributed Quasi-Poisson regression algorithm for modeling multi-site count outcomes in distributed data networks.分布式准泊松回归算法在分布式数据网络中对多点计数结果进行建模。
J Biomed Inform. 2022 Jul;131:104097. doi: 10.1016/j.jbi.2022.104097. Epub 2022 May 25.
5
Identifying Clinical Risk Factors for Opioid Use Disorder using a Distributed Algorithm to Combine Real-World Data from a Large Clinical Data Research Network.使用分布式算法结合大型临床数据研究网络的真实世界数据来识别阿片类药物使用障碍的临床风险因素。
AMIA Annu Symp Proc. 2021 Jan 25;2020:1220-1229. eCollection 2020.
6
Learning from local to global: An efficient distributed algorithm for modeling time-to-event data.从局部到全局学习:一种用于建模事件时间数据的高效分布式算法。
J Am Med Inform Assoc. 2020 Jul 1;27(7):1028-1036. doi: 10.1093/jamia/ocaa044.
7
Learning from vertically distributed data across multiple sites: An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection.从多个站点的垂直分布数据中学习:一种用于具有变量选择的Cox比例风险模型的高效隐私保护算法。
J Biomed Inform. 2024 Jan;149:104581. doi: 10.1016/j.jbi.2023.104581. Epub 2023 Dec 23.
8
One-shot distributed algorithms for addressing heterogeneity in competing risks data across clinical sites.单步分布式算法用于解决临床站点间竞争风险数据的异质性。
J Biomed Inform. 2024 Feb;150:104595. doi: 10.1016/j.jbi.2024.104595. Epub 2024 Jan 18.
9
Distributed learning for heterogeneous clinical data with application to integrating COVID-19 data across 230 sites.用于异构临床数据的分布式学习及其在整合230个地点的新冠肺炎数据中的应用
NPJ Digit Med. 2022 Jun 14;5(1):76. doi: 10.1038/s41746-022-00615-8.
10
ODACH: a one-shot distributed algorithm for Cox model with heterogeneous multi-center data.ODACH:一种用于异质多中心 Cox 模型的单步分布式算法。
Sci Rep. 2022 Apr 22;12(1):6627. doi: 10.1038/s41598-022-09069-0.

引用本文的文献

1
Centralized and Federated Models for the Analysis of Clinical Data.集中式和联邦式临床数据分析模型。
Annu Rev Biomed Data Sci. 2024 Aug;7(1):179-199. doi: 10.1146/annurev-biodatasci-122220-115746. Epub 2024 Jul 24.
2
Learning across diverse biomedical data modalities and cohorts: Challenges and opportunities for innovation.跨多种生物医学数据模式和队列的学习:创新面临的挑战与机遇
Patterns (N Y). 2024 Jan 17;5(2):100913. doi: 10.1016/j.patter.2023.100913. eCollection 2024 Feb 9.

本文引用的文献

1
Individual Data Protected Integrative Regression Analysis of High-Dimensional Heterogeneous Data.高维异构数据的个体数据保护整合回归分析
J Am Stat Assoc. 2022;117(540):2105-2119. doi: 10.1080/01621459.2021.1904958. Epub 2021 May 19.
2
Communication-Efficient Accurate Statistical Estimation.通信高效的精确统计估计
J Am Stat Assoc. 2023;118(542):1000-1010. doi: 10.1080/01621459.2021.1969238. Epub 2021 Sep 24.
3
Distributed learning for heterogeneous clinical data with application to integrating COVID-19 data across 230 sites.
用于异构临床数据的分布式学习及其在整合230个地点的新冠肺炎数据中的应用
NPJ Digit Med. 2022 Jun 14;5(1):76. doi: 10.1038/s41746-022-00615-8.
4
Distributed Quasi-Poisson regression algorithm for modeling multi-site count outcomes in distributed data networks.分布式准泊松回归算法在分布式数据网络中对多点计数结果进行建模。
J Biomed Inform. 2022 Jul;131:104097. doi: 10.1016/j.jbi.2022.104097. Epub 2022 May 25.
5
dPQL: a lossless distributed algorithm for generalized linear mixed model with application to privacy-preserving hospital profiling.dPQL:一种用于广义线性混合模型的无损分布式算法及其在隐私保护医院分析中的应用。
J Am Med Inform Assoc. 2022 Jul 12;29(8):1366-1371. doi: 10.1093/jamia/ocac067.
6
ODACH: a one-shot distributed algorithm for Cox model with heterogeneous multi-center data.ODACH:一种用于异质多中心 Cox 模型的单步分布式算法。
Sci Rep. 2022 Apr 22;12(1):6627. doi: 10.1038/s41598-022-09069-0.
7
DLMM as a lossless one-shot algorithm for collaborative multi-site distributed linear mixed models.作为一种无损的一次性算法,DLMM 适用于协作式多站点分布式线性混合模型。
Nat Commun. 2022 Mar 30;13(1):1678. doi: 10.1038/s41467-022-29160-4.
8
Truly privacy-preserving federated analytics for precision medicine with multiparty homomorphic encryption.多方同态加密实现精准医学真正隐私保护的联邦分析。
Nat Commun. 2021 Oct 11;12(1):5910. doi: 10.1038/s41467-021-25972-y.
9
An efficient and accurate distributed learning algorithm for modeling multi-site zero-inflated count outcomes.一种高效准确的分布式学习算法,用于对多站点零膨胀计数结果进行建模。
Sci Rep. 2021 Oct 4;11(1):19647. doi: 10.1038/s41598-021-99078-2.
10
Emergency Department Visits for Nonfatal Opioid Overdose During the COVID-19 Pandemic Across Six US Health Care Systems.在六个美国医疗保健系统中,COVID-19 大流行期间非致命类阿片药物过量的急诊就诊情况。
Ann Emerg Med. 2022 Feb;79(2):158-167. doi: 10.1016/j.annemergmed.2021.03.013. Epub 2021 Mar 19.