• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于分布式数据源水平划分数据的广义混合效应模型(GLMM)的联邦学习算法。

Federated learning algorithms for generalized mixed-effects model (GLMM) on horizontally partitioned data from distributed sources.

机构信息

School of Biomedical Informatics, UTHealth, 7000 Fannin St, Houston, 77030, TX, USA.

Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, 3400 Civic Center Boulevard, Philadelphia, 19104, PA, USA.

出版信息

BMC Med Inform Decis Mak. 2022 Oct 16;22(1):269. doi: 10.1186/s12911-022-02014-1.

DOI:10.1186/s12911-022-02014-1
PMID:36244993
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9569919/
Abstract

OBJECTIVES

This paper developed federated solutions based on two approximation algorithms to achieve federated generalized linear mixed effect models (GLMM). The paper also proposed a solution for numerical errors and singularity issues. And showed the two proposed methods can perform well in revealing the significance of parameter in distributed datasets, comparing to a centralized GLMM algorithm from R package ('lme4') as the baseline model.

METHODS

The log-likelihood function of GLMM is approximated by two numerical methods (Laplace approximation and Gaussian Hermite approximation, abbreviated as LA and GH), which supports federated decomposition of GLMM to bring computation to data. To solve the numerical errors and singularity issues, the loss-less estimation of log-sum-exponential trick and the adaptive regularization strategy was used to tackle the problems caused by federated settings.

RESULTS

Our proposed method can handle GLMM to accommodate hierarchical data with multiple non-independent levels of observations in a federated setting. The experiment results demonstrate comparable (LA) and superior (GH) performances with simulated and real-world data.

CONCLUSION

We modified and compared federated GLMMs with different approximations, which can support researchers in analyzing versatile biomedical data to accommodate mixed effects and address non-independence due to hierarchical structures (i.e., institutes, region, country, etc.).

摘要

目的

本文基于两种近似算法开发了联邦解决方案,以实现联邦广义线性混合效应模型(GLMM)。本文还提出了解决数值误差和奇异问题的方法。并表明,与集中式 GLMM 算法(来自 R 包“lme4”)作为基准模型相比,这两种方法在揭示分布式数据集参数的重要性方面表现良好。

方法

通过两种数值方法(拉普拉斯近似和高斯-赫尔墨斯近似,简称 LA 和 GH)对 GLMM 的对数似然函数进行近似,这支持了 GLMM 的联邦分解,从而将计算带到数据中。为了解决数值误差和奇异问题,使用无损失对数和指数求和估计技巧和自适应正则化策略来解决联邦设置引起的问题。

结果

我们提出的方法可以处理 GLMM,以适应具有多个非独立观测层次的层次数据的联邦设置。实验结果表明,模拟数据和真实世界数据的性能相当(LA)和优越(GH)。

结论

我们修改并比较了具有不同近似值的联邦 GLMM,可以支持研究人员分析各种生物医学数据,以适应混合效应,并解决由于层次结构(即机构、地区、国家等)引起的非独立性问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/a6be2f3ffc85/12911_2022_2014_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/2d2b3849a4c2/12911_2022_2014_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/49aac7c394db/12911_2022_2014_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/450cc524bad2/12911_2022_2014_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/7fb54284a791/12911_2022_2014_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/8570d06d2653/12911_2022_2014_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/3bf4787463c5/12911_2022_2014_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/a6be2f3ffc85/12911_2022_2014_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/2d2b3849a4c2/12911_2022_2014_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/49aac7c394db/12911_2022_2014_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/450cc524bad2/12911_2022_2014_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/7fb54284a791/12911_2022_2014_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/8570d06d2653/12911_2022_2014_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/3bf4787463c5/12911_2022_2014_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cd6/9571462/a6be2f3ffc85/12911_2022_2014_Fig7_HTML.jpg

相似文献

1
Federated learning algorithms for generalized mixed-effects model (GLMM) on horizontally partitioned data from distributed sources.基于分布式数据源水平划分数据的广义混合效应模型(GLMM)的联邦学习算法。
BMC Med Inform Decis Mak. 2022 Oct 16;22(1):269. doi: 10.1186/s12911-022-02014-1.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Communication-efficient federated learning of temporal effects on opioid use disorder with data from distributed research networks.利用分布式研究网络的数据进行通信高效的阿片类药物使用障碍时间效应联合学习。
J Am Med Inform Assoc. 2025 Apr 1;32(4):656-664. doi: 10.1093/jamia/ocae313.
6
Measures implemented in the school setting to contain the COVID-19 pandemic.学校为控制 COVID-19 疫情而采取的措施。
Cochrane Database Syst Rev. 2022 Jan 17;1(1):CD015029. doi: 10.1002/14651858.CD015029.
7
Federated Analysis With Differential Privacy in Oncology Research: Longitudinal Observational Study Across Hospital Data Warehouses.肿瘤学研究中具有差分隐私的联邦分析:跨医院数据仓库的纵向观察性研究
JMIR Med Inform. 2025 Jul 31;13:e59685. doi: 10.2196/59685.
8
Distributed Collaborative Data Processing Framework for Unmanned Platforms Based on Federated Edge Intelligence.基于联邦边缘智能的无人平台分布式协作数据处理框架
Sensors (Basel). 2025 Aug 1;25(15):4752. doi: 10.3390/s25154752.
9
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
10
Model-based federated learning for accurate MR image reconstruction from undersampled k-space data.基于模型的联邦学习在欠采样 k 空间数据的精确磁共振图像重建中的应用。
Comput Biol Med. 2024 Sep;180:108905. doi: 10.1016/j.compbiomed.2024.108905. Epub 2024 Jul 26.

引用本文的文献

1
Unlocking efficiency in real-world collaborative studies: a multi-site international study with one-shot lossless GLMM algorithm.在实际协作研究中提高效率:一项采用一次性无损广义线性混合模型算法的多中心国际研究。
NPJ Digit Med. 2025 Jul 19;8(1):457. doi: 10.1038/s41746-025-01846-1.
2
D3MI: an efficient and powerful federated imputation method for bias reduction in the analysis of distributed incomplete data by accounting for within-site correlation and between-site heterogeneity.D3MI:一种高效且强大的联邦插补方法,通过考虑站点内相关性和站点间异质性来减少分布式不完整数据分析中的偏差。
medRxiv. 2025 May 8:2025.05.08.25327224. doi: 10.1101/2025.05.08.25327224.
3

本文引用的文献

1
A privacy-preserving and computation-efficient federated algorithm for generalized linear mixed models to analyze correlated electronic health records data.一种用于分析相关电子健康记录数据的广义线性混合模型的隐私保护和计算高效的联邦算法。
PLoS One. 2023 Jan 17;18(1):e0280192. doi: 10.1371/journal.pone.0280192. eCollection 2023.
2
dPQL: a lossless distributed algorithm for generalized linear mixed model with application to privacy-preserving hospital profiling.dPQL:一种用于广义线性混合模型的无损分布式算法及其在隐私保护医院分析中的应用。
J Am Med Inform Assoc. 2022 Jul 12;29(8):1366-1371. doi: 10.1093/jamia/ocac067.
3
Distributed Statistical Analyses: A Scoping Review and Examples of Operational Frameworks Adapted to Health Analytics.
分布式统计分析:一项范围综述及适用于健康分析的操作框架示例
JMIR Med Inform. 2024 Nov 14;12:e53622. doi: 10.2196/53622.
4
Federated systems for automated infection surveillance: a perspective.联邦系统用于自动化感染监测:一个视角。
Antimicrob Resist Infect Control. 2024 Sep 27;13(1):113. doi: 10.1186/s13756-024-01464-8.
5
A privacy-preserving and computation-efficient federated algorithm for generalized linear mixed models to analyze correlated electronic health records data.一种用于分析相关电子健康记录数据的广义线性混合模型的隐私保护和计算高效的联邦算法。
PLoS One. 2023 Jan 17;18(1):e0280192. doi: 10.1371/journal.pone.0280192. eCollection 2023.
Privacy-Preserving Federated Learning for Internet of Medical Things Under Edge Computing.
边缘计算下用于医疗物联网的隐私保护联邦学习
IEEE J Biomed Health Inform. 2023 Feb;27(2):854-865. doi: 10.1109/JBHI.2022.3157725. Epub 2023 Feb 3.
4
Federated learning for predicting clinical outcomes in patients with COVID-19.基于联邦学习的 COVID-19 患者临床结局预测
Nat Med. 2021 Oct;27(10):1735-1743. doi: 10.1038/s41591-021-01506-3. Epub 2021 Sep 15.
5
Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach.用于改善COVID-19住院患者死亡率预测的电子健康记录联邦学习:机器学习方法。
JMIR Med Inform. 2021 Jan 27;9(1):e24207. doi: 10.2196/24207.
6
FL-QSAR: a federated learning-based QSAR prototype for collaborative drug discovery.FL-QSAR:基于联邦学习的合作药物发现 QSAR 原型。
Bioinformatics. 2021 Apr 1;36(22-23):5492-5498. doi: 10.1093/bioinformatics/btaa1006.
7
Treating medical data as a durable asset.将医疗数据视为一种耐用资产。
Nat Genet. 2020 Oct;52(10):1005-1010. doi: 10.1038/s41588-020-0698-y. Epub 2020 Sep 14.
8
Privacy-preserving construction of generalized linear mixed model for biomedical computation.用于生物医学计算的广义线性混合模型的隐私保护构建。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i128-i135. doi: 10.1093/bioinformatics/btaa478.
9
Re-Identification Risk in HIPAA De-Identified Datasets: The MVA Attack.《健康保险流通与责任法案》(HIPAA)去标识化数据集中的重新识别风险:多元变量分析(MVA)攻击
AMIA Annu Symp Proc. 2018 Dec 5;2018:1329-1337. eCollection 2018.
10
Re-identification Risks in HIPAA Safe Harbor Data: A study of data from one environmental health study.《健康保险流通与责任法案》安全港数据中的重新识别风险:一项对来自一项环境卫生研究数据的研究
Technol Sci. 2017;2017. Epub 2017 Aug 28.