• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于协作式健康数据分析的通信高效联邦广义张量分解

Communication Efficient Federated Generalized Tensor Factorization for Collaborative Health Data Analytics.

作者信息

Ma Jing, Zhang Qiuchen, Lou Jian, Xiong Li, Ho Joyce C

机构信息

Emory University.

出版信息

Proc Int World Wide Web Conf. 2021 Apr;2021:171-182. doi: 10.1145/3442381.3449832.

DOI:10.1145/3442381.3449832
PMID:34467367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8404412/
Abstract

Modern healthcare systems knitted by a web of entities (e.g., hospitals, clinics, pharmacy companies) are collecting a huge volume of healthcare data from a large number of individuals with various medical procedures, medications, diagnosis, and lab tests. To extract meaningful medical concepts (i.e., phenotypes) from such higher-arity relational healthcare data, tensor factorization has been proven to be an effective approach and received increasing research attention, due to their intrinsic capability to represent the high-dimensional data. Recently, federated learning offers a privacy-preserving paradigm for collaborative learning among different entities, which seemingly provides an ideal potential to further enhance the tensor factorization-based collaborative phenotyping to handle sensitive personal health data. However, existing attempts to federated tensor factorization come with various limitations, including restrictions to the classic tensor factorization, high communication cost and reduced accuracy. We propose a federated tensor factorization, which is flexible enough to choose from a variate of losses to best suit different types of data in practice. We design a three-level communication reduction strategy tailored to the generalized tensor factorization, which is able to reduce the uplink communication cost up to 99.90%. In addition, we theoretically prove that our algorithm does not compromise convergence speed despite the aggressive communication compression. Extensive experiments on two real-world electronics health record datasets demonstrate the efficiency improvements in terms of computation and communication cost.

摘要

由众多实体(如医院、诊所、制药公司)构成的现代医疗系统正在从大量个体那里收集海量的医疗数据,这些数据涵盖了各种医疗程序、药物治疗、诊断以及实验室检测。为了从这种高维关系型医疗数据中提取有意义的医学概念(即表型),张量分解已被证明是一种有效的方法,并受到了越来越多的研究关注,这是因为它具有表示高维数据的内在能力。最近,联邦学习为不同实体之间的协作学习提供了一种隐私保护范式,这似乎为进一步增强基于张量分解的协作表型分析以处理敏感的个人健康数据提供了理想的潜力。然而,现有的联邦张量分解尝试存在各种局限性,包括对经典张量分解的限制、高通信成本以及准确性降低。我们提出了一种联邦张量分解方法,它足够灵活,可以从各种损失函数中进行选择,以在实践中最适合不同类型的数据。我们针对广义张量分解设计了一种三级通信减少策略,该策略能够将上行通信成本降低高达99.90%。此外,我们从理论上证明,尽管进行了激进的通信压缩,我们的算法也不会影响收敛速度。在两个真实世界的电子健康记录数据集上进行的大量实验证明了在计算和通信成本方面的效率提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/9d10b17a3e8a/nihms-1722423-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/7b7dd0904870/nihms-1722423-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/616ad2a3f345/nihms-1722423-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/564d63e7bf2e/nihms-1722423-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/0b9d95e8ff8e/nihms-1722423-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/a12d7f228a62/nihms-1722423-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/20007525d132/nihms-1722423-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/6764ead32b75/nihms-1722423-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/4e40684a4290/nihms-1722423-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/9d10b17a3e8a/nihms-1722423-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/7b7dd0904870/nihms-1722423-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/616ad2a3f345/nihms-1722423-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/564d63e7bf2e/nihms-1722423-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/0b9d95e8ff8e/nihms-1722423-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/a12d7f228a62/nihms-1722423-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/20007525d132/nihms-1722423-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/6764ead32b75/nihms-1722423-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/4e40684a4290/nihms-1722423-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/739c/8404412/9d10b17a3e8a/nihms-1722423-f0007.jpg

相似文献

1
Communication Efficient Federated Generalized Tensor Factorization for Collaborative Health Data Analytics.用于协作式健康数据分析的通信高效联邦广义张量分解
Proc Int World Wide Web Conf. 2021 Apr;2021:171-182. doi: 10.1145/3442381.3449832.
2
Communication Efficient Tensor Factorization for Decentralized Healthcare Networks.用于分散式医疗网络的通信高效张量分解
Proc IEEE Int Conf Data Min. 2021 Dec;2021:1216-1221. doi: 10.1109/icdm51629.2021.00147. Epub 2022 Jan 24.
3
Privacy-Preserving Tensor Factorization for Collaborative Health Data Analysis.用于协作式健康数据分析的隐私保护张量分解
Proc ACM Int Conf Inf Knowl Manag. 2019 Nov;2019:1291-1300. doi: 10.1145/3357384.3357878.
4
Federated Tensor Factorization for Computational Phenotyping.用于计算表型分析的联邦张量分解
KDD. 2017 Aug;2017:887-895. doi: 10.1145/3097983.3098118.
5
Rubik: Knowledge Guided Tensor Factorization and Completion for Health Data Analytics.鲁比克:用于健康数据分析的知识引导张量分解与补全
KDD. 2015 Aug;2015:1265-1274. doi: 10.1145/2783258.2783395.
6
TASTE: Temporal and Static Tensor Factorization for Phenotyping Electronic Health Records.TASTE:用于电子健康记录表型分析的时间和静态张量分解
Proc ACM Conf Health Inference Learn (2020). 2020 Apr;2020:193-203. doi: 10.1145/3368555.3384464.
7
Privacy-Preserving Asynchronous Vertical Federated Learning Algorithms for Multiparty Collaborative Learning.用于多方协作学习的隐私保护异步垂直联邦学习算法
IEEE Trans Neural Netw Learn Syst. 2022 Nov;33(11):6103-6115. doi: 10.1109/TNNLS.2021.3072238. Epub 2022 Oct 27.
8
Distributed Tensor Decomposition for Large Scale Health Analytics.用于大规模健康分析的分布式张量分解
Proc Int World Wide Web Conf. 2019 May;2019:659-669. doi: 10.1145/3308558.3313548.
9
A privacy-preserving and computation-efficient federated algorithm for generalized linear mixed models to analyze correlated electronic health records data.一种用于分析相关电子健康记录数据的广义线性混合模型的隐私保护和计算高效的联邦算法。
PLoS One. 2023 Jan 17;18(1):e0280192. doi: 10.1371/journal.pone.0280192. eCollection 2023.
10
Boosted federated learning based on improved Particle Swarm Optimization for healthcare IoT devices.基于改进粒子群优化算法的联邦学习在医疗保健物联网设备中的应用。
Comput Biol Med. 2023 Sep;163:107195. doi: 10.1016/j.compbiomed.2023.107195. Epub 2023 Jun 22.

引用本文的文献

1
Cross-silo Federated Learning with Record-level Personalized Differential Privacy.具有记录级个性化差分隐私的跨孤岛联邦学习
Conf Comput Commun Secur. 2024 Oct;2024:303-317. doi: 10.1145/3658644.3670351. Epub 2024 Dec 9.
2
Federated Learning in Healthcare: A Benchmark Comparison of Engineering and Statistical Approaches for Structured Data Analysis.医疗保健中的联邦学习:结构化数据分析的工程方法与统计方法的基准比较
Health Data Sci. 2024 Dec 4;4:0196. doi: 10.34133/hds.0196. eCollection 2024.
3
Application of privacy protection technology to healthcare big data.

本文引用的文献

1
An Uplink Communication-Efficient Approach to Featurewise Distributed Sparse Optimization With Differential Privacy.
IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4529-4543. doi: 10.1109/TNNLS.2020.3020955. Epub 2021 Oct 5.
2
Privacy-Preserving Tensor Factorization for Collaborative Health Data Analysis.用于协作式健康数据分析的隐私保护张量分解
Proc ACM Int Conf Inf Knowl Manag. 2019 Nov;2019:1291-1300. doi: 10.1145/3357384.3357878.
3
Rubik: Knowledge Guided Tensor Factorization and Completion for Health Data Analytics.鲁比克:用于健康数据分析的知识引导张量分解与补全
隐私保护技术在医疗大数据中的应用。
Digit Health. 2024 Nov 4;10:20552076241282242. doi: 10.1177/20552076241282242. eCollection 2024 Jan-Dec.
4
Recent methodological advances in federated learning for healthcare.医疗保健领域联邦学习的最新方法进展。
Patterns (N Y). 2024 Jun 14;5(6):101006. doi: 10.1016/j.patter.2024.101006.
5
Creating High-Quality Synthetic Health Data: Framework for Model Development and Validation.创建高质量合成健康数据:模型开发与验证框架。
JMIR Form Res. 2024 Apr 22;8:e53241. doi: 10.2196/53241.
6
Federated and distributed learning applications for electronic health records and structured medical data: a scoping review.联邦学习和分布式学习在电子健康记录和结构化医疗数据中的应用:范围综述。
J Am Med Inform Assoc. 2023 Nov 17;30(12):2041-2049. doi: 10.1093/jamia/ocad170.
7
A Review of Privacy Enhancement Methods for Federated Learning in Healthcare Systems.联邦学习中增强医疗系统隐私保护的方法综述
Int J Environ Res Public Health. 2023 Aug 7;20(15):6539. doi: 10.3390/ijerph20156539.
8
Federated Learning in Health care Using Structured Medical Data.利用结构化医疗数据进行医疗保健中的联邦学习。
Adv Kidney Dis Health. 2023 Jan;30(1):4-16. doi: 10.1053/j.akdh.2022.11.007.
9
Communication Efficient Tensor Factorization for Decentralized Healthcare Networks.用于分散式医疗网络的通信高效张量分解
Proc IEEE Int Conf Data Min. 2021 Dec;2021:1216-1221. doi: 10.1109/icdm51629.2021.00147. Epub 2022 Jan 24.
KDD. 2015 Aug;2015:1265-1274. doi: 10.1145/2783258.2783395.
4
Distributed Tensor Decomposition for Large Scale Health Analytics.用于大规模健康分析的分布式张量分解
Proc Int World Wide Web Conf. 2019 May;2019:659-669. doi: 10.1145/3308558.3313548.
5
Federated Tensor Factorization for Computational Phenotyping.用于计算表型分析的联邦张量分解
KDD. 2017 Aug;2017:887-895. doi: 10.1145/3097983.3098118.
6
Variance Reduction in Stochastic Gradient Langevin Dynamics.随机梯度朗之万动力学中的方差缩减
Adv Neural Inf Process Syst. 2016 Dec;29:1154-1162.
7
MIMIC-III, a freely accessible critical care database.MIMIC-III,一个免费获取的重症监护数据库。
Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.
8
Turbo-SMT: Accelerating Coupled Sparse Matrix-Tensor Factorizations by 200×.Turbo-SMT:将耦合稀疏矩阵-张量分解加速200倍。
Proc SIAM Int Conf Data Min. 2014;2014:118-126. doi: 10.1137/1.9781611973440.14.
9
Limestone: high-throughput candidate phenotype generation via tensor factorization.石灰岩:通过张量分解进行高通量候选表型生成。
J Biomed Inform. 2014 Dec;52:199-211. doi: 10.1016/j.jbi.2014.07.001. Epub 2014 Jul 16.