用于分散式不完整纵向行为数据的联邦模糊聚类

Federated Fuzzy Clustering for Decentralized Incomplete Longitudinal Behavioral Data.

作者信息

Ngo Hieu, Fang Hua, Rumbut Joshua, Wang Honggang

机构信息

College of Engineering, University of Massachusetts Dartmouth, North Dartmouth, MA, 02747.

Department of Computer and Information Science, University of Massachusetts Dartmouth, North Dartmouth, MA, 02747 and the Department of Population and Quantitative Health Science, University of Massachusetts Chan Medical School, Worcester, MA 01655 USA.

出版信息

IEEE Internet Things J. 2024 Apr 15;11(8):14657-14670. doi: 10.1109/jiot.2023.3343719. Epub 2023 Dec 18.

DOI:10.1109/jiot.2023.3343719

PMID:38605934

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11006372/

Abstract

The use of medical data for machine learning, including unsupervised methods such as clustering, is often restricted by privacy regulations such as the Health Insurance Portability and Accountability Act (HIPAA). Medical data is sensitive and highly regulated and anonymization is often insufficient to protect a patient's identity. Traditional clustering algorithms are also unsuitable for longitudinal behavioral health trials, which often have missing data and observe individual behaviors over varying time periods. In this work, we develop a new decentralized federated multiple imputation-based fuzzy clustering algorithm for complex longitudinal behavioral trial data collected from multisite randomized controlled trials over different time periods. Federated learning (FL) preserves privacy by aggregating model parameters instead of data. Unlike previous FL methods, this proposed algorithm requires only two rounds of communication and handles clients with varying numbers of time points for incomplete longitudinal data. The model is evaluated on both empirical longitudinal dietary health data and simulated clusters with different numbers of clients, effect sizes, correlations, and sample sizes. The proposed algorithm converges rapidly and achieves desirable performance on multiple clustering metrics. This new method allows for targeted treatments for various patient groups while preserving their data privacy and enables the potential for broader applications in the Internet of Medical Things.

摘要

将医学数据用于机器学习，包括诸如聚类等无监督方法，通常受到隐私法规的限制，如《健康保险流通与责任法案》（HIPAA）。医学数据敏感且受到严格监管，匿名化往往不足以保护患者身份。传统的聚类算法也不适用于纵向行为健康试验，这类试验常常存在数据缺失的情况，并且要在不同时间段观察个体行为。在这项工作中，我们针对从不同时间段的多中心随机对照试验收集的复杂纵向行为试验数据，开发了一种新的基于分散式联邦多重插补的模糊聚类算法。联邦学习（FL）通过聚合模型参数而非数据来保护隐私。与先前的FL方法不同，该算法仅需两轮通信，并且能处理具有不同时间点数的不完整纵向数据的客户端。该模型在经验性纵向饮食健康数据以及具有不同客户端数量、效应大小、相关性和样本大小的模拟聚类上进行了评估。所提出的算法收敛迅速，并在多个聚类指标上取得了理想的性能。这种新方法在保护患者数据隐私的同时，允许针对不同患者群体进行有针对性的治疗，并为在医疗物联网中更广泛的应用创造了潜力。

相似文献

Federated Fuzzy Clustering for Decentralized Incomplete Longitudinal Behavioral Data.

IEEE Internet Things J. 2024 Apr 15;11(8):14657-14670. doi: 10.1109/jiot.2023.3343719. Epub 2023 Dec 18.

PSA-FL-CDM: A Novel Federated Learning-Based Consensus Model for Post-Stroke Assessment.

Sensors (Basel). 2024 Aug 6;24(16):5095. doi: 10.3390/s24165095.

A comparative study of federated learning methods for COVID-19 detection.

Sci Rep. 2024 Feb 16;14(1):3944. doi: 10.1038/s41598-024-54323-2.

The FeatureCloud Platform for Federated Learning in Biomedicine: Unified Approach.

J Med Internet Res. 2023 Jul 12;25:e42621. doi: 10.2196/42621.

Handling Privacy-Sensitive Medical Data With Federated Learning: Challenges and Future Directions.

IEEE J Biomed Health Inform. 2023 Feb;27(2):790-803. doi: 10.1109/JBHI.2022.3185673. Epub 2023 Feb 3.

Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth.

J Med Syst. 2016 Jun;40(6):146. doi: 10.1007/s10916-016-0499-0. Epub 2016 Apr 28.

FedMCC: Federated multi-center clustering algorithm to improve privacy healthcare.

Methods. 2023 Oct;218:94-100. doi: 10.1016/j.ymeth.2023.07.006. Epub 2023 Jul 26.

Enhancing IoT Security through a Green and Sustainable Federated Learning Platform: Leveraging Efficient Encryption and the Quondam Signature Algorithm.

Sensors (Basel). 2023 Sep 26;23(19):8090. doi: 10.3390/s23198090.

Boosted federated learning based on improved Particle Swarm Optimization for healthcare IoT devices.

Comput Biol Med. 2023 Sep;163:107195. doi: 10.1016/j.compbiomed.2023.107195. Epub 2023 Jun 22.

MIFuzzy Clustering for Incomplete Longitudinal Data in Smart Health.

Smart Health (Amst). 2017 Jun;1-2:50-65. doi: 10.1016/j.smhl.2017.04.002. Epub 2017 Apr 27.

引用本文的文献

Optimized convolutional neural network using African vulture optimization algorithm for the detection of exons.

Sci Rep. 2025 Jan 30;15(1):3810. doi: 10.1038/s41598-025-86672-x.

A Greedy Tabu Dual Heuristic algorithm for the cyclic pickup and delivery problem with 3D loading constraints.

Sci Rep. 2024 Dec 30;14(1):31762. doi: 10.1038/s41598-024-82534-0.

本文引用的文献

A review of harmonization methods for studying dietary patterns.

Smart Health (Amst). 2022 Mar;23. doi: 10.1016/j.smhl.2021.100263. Epub 2022 Jan 13.

Interpretable Machine Learning Framework Reveals Robust Gut Microbiome Features Associated With Type 2 Diabetes.

Diabetes Care. 2021 Feb;44(2):358-366. doi: 10.2337/dc20-1536. Epub 2020 Dec 7.

Federated Learning for Healthcare Informatics.

J Healthc Inform Res. 2021;5(1):1-19. doi: 10.1007/s41666-020-00082-4. Epub 2020 Nov 12.

Statistical and Machine-Learning Analyses in Nutritional Genomics Studies.

Nutrients. 2020 Oct 14;12(10):3140. doi: 10.3390/nu12103140.

The future of digital health with federated learning.

NPJ Digit Med. 2020 Sep 14;3:119. doi: 10.1038/s41746-020-00323-1. eCollection 2020.

Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data.

Sci Rep. 2020 Jul 28;10(1):12598. doi: 10.1038/s41598-020-69250-1.

ANHIR: Automatic Non-Rigid Histological Image Registration Challenge.

IEEE Trans Med Imaging. 2020 Oct;39(10):3042-3052. doi: 10.1109/TMI.2020.2986331. Epub 2020 Apr 7.

Estimating the success of re-identifications in incomplete datasets using generative models.

Nat Commun. 2019 Jul 23;10(1):3069. doi: 10.1038/s41467-019-10933-3.

A longitudinal big data approach for precision health.

Nat Med. 2019 May;25(5):792-804. doi: 10.1038/s41591-019-0414-6. Epub 2019 May 8.

eFCM: An Enhanced Fuzzy C-Means Algorithm for Longitudinal Intervention Data.

Int Conf Comput Netw Commun. 2018 Mar;2018:912-916. doi: 10.1109/ICCNC.2018.8390419. Epub 2018 Jun 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于分散式不完整纵向行为数据的联邦模糊聚类

Federated Fuzzy Clustering for Decentralized Incomplete Longitudinal Behavioral Data.

作者信息

Ngo Hieu, Fang Hua, Rumbut Joshua, Wang Honggang

机构信息

College of Engineering, University of Massachusetts Dartmouth, North Dartmouth, MA, 02747.

出版信息

IEEE Internet Things J. 2024 Apr 15;11(8):14657-14670. doi: 10.1109/jiot.2023.3343719. Epub 2023 Dec 18.

DOI:10.1109/jiot.2023.3343719

PMID:38605934

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11006372/

Abstract

摘要

用于分散式不完整纵向行为数据的联邦模糊聚类

Federated Fuzzy Clustering for Decentralized Incomplete Longitudinal Behavioral Data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于分散式不完整纵向行为数据的联邦模糊聚类

Federated Fuzzy Clustering for Decentralized Incomplete Longitudinal Behavioral Data.

作者信息

机构信息

出版信息