• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用隐私保护联邦学习基础设施,从马斯特里赫特研究和荷兰统计局的分布式数据中研究糖尿病与医疗保健费用之间的关联。

Studying the association of diabetes and healthcare cost on distributed data from the Maastricht Study and Statistics Netherlands using a privacy-preserving federated learning infrastructure.

机构信息

Institute of Data Science, Faculty of Science and Engineering, Maastricht University, Maastricht, The Netherlands.

Brightlands Institute of Smart Society, Faculty of Science and Engineering, Maastricht University, Heerlen, The Netherlands.

出版信息

J Biomed Inform. 2022 Oct;134:104194. doi: 10.1016/j.jbi.2022.104194. Epub 2022 Sep 5.

DOI:10.1016/j.jbi.2022.104194
PMID:36064113
Abstract

The mining of personal data collected by multiple organizations remains challenging in the presence of technical barriers, privacy concerns, and legal and/or organizational restrictions. While a number of privacy-preserving and data mining frameworks have recently emerged, much remains to show their practical utility. In this study, we implement and utilize a secure infrastructure using data from Statistics Netherlands and the Maastricht Study to learn the association between Type 2 Diabetes Mellitus (T2DM) and healthcare expenses considering the impact of lifestyle, physical activities, and complications of T2DM. Through experiments using real-world distributed personal data, we present the feasibility and effectiveness of the secure infrastructure for practical use cases of linking and analyzing vertically partitioned data across multiple organizations. We discovered that individuals diagnosed with T2DM had significantly higher expenses than those with prediabetes, while participants with prediabetes spent more than those without T2DM in all the included healthcare categories to different degrees. We further discuss a joint effort from technical, ethical-legal, and domain-specific experts that is highly valued for applying such a secure infrastructure to real-life use cases to protect data privacy.

摘要

在存在技术障碍、隐私问题以及法律和/或组织限制的情况下,挖掘多个组织收集的个人数据仍然具有挑战性。虽然最近出现了许多隐私保护和数据挖掘框架,但仍需要展示它们的实际效用。在这项研究中,我们使用来自荷兰统计局和马斯特里赫特研究的数据实施和利用安全基础设施,以了解 2 型糖尿病(T2DM)与医疗保健费用之间的关联,同时考虑生活方式、体育活动以及 T2DM 并发症的影响。通过使用真实分布式个人数据的实验,我们展示了安全基础设施对于跨多个组织链接和分析垂直分割数据的实际用例的可行性和有效性。我们发现,被诊断患有 T2DM 的个体的支出明显高于前驱糖尿病患者,而前驱糖尿病患者在所有包含的医疗保健类别中的支出均高于没有 T2DM 的患者,但程度不同。我们进一步讨论了技术、伦理法律和特定领域专家的共同努力,这对于将这种安全基础设施应用于现实生活中的用例以保护数据隐私具有重要价值。

相似文献

1
Studying the association of diabetes and healthcare cost on distributed data from the Maastricht Study and Statistics Netherlands using a privacy-preserving federated learning infrastructure.使用隐私保护联邦学习基础设施,从马斯特里赫特研究和荷兰统计局的分布式数据中研究糖尿病与医疗保健费用之间的关联。
J Biomed Inform. 2022 Oct;134:104194. doi: 10.1016/j.jbi.2022.104194. Epub 2022 Sep 5.
2
A Privacy-Preserving Infrastructure for Analyzing Personal Health Data in a Vertically Partitioned Scenario.一种用于在垂直分区场景中分析个人健康数据的隐私保护基础设施。
Stud Health Technol Inform. 2019 Aug 21;264:373-377. doi: 10.3233/SHTI190246.
3
Using the Personal Health Train for Automated and Privacy-Preserving Analytics on Vertically Partitioned Data.使用个人健康列车对垂直分区数据进行自动化且保护隐私的分析。
Stud Health Technol Inform. 2018;247:581-585.
4
Distributed learning on 20 000+ lung cancer patients - The Personal Health Train.2 万多名肺癌患者的分布式学习 - 个人健康训练。
Radiother Oncol. 2020 Mar;144:189-200. doi: 10.1016/j.radonc.2019.11.019. Epub 2020 Jan 3.
5
Infrastructure platform for privacy-preserving distributed machine learning development of computer-assisted theragnostics in cancer.用于癌症计算机辅助治疗学中隐私保护分布式机器学习开发的基础架构平台。
J Biomed Inform. 2022 Oct;134:104181. doi: 10.1016/j.jbi.2022.104181. Epub 2022 Aug 30.
6
Accurate training of the Cox proportional hazards model on vertically-partitioned data while preserving privacy.在保护隐私的同时,对垂直分区数据进行 Cox 比例风险模型的精确训练。
BMC Med Inform Decis Mak. 2022 Feb 24;22(1):49. doi: 10.1186/s12911-022-01771-3.
7
Learning from vertically distributed data across multiple sites: An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection.从多个站点的垂直分布数据中学习:一种用于具有变量选择的Cox比例风险模型的高效隐私保护算法。
J Biomed Inform. 2024 Jan;149:104581. doi: 10.1016/j.jbi.2023.104581. Epub 2023 Dec 23.
8
The BioRef Infrastructure, a Framework for Real-Time, Federated, Privacy-Preserving, and Personalized Reference Intervals: Design, Development, and Application.生物参考信息基础设施:一个用于实时、联合、隐私保护和个性化参考区间的框架:设计、开发和应用。
J Med Internet Res. 2023 Oct 18;25:e47254. doi: 10.2196/47254.
9
Federated Analysis for Privacy-Preserving Data Sharing: A Technical and Legal Primer.联邦分析用于隐私保护的数据共享:技术和法律基础
Annu Rev Genomics Hum Genet. 2023 Aug 25;24:347-368. doi: 10.1146/annurev-genom-110122-084756. Epub 2023 May 30.
10
Privacy Preserving Association Rule Mining on Distributed Healthcare Data: COVID-19 and Breast Cancer Case Study.分布式医疗数据上的隐私保护关联规则挖掘:以COVID-19和乳腺癌为例的案例研究
SN Comput Sci. 2021;2(6):418. doi: 10.1007/s42979-021-00801-7. Epub 2021 Aug 18.

引用本文的文献

1
Advancing Privacy-Preserving Health Care Analytics and Implementation of the Personal Health Train: Federated Deep Learning Study.推进隐私保护医疗保健分析与个人健康列车的实施:联邦深度学习研究
JMIR AI. 2025 Feb 6;4:e60847. doi: 10.2196/60847.
2
Advancing healthcare through data: the BETTER project's vision for distributed analytics.通过数据推动医疗保健发展:BETTER项目对分布式分析的愿景。
Front Med (Lausanne). 2024 Oct 2;11:1473874. doi: 10.3389/fmed.2024.1473874. eCollection 2024.
3
Artificial intelligence, ChatGPT, and other large language models for social determinants of health: Current state and future directions.
人工智能、ChatGPT 及其他用于健康社会决定因素的大语言模型:现状与未来方向。
Cell Rep Med. 2024 Jan 16;5(1):101356. doi: 10.1016/j.xcrm.2023.101356.