• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大型健康数据集的安全记录链接:混合云模型评估

Secure Record Linkage of Large Health Data Sets: Evaluation of a Hybrid Cloud Model.

作者信息

Brown Adrian Paul, Randall Sean M

机构信息

Centre for Data Linkage, Curtin University, Bentley, Australia.

出版信息

JMIR Med Inform. 2020 Sep 23;8(9):e18920. doi: 10.2196/18920.

DOI:10.2196/18920
PMID:32965236
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7542414/
Abstract

BACKGROUND

The linking of administrative data across agencies provides the capability to investigate many health and social issues with the potential to deliver significant public benefit. Despite its advantages, the use of cloud computing resources for linkage purposes is scarce, with the storage of identifiable information on cloud infrastructure assessed as high risk by data custodians.

OBJECTIVE

This study aims to present a model for record linkage that utilizes cloud computing capabilities while assuring custodians that identifiable data sets remain secure and local.

METHODS

A new hybrid cloud model was developed, including privacy-preserving record linkage techniques and container-based batch processing. An evaluation of this model was conducted with a prototype implementation using large synthetic data sets representative of administrative health data.

RESULTS

The cloud model kept identifiers on premises and uses privacy-preserved identifiers to run all linkage computations on cloud infrastructure. Our prototype used a managed container cluster in Amazon Web Services to distribute the computation using existing linkage software. Although the cost of computation was relatively low, the use of existing software resulted in an overhead of processing of 35.7% (149/417 min execution time).

CONCLUSIONS

The result of our experimental evaluation shows the operational feasibility of such a model and the exciting opportunities for advancing the analysis of linkage outputs.

摘要

背景

跨机构行政数据的关联提供了调查诸多健康和社会问题的能力,有可能带来重大的公共利益。尽管有其优势,但将云计算资源用于数据关联目的的情况却很少见,数据保管人认为在云基础设施上存储可识别信息具有高风险。

目的

本研究旨在提出一种记录关联模型,该模型利用云计算能力,同时向保管人保证可识别数据集的安全性和本地化。

方法

开发了一种新的混合云模型,包括隐私保护记录关联技术和基于容器的批处理。使用代表行政健康数据的大型合成数据集通过原型实现对该模型进行了评估。

结果

该云模型将标识符保留在本地,并使用隐私保护标识符在云基础设施上运行所有关联计算。我们的原型在亚马逊网络服务中使用了一个托管容器集群,以使用现有的关联软件来分配计算任务。虽然计算成本相对较低,但使用现有软件导致处理开销为35.7%(执行时间为149/417分钟)。

结论

我们实验评估的结果表明了这种模型的操作可行性以及推进关联输出分析的令人兴奋的机会。

相似文献

1
Secure Record Linkage of Large Health Data Sets: Evaluation of a Hybrid Cloud Model.大型健康数据集的安全记录链接:混合云模型评估
JMIR Med Inform. 2020 Sep 23;8(9):e18920. doi: 10.2196/18920.
2
Privacy preserving probabilistic record linkage (P3RL): a novel method for linking existing health-related data and maintaining participant confidentiality.隐私保护概率性记录链接(P3RL):一种链接现有健康相关数据并维护参与者隐私的新方法。
BMC Med Res Methodol. 2015 May 30;15:46. doi: 10.1186/s12874-015-0038-6.
3
Secure Secondary Use of Clinical Data with Cloud-based NLP Services. Towards a Highly Scalable Research Infrastructure.利用基于云的自然语言处理服务实现临床数据的安全二次利用。迈向高度可扩展的研究基础设施。
Methods Inf Med. 2015;54(3):276-82. doi: 10.3414/ME13-01-0133. Epub 2014 Nov 7.
4
Obfuscatable multi-recipient re-encryption for secure privacy-preserving personal health record services.用于安全隐私保护个人健康记录服务的可混淆多接收者重新加密
Technol Health Care. 2015;23 Suppl 1:S139-45. doi: 10.3233/thc-150946.
5
Secure and scalable deduplication of horizontally partitioned health data for privacy-preserving distributed statistical computation.用于隐私保护分布式统计计算的水平分区健康数据的安全且可扩展的重复数据删除
BMC Med Inform Decis Mak. 2017 Jan 3;17(1):1. doi: 10.1186/s12911-016-0389-x.
6
A cloud-based buyer-seller watermarking protocol (CB-BSWP) using semi-trusted third party for copy deterrence and privacy preserving.一种基于云的买卖双方水印协议(CB-BSWP),使用半可信第三方来防止复制并保护隐私。
Multimed Tools Appl. 2022;81(15):21417-21448. doi: 10.1007/s11042-022-12550-7. Epub 2022 Mar 15.
7
Data linkage infrastructure for cross-jurisdictional health-related research in Australia.澳大利亚跨司法管辖区健康相关研究的数据链接基础设施。
BMC Health Serv Res. 2012 Dec 29;12:480. doi: 10.1186/1472-6963-12-480.
8
An Efficient Privacy-Preserving Public Auditing Protocol for Cloud-Based Medical Storage System.一种用于基于云的医疗存储系统的高效隐私保护公共审计协议。
IEEE J Biomed Health Inform. 2022 May;26(5):2020-2031. doi: 10.1109/JBHI.2022.3140831. Epub 2022 May 5.
9
Verifiable fully outsourced attribute-based signcryption system for IoT eHealth big data in cloud computing.可验证的完全外包基于属性的签密系统,用于云计算中的物联网电子健康大数据。
Math Biosci Eng. 2019 Apr 22;16(5):3561-3594. doi: 10.3934/mbe.2019178.
10
Design of Secure Protocol for Cloud-Assisted Electronic Health Record System Using Blockchain.基于区块链的云辅助电子健康记录系统安全协议设计。
Sensors (Basel). 2020 May 21;20(10):2913. doi: 10.3390/s20102913.

引用本文的文献

1
Validating a novel deterministic privacy-preserving record linkage between administrative & clinical data: applications in stroke research.验证一种新颖的行政与临床数据确定性隐私保护记录链接方法:在中风研究中的应用。
Int J Popul Data Sci. 2022 Nov 22;7(4):1755. doi: 10.23889/ijpds.v7i4.1755. eCollection 2022.

本文引用的文献

1
A Position Statement on Population Data Science: The Science of Data about People.关于人口数据科学的立场声明:关于人群的数据科学。
Int J Popul Data Sci. 2018 Feb 22;3(1):415. doi: 10.23889/ijpds.v3i1.415.
2
Harnessing wearable device data to improve state-level real-time surveillance of influenza-like illness in the USA: a population-based study.利用可穿戴设备数据改善美国州级实时流感样疾病监测:一项基于人群的研究。
Lancet Digit Health. 2020 Feb;2(2):e85-e93. doi: 10.1016/S2589-7500(19)30222-5. Epub 2020 Jan 16.
3
Population Data Centre Profiles: Centre for Data Linkage.
人口数据中心简介:数据链接中心
Int J Popul Data Sci. 2020 Mar 11;4(2):1139. doi: 10.23889/ijpds.v4i2.1139.
4
Measuring mobility, disease connectivity and individual risk: a review of using mobile phone data and mHealth for travel medicine.测量移动性、疾病关联性和个体风险:利用移动电话数据和移动医疗进行旅行医学研究的综述。
J Travel Med. 2019 May 10;26(3). doi: 10.1093/jtm/taz019.
5
Estimating parameters for probabilistic linkage of privacy-preserved datasets.估算隐私保护数据集概率关联的参数。
BMC Med Res Methodol. 2017 Jul 10;17(1):95. doi: 10.1186/s12874-017-0370-0.
6
Using Electronic Health Records for Population Health Research: A Review of Methods and Applications.利用电子健康记录进行人群健康研究:方法与应用综述。
Annu Rev Public Health. 2016;37:61-81. doi: 10.1146/annurev-publhealth-032315-021353. Epub 2015 Dec 11.
7
Precision Public Health for the Era of Precision Medicine.精准医学时代的精准公共卫生。
Am J Prev Med. 2016 Mar;50(3):398-401. doi: 10.1016/j.amepre.2015.08.031. Epub 2015 Nov 4.
8
SOEMPI: A Secure Open Enterprise Master Patient Index Software Toolkit for Private Record Linkage.SOEMPI:用于私人记录链接的安全开放式企业主患者索引软件工具包。
AMIA Annu Symp Proc. 2014 Nov 14;2014:1105-14. eCollection 2014.
9
A transparent and transportable methodology for evaluating Data Linkage software.一种用于评估数据链接软件的透明且可移植的方法。
J Biomed Inform. 2012 Feb;45(1):165-72. doi: 10.1016/j.jbi.2011.10.006. Epub 2011 Oct 30.
10
Record Linkage.记录链接
Am J Public Health Nations Health. 1946 Dec;36(12):1412-6.