Suppr超能文献

利用预测性机器学习快速分配健康数据存储库。

Rapid health data repository allocation using predictive machine learning.

机构信息

Federation University Australia, Australia.

出版信息

Health Informatics J. 2020 Dec;26(4):3009-3036. doi: 10.1177/1460458220957486. Epub 2020 Sep 24.

Abstract

Health-related data is stored in a number of repositories that are managed and controlled by different entities. For instance, Electronic Health Records are usually administered by governments. Electronic Medical Records are typically controlled by health care providers, whereas Personal Health Records are managed directly by patients. Recently, Blockchain-based health record systems largely regulated by technology have emerged as another type of repository. Repositories for storing health data differ from one another based on cost, level of security and quality of performance. Not only has the type of repositories increased in recent years, but the quantum of health data to be stored has increased. For instance, the advent of wearable sensors that capture physiological signs has resulted in an exponential growth in digital health data. The increase in the types of repository and amount of data has driven a need for intelligent processes to select appropriate repositories as data is collected. However, the storage allocation decision is complex and nuanced. The challenges are exacerbated when health data are continuously streamed, as is the case with wearable sensors. Although patients are not always solely responsible for determining which repository should be used, they typically have some input into this decision. Patients can be expected to have idiosyncratic preferences regarding storage decisions depending on their unique contexts. In this paper, we propose a predictive model for the storage of health data that can meet patient needs and make storage decisions rapidly, in real-time, even with data streaming from wearable sensors. The model is built with a machine learning classifier that learns the mapping between characteristics of health data and features of storage repositories from a training set generated synthetically from correlations evident from small samples of experts. Results from the evaluation demonstrate the viability of the machine learning technique used.

摘要

健康相关数据存储在许多由不同实体管理和控制的存储库中。例如,电子健康记录通常由政府管理。电子病历通常由医疗保健提供者控制,而个人健康记录则由患者直接管理。最近,基于区块链的健康记录系统作为另一种存储库类型大量涌现,这些系统主要由技术监管。存储健康数据的存储库在成本、安全性和性能质量方面彼此不同。近年来,不仅存储库的类型有所增加,而且要存储的健康数据量也有所增加。例如,可捕获生理迹象的可穿戴传感器的出现导致数字健康数据呈指数级增长。存储库类型和数据量的增加促使人们需要智能流程来选择合适的存储库,以便在数据被收集时进行存储。然而,存储分配决策非常复杂。当健康数据连续流式传输时,这种挑战会更加严重,例如可穿戴传感器的情况就是如此。虽然患者并不总是独自负责确定应该使用哪个存储库,但他们通常对此决策有一定的发言权。患者可以根据自己的独特情况对存储决策有特殊的偏好。在本文中,我们提出了一种用于存储健康数据的预测模型,该模型可以满足患者的需求,并在实时、甚至从可穿戴传感器流式传输数据的情况下快速做出存储决策。该模型是使用机器学习分类器构建的,该分类器从通过对来自专家小样本的相关性进行合成生成的训练集中学习健康数据的特征与存储库特征之间的映射。评估结果证明了所使用的机器学习技术的可行性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验