云端医疗记录大数据访问与处理平台的实现

Implementation of a Big Data Accessing and Processing Platform for Medical Records in Cloud.

作者信息

Yang Chao-Tung, Liu Jung-Chun, Chen Shuo-Tsung, Lu Hsin-Wen

机构信息

Department of Computer Science, Tunghai University, Taichung, 40704, Taiwan, Republic of China.

出版信息

J Med Syst. 2017 Aug 18;41(10):149. doi: 10.1007/s10916-017-0777-5.

DOI:10.1007/s10916-017-0777-5

PMID:28822042

Abstract

Big Data analysis has become a key factor of being innovative and competitive. Along with population growth worldwide and the trend aging of population in developed countries, the rate of the national medical care usage has been increasing. Due to the fact that individual medical data are usually scattered in different institutions and their data formats are varied, to integrate those data that continue increasing is challenging. In order to have scalable load capacity for these data platforms, we must build them in good platform architecture. Some issues must be considered in order to use the cloud computing to quickly integrate big medical data into database for easy analyzing, searching, and filtering big data to obtain valuable information.This work builds a cloud storage system with HBase of Hadoop for storing and analyzing big data of medical records and improves the performance of importing data into database. The data of medical records are stored in HBase database platform for big data analysis. This system performs distributed computing on medical records data processing through Hadoop MapReduce programming, and to provide functions, including keyword search, data filtering, and basic statistics for HBase database. This system uses the Put with the single-threaded method and the CompleteBulkload mechanism to import medical data. From the experimental results, we find that when the file size is less than 300MB, the Put with single-threaded method is used and when the file size is larger than 300MB, the CompleteBulkload mechanism is used to improve the performance of data import into database. This system provides a web interface that allows users to search data, filter out meaningful information through the web, and analyze and convert data in suitable forms that will be helpful for medical staff and institutions.

摘要

大数据分析已成为创新和竞争的关键因素。随着全球人口增长以及发达国家人口老龄化趋势，国家医疗保健使用率一直在上升。由于个人医疗数据通常分散在不同机构且数据格式各异，整合这些不断增加的数据具有挑战性。为了使这些数据平台具备可扩展的负载能力，我们必须构建良好的平台架构。为了利用云计算将大量医疗数据快速集成到数据库中以便轻松分析、搜索和筛选大数据以获取有价值的信息，必须考虑一些问题。这项工作构建了一个基于Hadoop的HBase的云存储系统，用于存储和分析医疗记录大数据，并提高将数据导入数据库的性能。医疗记录数据存储在用于大数据分析的HBase数据库平台中。该系统通过Hadoop MapReduce编程对医疗记录数据处理进行分布式计算，并为HBase数据库提供包括关键词搜索、数据筛选和基本统计等功能。该系统使用单线程方法的Put和CompleteBulkload机制来导入医疗数据。从实验结果来看，我们发现当文件大小小于300MB时，使用单线程方法的Put，当文件大小大于300MB时，使用CompleteBulkload机制来提高数据导入数据库的性能。该系统提供了一个Web界面，允许用户搜索数据、通过网络筛选出有意义的信息，并以合适的形式分析和转换数据，这将对医护人员和机构有所帮助。

相似文献

Implementation of a Big Data Accessing and Processing Platform for Medical Records in Cloud.

J Med Syst. 2017 Aug 18;41(10):149. doi: 10.1007/s10916-017-0777-5.

Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends.

BioData Min. 2014 Oct 29;7:22. doi: 10.1186/1756-0381-7-22. eCollection 2014.

Medical Cloud Computing Data Processing to Optimize the Effect of Drugs.

J Healthc Eng. 2021 Mar 19;2021:5560691. doi: 10.1155/2021/5560691. eCollection 2021.

CloudDOE: a user-friendly tool for deploying Hadoop clouds and analyzing high-throughput sequencing data with MapReduce.

PLoS One. 2014 Jun 4;9(6):e98146. doi: 10.1371/journal.pone.0098146. eCollection 2014.

Anesthesia decision analysis using a cloud-based big data platform.

Eur J Med Res. 2024 Mar 25;29(1):201. doi: 10.1186/s40001-024-01764-0.

Using Distributed Data over HBase in Big Data Analytics Platform for Clinical Services.

Comput Math Methods Med. 2017;2017:6120820. doi: 10.1155/2017/6120820. Epub 2017 Dec 11.

Cloud Engineering Principles and Technology Enablers for Medical Image Processing-as-a-Service.

Proc IEEE Int Conf Cloud Eng. 2017 Apr;2017:127-137. doi: 10.1109/IC2E.2017.23. Epub 2017 May 11.

MISS-D: A fast and scalable framework of medical image storage service based on distributed file system.

Comput Methods Programs Biomed. 2020 Apr;186:105189. doi: 10.1016/j.cmpb.2019.105189. Epub 2019 Nov 14.

Enabling big geoscience data analytics with a cloud-based, MapReduce-enabled and service-oriented workflow framework.

PLoS One. 2015 Mar 5;10(3):e0116781. doi: 10.1371/journal.pone.0116781. eCollection 2015.

An Efficient Middle Layer Platform for Medical Imaging Archives.

J Healthc Eng. 2018 Jun 21;2018:3984061. doi: 10.1155/2018/3984061. eCollection 2018.

引用本文的文献

Anesthesia decision analysis using a cloud-based big data platform.

Eur J Med Res. 2024 Mar 25;29(1):201. doi: 10.1186/s40001-024-01764-0.

Social innovation for life expectancy extension utilizing a platform-centered system used in the Iwaki health promotion project: A protocol paper.

SAGE Open Med. 2021 Mar 19;9:20503121211002606. doi: 10.1177/20503121211002606. eCollection 2021.

Detecting Overlapping Communities in Modularity Optimization by Reweighting Vertices.

Entropy (Basel). 2020 Jul 27;22(8):819. doi: 10.3390/e22080819.

Medical Big Data Warehouse: Architecture and System Design, a Case Study: Improving Healthcare Resources Distribution.

J Med Syst. 2018 Feb 19;42(4):59. doi: 10.1007/s10916-018-0894-9.

本文引用的文献

'Big data', Hadoop and cloud computing in genomics.

J Biomed Inform. 2013 Oct;46(5):774-81. doi: 10.1016/j.jbi.2013.07.001. Epub 2013 Jul 18.

Design of a Personal Health Record and Health Knowledge Sharing System using IHE-XDS and OWL.

J Med Syst. 2013 Apr;37(2):9921. doi: 10.1007/s10916-012-9921-4. Epub 2013 Jan 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

云端医疗记录大数据访问与处理平台的实现

Implementation of a Big Data Accessing and Processing Platform for Medical Records in Cloud.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献