基于 Spark 引擎的疾病负担大数据平台的设计与开发。

Design and Development of a Big Data Platform for Disease Burden Based on the Spark Engine.

机构信息

School of Public Health and Management, Guangzhou University of Chinese Medicine, Guangzhou 510006, China.

College of Physical Education and Health, Guangxi Medical University, Nanning 530021, China.

出版信息

Comput Intell Neurosci. 2023 Feb 6;2023:8963053. doi: 10.1155/2023/8963053. eCollection 2023.

DOI:10.1155/2023/8963053

PMID:36793705

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9925246/

Abstract

OBJECTIVE

This study attempts to build a big data platform for disease burden that can realize the deep coupling of artificial intelligence and public health. This is a highly open and shared intelligent platform, including big data collection, analysis, and result visualization.

METHODS

Based on data mining theory and technology, the current situation of multisource data on disease burden was analyzed. Putting forward the disease burden big data management model, functional modules, and technical framework, Kafka technology is used to optimize the transmission efficiency of the underlying data. This will be an efficient and highly scalable data analysis platform through embedding embedded Sparkmlib in the Hadoop ecosystem.

RESULTS

With the concept of "Internet + medical integration," the overall architecture design of the big data platform for disease burden management was proposed based on the Spark engine and Python language. The main system composition and application scenarios are given at four levels: multisource data collection, data processing, data analysis, and the application layer, according to application scenarios and use requirements.

CONCLUSION

The big data platform of disease burden management helps to promote the multisource convergence of disease burden data and provides a new path for the standardized paradigm of disease burden measurement. Provide methods and ideas for the deep integration of medical big data and the formation of a broader standard paradigm.

摘要

目的

本研究旨在构建一个可实现人工智能与公共卫生深度耦合的疾病负担大数据平台。这是一个高度开放和共享的智能平台，包括大数据采集、分析和结果可视化。

方法

基于数据挖掘理论和技术，分析了疾病负担多源数据的现状。提出了疾病负担大数据管理模型、功能模块和技术框架，利用 Kafka 技术优化底层数据的传输效率。通过在 Hadoop 生态系统中嵌入嵌入式 Sparkmlib，这将是一个高效且具有高可扩展性的数据分析平台。

结果

基于“互联网+医疗”的理念，提出了基于 Spark 引擎和 Python 语言的疾病负担管理大数据平台的总体架构设计。根据应用场景和使用要求，给出了主要系统组成和四个层次的应用场景：多源数据采集、数据处理、数据分析和应用层。

结论

疾病负担管理大数据平台有助于促进疾病负担数据的多源融合，为疾病负担测量的规范化范式提供了新的途径。为医疗大数据的深度融合和更广泛的标准范式的形成提供了方法和思路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c7e/9925246/f026d5ffc3b9/CIN2023-8963053.001.jpg

相似文献

Design and Development of a Big Data Platform for Disease Burden Based on the Spark Engine.

Comput Intell Neurosci. 2023 Feb 6;2023:8963053. doi: 10.1155/2023/8963053. eCollection 2023.

Big data-driven intelligent governance of college students' physical health: System and strategy.

Front Public Health. 2022 Aug 9;10:924025. doi: 10.3389/fpubh.2022.924025. eCollection 2022.

Disease-specific data processing: An intelligent digital platform for diabetes based on model prediction and data analysis utilizing big data technology.

Front Public Health. 2022 Dec 12;10:1053269. doi: 10.3389/fpubh.2022.1053269. eCollection 2022.

New Progress in Artificial Intelligence Algorithm Research Based on Big Data Processing of IOT Systems on Intelligent Production Lines.

Comput Intell Neurosci. 2022 Mar 10;2022:3283165. doi: 10.1155/2022/3283165. eCollection 2022.

Big Data Precision Marketing Approach under IoT Cloud Platform Information Mining.

Comput Intell Neurosci. 2022 Jan 12;2022:4828108. doi: 10.1155/2022/4828108. eCollection 2022.

Big Data Health Care Platform With Multisource Heterogeneous Data Integration and Massive High-Dimensional Data Governance for Large Hospitals: Design, Development, and Application.

JMIR Med Inform. 2022 Apr 13;10(4):e36481. doi: 10.2196/36481.

Big Data-Artificial Intelligence Fusion Technology in Education in the Context of the New Crown Epidemic.

Big Data. 2022 Jun;10(3):262-276. doi: 10.1089/big.2021.0245. Epub 2022 May 23.

Big Data Technology in the Macrodecision-Making Model of Regional Industrial Economic Information Applied Research.

Comput Intell Neurosci. 2022 Jul 18;2022:7400797. doi: 10.1155/2022/7400797. eCollection 2022.

Sports Economic Mining Algorithm Based on Association Analysis and Big Data Model.

Comput Intell Neurosci. 2022 May 23;2022:1518202. doi: 10.1155/2022/1518202. eCollection 2022.

Application and Effectiveness of Big Data and Artificial Intelligence in the Construction of Nursing Sensitivity Quality Indicators.

J Healthc Eng. 2021 Sep 21;2021:2087876. doi: 10.1155/2021/2087876. eCollection 2021.

本文引用的文献

The Application of a Computer Monitoring System Using IoT Technology.

Comput Intell Neurosci. 2022 Jun 6;2022:4033886. doi: 10.1155/2022/4033886. eCollection 2022.

Effective Analysis of Inpatient Satisfaction: The Random Forest Algorithm.

Patient Prefer Adherence. 2021 Apr 7;15:691-703. doi: 10.2147/PPA.S294402. eCollection 2021.

Global burden of 87 risk factors in 204 countries and territories, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019.

Lancet. 2020 Oct 17;396(10258):1223-1249. doi: 10.1016/S0140-6736(20)30752-2.

Disease burden of congenital cytomegalovirus infection in Japan.

J Infect Chemother. 2021 Feb;27(2):161-164. doi: 10.1016/j.jiac.2020.08.018. Epub 2020 Sep 7.

A Big Data Platform for Real Time Analysis of Signs of Depression in Social Media.

Int J Environ Res Public Health. 2020 Jul 1;17(13):4752. doi: 10.3390/ijerph17134752.

Social big data: Recent achievements and new challenges.

Inf Fusion. 2016 Mar;28:45-59. doi: 10.1016/j.inffus.2015.08.005. Epub 2015 Aug 28.

European burden of disease network: strengthening the collaboration.

Eur J Public Health. 2020 Feb 1;30(1):2-3. doi: 10.1093/eurpub/ckz225.

Data-Enabled Digestive Medicine: A New Big Data Analytics Platform.

IEEE/ACM Trans Comput Biol Bioinform. 2021 May-Jun;18(3):922-931. doi: 10.1109/TCBB.2019.2951555. Epub 2021 Jun 3.

The Korea Cancer Big Data Platform (K-CBP) for Cancer Research.

Int J Environ Res Public Health. 2019 Jun 28;16(13):2290. doi: 10.3390/ijerph16132290.

Health Care and Precision Medicine Research: Analysis of a Scalable Data Science Platform.

J Med Internet Res. 2019 Apr 9;21(4):e13043. doi: 10.2196/13043.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于 Spark 引擎的疾病负担大数据平台的设计与开发。

Design and Development of a Big Data Platform for Disease Burden Based on the Spark Engine.

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献