• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医疗保健与精准医学研究:一个可扩展数据科学平台的分析

Health Care and Precision Medicine Research: Analysis of a Scalable Data Science Platform.

作者信息

McPadden Jacob, Durant Thomas Js, Bunch Dustin R, Coppi Andreas, Price Nathaniel, Rodgerson Kris, Torre Charles J, Byron William, Hsiao Allen L, Krumholz Harlan M, Schulz Wade L

机构信息

Department of Pediatrics, Yale University School of Medicine, New Haven, CT, United States.

Department of Laboratory Medicine, Yale University School of Medicine, New Haven, CT, United States.

出版信息

J Med Internet Res. 2019 Apr 9;21(4):e13043. doi: 10.2196/13043.

DOI:10.2196/13043
PMID:30964441
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6477571/
Abstract

BACKGROUND

Health care data are increasing in volume and complexity. Storing and analyzing these data to implement precision medicine initiatives and data-driven research has exceeded the capabilities of traditional computer systems. Modern big data platforms must be adapted to the specific demands of health care and designed for scalability and growth.

OBJECTIVE

The objectives of our study were to (1) demonstrate the implementation of a data science platform built on open source technology within a large, academic health care system and (2) describe 2 computational health care applications built on such a platform.

METHODS

We deployed a data science platform based on several open source technologies to support real-time, big data workloads. We developed data-acquisition workflows for Apache Storm and NiFi in Java and Python to capture patient monitoring and laboratory data for downstream analytics.

RESULTS

Emerging data management approaches, along with open source technologies such as Hadoop, can be used to create integrated data lakes to store large, real-time datasets. This infrastructure also provides a robust analytics platform where health care and biomedical research data can be analyzed in near real time for precision medicine and computational health care use cases.

CONCLUSIONS

The implementation and use of integrated data science platforms offer organizations the opportunity to combine traditional datasets, including data from the electronic health record, with emerging big data sources, such as continuous patient monitoring and real-time laboratory results. These platforms can enable cost-effective and scalable analytics for the information that will be key to the delivery of precision medicine initiatives. Organizations that can take advantage of the technical advances found in data science platforms will have the opportunity to provide comprehensive access to health care data for computational health care and precision medicine research.

摘要

背景

医疗保健数据的数量和复杂性正在不断增加。存储和分析这些数据以实施精准医疗计划和数据驱动型研究已超出传统计算机系统的能力范围。现代大数据平台必须适应医疗保健的特定需求,并设计为具有可扩展性和增长性。

目的

我们研究的目的是:(1)展示在大型学术医疗保健系统中基于开源技术构建的数据科学平台的实施情况;(2)描述基于此类平台构建的两个计算医疗保健应用程序。

方法

我们部署了一个基于多种开源技术的数据科学平台,以支持实时大数据工作负载。我们用Java和Python为Apache Storm和NiFi开发了数据采集工作流程,以捕获患者监测数据和实验室数据,用于下游分析。

结果

新兴的数据管理方法以及诸如Hadoop之类的开源技术可用于创建集成数据湖,以存储大型实时数据集。该基础设施还提供了一个强大的分析平台,可在该平台上对医疗保健和生物医学研究数据进行近实时分析,以用于精准医疗和计算医疗保健用例。

结论

集成数据科学平台的实施和使用为各组织提供了机会,可将包括电子健康记录数据在内的传统数据集与新兴大数据源(如持续的患者监测数据和实时实验室结果)相结合。这些平台可为对精准医疗计划的实施至关重要的信息提供具有成本效益且可扩展的分析。能够利用数据科学平台中技术进步的组织将有机会为计算医疗保健和精准医学研究提供全面的医疗保健数据访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e1a/6477571/7577bbd1b762/jmir_v21i4e13043_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e1a/6477571/1db9d1c33b75/jmir_v21i4e13043_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e1a/6477571/25340b0449c9/jmir_v21i4e13043_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e1a/6477571/ed18af6d5e61/jmir_v21i4e13043_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e1a/6477571/7577bbd1b762/jmir_v21i4e13043_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e1a/6477571/1db9d1c33b75/jmir_v21i4e13043_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e1a/6477571/25340b0449c9/jmir_v21i4e13043_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e1a/6477571/ed18af6d5e61/jmir_v21i4e13043_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e1a/6477571/7577bbd1b762/jmir_v21i4e13043_fig4.jpg

相似文献

1
Health Care and Precision Medicine Research: Analysis of a Scalable Data Science Platform.医疗保健与精准医学研究:一个可扩展数据科学平台的分析
J Med Internet Res. 2019 Apr 9;21(4):e13043. doi: 10.2196/13043.
2
Innovations in Genomics and Big Data Analytics for Personalized Medicine and Health Care: A Review.基因组学和大数据分析在个性化医疗和医疗保健中的创新:综述。
Int J Mol Sci. 2022 Apr 22;23(9):4645. doi: 10.3390/ijms23094645.
3
Integrative methods for analyzing big data in precision medicine.精准医学中大数据分析的整合方法。
Proteomics. 2016 Mar;16(5):741-58. doi: 10.1002/pmic.201500396.
4
A scalable, secure, and interoperable platform for deep data-driven health management.用于深度数据驱动型健康管理的可扩展、安全且互操作的平台。
Nat Commun. 2021 Oct 1;12(1):5757. doi: 10.1038/s41467-021-26040-1.
5
Applications of Artificial Intelligence and Big Data Analytics in m-Health: A Healthcare System Perspective.人工智能和大数据分析在移动医疗中的应用:医疗保健系统视角。
J Healthc Eng. 2020 Aug 30;2020:8894694. doi: 10.1155/2020/8894694. eCollection 2020.
6
A logic model for precision medicine implementation informed by stakeholder views and implementation science.基于利益相关者观点和实施科学的精准医学实施逻辑模型。
Genet Med. 2019 May;21(5):1139-1154. doi: 10.1038/s41436-018-0315-y. Epub 2018 Oct 23.
7
Why we need a small data paradigm.为什么我们需要小数据范式。
BMC Med. 2019 Jul 17;17(1):133. doi: 10.1186/s12916-019-1366-x.
8
Big Data Analytics for Genomic Medicine.基因组医学中的大数据分析
Int J Mol Sci. 2017 Feb 15;18(2):412. doi: 10.3390/ijms18020412.
9
Next-Generation Sequencing Informatics: Challenges and Strategies for Implementation in a Clinical Environment.下一代测序信息学:临床环境中实施的挑战与策略
Arch Pathol Lab Med. 2016 Sep;140(9):958-75. doi: 10.5858/arpa.2015-0507-RA. Epub 2016 Feb 22.
10
Big data analytics for personalized medicine.大数据分析在个性化医疗中的应用。
Curr Opin Biotechnol. 2019 Aug;58:161-167. doi: 10.1016/j.copbio.2019.03.004. Epub 2019 Apr 6.

引用本文的文献

1
Medical Big Data Storage in Precision Medicine: A Systematic Review.精准医学中的医学大数据存储:一项系统综述。
J Biomed Phys Eng. 2025 Jun 1;15(3):205-220. doi: 10.31661/jbpe.v0i0.2402-1730. eCollection 2025 Jun.
2
Development of Personas and Journey Maps for Artificial Intelligence Agents Supporting the Use of Health Big Data: Human-Centered Design Approach.支持健康大数据使用的人工智能代理的人物角色和旅程地图开发:以人为本的设计方法。
JMIR Form Res. 2025 Jan 8;9:e67272. doi: 10.2196/67272.
3
Enhancing Data Science and Genomics Capacity of a Historically Black Medical College Through Interdisciplinary Training and Research Collaborations.

本文引用的文献

1
The problem with Big Data in Translational Medicine. A review of where we've been and the possibilities ahead.转化医学中的大数据问题。对我们的过往及未来可能性的回顾。
Appl Transl Genom. 2015 Aug 5;6:3-6. doi: 10.1016/j.atg.2015.07.005. eCollection 2015 Sep.
2
Automated continuous vital signs predict use of uncrossed matched blood and massive transfusion following trauma.自动连续生命体征可预测创伤后未交叉配血血液的使用及大量输血情况。
J Trauma Acute Care Surg. 2016 Jun;80(6):897-906. doi: 10.1097/TA.0000000000001047.
3
Integrative methods for analyzing big data in precision medicine.
通过跨学科培训和研究合作提升一所历史悠久的黑人医学院的数据科学和基因组学能力。
J Biotechnol Biomed. 2024;7(4):425-432. doi: 10.26502/jbb.2642-91280166. Epub 2024 Oct 18.
4
Use of electronic health records to characterize patients with uncontrolled hypertension in two large health system networks.利用电子健康记录描述两大医疗系统网络中未控制的高血压患者的特征。
BMC Cardiovasc Disord. 2024 Sep 18;24(1):497. doi: 10.1186/s12872-024-04161-x.
5
Personalised Medicine-Implementation to the Healthcare System in Europe (Focus Group Discussions).个性化医疗——在欧洲医疗体系中的实施(焦点小组讨论)
J Pers Med. 2023 Feb 21;13(3):380. doi: 10.3390/jpm13030380.
6
A Case Study of Enhancing the Data Science Capacity of an RCMI Program at a Historically Black Medical College.一个强化 RCMI 项目数据科学能力的案例研究:以一所历史悠久的黑人医学院为例。
Int J Environ Res Public Health. 2023 Mar 8;20(6):4775. doi: 10.3390/ijerph20064775.
7
Design and Development of a Big Data Platform for Disease Burden Based on the Spark Engine.基于 Spark 引擎的疾病负担大数据平台的设计与开发。
Comput Intell Neurosci. 2023 Feb 6;2023:8963053. doi: 10.1155/2023/8963053. eCollection 2023.
8
Policy Preferences Regarding Health Data Sharing Among Patients With Cancer: Public Deliberations.癌症患者对健康数据共享的政策偏好:公众审议
JMIR Cancer. 2023 Jan 31;9:e39631. doi: 10.2196/39631.
9
Association between primary or booster COVID-19 mRNA vaccination and Omicron lineage BA.1 SARS-CoV-2 infection in people with a prior SARS-CoV-2 infection: A test-negative case-control analysis.既往感染人群中,与 COVID-19 mRNA 疫苗(基础免疫或加强针)接种和奥密克戎变异株 BA.1 刺突蛋白 SARS-CoV-2 感染的相关性:一项病例对照研究。
PLoS Med. 2022 Dec 1;19(12):e1004136. doi: 10.1371/journal.pmed.1004136. eCollection 2022 Dec.
10
Lifting Hospital Electronic Health Record Data Treasures: Challenges and Opportunities.挖掘医院电子健康记录数据宝藏:挑战与机遇
JMIR Med Inform. 2022 Oct 21;10(10):e38557. doi: 10.2196/38557.
精准医学中大数据分析的整合方法。
Proteomics. 2016 Mar;16(5):741-58. doi: 10.1002/pmic.201500396.
4
Precision medicine--personalized, problematic, and promising.精准医学——个性化、存在问题且充满希望。
N Engl J Med. 2015 Jun 4;372(23):2229-34. doi: 10.1056/NEJMsb1503104. Epub 2015 May 27.
5
A new initiative on precision medicine.一项关于精准医学的新倡议。
N Engl J Med. 2015 Feb 26;372(9):793-5. doi: 10.1056/NEJMp1500523. Epub 2015 Jan 30.
6
Network physiology reveals relations between network topology and physiological function.网络生理学揭示了网络拓扑结构和生理功能之间的关系。
Nat Commun. 2012 Feb 28;3:702. doi: 10.1038/ncomms1705.
7
Errors in laboratory medicine.检验医学中的误差
Clin Chem. 2002 May;48(5):691-8.
8
Toward the early diagnosis of neonatal sepsis and sepsis-like illness using novel heart rate analysis.利用新型心率分析实现新生儿败血症及败血症样疾病的早期诊断。
Pediatrics. 2001 Jan;107(1):97-104. doi: 10.1542/peds.107.1.97.