• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

云端的真实世界证据:使用亚马逊网络服务资源开发端到端数据与分析管道教程。

Real-world evidence in the cloud: Tutorial on developing an end-to-end data and analytics pipeline using Amazon Web Services resources.

作者信息

Anderson Wes, Bhatnagar Roopal, Scollick Keith, Schito Marco, Walls Ramona, Podichetty Jagdeep T

机构信息

Critical Path Institute, Tucson, Arizona, USA.

出版信息

Clin Transl Sci. 2024 Dec;17(12):e70078. doi: 10.1111/cts.70078.

DOI:10.1111/cts.70078
PMID:39670335
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11638732/
Abstract

In the rapidly evolving landscape of healthcare and drug development, the ability to efficiently collect, process, and analyze large volumes of real-world data (RWD) is critical for advancing drug development. This article provides a blueprint for establishing an end-to-end data and analytics pipeline in a cloud-based environment. The pipeline presented here includes four major components, including data ingestion, transformation, visualization, and analytics, each supported by a suite of Amazon Web Services (AWS) tools. The pipeline is exemplified through the CURE ID platform, a collaborative tool designed to capture and analyze real-world, off-label treatment administrations. By using services such as AWS Lambda, Amazon Relational Database Service (RDS), Amazon QuickSight, and Amazon SageMaker, the pipeline facilitates the ingestion of diverse data sources, the transformation of raw data into structured formats, the creation of interactive dashboards for data visualization, and the application of advanced machine learning models for data analytics. The described architecture not only supports the needs of the CURE ID platform, but also offers a scalable and adaptable framework that can be applied across various domains to enhance data-driven decision making beyond drug repurposing.

摘要

在快速发展的医疗保健和药物研发领域,有效收集、处理和分析大量真实世界数据(RWD)的能力对于推进药物研发至关重要。本文提供了一个在基于云的环境中建立端到端数据和分析管道的蓝图。这里介绍的管道包括四个主要组件,即数据摄取、转换、可视化和分析,每个组件都由一套亚马逊网络服务(AWS)工具提供支持。该管道通过CURE ID平台进行示例说明,CURE ID平台是一个协作工具,旨在捕获和分析真实世界中的非标签治疗用药情况。通过使用AWS Lambda、亚马逊关系数据库服务(RDS)、亚马逊QuickSight和亚马逊SageMaker等服务,该管道便于摄取各种数据源,将原始数据转换为结构化格式,创建用于数据可视化的交互式仪表板,并应用先进的机器学习模型进行数据分析。所描述的架构不仅支持CURE ID平台的需求,还提供了一个可扩展且适应性强的框架,该框架可应用于各个领域,以加强超越药物重新利用的数据驱动决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/ec392a58e070/CTS-17-e70078-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/89831b71e1ab/CTS-17-e70078-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/3f33f6d2a1b9/CTS-17-e70078-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/48403c11a377/CTS-17-e70078-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/781708d9c389/CTS-17-e70078-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/ec392a58e070/CTS-17-e70078-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/89831b71e1ab/CTS-17-e70078-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/3f33f6d2a1b9/CTS-17-e70078-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/48403c11a377/CTS-17-e70078-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/781708d9c389/CTS-17-e70078-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc9/11638732/ec392a58e070/CTS-17-e70078-g005.jpg

相似文献

1
Real-world evidence in the cloud: Tutorial on developing an end-to-end data and analytics pipeline using Amazon Web Services resources.云端的真实世界证据:使用亚马逊网络服务资源开发端到端数据与分析管道教程。
Clin Transl Sci. 2024 Dec;17(12):e70078. doi: 10.1111/cts.70078.
2
Processing shotgun proteomics data on the Amazon cloud with the trans-proteomic pipeline.使用跨蛋白质组学管道在亚马逊云中处理鸟枪法蛋白质组学数据。
Mol Cell Proteomics. 2015 Feb;14(2):399-404. doi: 10.1074/mcp.O114.043380. Epub 2014 Nov 23.
3
A Visual Dashboard to Track Learning Analytics for Educational Cloud Computing.用于跟踪教育云计算学习分析的可视化仪表板。
Sensors (Basel). 2019 Jul 4;19(13):2952. doi: 10.3390/s19132952.
4
Developing a Reproducible Microbiome Data Analysis Pipeline Using the Amazon Web Services Cloud for a Cancer Research Group: Proof-of-Concept Study.为癌症研究小组使用亚马逊网络服务云开发可重复的微生物组数据分析管道:概念验证研究
JMIR Med Inform. 2019 Nov 11;7(4):e14667. doi: 10.2196/14667.
5
HealtheDataLab - a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions.HealtheDataLab- 一个针对医疗保健领域的数据科学和高级分析的云计算解决方案,应用于预测多中心儿科再入院率。
BMC Med Inform Decis Mak. 2020 Jun 19;20(1):115. doi: 10.1186/s12911-020-01153-7.
6
Digital Health Dashboards for Decision-Making to Enable Rapid Responses During Public Health Crises: Replicable and Scalable Methodology.用于决策的数字健康仪表盘,以在公共卫生危机期间实现快速响应:可复制和可扩展的方法。
JMIR Res Protoc. 2023 Jun 30;12:e46810. doi: 10.2196/46810.
7
Performance Management of High Performance Computing for Medical Image Processing in Amazon Web Services.亚马逊网络服务中用于医学图像处理的高性能计算的绩效管理
Proc SPIE Int Soc Opt Eng. 2016 Feb 27;9789. doi: 10.1117/12.2217396. Epub 2016 Mar 25.
8
A Scalable, Web-Based Platform for Proteomics Data Processing, Result Storage and Analysis.一个用于蛋白质组学数据处理、结果存储和分析的可扩展的基于网络的平台。
J Proteome Res. 2025 Mar 7;24(3):1241-1249. doi: 10.1021/acs.jproteome.4c00871. Epub 2025 Feb 21.
9
Cloud-based serverless computing enables accelerated monte carlo simulations for nuclear medicine imaging.基于云的无服务器计算可实现核医学成像的加速蒙特卡罗模拟。
Biomed Phys Eng Express. 2024 Jun 25;10(4). doi: 10.1088/2057-1976/ad5847.
10
Deploying Mass Spectrometric Data Analysis in the Amazon AWS Cloud Computing Environment.在亚马逊 AWS 云计算环境中部署质谱数据分析。
Methods Mol Biol. 2021;2271:375-397. doi: 10.1007/978-1-0716-1241-5_26.

本文引用的文献

1
Unlocking the Capabilities of Large Language Models for Accelerating Drug Development.释放大语言模型在加速药物研发方面的能力。
Clin Pharmacol Ther. 2024 Jul;116(1):38-41. doi: 10.1002/cpt.3279. Epub 2024 Apr 22.
2
Real-world data: a comprehensive literature review on the barriers, challenges, and opportunities associated with their inclusion in the health technology assessment process.真实世界数据:纳入卫生技术评估过程中相关障碍、挑战和机遇的全面文献综述。
J Pharm Pharm Sci. 2024 Feb 28;27:12302. doi: 10.3389/jpps.2024.12302. eCollection 2024.
3
A cloud-based pipeline for analysis of FHIR and long-read data.
用于分析FHIR和长读长数据的基于云的管道。
Bioinform Adv. 2023 Jan 20;3(1):vbac095. doi: 10.1093/bioadv/vbac095. eCollection 2023.
4
Health and Healthcare: Assessing the Real World Data Policy Landscape in Europe.健康与医疗保健:评估欧洲现实世界数据政策格局
Rand Health Q. 2014 Jun 1;4(2):15. eCollection 2014 Summer.