• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于整合临床试验中患者数据的后绑定、分布式、非关系型仓库。

A late-binding, distributed, NoSQL warehouse for integrating patient data from clinical trials.

机构信息

Covance, the Drug Development Division of LabCorp Carnegie Center, Princeton, NJ, USA.

出版信息

Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz032.

DOI:10.1093/database/baz032
PMID:30854563
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6409386/
Abstract

Clinical trial data are typically collected through multiple systems developed by different vendors using different technologies and data standards. That data need to be integrated, standardized and transformed for a variety of monitoring and reporting purposes. The need to process large volumes of often inconsistent data in the presence of ever-changing requirements poses a significant technical challenge. As part of a comprehensive clinical data repository, we have developed a data warehouse that integrates patient data from any source, standardizes it and makes it accessible to study teams in a timely manner to support a wide range of analytic tasks for both in-flight and completed studies. Our solution combines Apache HBase, a NoSQL column store, Apache Phoenix, a massively parallel relational query engine and a user-friendly interface to facilitate efficient loading of large volumes of data under incomplete or ambiguous specifications, utilizing an extract-load-transform design pattern that defers data mapping until query time. This approach allows us to maintain a single copy of the data and transform it dynamically into any desirable format without requiring additional storage. Changes to the mapping specifications can be easily introduced and multiple representations of the data can be made available concurrently. Further, by versioning the data and the transformations separately, we can apply historical maps to current data or current maps to historical data, which simplifies the maintenance of data cuts and facilitates interim analyses for adaptive trials. The result is a highly scalable, secure and redundant solution that combines the flexibility of a NoSQL store with the robustness of a relational query engine to support a broad range of applications, including clinical data management, medical review, risk-based monitoring, safety signal detection, post hoc analysis of completed studies and many others.

摘要

临床试验数据通常通过不同供应商使用不同技术和数据标准开发的多个系统收集。这些数据需要进行集成、标准化和转换,以满足各种监测和报告目的。在不断变化的需求下,需要处理大量通常不一致的数据,这带来了重大的技术挑战。作为综合临床数据存储库的一部分,我们开发了一个数据仓库,该仓库可以整合来自任何来源的患者数据,对其进行标准化,并及时提供给研究团队,以支持针对进行中和已完成研究的各种分析任务。我们的解决方案结合了 Apache HBase(一种 NoSQL 列式存储)、Apache Phoenix(一种大规模并行关系查询引擎)和用户友好的界面,以促进在不完整或模糊规范下高效加载大量数据,利用提取-加载-转换设计模式,直到查询时间才推迟数据映射。这种方法使我们能够维护数据的单一副本,并根据需要将其动态转换为任何所需格式,而无需额外的存储。可以轻松引入映射规范的更改,并同时提供数据的多个表示形式。此外,通过分别对数据和转换进行版本控制,我们可以将历史映射应用于当前数据或将当前映射应用于历史数据,从而简化数据切割的维护,并为适应性试验提供临时分析。结果是一个高度可扩展、安全且冗余的解决方案,它结合了 NoSQL 存储的灵活性和关系查询引擎的稳健性,以支持广泛的应用,包括临床数据管理、医学审查、基于风险的监测、安全信号检测、已完成研究的事后分析等。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/3d630be6ee56/baz032f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/720a57cd7ef8/baz032f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/d74765572c06/baz032f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/de60dec1b9b4/baz032f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/831a0d55dfc4/baz032f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/3d630be6ee56/baz032f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/720a57cd7ef8/baz032f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/d74765572c06/baz032f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/de60dec1b9b4/baz032f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/831a0d55dfc4/baz032f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b994/6409386/3d630be6ee56/baz032f5.jpg

相似文献

1
A late-binding, distributed, NoSQL warehouse for integrating patient data from clinical trials.用于整合临床试验中患者数据的后绑定、分布式、非关系型仓库。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz032.
2
A dimensional warehouse for integrating operational data from clinical trials.临床试验运营数据集成的多维仓库。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz039.
3
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
4
Examining database persistence of ISO/EN 13606 standardized electronic health record extracts: relational vs. NoSQL approaches.审视ISO/EN 13606标准化电子健康记录提取物的数据库持久性:关系型与非关系型方法对比
BMC Med Inform Decis Mak. 2017 Aug 18;17(1):123. doi: 10.1186/s12911-017-0515-4.
5
An adaptive spark-based framework for querying large-scale NoSQL and relational databases.一种适用于查询大规模 NoSQL 和关系型数据库的基于火花的自适应框架。
PLoS One. 2021 Aug 19;16(8):e0255562. doi: 10.1371/journal.pone.0255562. eCollection 2021.
6
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Executing Complexity-Increasing Queries in Relational (MySQL) and NoSQL (MongoDB and EXist) Size-Growing ISO/EN 13606 Standardized EHR Databases.在关系型(MySQL)和非关系型(MongoDB和EXist)且规模不断增长的ISO/EN 13606标准化电子健康记录数据库中执行复杂度递增查询。
J Vis Exp. 2018 Mar 19(133):57439. doi: 10.3791/57439.
9
Evaluation of relational and NoSQL database architectures to manage genomic annotations.用于管理基因组注释的关系型和非关系型数据库架构评估。
J Biomed Inform. 2016 Dec;64:288-295. doi: 10.1016/j.jbi.2016.10.015. Epub 2016 Oct 31.
10
High dimensional biological data retrieval optimization with NoSQL technology.使用NoSQL技术进行高维生物数据检索优化
BMC Genomics. 2014;15 Suppl 8(Suppl 8):S3. doi: 10.1186/1471-2164-15-S8-S3. Epub 2014 Nov 13.

引用本文的文献

1
An Efficient Agent Based Data Management Method of NoSQL Environments for Health Care Applications.一种用于医疗保健应用的基于代理的高效NoSQL环境数据管理方法。
Healthcare (Basel). 2021 Mar 13;9(3):322. doi: 10.3390/healthcare9030322.
2
Xcellerate Investigator Portal: A New Web-Based Tool for Online Delivery of Central Laboratory Data, Reports, and Communications to Clinical Sites.加速研究者门户:一种基于网络的新工具,用于向临床站点在线提供中心实验室数据、报告和通信。
SLAS Technol. 2020 Oct;25(5):427-435. doi: 10.1177/2472630320942200. Epub 2020 Jul 29.
3
Insights from Adopting a Data Commons Approach for Large-scale Observational Cohort Studies: The California Teachers Study.

本文引用的文献

1
A dimensional warehouse for integrating operational data from clinical trials.临床试验运营数据集成的多维仓库。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz039.
2
Risk-based Monitoring of Clinical Trials: An Integrative Approach.基于风险的临床试验监测:一种综合方法。
Clin Ther. 2018 Jul;40(7):1204-1212. doi: 10.1016/j.clinthera.2018.04.020. Epub 2018 Jul 4.
3
Quantifying and visualizing site performance in clinical trials.量化并可视化临床试验中的站点表现。
采用数据公有方法进行大规模观察性队列研究的见解:加利福尼亚教师研究。
Cancer Epidemiol Biomarkers Prev. 2020 Apr;29(4):777-786. doi: 10.1158/1055-9965.EPI-19-0842. Epub 2020 Feb 12.
4
A new risk and issue management system to improve productivity, quality, and compliance in clinical trials.一种新的风险与问题管理系统,旨在提高临床试验的生产力、质量和合规性。
JAMIA Open. 2019 Mar 19;2(2):216-221. doi: 10.1093/jamiaopen/ooz006. eCollection 2019 Jul.
5
A dimensional warehouse for integrating operational data from clinical trials.临床试验运营数据集成的多维仓库。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz039.
6
A cross-source, system-agnostic solution for clinical data review.一种跨数据源、与系统无关的临床数据审查解决方案。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz017.
Contemp Clin Trials Commun. 2018 Jan 31;9:108-114. doi: 10.1016/j.conctc.2018.01.005. eCollection 2018 Mar.
4
Robotic measurement of arm movements after stroke establishes biomarkers of motor recovery.机器人测量中风后手臂运动,建立运动恢复的生物标志物。
Stroke. 2014 Jan;45(1):200-4. doi: 10.1161/STROKEAHA.113.002296. Epub 2013 Dec 12.
5
Effective knowledge management in translational medicine.转化医学中的有效知识管理。
J Transl Med. 2010 Jul 19;8:68. doi: 10.1186/1479-5876-8-68.
6
Broadening access to electronic healthcare databases.扩大对电子医疗数据库的访问。
Nat Rev Drug Discov. 2010 Jan;9(1):84. doi: 10.1038/nrd2988-c1.
7
The BRIDG project: a technical report.BRIDG项目:一份技术报告。
J Am Med Inform Assoc. 2008 Mar-Apr;15(2):130-7. doi: 10.1197/jamia.M2556. Epub 2007 Dec 20.
8
Advanced biological and chemical discovery (ABCD): centralizing discovery knowledge in an inherently decentralized world.先进生物与化学发现(ABCD):在本质上分散的世界中集中发现知识。
J Chem Inf Model. 2007 Nov-Dec;47(6):1999-2014. doi: 10.1021/ci700267w. Epub 2007 Nov 1.
9
Data mining applications in healthcare.医疗保健中的数据挖掘应用。
J Healthc Inf Manag. 2005 Spring;19(2):64-72.
10
Stochastic proximity embedding.随机近似嵌入
J Comput Chem. 2003 Jul 30;24(10):1215-21. doi: 10.1002/jcc.10234.