Lee Seungwon, Li Bing, Martin Elliot A, D'Souza Adam G, Jiang Jason, Doktorchik Chelsea, Southern Danielle A, Lee Joon, Wiebe Natalie, Quan Hude, Eastwood Cathy A
Centre for Health Informatics, University of Calgary, Calgary, Alberta, Canada.
Department of Community Health Sciences, University of Calgary, Calgary, Alberta, Canada.
CJC Open. 2020 Dec 27;3(5):639-645. doi: 10.1016/j.cjco.2020.12.019. eCollection 2021 May.
The initiatives of precision medicine and learning health systems require databases with rich and accurately captured data on patient characteristics. We introduce the linical gistry, dminisrative Data and lectronic Medical Records (CREATE) database, which includes linked data from 4 population databases: lberta rovincial oject for utcome ssessment in oronary eart Disease (APPROACH; a national clinical registry), Sunrise Clinical Manager (SCM) electronic medical record (city-wide), the Discharge Abstract Database (DAD), and the National Ambulatory Care Reporting System (NACRS). The intent of this work is to introduce a cardiovascular-specific database for pursuing precision health activities using big data analytics.
We used deterministic data linkage to link SCM electronic medical record data to APPROACH clinical registry data using patient identifier variables. The APPROACH-SCM data set was subsequently linked to DAD and NACRS to obtain inpatient and outpatient cohort data. We further validated the quality of the linkage, where applicable, in these databases by comparing against the Alberta Health Insurance Care Plan registry database.
We achieved 99.96% linkage across these 4 databases. Currently, there are 30,984 patients with 35,753 catheterizations in the CREATE database. The inpatient cohort contained 65.75% (20,373/30,984) of the patient sample, whereas the outpatient cohort contained 29.78% (9226/30,984). The infrastructure and the process to update and expand the database has been established.
CREATE is intended to serve as a database for supporting big data analytics activities surrounding cardiac precision health. The CREATE database will be managed by the Centre for Health Informatics at the University of Calgary, and housed in a secure high-performance computing environment.
精准医学和学习型健康系统的倡议需要包含丰富且准确获取的患者特征数据的数据库。我们引入了临床注册、行政数据和电子病历(CREATE)数据库,该数据库包含来自4个群体数据库的链接数据:阿尔伯塔省冠状动脉疾病结局评估项目(APPROACH;一个国家级临床注册库)、Sunrise临床管理器(SCM)电子病历(全市范围)、出院摘要数据库(DAD)以及国家门诊护理报告系统(NACRS)。这项工作的目的是引入一个心血管疾病专用数据库,用于利用大数据分析开展精准健康活动。
我们使用确定性数据链接,通过患者标识符变量将SCM电子病历数据与APPROACH临床注册数据相链接。随后,将APPROACH - SCM数据集与DAD和NACRS相链接,以获取住院和门诊队列数据。在适用的情况下,我们通过与阿尔伯塔省健康保险护理计划注册数据库进行比较,进一步验证了这些数据库中链接的质量。
我们在这4个数据库之间实现了99.96%的链接率。目前,CREATE数据库中有30984名患者,进行了35753次导管插入术。住院队列包含患者样本的65.75%(20373/30984),而门诊队列包含29.78%(9226/30984)。已经建立了更新和扩展数据库的基础设施及流程。
CREATE旨在作为一个数据库,支持围绕心脏精准健康的大数据分析活动。CREATE数据库将由卡尔加里大学健康信息学中心管理,并存储在安全的高性能计算环境中。