Center of Data and Knowledge Integration for Health (Cidacs), Fiocruz-Brazil, Parque Tecnológico da Edf, Tecnocentro, R. Mundo, Salvador, BA 41745-715, Brazil.
Shaukat Khanum Memorial Cancer Hospital and Research Centre, Johar Town, Lahore, 54840, Pakistan.
J Am Med Inform Assoc. 2023 Mar 16;30(4):643-655. doi: 10.1093/jamia/ocac180.
The aim of this work is to demonstrate the use of a standardized health informatics framework to generate reliable and reproducible real-world evidence from Latin America and South Asia towards characterizing coronavirus disease 2019 (COVID-19) in the Global South.
Patient-level COVID-19 records collected in a patient self-reported notification system, hospital in-patient and out-patient records, and community diagnostic labs were harmonized to the Observational Medical Outcomes Partnership common data model and analyzed using a federated network analytics framework. Clinical characteristics of individuals tested for, diagnosed with or tested positive for, hospitalized with, admitted to intensive care unit with, or dying with COVID-19 were estimated.
Two COVID-19 databases covering 8.3 million people from Pakistan and 2.6 million people from Bahia, Brazil were analyzed. 109 504 (Pakistan) and 921 (Brazil) medical concepts were harmonized to Observational Medical Outcomes Partnership common data model. In total, 341 505 (4.1%) people in the Pakistan dataset and 1 312 832 (49.2%) people in the Brazilian dataset were tested for COVID-19 between January 1, 2020 and April 20, 2022, with a median [IQR] age of 36 [25, 76] and 38 (27, 50); 40.3% and 56.5% were female in Pakistan and Brazil, respectively. 1.2% percent individuals in the Pakistan dataset had Afghan ethnicity. In Brazil, 52.3% had mixed ethnicity. In agreement with international findings, COVID-19 outcomes were more severe in men, elderly, and those with underlying health conditions.
COVID-19 data from 2 large countries in the Global South were harmonized and analyzed using a standardized health informatics framework developed by an international community of health informaticians. This proof-of-concept study demonstrates a potential open science framework for global knowledge mobilization and clinical translation for timely response to healthcare needs in pandemics and beyond.
本研究旨在展示使用标准化的健康信息学框架从拉丁美洲和南亚生成可靠和可重复的真实世界证据,以描述全球南方的 2019 年冠状病毒病(COVID-19)。
从患者自我报告通知系统、医院门诊和住院记录以及社区诊断实验室中收集的患者水平 COVID-19 记录被协调到观察性医疗结局伙伴关系通用数据模型,并使用联邦网络分析框架进行分析。估计了接受 COVID-19 检测、诊断、检测呈阳性、住院、入住重症监护病房或死于 COVID-19 的个体的临床特征。
分析了来自巴基斯坦的 830 万人和巴西巴伊亚州的 260 万人的两个 COVID-19 数据库。109504 个(巴基斯坦)和 921 个(巴西)医学概念被协调到观察性医疗结局伙伴关系通用数据模型。在巴基斯坦数据集,共有 341505 人(4.1%)和巴西数据集 1312832 人(49.2%)在 2020 年 1 月 1 日至 2022 年 4 月 20 日期间接受了 COVID-19 检测,中位年龄为 36 [25,76]和 38(27,50);巴基斯坦和巴西分别有 40.3%和 56.5%的女性。巴基斯坦数据集有 1.2%的个体具有阿富汗民族。在巴西,52.3%的人具有混合民族。与国际研究结果一致,COVID-19 结局在男性、老年人和有基础疾病的人群中更为严重。
使用国际健康信息学家社区开发的标准化健康信息学框架对来自全球南方两个大国的 COVID-19 数据进行了协调和分析。这一概念验证研究展示了一个潜在的开放科学框架,用于全球知识动员和临床转化,以应对大流行及以后的医疗保健需求。