Core Facility Digital Medicine and Interoperability, Berlin Institute of Health (BIH) at Charité - Universitätsmedizin Berlin, Anna-Louisa-Karsch-Str.2, 10178, Berlin, Germany.
J Med Syst. 2023 Nov 14;47(1):115. doi: 10.1007/s10916-023-02012-4.
The COVID-19 pandemic has led to tremendous investment in clinical studies to generate much-needed knowledge on the prevention, diagnosis, treatment and long-term effects of the disease. Case report forms, comprised of questions and answers (variables), are commonly used to collect data in clinical trials. Maximizing the value of study data depends on data quality and on the ability to easily pool and share data from several sources. ISARIC, in collaboration with the WHO, has created a case report form that is available for use by the scientific community to collect COVID-19 trial data. One of such research initiatives collecting and analyzing multi-country and multi-cohort COVID-19 study data is the Horizon 2020 project ORCHESTRA. Following the ISO/TS 21564:2019 standard, a mapping between five ORCHESTRA studies' variables and the ISARIC Freestanding Follow-Up Survey elements was created. Measures of correspondence of shared semantic domain of 0 (perfect match), 1 (fully inclusive match), 2 (partial match), 4 (transformation required) or 4* (not present in ORCHESTRA) as compared to the target code system, ORCHESTRA study variables, were assigned to each of the elements in the ISARIC FUP case report form (CRF) which was considered the source code system. Of the ISARIC FUP CRF's variables, around 34% were found to show an exact match with corresponding variables in ORCHESTRA studies and about 33% showed a non-inclusive overlap. Matching variables provided information on patient demographics, COVID-19 testing, hospital admission and symptoms. More in-depth details are covered in ORCHESTRA variables with regards to treatment and comorbidities. ORCHESTRA's Long-Term Sequelae and Fragile population studies' CRFs include 32 and 27 variables respectively which were evaluated as a perfect match to variables in the ISARIC FUP CRF. Our study serves as an example of the kind of maps between case report form variables from different research projects needed to link ongoing COVID-19 research efforts and facilitate collaboration and data sharing. To enable data aggregation across two data systems, the information they contain needs to be connected through a map to determine compatibility and transformation needs. Combining data from various clinical studies can increase the power of analytical insights.
新型冠状病毒肺炎(COVID-19)大流行促使人们对临床研究进行了大量投资,以获取有关该疾病预防、诊断、治疗和长期影响的急需知识。病例报告表由问题和答案(变量)组成,通常用于收集临床试验数据。最大限度地提高研究数据的价值取决于数据质量以及轻松汇集和共享来自多个来源的数据的能力。ISARIC 与世界卫生组织(WHO)合作创建了一个病例报告表,供科学界用于收集 COVID-19 试验数据。正在收集和分析多国多队列 COVID-19 研究数据的此类研究计划之一是欧盟 Horizon 2020 项目 ORCHESTRA。根据 ISO/TS 21564:2019 标准,创建了五个 ORCHESTRA 研究变量与 ISARIC 独立随访调查(Freestanding Follow-Up Survey)元素之间的映射关系。与目标代码系统相比,共享语义域的匹配度为 0(完全匹配)、1(完全包含匹配)、2(部分匹配)、4(需要转换)或 4*(在 ORCHESTRA 中不存在),并将这些匹配度分配给 ISARIC 随访调查病例报告表(CRF)的每个元素,ISARIC 随访调查病例报告表被视为源代码系统。在 ISARIC 随访调查 CRF 的变量中,约 34%的变量与 ORCHESTRA 研究中的对应变量完全匹配,约 33%的变量具有非包容性重叠。匹配变量提供了有关患者人口统计学、COVID-19 检测、住院和症状的信息。ORCHESTRA 变量提供了关于治疗和合并症的更详细信息。ORCHESTRA 的长期后遗症和脆弱人群研究的 CRF 分别包含 32 个和 27 个变量,这些变量被评估为与 ISARIC 随访调查 CRF 中的变量完全匹配。我们的研究为不同研究项目之间的病例报告表变量之间的映射提供了一个示例,这些映射对于链接正在进行的 COVID-19 研究工作以及促进协作和数据共享是必要的。为了能够在两个数据系统之间进行数据汇总,需要通过映射连接它们所包含的信息,以确定兼容性和转换需求。合并来自各种临床研究的数据可以增强分析见解的效力。