Hooshafza Sepideh, Orlandi Fabrizio, Flynn Rachel, McQuaid Louise, Stephens Gaye, O'Connor Laura
Health Information and Quality Authority (HIQA), Cork, Ireland.
The SFI ADAPT Research Centre for AI-Driven Digital Content Technology, School of Computer Science and Statistics, Trinity College Dublin, Dublin, Ireland.
HRB Open Res. 2022 Aug 2;4:101. doi: 10.12688/hrbopenres.13403.2. eCollection 2021.
The benefits of having high-quality healthcare data are well established. However, high-dimensionality and irregularity of healthcare data pose challenges in their management. Knowledge graphs have gained increasing popularity in many domains, as a method for representing data to overcome such challenges. However, little is known about their suitability for use with healthcare data. One important factor in representing data is "time". Data with time related attributes are considered, temporal data. Temporal data are frequently observed in healthcare and the management of rapidly changing patient data is an ongoing challenge. Traditionally, data models have focused on presenting static data and do not account for temporal data. Temporal data models ensure time consistency in data models and assist analysing the history of data and predicting the future trends in data. Knowledge graphs can include temporal data models and are therefore of interest to the field of healthcare data management. As such, the herein aim is to outline a protocol for an inter-disciplinary systematic review of approaches, applications and challenges in modelling temporal data in knowledge graphs so that we can inform the application of knowledge graphs to healthcare data. The research questions is, what are the existing approaches in modelling temporal data in RDF based knowledge graphs. Two sub-questions on applications, and challenges will also be evaluated. ACM digital library, IEEE Xplore and Scopus will be searched for this review. The search will be limited to peer-reviewed literature referring to knowledge graphs based on Resource Description Framework (RDF). A narrative synthesis of the papers will be conducted. The findings of this systematic review will be useful for data engineers to better represent data and perform analytics through temporal data modelling. They can be applied in the context of healthcare data and the current challenges faced in managing rapidly changing patient data.
高质量医疗数据的益处已得到充分证实。然而,医疗数据的高维度和不规则性给其管理带来了挑战。知识图谱作为一种克服此类挑战的数据表示方法,在许多领域越来越受欢迎。然而,对于它们是否适用于医疗数据,人们了解甚少。表示数据的一个重要因素是“时间”。具有时间相关属性的数据被视为时态数据。时态数据在医疗领域经常出现,快速变化的患者数据管理一直是一项挑战。传统上,数据模型侧重于呈现静态数据,并未考虑时态数据。时态数据模型可确保数据模型中的时间一致性,并有助于分析数据历史和预测数据未来趋势。知识图谱可以包含时态数据模型,因此在医疗数据管理领域受到关注。因此,本文的目的是概述一个跨学科系统综述的方案,以探讨知识图谱中时态数据建模的方法、应用和挑战,从而为知识图谱在医疗数据中的应用提供参考。研究问题是,在基于RDF的知识图谱中,时态数据建模的现有方法有哪些。还将评估关于应用和挑战的两个子问题。本次综述将检索ACM数字图书馆、IEEE Xplore和Scopus。检索将限于基于资源描述框架(RDF)的知识图谱的同行评审文献。将对这些论文进行叙述性综合分析。本次系统综述的结果将有助于数据工程师通过时态数据建模更好地表示数据并进行分析。它们可应用于医疗数据背景以及管理快速变化的患者数据时面临的当前挑战。