Department of Social Medicine and Public Health, Faculty of Public Health, Medical University of Plovdiv, Plovdiv, Bulgaria.
Bulgarian Association for Promotion of Education and Science, Institute for Rare Disease, Plovdiv, Bulgaria.
Front Public Health. 2023 Sep 15;11:1214766. doi: 10.3389/fpubh.2023.1214766. eCollection 2023.
Given the increased availability of data sources such as hospital information systems, electronic health records, and health-related registries, a novel approach is required to develop artificial intelligence-based decision support that can assist clinicians in their diagnostic decision-making and shorten rare disease patients' diagnostic odyssey. The aim is to identify key challenges in the process of mapping European rare disease databases, relevant to ML-based screening technologies in terms of organizational, FAIR and legal principles.
A scoping review was conducted based on the PRISMA-ScR checklist. The primary article search was conducted in three electronic databases (MEDLINE/Pubmed, Scopus, and Web of Science) and a secondary search was performed in Google scholar and on the organizations' websites. Each step of this review was carried out independently by two researchers. A charting form for relevant study analysis was developed and used to categorize data and identify data items in three domains - organizational, FAIR and legal.
At the end of the screening process, 73 studies were eligible for review based on inclusion and exclusion criteria with more than 60% ( = 46) of the research published in the last 5 years and originated only from EU/EEA countries. Over the ten-year period (2013-2022), there is a clear cycling trend in the publications, with a peak of challenges reporting every four years. Within this trend, the following dynamic was identified: except for 2016, organizational challenges dominated the articles published up to 2018; legal challenges were the most frequently discussed topic from 2018 to 2022. The following distribution of the data items by domains was observed - (1) organizational ( = 36): data accessibility and sharing (20.2%); long-term sustainability (18.2%); governance, planning and design (17.2%); lack of harmonization and standardization (17.2%); quality of data collection (16.2%); and privacy risks and small sample size (11.1%); (2) FAIR ( = 15): findable (17.9%); accessible sustainability (25.0%); interoperable (39.3%); and reusable (17.9%); and (3) legal ( = 33): data protection by all means (34.4%); data management and ownership (22.9%); research under GDPR and member state law (20.8%); trust and transparency (13.5%); and digitalization of health (8.3%). We observed a specific pattern repeated in all domains during the process of data charting and data item identification - in addition to the outlined challenges, good practices, guidelines, and recommendations were also discussed. The proportion of publications addressing only good practices, guidelines, and recommendations for overcoming challenges when mapping RD databases in at least one domain was calculated to be 47.9% ( = 35).
Despite the opportunities provided by innovation - automation, electronic health records, hospital-based information systems, biobanks, rare disease registries and European Reference Networks - the results of the current scoping review demonstrate a diversity of the challenges that must still be addressed, with immediate actions on ensuring better governance of rare disease registries, implementing FAIR principles, and enhancing the EU legal framework.
随着医院信息系统、电子健康记录和与健康相关的登记处等数据源的可用性增加,需要采用一种新方法来开发基于人工智能的决策支持,以帮助临床医生进行诊断决策,并缩短罕见病患者的诊断探索过程。目标是确定将欧洲罕见病数据库进行映射的过程中的关键挑战,这些挑战涉及到基于机器学习的筛选技术的组织、FAIR 和法律原则。
我们进行了基于 PRISMA-ScR 清单的范围综述。主要的文章搜索在三个电子数据库(MEDLINE/Pubmed、Scopus 和 Web of Science)中进行,并在 Google 学术和组织网站上进行了二次搜索。该综述的每个步骤均由两名研究人员独立进行。制定了一个图表形式的相关研究分析表,用于将数据分类,并在三个领域(组织、FAIR 和法律)中确定数据项。
在筛选过程结束时,根据纳入和排除标准,有 73 项研究符合审查条件,其中超过 60%( = 46)的研究是在过去 5 年内发表的,并且仅来自欧盟/欧洲经济区国家。在过去十年(2013-2022 年)中,出版物呈现出明显的循环趋势,每四年报告一次挑战的高峰。在这一趋势中,确定了以下动态:除了 2016 年,组织挑战主导了截至 2018 年发表的文章;自 2018 年至 2022 年,法律挑战是最常讨论的话题。按领域观察到的数据项分布如下:(1)组织( = 36):数据可访问性和共享(20.2%);长期可持续性(18.2%);治理、规划和设计(17.2%);缺乏协调和标准化(17.2%);数据收集质量(16.2%);以及隐私风险和小样本量(11.1%);(2)FAIR( = 15):可查找性(17.9%);可访问性可持续性(25.0%);互操作性(39.3%);和可重用性(17.9%);以及(3)法律( = 33):通过各种手段保护数据(34.4%);数据管理和所有权(22.9%);在 GDPR 和成员国法律下进行研究(20.8%);信任和透明度(13.5%);以及健康数字化(8.3%)。我们在数据图表和数据项识别过程中观察到了一个在所有领域中重复出现的特定模式——除了已概述的挑战,还讨论了良好实践、准则和建议。计算出在至少一个领域中映射 RD 数据库时,仅解决挑战的良好实践、准则和建议的出版物比例为 47.9%( = 35)。
尽管创新提供了机会——自动化、电子健康记录、医院信息系统、生物库、罕见病登记处和欧洲参考网络——但当前范围综述的结果表明,仍有许多挑战需要解决,需要立即采取行动,确保更好地治理罕见病登记处,实施 FAIR 原则,并加强欧盟法律框架。