Ben-Miled Zina, Shebesh Jacob A, Su Jing, Dexter Paul R, Grout Randall W, Boustani Malaz A
Phillip M. Drayer Department of Electrical and Computer Engineering, Lamar University, Cherry Building, Beaumont, TX 77705, USA.
Department of Electrical and Computer Engineering, School of Engineering and Technology, Indiana University Purdue University at Indianapolis, 723 W. Michigan Street, Indianapolis, IN 46202, USA.
Information (Basel). 2025 Jan;16(1). doi: 10.3390/info16010054. Epub 2025 Jan 15.
Electronic health records (EHR) are now widely available in healthcare institutions to document the medical history of patients as they interact with healthcare services. In particular, routine care EHR data are collected for a large number of patients. These data span multiple heterogeneous elements (i.e., demographics, diagnosis, medications, clinical notes, vital signs, and laboratory results) which contain semantic, concept, and temporal information. Recent advances in generative learning techniques were able to leverage the fusion of multiple routine care EHR data elements to enhance clinical decision support.
A scoping review of the proposed techniques including fusion architectures, input data elements, and application areas is needed to synthesize variances and identify research gaps that can promote re-use of these techniques for new clinical outcomes.
A comprehensive literature search was conducted using Google Scholar to identify high impact fusion architectures over multi-modal routine care EHR data during the period 2018 to 2023. The guidelines from the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) extension for scoping review were followed. The findings were derived from the selected studies using a thematic and comparative analysis.
The scoping review revealed the lack of standard definition for EHR data elements as they are transformed into input modalities. These definitions ignore one or more key characteristics of the data including source, encoding scheme, and concept level. Moreover, in order to adapt to emergent generative learning techniques, the classification of fusion architectures should distinguish fusion from learning and take into consideration that learning can concurrently happen in all three layers of new fusion architectures (i.e., encoding, representation, and decision). These aspects constitute the first step towards a streamlined approach to the design of multi-modal fusion architectures for routine care EHR data. In addition, current pretrained encoding models are inconsistent in their handling of temporal and semantic information thereby hindering their re-use for different applications and clinical settings.
Current routine care EHR fusion architectures mostly follow a design-by-example methodology. Guidelines are needed for the design of efficient multi-modal models for a broad range of healthcare applications. In addition to promoting re-use, these guidelines need to outline best practices for combining multiple modalities while leveraging transfer learning and co-learning as well as semantic and temporal encoding.
电子健康记录(EHR)如今在医疗机构中广泛可用,用于记录患者在接受医疗服务时的病史。特别是,大量患者的常规护理EHR数据被收集。这些数据涵盖多个异质元素(即人口统计学、诊断、药物、临床记录、生命体征和实验室结果),其中包含语义、概念和时间信息。生成学习技术的最新进展能够利用多个常规护理EHR数据元素的融合来增强临床决策支持。
需要对包括融合架构、输入数据元素和应用领域在内的所提出技术进行范围综述,以综合差异并识别可促进这些技术用于新临床结果的研究差距。
使用谷歌学术进行全面的文献检索,以识别2018年至2023年期间对多模式常规护理EHR数据有高影响力的融合架构。遵循PRISMA(系统评价和元分析的首选报告项目)扩展版的范围综述指南。通过主题分析和比较分析从选定的研究中得出结果。
范围综述揭示,EHR数据元素在转换为输入模式时缺乏标准定义。这些定义忽略了数据的一个或多个关键特征,包括来源、编码方案和概念级别。此外,为了适应新兴的生成学习技术,融合架构的分类应区分融合与学习,并考虑到学习可以在新融合架构的所有三层(即编码、表示和决策)中同时发生。这些方面构成了为常规护理EHR数据设计多模式融合架构的简化方法的第一步。此外,当前的预训练编码模型在处理时间和语义信息方面不一致,从而阻碍了它们在不同应用和临床环境中的重用。
当前的常规护理EHR融合架构大多遵循示例设计方法。需要为广泛的医疗保健应用设计高效多模式模型的指南。除了促进重用之外,这些指南还需要概述在利用迁移学习和协同学习以及语义和时间编码时组合多种模式的最佳实践。