利用生理信号、医学图像和临床记录进行多模态风险预测。

Multimodal risk prediction with physiological signals, medical images and clinical notes.

作者信息

Wang Yuanlong, Yin Changchang, Zhang Ping

机构信息

Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210, USA.

Department of Biomedical Informatics, The Ohio State University, Columbus, OH 43210, USA.

出版信息

Heliyon. 2024 Feb 28;10(5):e26772. doi: 10.1016/j.heliyon.2024.e26772. eCollection 2024 Mar 15.

DOI:10.1016/j.heliyon.2024.e26772

PMID:38455585

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10918115/

Abstract

The broad adoption of electronic health record (EHR) systems brings us a tremendous amount of clinical data and thus provides opportunities to conduct data-based healthcare research to solve various clinical problems in the medical domain. Machine learning and deep learning methods are widely used in the medical informatics and healthcare domain due to their power to mine insights from raw data. When adapting deep learning models for EHR data, it is essential to consider its heterogeneous nature: EHR contains patient records from various sources including medical tests (e.g. blood test, microbiology test), medical imaging, diagnosis, medications, procedures, clinical notes, etc. Those modalities together provide a holistic view of patient health status and complement each other. Therefore, combining data from multiple modalities that are intrinsically different is challenging but intuitively promising in deep learning for EHR. To assess the expectations of multimodal data, we introduce a comprehensive fusion framework designed to integrate temporal variables, medical images, and clinical notes in EHR for enhanced performance in clinical risk prediction. Early, joint, and late fusion strategies are employed to combine data from various modalities effectively. We test the model with three predictive tasks: in-hospital mortality, long length of stay, and 30-day readmission. Experimental results show that multimodal models outperform uni-modal models in the tasks involved. Additionally, by training models with different input modality combinations, we calculate the Shapley value for each modality to quantify their contribution to multimodal performance. It is shown that temporal variables tend to be more helpful than CXR images and clinical notes in the three explored predictive tasks.

摘要

电子健康记录（EHR）系统的广泛应用为我们带来了大量临床数据，从而为开展基于数据的医疗保健研究以解决医学领域的各种临床问题提供了机会。机器学习和深度学习方法因其能够从原始数据中挖掘见解的能力而在医学信息学和医疗保健领域得到广泛应用。在将深度学习模型应用于EHR数据时，必须考虑其异构性质：EHR包含来自各种来源的患者记录，包括医学检查（如血液检查、微生物学检查）、医学影像、诊断、药物、手术、临床记录等。这些模态共同提供了患者健康状况的整体视图，并且相互补充。因此，在EHR的深度学习中，将本质上不同的多模态数据进行组合具有挑战性，但直观上很有前景。为了评估对多模态数据的期望，我们引入了一个综合融合框架，旨在整合EHR中的时间变量、医学影像和临床记录，以提高临床风险预测的性能。采用早期、联合和晚期融合策略来有效组合来自各种模态的数据。我们用三个预测任务对模型进行测试：院内死亡率、长期住院和30天再入院。实验结果表明，多模态模型在涉及的任务中优于单模态模型。此外，通过使用不同的输入模态组合训练模型，我们计算每个模态的沙普利值以量化它们对多模态性能的贡献。结果表明，在三个探索的预测任务中，时间变量往往比胸部X光图像和临床记录更有帮助。

相似文献

Multimodal risk prediction with physiological signals, medical images and clinical notes.

Heliyon. 2024 Feb 28;10(5):e26772. doi: 10.1016/j.heliyon.2024.e26772. eCollection 2024 Mar 15.

Multimodal Risk Prediction with Physiological Signals, Medical Images and Clinical Notes.

medRxiv. 2023 May 26:2023.05.18.23290207. doi: 10.1101/2023.05.18.23290207.

Combining structured and unstructured data for predictive models: a deep learning approach.

BMC Med Inform Decis Mak. 2020 Oct 29;20(1):280. doi: 10.1186/s12911-020-01297-6.

Artificial intelligence-based methods for fusion of electronic health records and imaging data.

Sci Rep. 2022 Oct 26;12(1):17981. doi: 10.1038/s41598-022-22514-4.

Toward attention-based learning to predict the risk of brain degeneration with multimodal medical data.

Front Neurosci. 2023 Jan 18;16:1043626. doi: 10.3389/fnins.2022.1043626. eCollection 2022.

Automated Fusion of Multimodal Electronic Health Records for Better Medical Predictions.

Proc SIAM Int Conf Data Min. 2024;2024:361-369. doi: 10.1137/1.9781611978032.41.

Prediction of multiclass surgical outcomes in glaucoma using multimodal deep learning based on free-text operative notes and structured EHR data.

J Am Med Inform Assoc. 2024 Jan 18;31(2):456-464. doi: 10.1093/jamia/ocad213.

Multimodal deep learning for liver cancer applications: a scoping review.

Front Artif Intell. 2023 Oct 27;6:1247195. doi: 10.3389/frai.2023.1247195. eCollection 2023.

M3T-LM: A multi-modal multi-task learning model for jointly predicting patient length of stay and mortality.

Comput Biol Med. 2024 Dec;183:109237. doi: 10.1016/j.compbiomed.2024.109237. Epub 2024 Oct 7.

Weakly Semi-supervised phenotyping using Electronic Health records.

J Biomed Inform. 2022 Oct;134:104175. doi: 10.1016/j.jbi.2022.104175. Epub 2022 Sep 5.

引用本文的文献

Improving CNN predictive accuracy in COVID-19 health analytics.

Sci Rep. 2025 Aug 14;15(1):29864. doi: 10.1038/s41598-025-15218-y.

Comparison of Multimodal Deep Learning Approaches for Predicting Clinical Deterioration in Ward Patients: Observational Cohort Study.

J Med Internet Res. 2025 Jun 11;27:e75340. doi: 10.2196/75340.

Advancing Precision Oncology Through Modeling of Longitudinal and Multimodal Data.

ArXiv. 2025 Apr 29:arXiv:2502.07836v2.

Predictive Modeling with Temporal Graphical Representation on Electronic Health Records.

IJCAI (U S). 2024 Aug;2024:5763-5771. doi: 10.24963/ijcai.2024/637.

Three-Stage Framework for Accurate Pediatric Chest X-ray Diagnosis Using Self-Supervision and Transfer Learning on Small Datasets.

Diagnostics (Basel). 2024 Jul 29;14(15):1634. doi: 10.3390/diagnostics14151634.

Automated vs. manual coding of neuroimaging reports via natural language processing, using the international classification of diseases, tenth revision.

Heliyon. 2024 May 7;10(10):e30106. doi: 10.1016/j.heliyon.2024.e30106. eCollection 2024 May 30.

本文引用的文献

Multimodal machine learning in precision health: A scoping review.

NPJ Digit Med. 2022 Nov 7;5(1):171. doi: 10.1038/s41746-022-00712-8.

Artificial intelligence-based methods for fusion of electronic health records and imaging data.

Sci Rep. 2022 Oct 26;12(1):17981. doi: 10.1038/s41598-022-22514-4.

Multimodal attention-based deep learning for Alzheimer's disease diagnosis.

J Am Med Inform Assoc. 2022 Nov 14;29(12):2014-2022. doi: 10.1093/jamia/ocac168.

Integrated multimodal artificial intelligence framework for healthcare applications.

NPJ Digit Med. 2022 Sep 20;5(1):149. doi: 10.1038/s41746-022-00689-4.

Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction.

NPJ Digit Med. 2021 May 20;4(1):86. doi: 10.1038/s41746-021-00455-y.

Richer fusion network for breast cancer classification based on multimodal data.

BMC Med Inform Decis Mak. 2021 Apr 22;21(Suppl 1):134. doi: 10.1186/s12911-020-01340-6.

A machine learning-based pulmonary venous obstruction prediction model using clinical data and CT image.

Int J Comput Assist Radiol Surg. 2021 Apr;16(4):609-617. doi: 10.1007/s11548-021-02335-y. Epub 2021 Mar 31.

Multimodal fusion with deep neural networks for leveraging CT imaging and electronic health record: a case-study in pulmonary embolism detection.

Sci Rep. 2020 Dec 17;10(1):22147. doi: 10.1038/s41598-020-78888-w.

Combining structured and unstructured data for predictive models: a deep learning approach.

BMC Med Inform Decis Mak. 2020 Oct 29;20(1):280. doi: 10.1186/s12911-020-01297-6.

From Local Explanations to Global Understanding with Explainable AI for Trees.

Nat Mach Intell. 2020 Jan;2(1):56-67. doi: 10.1038/s42256-019-0138-9. Epub 2020 Jan 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用生理信号、医学图像和临床记录进行多模态风险预测。

Multimodal risk prediction with physiological signals, medical images and clinical notes.

作者信息

Wang Yuanlong, Yin Changchang, Zhang Ping

机构信息

Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210, USA.

Department of Biomedical Informatics, The Ohio State University, Columbus, OH 43210, USA.

出版信息

Heliyon. 2024 Feb 28;10(5):e26772. doi: 10.1016/j.heliyon.2024.e26772. eCollection 2024 Mar 15.

DOI:10.1016/j.heliyon.2024.e26772

PMID:38455585

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10918115/

Abstract

摘要

利用生理信号、医学图像和临床记录进行多模态风险预测。

Multimodal risk prediction with physiological signals, medical images and clinical notes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用生理信号、医学图像和临床记录进行多模态风险预测。

Multimodal risk prediction with physiological signals, medical images and clinical notes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献