EAPR：用于疾病预测的可解释且增强的患者表征学习

EAPR: explainable and augmented patient representation learning for disease prediction.

作者信息

Zhang Jiancheng, Xu Yonghui, Ye Bicui, Zhao Yibowen, Sun Xiaofang, Meng Qi, Zhang Yang, Cui Lizhen

机构信息

Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Jinan, China.

School of Software, Shandong University, Jinan, China.

出版信息

Health Inf Sci Syst. 2023 Nov 14;11(1):53. doi: 10.1007/s13755-023-00256-5. eCollection 2023 Dec.

DOI:10.1007/s13755-023-00256-5

PMID:37974902

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10645955/

Abstract

Patient representation learning aims to encode meaningful information about the patient's Electronic Health Records (EHR) in the form of a mathematical representation. Recent advances in deep learning have empowered Patient representation learning methods with greater representational power, allowing the learned representations to significantly improve the performance of disease prediction models. However, the inherent shortcomings of deep learning models, such as the need for massive amounts of labeled data and inexplicability, limit the performance of deep learning-based Patient representation learning methods to further improvements. In particular, learning robust patient representations is challenging when patient data is missing or insufficient. Although data augmentation techniques can tackle this deficiency, the complex data processing further weakens the inexplicability of patient representation learning models. To address the above challenges, this paper proposes an Explainable and Augmented Patient Representation Learning for disease prediction (EAPR). EAPR utilizes data augmentation controlled by confidence interval to enhance patient representation in the presence of limited patient data. Moreover, EAPR proposes to use two-stage gradient backpropagation to address the problem of unexplainable patient representation learning models due to the complex data enhancement process. The experimental results on real clinical data validate the effectiveness and explainability of the proposed approach.

摘要

患者表征学习旨在以数学表征的形式对患者电子健康记录（EHR）中的有意义信息进行编码。深度学习的最新进展赋予了患者表征学习方法更强的表征能力，使学习到的表征能够显著提高疾病预测模型的性能。然而，深度学习模型的固有缺点，如需要大量标记数据和难以解释性，限制了基于深度学习的患者表征学习方法性能的进一步提升。特别是，当患者数据缺失或不足时，学习鲁棒的患者表征具有挑战性。尽管数据增强技术可以解决这一缺陷，但复杂的数据处理进一步削弱了患者表征学习模型的可解释性。为了应对上述挑战，本文提出了一种用于疾病预测的可解释增强患者表征学习（EAPR）方法。EAPR利用由置信区间控制的数据增强，在患者数据有限的情况下增强患者表征。此外，EAPR提出使用两阶段梯度反向传播来解决由于复杂的数据增强过程导致的患者表征学习模型难以解释的问题。在真实临床数据上的实验结果验证了所提方法的有效性和可解释性。

相似文献

EAPR: explainable and augmented patient representation learning for disease prediction.

Health Inf Sci Syst. 2023 Nov 14;11(1):53. doi: 10.1007/s13755-023-00256-5. eCollection 2023 Dec.

Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review.

J Biomed Inform. 2021 Mar;115:103671. doi: 10.1016/j.jbi.2020.103671. Epub 2020 Dec 31.

Treatment effect prediction with adversarial deep learning using electronic health records.

BMC Med Inform Decis Mak. 2020 Dec 14;20(Suppl 4):139. doi: 10.1186/s12911-020-01151-9.

Learning explainable task-relevant state representation for model-free deep reinforcement learning.

Neural Netw. 2024 Dec;180:106741. doi: 10.1016/j.neunet.2024.106741. Epub 2024 Sep 20.

Explainable automated coding of clinical notes using hierarchical label-wise attention networks and label embedding initialisation.

J Biomed Inform. 2021 Apr;116:103728. doi: 10.1016/j.jbi.2021.103728. Epub 2021 Mar 9.

DeepMPM: a mortality risk prediction model using longitudinal EHR data.

BMC Bioinformatics. 2022 Oct 14;23(1):423. doi: 10.1186/s12859-022-04975-6.

RAHM: Relation augmented hierarchical multi-task learning framework for reasonable medication stocking.

J Biomed Inform. 2020 Aug;108:103502. doi: 10.1016/j.jbi.2020.103502. Epub 2020 Jul 14.

A multi-filter deep transfer learning framework for image-based autism spectrum disorder detection.

Sci Rep. 2025 Apr 24;15(1):14253. doi: 10.1038/s41598-025-97708-7.

Universal representation learning for multivariate time series using the instance-level and cluster-level supervised contrastive learning.

Data Min Knowl Discov. 2024 May;38(3):1493-1519. doi: 10.1007/s10618-024-01006-1. Epub 2024 Feb 9.

Deep representation learning for individualized treatment effect estimation using electronic health records.

J Biomed Inform. 2019 Dec;100:103303. doi: 10.1016/j.jbi.2019.103303. Epub 2019 Oct 11.

本文引用的文献

Enriching representation learning using 53 million patient notes through human phenotype ontology embedding.

Artif Intell Med. 2023 May;139:102523. doi: 10.1016/j.artmed.2023.102523. Epub 2023 Feb 28.

HealthNet: A Health Progression Network via Heterogeneous Medical Information Fusion.

IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):6940-6954. doi: 10.1109/TNNLS.2022.3202305. Epub 2023 Oct 5.

Patient-centric characterization of multimorbidity trajectories in patients with severe mental illnesses: A temporal bipartite network modeling approach.

J Biomed Inform. 2022 Mar;127:104010. doi: 10.1016/j.jbi.2022.104010. Epub 2022 Feb 11.

Deep learning for temporal data representation in electronic health records: A systematic review of challenges and methodologies.

J Biomed Inform. 2022 Feb;126:103980. doi: 10.1016/j.jbi.2021.103980. Epub 2021 Dec 30.

Deep transfer learning and data augmentation improve glucose levels prediction in type 2 diabetes patients.

NPJ Digit Med. 2021 Jul 14;4(1):109. doi: 10.1038/s41746-021-00480-x.

PSSPNN: PatchShuffle Stochastic Pooling Neural Network for an Explainable Diagnosis of COVID-19 with Multiple-Way Data Augmentation.

Comput Math Methods Med. 2021 Mar 8;2021:6633755. doi: 10.1155/2021/6633755. eCollection 2021.

Bidirectional Representation Learning From Transformers Using Multimodal Electronic Health Record Data to Predict Depression.

IEEE J Biomed Health Inform. 2021 Aug;25(8):3121-3129. doi: 10.1109/JBHI.2021.3063721. Epub 2021 Aug 5.

Universal Physiological Representation Learning With Soft-Disentangled Rateless Autoencoders.

IEEE J Biomed Health Inform. 2021 Aug;25(8):2928-2937. doi: 10.1109/JBHI.2021.3062335. Epub 2021 Aug 5.

Language models are an effective representation learning technique for electronic health record data.

J Biomed Inform. 2021 Jan;113:103637. doi: 10.1016/j.jbi.2020.103637. Epub 2020 Dec 5.

Temporal tree representation for similarity computation between medical patients.

Artif Intell Med. 2020 Aug;108:101900. doi: 10.1016/j.artmed.2020.101900. Epub 2020 Jun 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

EAPR：用于疾病预测的可解释且增强的患者表征学习

EAPR: explainable and augmented patient representation learning for disease prediction.

作者信息

Zhang Jiancheng, Xu Yonghui, Ye Bicui, Zhao Yibowen, Sun Xiaofang, Meng Qi, Zhang Yang, Cui Lizhen

机构信息

Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Jinan, China.

School of Software, Shandong University, Jinan, China.

出版信息

Health Inf Sci Syst. 2023 Nov 14;11(1):53. doi: 10.1007/s13755-023-00256-5. eCollection 2023 Dec.

DOI:10.1007/s13755-023-00256-5

PMID:37974902

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10645955/

Abstract

摘要

EAPR：用于疾病预测的可解释且增强的患者表征学习

EAPR: explainable and augmented patient representation learning for disease prediction.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

EAPR：用于疾病预测的可解释且增强的患者表征学习

EAPR: explainable and augmented patient representation learning for disease prediction.

作者信息

机构信息

出版信息