关注医患对话：用于新冠病毒疾病诊断的多模态知识图谱注意力图像-文本嵌入

Pay attention to doctor-patient dialogues: Multi-modal knowledge graph attention image-text embedding for COVID-19 diagnosis.

作者信息

Zheng Wenbo, Yan Lan, Gou Chao, Zhang Zhi-Cheng, Jason Zhang Jun, Hu Ming, Wang Fei-Yue

机构信息

School of Software Engineering, Xi'an Jiaotong University, Xi'an 710049, China.

State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China.

出版信息

Inf Fusion. 2021 Nov;75:168-185. doi: 10.1016/j.inffus.2021.05.015. Epub 2021 Jun 1.

DOI:10.1016/j.inffus.2021.05.015

PMID:34093095

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8168340/

Abstract

The sudden increase in coronavirus disease 2019 (COVID-19) cases puts high pressure on healthcare services worldwide. At this stage, fast, accurate, and early clinical assessment of the disease severity is vital. In general, there are two issues to overcome: (1) Current deep learning-based works suffer from multimodal data adequacy issues; (2) In this scenario, multimodal (e.g., text, image) information should be taken into account together to make accurate inferences. To address these challenges, we propose a multi-modal knowledge graph attention embedding for COVID-19 diagnosis. Our method not only learns the relational embedding from nodes in a constituted knowledge graph but also has access to medical knowledge, aiming at improving the performance of the classifier through the mechanism of medical knowledge attention. The experimental results show that our approach significantly improves classification performance compared to other state-of-the-art techniques and possesses robustness for each modality from multi-modal data. Moreover, we construct a new COVID-19 multi-modal dataset based on text mining, consisting of 1393 doctor-patient dialogues and their 3706 images (347 X-ray 2598 CT 761 ultrasound) about COVID-19 patients and 607 non-COVID-19 patient dialogues and their 10754 images (9658 X-ray 494 CT 761 ultrasound), and the fine-grained labels of all. We hope this work can provide insights to the researchers working in this area to shift the attention from only medical images to the doctor-patient dialogue and its corresponding medical images.

摘要

2019冠状病毒病（COVID-19）病例的突然增加给全球医疗服务带来了巨大压力。在现阶段，对疾病严重程度进行快速、准确和早期的临床评估至关重要。一般来说，有两个问题需要克服：（1）当前基于深度学习的工作存在多模态数据充足性问题；（2）在这种情况下，应综合考虑多模态（如文本、图像）信息以做出准确推断。为应对这些挑战，我们提出了一种用于COVID-19诊断的多模态知识图谱注意力嵌入方法。我们的方法不仅从构建的知识图谱中的节点学习关系嵌入，还能获取医学知识，旨在通过医学知识注意力机制提高分类器的性能。实验结果表明，与其他现有技术相比，我们的方法显著提高了分类性能，并且对多模态数据中的每种模态都具有鲁棒性。此外，我们基于文本挖掘构建了一个新的COVID-19多模态数据集，该数据集由1393个关于COVID-19患者的医患对话及其3706张图像（347张X光、2598张CT、761张超声）以及607个非COVID-19患者对话及其10754张图像（9658张X光、494张CT、761张超声）组成，并且所有数据都有细粒度标签。我们希望这项工作能为该领域的研究人员提供思路，促使他们将注意力从仅关注医学图像转移到医患对话及其相应的医学图像上。

相似文献

Pay attention to doctor-patient dialogues: Multi-modal knowledge graph attention image-text embedding for COVID-19 diagnosis.关注医患对话：用于新冠病毒疾病诊断的多模态知识图谱注意力图像-文本嵌入

Inf Fusion. 2021 Nov;75:168-185. doi: 10.1016/j.inffus.2021.05.015. Epub 2021 Jun 1.

Knowledge graph embedding by fusing multimodal content via cross-modal learning.通过跨模态学习融合多模态内容进行知识图谱嵌入。

Math Biosci Eng. 2023 Jun 26;20(8):14180-14200. doi: 10.3934/mbe.2023634.

Learning to learn by yourself: Unsupervised meta-learning with self-knowledge distillation for COVID-19 diagnosis from pneumonia cases.学会自主学习：基于自我知识蒸馏的无监督元学习用于从肺炎病例中诊断新冠肺炎

Int J Intell Syst. 2021 Aug;36(8):4033-4064. doi: 10.1002/int.22449. Epub 2021 May 13.

Triplet-aware graph neural networks for factorized multi-modal knowledge graph entity alignment.基于三元组感知图神经网络的分解式多模态知识图实体对齐方法。

Neural Netw. 2024 Nov;179:106479. doi: 10.1016/j.neunet.2024.106479. Epub 2024 Jun 20.

A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。

Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.

Fine-Grained Cross-Modal Semantic Consistency in Natural Conservation Image Data from a Multi-Task Perspective.从多任务视角看自然保护图像数据中的细粒度跨模态语义一致性

Sensors (Basel). 2024 May 14;24(10):3130. doi: 10.3390/s24103130.

MCPL: Multi-Modal Collaborative Prompt Learning for Medical Vision-Language Model.MCPL：用于医学视觉语言模型的多模态协作提示学习

IEEE Trans Med Imaging. 2024 Dec;43(12):4224-4235. doi: 10.1109/TMI.2024.3418408. Epub 2024 Dec 2.

MAMF-GCN: Multi-scale adaptive multi-channel fusion deep graph convolutional network for predicting mental disorder.MAMF-GCN：用于预测精神障碍的多尺度自适应多通道融合深度图卷积网络。

Comput Biol Med. 2022 Sep;148:105823. doi: 10.1016/j.compbiomed.2022.105823. Epub 2022 Jul 6.

Deep semi-supervised learning via dynamic anchor graph embedding in latent space.基于潜在空间动态锚图嵌入的深度半监督学习。

Neural Netw. 2022 Feb;146:350-360. doi: 10.1016/j.neunet.2021.11.026. Epub 2021 Dec 1.

Multi-modality self-attention aware deep network for 3D biomedical segmentation.多模态自注意力感知深度网络用于 3D 生物医学分割。

BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):119. doi: 10.1186/s12911-020-1109-0.

引用本文的文献

Advancing healthcare through multimodal data fusion: a comprehensive review of techniques and applications.通过多模态数据融合推进医疗保健：技术与应用的全面综述

PeerJ Comput Sci. 2024 Oct 30;10:e2298. doi: 10.7717/peerj-cs.2298. eCollection 2024.

Enhancing ophthalmology medical record management with multi-modal knowledge graphs.多模态知识图谱增强眼科病历管理。

Sci Rep. 2024 Oct 5;14(1):23221. doi: 10.1038/s41598-024-73316-9.

Music recommendation algorithms based on knowledge graph and multi-task feature learning.基于知识图谱和多任务特征学习的音乐推荐算法

Sci Rep. 2024 Jan 24;14(1):2055. doi: 10.1038/s41598-024-52463-z.

A scoping review on multimodal deep learning in biomedical images and texts.多模态深度学习在生物医学图像和文本中的应用综述

J Biomed Inform. 2023 Oct;146:104482. doi: 10.1016/j.jbi.2023.104482. Epub 2023 Aug 29.

Knowledge-Based Recurrent Neural Network for TCM Cerebral Palsy Diagnosis.基于知识的循环神经网络用于中医脑瘫诊断

Evid Based Complement Alternat Med. 2022 Oct 12;2022:7708376. doi: 10.1155/2022/7708376. eCollection 2022.

Comparison of Convolutional Neural Networks and Transformers for the Classification of Images of COVID-19, Pneumonia and Healthy Individuals as Observed with Computed Tomography.卷积神经网络与Transformer在基于计算机断层扫描观察的COVID-19、肺炎及健康个体图像分类中的比较

J Imaging. 2022 Sep 1;8(9):237. doi: 10.3390/jimaging8090237.

Leveraging Representation Learning for the Construction and Application of a Knowledge Graph for Traditional Chinese Medicine: Framework Development Study.利用表征学习构建和应用中医知识图谱：框架开发研究

JMIR Med Inform. 2022 Sep 2;10(9):e38414. doi: 10.2196/38414.

Knowledge Graph Applications in Medical Imaging Analysis: A Scoping Review.知识图谱在医学影像分析中的应用：一项范围综述。

Health Data Sci. 2022;2022. doi: 10.34133/2022/9841548. Epub 2022 Jun 14.

Review of Machine Learning in Lung Ultrasound in COVID-19 Pandemic.COVID-19大流行期间肺部超声中机器学习的综述

J Imaging. 2022 Mar 5;8(3):65. doi: 10.3390/jimaging8030065.

本文引用的文献

Int J Intell Syst. 2021 Aug;36(8):4033-4064. doi: 10.1002/int.22449. Epub 2021 May 13.

Dynamic-Fusion-Based Federated Learning for COVID-19 Detection.基于动态融合的用于新冠病毒检测的联邦学习

IEEE Internet Things J. 2021 Feb 4;8(21):15884-15891. doi: 10.1109/JIOT.2021.3056185. eCollection 2021 Nov 1.

A critic evaluation of methods for COVID-19 automatic detection from X-ray images.对从X射线图像中自动检测COVID-19的方法的批判性评估。

Inf Fusion. 2021 Dec;76:1-7. doi: 10.1016/j.inffus.2021.04.008. Epub 2021 Apr 30.

A deep learning-based quantitative computed tomography model for predicting the severity of COVID-19: a retrospective study of 196 patients.一种基于深度学习的定量计算机断层扫描模型用于预测新型冠状病毒肺炎的严重程度：一项对196例患者的回顾性研究

Ann Transl Med. 2021 Feb;9(3):216. doi: 10.21037/atm-20-2464.

A novel multiple instance learning framework for COVID-19 severity assessment via data augmentation and self-supervised learning.一种基于数据增强和自监督学习的新型 COVID-19 严重程度评估的多实例学习框架。

Med Image Anal. 2021 Apr;69:101978. doi: 10.1016/j.media.2021.101978. Epub 2021 Feb 3.

Impact of the Inflow Population From Outbreak Areas on the COVID-19 Epidemic in Yunnan Province and the Recommended Control Measures: A Preliminary Study.云南省输入性病例对新冠肺炎疫情的影响及防控建议：一项初步研究。

Front Public Health. 2020 Dec 2;8:609974. doi: 10.3389/fpubh.2020.609974. eCollection 2020.

Viral Pneumonia Screening on Chest X-Rays Using Confidence-Aware Anomaly Detection.基于置信度感知异常检测的胸部 X 射线病毒性肺炎筛查。

IEEE Trans Med Imaging. 2021 Mar;40(3):879-890. doi: 10.1109/TMI.2020.3040950. Epub 2021 Mar 2.

COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images.COVID-Net：一种针对胸部 X 光图像中 COVID-19 病例检测的定制化深度卷积神经网络设计。

Sci Rep. 2020 Nov 11;10(1):19549. doi: 10.1038/s41598-020-76550-z.

COVIDGR Dataset and COVID-SDNet Methodology for Predicting COVID-19 Based on Chest X-Ray Images.基于胸部 X 光图像预测 COVID-19 的 COVIDGR 数据集和 COVID-SDNet 方法。

IEEE J Biomed Health Inform. 2020 Dec;24(12):3595-3605. doi: 10.1109/JBHI.2020.3037127. Epub 2020 Dec 4.

A Weakly-Supervised Framework for COVID-19 Classification and Lesion Localization From Chest CT.一种基于弱监督的 COVID-19 分类和胸部 CT 病变定位框架。

IEEE Trans Med Imaging. 2020 Aug;39(8):2615-2625. doi: 10.1109/TMI.2020.2995965.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

关注医患对话：用于新冠病毒疾病诊断的多模态知识图谱注意力图像-文本嵌入

Pay attention to doctor-patient dialogues: Multi-modal knowledge graph attention image-text embedding for COVID-19 diagnosis.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献