Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, Italy; Department of Computer, Control, and Management Engineering, Sapienza University of Rome, Italy.
Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, Italy; Department of Radiation Sciences, Radiation Physics, Biomedical Engineering, Umeå, University, Umeå, Sweden.
Comput Biol Med. 2023 Mar;154:106625. doi: 10.1016/j.compbiomed.2023.106625. Epub 2023 Feb 2.
The COVID-19 pandemic has caused millions of cases and deaths and the AI-related scientific community, after being involved with detecting COVID-19 signs in medical images, has been now directing the efforts towards the development of methods that can predict the progression of the disease. This task is multimodal by its very nature and, recently, baseline results achieved on the publicly available AIforCOVID dataset have shown that chest X-ray scans and clinical information are useful to identify patients at risk of severe outcomes. While deep learning has shown superior performance in several medical fields, in most of the cases it considers unimodal data only. In this respect, when, which and how to fuse the different modalities is an open challenge in multimodal deep learning. To cope with these three questions here we present a novel approach optimizing the setup of a multimodal end-to-end model. It exploits Pareto multi-objective optimization working with a performance metric and the diversity score of multiple candidate unimodal neural networks to be fused. We test our method on the AIforCOVID dataset, attaining state-of-the-art results, not only outperforming the baseline performance but also being robust to external validation. Moreover, exploiting XAI algorithms we figure out a hierarchy among the modalities and we extract the features' intra-modality importance, enriching the trust on the predictions made by the model.
新冠疫情已导致数百万人感染和死亡,相关人工智能科学界在医学图像中检测新冠病毒迹象之后,现在正致力于开发能够预测疾病进展的方法。由于其本质的多模态性,最近在可公开获取的 AIforCOVID 数据集上取得的基线结果表明,胸部 X 光扫描和临床信息可用于识别有重症风险的患者。虽然深度学习在多个医学领域已经表现出了卓越的性能,但在大多数情况下,它仅考虑单模态数据。在这方面,何时、如何以及如何融合不同模态是多模态深度学习中的一个开放性挑战。为了解决这三个问题,我们在这里提出了一种新颖的方法,优化了端到端多模态模型的设置。它利用帕累托多目标优化,结合要融合的性能指标和多个候选单模态神经网络的多样性评分。我们在 AIforCOVID 数据集上测试了我们的方法,取得了最先进的结果,不仅超过了基线性能,而且对外部验证具有鲁棒性。此外,利用可解释性人工智能算法,我们确定了模态之间的层次结构,并提取了模态内特征的重要性,从而增强了对模型做出的预测的信任。