Suppr超能文献

DF-DM:人工智能时代多模态数据融合的基础过程模型。

DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era.

作者信息

Restrepo David, Wu Chenwei, Vásquez-Venegas Constanza, Nakayama Luis Filipe, Celi Leo Anthony, López Diego M

机构信息

Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.

Departamento de Telemática, Universidad del Cauca, Popayán, Cauca, Colombia.

出版信息

Res Sq. 2024 Apr 23:rs.3.rs-4277992. doi: 10.21203/rs.3.rs-4277992/v1.

Abstract

In the big data era, integrating diverse data modalities poses significant challenges, particularly in complex fields like healthcare. This paper introduces a new process model for multimodal Data Fusion for Data Mining, integrating embeddings and the Cross-Industry Standard Process for Data Mining with the existing Data Fusion Information Group model. Our model aims to decrease computational costs, complexity, and bias while improving efficiency and reliability. We also propose "disentangled dense fusion," a novel embedding fusion method designed to optimize mutual information and facilitate dense inter-modality feature interaction, thereby minimizing redundant information. We demonstrate the model's efficacy through three use cases: predicting diabetic retinopathy using retinal images and patient metadata, domestic violence prediction employing satellite imagery, internet, and census data, and identifying clinical and demographic features from radiography images and clinical notes. The model achieved a Macro F1 score of 0.92 in diabetic retinopathy prediction, an R-squared of 0.854 and sMAPE of 24.868 in domestic violence prediction, and a macro AUC of 0.92 and 0.99 for disease prediction and sex classification, respectively, in radiological analysis. These results underscore the Data Fusion for Data Mining model's potential to significantly impact multimodal data processing, promoting its adoption in diverse, resource-constrained settings.

摘要

在大数据时代,整合多种数据模态带来了重大挑战,尤其是在医疗保健等复杂领域。本文介绍了一种用于数据挖掘的多模态数据融合新流程模型,将嵌入技术以及数据挖掘的跨行业标准流程与现有的数据融合信息组模型相结合。我们的模型旨在降低计算成本、复杂性和偏差,同时提高效率和可靠性。我们还提出了“解缠密集融合”,这是一种新颖的嵌入融合方法,旨在优化互信息并促进密集的跨模态特征交互,从而最大限度地减少冗余信息。我们通过三个用例展示了该模型的有效性:使用视网膜图像和患者元数据预测糖尿病视网膜病变、利用卫星图像、互联网和人口普查数据预测家庭暴力,以及从X光图像和临床记录中识别临床和人口统计学特征。该模型在糖尿病视网膜病变预测中取得了0.92的宏F1分数,在家庭暴力预测中取得了0.854的R平方和24.868的对称平均绝对百分比误差,在放射学分析中,疾病预测和性别分类的宏AUC分别为0.92和0.99。这些结果凸显了数据挖掘数据融合模型对多模态数据处理产生重大影响的潜力,促进其在各种资源受限环境中的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c193/11092829/28e7882135c2/nihpp-rs4277992v1-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验