DF-DM：人工智能时代多模态数据融合的基础过程模型。

DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era.

作者信息

Restrepo David, Wu Chenwei, Vásquez-Venegas Constanza, Nakayama Luis Filipe, Celi Leo Anthony, López Diego M

机构信息

Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.

Departamento de Telemática, Universidad del Cauca, Popayán, Cauca, Colombia.

出版信息

Res Sq. 2024 Apr 23:rs.3.rs-4277992. doi: 10.21203/rs.3.rs-4277992/v1.

DOI:10.21203/rs.3.rs-4277992/v1

PMID:38746100

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11092829/

Abstract

In the big data era, integrating diverse data modalities poses significant challenges, particularly in complex fields like healthcare. This paper introduces a new process model for multimodal Data Fusion for Data Mining, integrating embeddings and the Cross-Industry Standard Process for Data Mining with the existing Data Fusion Information Group model. Our model aims to decrease computational costs, complexity, and bias while improving efficiency and reliability. We also propose "disentangled dense fusion," a novel embedding fusion method designed to optimize mutual information and facilitate dense inter-modality feature interaction, thereby minimizing redundant information. We demonstrate the model's efficacy through three use cases: predicting diabetic retinopathy using retinal images and patient metadata, domestic violence prediction employing satellite imagery, internet, and census data, and identifying clinical and demographic features from radiography images and clinical notes. The model achieved a Macro F1 score of 0.92 in diabetic retinopathy prediction, an R-squared of 0.854 and sMAPE of 24.868 in domestic violence prediction, and a macro AUC of 0.92 and 0.99 for disease prediction and sex classification, respectively, in radiological analysis. These results underscore the Data Fusion for Data Mining model's potential to significantly impact multimodal data processing, promoting its adoption in diverse, resource-constrained settings.

摘要

在大数据时代，整合多种数据模态带来了重大挑战，尤其是在医疗保健等复杂领域。本文介绍了一种用于数据挖掘的多模态数据融合新流程模型，将嵌入技术以及数据挖掘的跨行业标准流程与现有的数据融合信息组模型相结合。我们的模型旨在降低计算成本、复杂性和偏差，同时提高效率和可靠性。我们还提出了“解缠密集融合”，这是一种新颖的嵌入融合方法，旨在优化互信息并促进密集的跨模态特征交互，从而最大限度地减少冗余信息。我们通过三个用例展示了该模型的有效性：使用视网膜图像和患者元数据预测糖尿病视网膜病变、利用卫星图像、互联网和人口普查数据预测家庭暴力，以及从X光图像和临床记录中识别临床和人口统计学特征。该模型在糖尿病视网膜病变预测中取得了0.92的宏F1分数，在家庭暴力预测中取得了0.854的R平方和24.868的对称平均绝对百分比误差，在放射学分析中，疾病预测和性别分类的宏AUC分别为0.92和0.99。这些结果凸显了数据挖掘数据融合模型对多模态数据处理产生重大影响的潜力，促进其在各种资源受限环境中的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c193/11092829/28e7882135c2/nihpp-rs4277992v1-f0001.jpg

相似文献

DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era.DF-DM：人工智能时代多模态数据融合的基础过程模型。

Res Sq. 2024 Apr 23:rs.3.rs-4277992. doi: 10.21203/rs.3.rs-4277992/v1.

An End-to-End Natural Language Processing Application for Prediction of Medical Case Coding Complexity: Algorithm Development and Validation.一种用于预测医学病例编码复杂性的端到端自然语言处理应用程序：算法开发与验证

JMIR Med Inform. 2023 Jan 19;11:e38150. doi: 10.2196/38150.

HFBSurv: hierarchical multimodal fusion with factorized bilinear models for cancer survival prediction.HFBSurv：基于因子化双线性模型的层次化多模态融合用于癌症生存预测。

Bioinformatics. 2022 Apr 28;38(9):2587-2594. doi: 10.1093/bioinformatics/btac113.

Diabetic retinopathy screening through artificial intelligence algorithms: A systematic review.基于人工智能算法的糖尿病视网膜病变筛查：系统综述。

Surv Ophthalmol. 2024 Sep-Oct;69(5):707-721. doi: 10.1016/j.survophthal.2024.05.008. Epub 2024 Jun 15.

A Multimodal Affinity Fusion Network for Predicting the Survival of Breast Cancer Patients.用于预测乳腺癌患者生存情况的多模态亲和力融合网络。

Front Genet. 2021 Aug 20;12:709027. doi: 10.3389/fgene.2021.709027. eCollection 2021.

MIF: Multi-Shot Interactive Fusion Model for Cancer Survival Prediction Using Pathological Image and Genomic Data.MIF：使用病理图像和基因组数据进行癌症生存预测的多镜头交互式融合模型

IEEE J Biomed Health Inform. 2025 May;29(5):3247-3258. doi: 10.1109/JBHI.2024.3363161. Epub 2025 May 6.

Optimized Convolutional Fusion for Multimodal Neuroimaging in Alzheimer's Disease Diagnosis: Enhancing Data Integration and Feature Extraction.用于阿尔茨海默病诊断的多模态神经成像优化卷积融合：增强数据整合与特征提取

J Pers Med. 2023 Oct 14;13(10):1496. doi: 10.3390/jpm13101496.

Assessment of Clinical Metadata on the Accuracy of Retinal Fundus Image Labels in Diabetic Retinopathy in Uganda: Case-Crossover Study Using the Multimodal Database of Retinal Images in Africa.乌干达糖尿病视网膜病变视网膜眼底图像标签准确性的临床元数据评估：使用非洲视网膜图像多模态数据库的病例交叉研究。

JMIR Form Res. 2024 Sep 18;8:e59914. doi: 10.2196/59914.

ERT-GFAN: A multimodal drug-target interaction prediction model based on molecular biology and knowledge-enhanced attention mechanism.ERT-GFAN：一种基于分子生物学和知识增强注意力机制的多模态药物-靶标相互作用预测模型。

Comput Biol Med. 2024 Sep;180:109012. doi: 10.1016/j.compbiomed.2024.109012. Epub 2024 Aug 16.

Prediction model of early recurrence of multimodal hepatocellular carcinoma with tensor fusion.基于张量融合的多模态肝细胞癌早期复发预测模型。

Phys Med Biol. 2024 Jun 5;69(12). doi: 10.1088/1361-6560/ad4f45.

本文引用的文献

BRSET: A Brazilian Multilabel Ophthalmological Dataset of Retina Fundus Photos.BRSET：一个巴西视网膜眼底照片多标签眼科数据集。

PLOS Digit Health. 2024 Jul 11;3(7):e0000454. doi: 10.1371/journal.pdig.0000454. eCollection 2024 Jul.

Comparing ChatGPT and GPT-4 performance in USMLE soft skill assessments.比较 ChatGPT 和 GPT-4 在 USMLE 软技能评估中的表现。

Sci Rep. 2023 Oct 1;13(1):16492. doi: 10.1038/s41598-023-43436-9.

An advanced deep learning predictive model for air quality index forecasting with remote satellite-derived hydro-climatological variables.一种利用卫星遥感水文气候变量进行空气质量指数预测的先进深度学习预测模型。

Sci Total Environ. 2024 Jan 1;906:167234. doi: 10.1016/j.scitotenv.2023.167234. Epub 2023 Sep 20.

A foundation model for generalizable disease detection from retinal images.基于视网膜图像的通用疾病检测的基础模型。

Nature. 2023 Oct;622(7981):156-163. doi: 10.1038/s41586-023-06555-x. Epub 2023 Sep 13.

Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index.用于计算新冠病毒易感性指数的多维机器学习模型

J Pers Med. 2023 Jul 15;13(7):1141. doi: 10.3390/jpm13071141.

GPT-4: a new era of artificial intelligence in medicine.GPT-4：医学人工智能的新纪元。

Ir J Med Sci. 2023 Dec;192(6):3197-3200. doi: 10.1007/s11845-023-03377-8. Epub 2023 Apr 19.

AI-Generated Medical Advice-GPT and Beyond.人工智能生成的医学建议——GPT及其他。

JAMA. 2023 Apr 25;329(16):1349-1350. doi: 10.1001/jama.2023.5321.

Sources of bias in artificial intelligence that perpetuate healthcare disparities-A global review.导致医疗保健差距长期存在的人工智能偏差来源——一项全球综述。

PLOS Digit Health. 2022 Mar 31;1(3):e0000022. doi: 10.1371/journal.pdig.0000022. eCollection 2022 Mar.

Algorithmic encoding of protected characteristics in chest X-ray disease detection models.算法对胸部 X 射线疾病检测模型中受保护特征的编码。

EBioMedicine. 2023 Mar;89:104467. doi: 10.1016/j.ebiom.2023.104467. Epub 2023 Feb 13.

Challenges and solutions for transforming health ecosystems in low- and middle-income countries through artificial intelligence.通过人工智能转变低收入和中等收入国家卫生生态系统面临的挑战与解决方案

Front Med (Lausanne). 2022 Dec 2;9:958097. doi: 10.3389/fmed.2022.958097. eCollection 2022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

DF-DM：人工智能时代多模态数据融合的基础过程模型。

DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献