通过多模态学习减轻标注负担。

Reducing Annotation Burden Through Multimodal Learning.

作者信息

Lopez Kevin, Fodeh Samah J, Allam Ahmed, Brandt Cynthia A, Krauthammer Michael

机构信息

Program of Computational Biology and Bioinformatics, Yale University, New Haven, CT, United States.

Department of Emergency Medicine, Yale School of Medicine, New Haven, CT, United States.

出版信息

Front Big Data. 2020 Jun 2;3:19. doi: 10.3389/fdata.2020.00019. eCollection 2020.

DOI:10.3389/fdata.2020.00019

PMID:33693393

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7931886/

Abstract

Choosing an optimal data fusion technique is essential when performing machine learning with multimodal data. In this study, we examined deep learning-based multimodal fusion techniques for the combined classification of radiological images and associated text reports. In our analysis, we (1) compared the classification performance of three prototypical multimodal fusion techniques: , and fusion, (2) assessed the performance of multimodal compared to unimodal learning; and finally (3) investigated the amount of labeled data needed by multimodal vs. unimodal models to yield comparable classification performance. Our experiments demonstrate the potential of multimodal fusion methods to yield competitive results using less training data (labeled data) than their unimodal counterparts. This was more pronounced using the and less so using the fusion approaches. With increasing amount of training data, unimodal models achieved comparable results to multimodal models. Overall, our results suggest the potential of multimodal learning to decrease the need for labeled training data resulting in a lower annotation burden for domain experts.

摘要

在使用多模态数据进行机器学习时，选择最佳的数据融合技术至关重要。在本研究中，我们研究了基于深度学习的多模态融合技术，用于放射图像和相关文本报告的联合分类。在我们的分析中，我们（1）比较了三种典型多模态融合技术的分类性能：、以及融合；（2）评估了多模态与单模态学习相比的性能；最后（3）研究了多模态模型与单模态模型为产生可比分类性能所需的标注数据量。我们的实验表明，与单模态对应方法相比，多模态融合方法有潜力使用更少的训练数据（标注数据）产生有竞争力的结果。使用时这种情况更明显，而使用融合方法时则不太明显。随着训练数据量的增加，单模态模型取得了与多模态模型相当的结果。总体而言，我们的结果表明多模态学习有潜力减少对标注训练数据的需求，从而降低领域专家的标注负担。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca23/7931886/f9ea67af89f4/fdata-03-00019-g0001.jpg

相似文献

Reducing Annotation Burden Through Multimodal Learning.通过多模态学习减轻标注负担。

Front Big Data. 2020 Jun 2;3:19. doi: 10.3389/fdata.2020.00019. eCollection 2020.

Multimodal medical tensor fusion network-based DL framework for abnormality prediction from the radiology CXRs and clinical text reports.基于多模态医学张量融合网络的深度学习框架，用于从放射学胸部X光片和临床文本报告中预测异常情况。

Multimed Tools Appl. 2023 Apr 21:1-48. doi: 10.1007/s11042-023-14940-x.

Accurately Differentiating Between Patients With COVID-19, Patients With Other Viral Infections, and Healthy Individuals: Multimodal Late Fusion Learning Approach.准确区分 COVID-19 患者、其他病毒感染患者和健康个体：多模态晚期融合学习方法。

J Med Internet Res. 2021 Jan 6;23(1):e25535. doi: 10.2196/25535.

Effective Techniques for Multimodal Data Fusion: A Comparative Analysis.多模态数据融合的有效技术：比较分析。

Sensors (Basel). 2023 Feb 21;23(5):2381. doi: 10.3390/s23052381.

Multimodal Data Fusion of Deep Learning and Dynamic Functional Connectivity Features to Predict Alzheimer's Disease Progression.深度学习与动态功能连接特征的多模态数据融合以预测阿尔茨海默病进展

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:4409-4413. doi: 10.1109/EMBC.2019.8856500.

Multimodal deep learning for biomedical data fusion: a review.多模态深度学习在生物医学数据融合中的应用综述。

Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab569.

Learning-Based Multimodal Information Fusion and Behavior Recognition of Vascular Interventionists' Operating Skills.基于学习的血管介入术者操作技能的多模态信息融合与行为识别。

IEEE J Biomed Health Inform. 2023 Sep;27(9):4536-4547. doi: 10.1109/JBHI.2023.3289548. Epub 2023 Sep 6.

Transferability of artificial neural networks for clinical document classification across hospitals: A case study on abnormality detection from radiology reports.医院间临床文档分类的人工神经网络可转移性：以放射学报告异常检测为例的研究。

J Biomed Inform. 2018 Sep;85:68-79. doi: 10.1016/j.jbi.2018.07.017. Epub 2018 Jul 17.

Adolescent Depression Detection Model Based on Multimodal Data of Interview Audio and Text.基于访谈音频和文本的多模态数据的青少年抑郁检测模型。

Int J Neural Syst. 2022 Nov;32(11):2250045. doi: 10.1142/S0129065722500459. Epub 2022 Aug 26.

Multimodal risk prediction with physiological signals, medical images and clinical notes.利用生理信号、医学图像和临床记录进行多模态风险预测。

Heliyon. 2024 Feb 28;10(5):e26772. doi: 10.1016/j.heliyon.2024.e26772. eCollection 2024 Mar 15.

引用本文的文献

Evaluating multiple large language models on orbital diseases.评估多种大语言模型在眼眶疾病方面的表现。

Front Cell Dev Biol. 2025 Jul 7;13:1574378. doi: 10.3389/fcell.2025.1574378. eCollection 2025.

Benefits and Risks of AI in Health Care: Narrative Review.人工智能在医疗保健中的益处与风险：叙述性综述

Interact J Med Res. 2024 Nov 18;13:e53616. doi: 10.2196/53616.

Progress in the application of artificial intelligence in skin wound assessment and prediction of healing time.人工智能在皮肤伤口评估及愈合时间预测中的应用进展

Am J Transl Res. 2024 Jul 15;16(7):2765-2776. doi: 10.62347/MYHE3488. eCollection 2024.

Federated learning for multi-omics: A performance evaluation in Parkinson's disease.用于多组学的联邦学习：帕金森病中的性能评估

Patterns (N Y). 2024 Mar 1;5(3):100945. doi: 10.1016/j.patter.2024.100945. eCollection 2024 Mar 8.

Federated Learning for multi-omics: a performance evaluation in Parkinson's disease.用于多组学的联邦学习：帕金森病中的性能评估

bioRxiv. 2024 Feb 12:2023.10.04.560604. doi: 10.1101/2023.10.04.560604.

Multi-modality machine learning predicting Parkinson's disease.多模态机器学习预测帕金森病。

NPJ Parkinsons Dis. 2022 Apr 1;8(1):35. doi: 10.1038/s41531-022-00288-w.

Harnessing multimodal data integration to advance precision oncology.利用多模态数据整合推进精准肿瘤学。

Nat Rev Cancer. 2022 Feb;22(2):114-126. doi: 10.1038/s41568-021-00408-3. Epub 2021 Oct 18.

A Deep Learning Approach to Diagnostic Classification of Prostate Cancer Using Pathology-Radiology Fusion.深度学习方法在病理-影像融合中对前列腺癌的诊断分类。

J Magn Reson Imaging. 2021 Aug;54(2):462-471. doi: 10.1002/jmri.27599. Epub 2021 Mar 14.

本文引用的文献

PadChest: A large chest x-ray image dataset with multi-label annotated reports.PadChest：一个大型胸部 X 射线图像数据集，带有多标签注释报告。

Med Image Anal. 2020 Dec;66:101797. doi: 10.1016/j.media.2020.101797. Epub 2020 Aug 20.

Structured crowdsourcing enables convolutional segmentation of histology images.结构化众包可实现组织学图像的卷积分割。

Bioinformatics. 2019 Sep 15;35(18):3461-3467. doi: 10.1093/bioinformatics/btz083.

Multimodal Machine Learning: A Survey and Taxonomy.多模态机器学习：一项综述与分类法

IEEE Trans Pattern Anal Mach Intell. 2019 Feb;41(2):423-443. doi: 10.1109/TPAMI.2018.2798607. Epub 2018 Jan 25.

Enhancing Clustering by Exploiting Complementary Data Modalities in the Medical Domain.通过利用医学领域中的互补数据模态增强聚类

IFIP Adv Inf Commun Technol. 2012 Sep;381:357-367. doi: 10.1007/978-3-642-33409-2_37.

The additional value of the lateral chest radiograph for the detection of small pulmonary nodules-a ROC analysis.胸部侧位X线片在检测小肺结节方面的附加价值——一项ROC分析

Br J Radiol. 2016 Nov;89(1067):20160394. doi: 10.1259/bjr.20160394. Epub 2016 Sep 26.

Preparing a collection of radiology examinations for distribution and retrieval.准备一批用于分发和检索的放射学检查资料。

J Am Med Inform Assoc. 2016 Mar;23(2):304-10. doi: 10.1093/jamia/ocv080. Epub 2015 Jul 1.

Complementary ensemble clustering of biomedical data.生物医学数据的互补集成聚类。

J Biomed Inform. 2013 Jun;46(3):436-43. doi: 10.1016/j.jbi.2013.02.001. Epub 2013 Feb 27.

The impact of misclassification due to survey response fatigue on estimation and identifiability of treatment effects.由于调查应答疲劳导致的分类错误对处理效应估计和可识别性的影响。

Stat Med. 2011 Dec 30;30(30):3560-72. doi: 10.1002/sim.4377. Epub 2011 Sep 23.

Lateral chest X-ray for physicians.面向医生的胸部侧位X线片。

J R Soc Med. 2005 Jul;98(7):310-2. doi: 10.1177/014107680509800705.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过多模态学习减轻标注负担。

Reducing Annotation Burden Through Multimodal Learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献