Khattar Anuradha, Quadri S M K
Department of Computer Science, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi, India.
Multimed Tools Appl. 2023;82(6):9083-9111. doi: 10.1007/s11042-022-13456-0. Epub 2022 Jul 20.
Labeled data scarcity at the time of an ongoing disaster has encouraged the researchers to use the labeled data from some previous disaster for training and transferring the knowledge to the current disaster task using Domain Adaptation (DA). However, often labeled data from more than one previous disaster may be available. As all deep learning models are data-hungry and perform better if fed with more annotated data, it is advisable to use data from multiple sources for training a Deep Convolutional Neural Network (DCNN). One of the easiest ways is to simply combine the data from multiple sources and use it for training. However, this arrangement is not that straightforward. The models trained on the combined data from various sources do not perform well on the target, mainly due to distribution discrepancies between multiple sources. This has motivated us to explore the challenging area of multi-source domain adaptation for disaster management. The aim is to learn the domain invariant features and representations across the domains and transfer more related knowledge to solve the target task with improved accuracy than single-source or combined-source domain adaptation. This study proposes a Multi-Source Domain Adaptation framework for Disaster Management (MSDA-DM) to classify disaster images posted on social media based on unsupervised DA with adversarial training. The empirical results obtained confirm that the proposed model MSDA-DM performs better than single-source DA by up to 10.83% and combined-source DA by up to 5.06% in terms of F1-score for different sets of source and target disaster domains. We also compare our model with current state-of-the-art models. The main challenge of multi-source DA is the choice of the relevant sources taken for training since, unlike single-source DA that handles only source-target distribution drift, the multi-source DA network has to address both source-target and source-source distribution drifts.
在持续灾难发生时,标注数据稀缺促使研究人员使用之前某些灾难的标注数据进行训练,并通过域适应(DA)将知识转移到当前的灾难任务中。然而,通常可能有来自不止一次先前灾难的标注数据。由于所有深度学习模型都需要大量数据,并且如果提供更多带注释的数据会表现得更好,因此建议使用来自多个源的数据来训练深度卷积神经网络(DCNN)。最简单的方法之一就是简单地将来自多个源的数据组合起来并用于训练。然而,这种安排并非那么简单直接。在来自各种源的组合数据上训练的模型在目标数据上表现不佳,主要是由于多个源之间的分布差异。这促使我们探索灾难管理中多源域适应这一具有挑战性的领域。目的是学习跨域的域不变特征和表示,并转移更多相关知识,以比单源或组合源域适应更高的准确率来解决目标任务。本研究提出了一种用于灾难管理的多源域适应框架(MSDA-DM),以基于带有对抗训练的无监督DA对社交媒体上发布的灾难图像进行分类。获得的实证结果证实,对于不同的源和目标灾难域集,所提出的模型MSDA-DM在F1分数方面比单源DA性能提升高达10.83%,比组合源DA性能提升高达5.06%。我们还将我们的模型与当前的最先进模型进行了比较。多源DA的主要挑战在于选择用于训练的相关源,因为与仅处理源 - 目标分布漂移的单源DA不同,多源DA网络必须同时解决源 - 目标和源 - 源分布漂移问题。